Google DeepMind unveils AI agent that learns in real time

Source Cryptopolitan

Google DeepMind on Thursday debuted SIMA 2 – its reasoning AI agent that the firm claims behaves like a human inside virtual worlds. The tech company said SIMA 2 helps DeepMind advance beyond simple on-screen actions and move toward AI that plans and explains itself, as well as learn through experience.

The firm said the launch marked a significant step toward Artificial General Intelligence (AGI). DeepMind also warned that SIMA 2 has important general implications for the future of robotics and AI-embodiment.

SIMA 2 thinks for itself and takes actions in interactive environments

The tech company released the first version of SIMA (Scalable Instructable MultiWorld Agent) in March. Google said the AI agent learned hundreds of basic skills by watching the screen and using virtual keyboard and mouse controls. The firm also acknowledged that the latest version of the AI agent takes things a step further by allowing the AI to think for itself.

Google DeepMind also revealed that Gemini powers the AI agent. The tech company stated that integrating SIMA 2 and Gemini helps the AI agent understand a user’s high-level goal, perform complex reasoning, and skillfully execute goal-oriented actions in games. 

The firm said SIMA 2 is the company’s most capable AI agent for virtual 3D worlds. DeepMind found that interacting with the agent felt less like giving it commands and more like collaborating with a reasoning companion about the task at hand.

According to the announcement, SIMA 2 goes beyond following basic instructions to thinking, understanding, and taking actions in interactive environments. The AI agent will allow users to interact with it through text, voice, or even images.

Google said its Gemini AI model helps SIMA 2 interpret high-level goals and talk through the steps it intends to take. The firm added that Gemini helps the new human-centered agent collaborate within games with a level of reasoning the original system could not achieve.

The tech company also reported stronger generalization across virtual environments. DeepMind confirmed that SIMA 2 completed longer, more complex tasks, including logic prompts, screen-drawn sketches, and emojis. Google said the ability gets SIMA 2’s performance closer to that of a human player on a wide range of tasks. The firm also noted that the AI agent had a 65% task completion rate, compared to 31% by SIMA 1.

DeepMind found that SIMA 2 interpreted instructions and acted inside entirely new 3D worlds generated by Genie 3. The project was released last year, which creates interactive environments from a single image or text prompt. The tech company said SIMA 2 could orient itself, understand goals, and take meaningful action in worlds it had never encountered until before testing.

Google argued that the human-centered agent is now far better at carrying out detailed instructions, even in worlds it’s never experienced before. The firm said SIMA 2 can transfer learned concepts from one game to another, connecting the dots between similar tasks.

DeepMind finds gaps in SIMA 2 that need to be addressed

Researchers noted that the agent switched into self-directed play after learning from human demonstrations. The agent used trial and error, along with feedback generated by Gemini, to create new experience data. The new experience data includes a training loop where SIMA 2 attempted the tasks it generated and fed its own trajectory data back into the next version of the model.

Although DeepMind hailed SIMA 2 as an advancement in artificial intelligence, the research also found gaps that need to be addressed. Google identified gaps, including working within a limited memory window, struggling with very long, multi-step tasks, and facing visual-interpretation challenges seen in 3D AI systems.

DeepMind revealed that SIMA 2 served as a testbed for skills that could be used in robotics and navigation in the future. The firm said its SIMA 2 research offers a strong path towards applications in robotics and also AGI in the real world.

Get seen where it counts. Advertise in Cryptopolitan Research and reach crypto’s sharpest investors and builders.

Disclaimer: For information purposes only. Past performance is not indicative of future results.
placeholder
Why a Quiet 2025 Signals a Massive 2026 Crypto Bull Run: Bitwise CIO ExplainsBitwise's Matt Hougan Predicts a Crypto Boom in 2026 Amid Current Market Struggles
Author  Mitrade
Yesterday 04: 03
Bitwise's Matt Hougan Predicts a Crypto Boom in 2026 Amid Current Market Struggles
placeholder
Gold hits three-week top as dovish Fed bets offset US government reopening optimismGold (XAU/USD) reverses a modest Asian session dip and climbs to an over three-week high, around the $4,213 region, on Thursday.
Author  FXStreet
Yesterday 06: 22
Gold (XAU/USD) reverses a modest Asian session dip and climbs to an over three-week high, around the $4,213 region, on Thursday.
placeholder
Bitcoin vs. Ethereum: Distinct Monetary UniversesBitcoin and Ethereum are diverging significantly in their monetary roles, according to a joint report from Glassnode and Keyrock.
Author  Mitrade
7 hours ago
Bitcoin and Ethereum are diverging significantly in their monetary roles, according to a joint report from Glassnode and Keyrock.
placeholder
Ethereum slides 5% as bears lean on $3,500 cap and put $3,150 support in focusEthereum (ETH) drops more than 5% after a failed push above $3,550, with price sliding to $3,153 and now holding below $3,350, the 100-hour SMA and a bearish trend line at $3,500; unless bulls reclaim the $3,350–$3,500 zone, the short-term bias stays bearish and a clean break under $3,150 could expose $3,050, $3,000 and even the $2,880–$2,850 support area.
Author  Mitrade
6 hours ago
Ethereum (ETH) drops more than 5% after a failed push above $3,550, with price sliding to $3,153 and now holding below $3,350, the 100-hour SMA and a bearish trend line at $3,500; unless bulls reclaim the $3,350–$3,500 zone, the short-term bias stays bearish and a clean break under $3,150 could expose $3,050, $3,000 and even the $2,880–$2,850 support area.
placeholder
Top 3 Price Prediction: Bitcoin, Ethereum, Ripple – BTC, ETH, and XRP flash deeper downside risks as market selloff intensifiesBitcoin (BTC), Ethereum (ETH) and Ripple (XRP) trade in red on Friday after correcting more than 5%, 10% and 2%, respectively, so far this week.
Author  FXStreet
1 hour ago
Bitcoin (BTC), Ethereum (ETH) and Ripple (XRP) trade in red on Friday after correcting more than 5%, 10% and 2%, respectively, so far this week.
goTop
quote