Saturday, September 13, 2025
No Result
View All Result
Eltaller Digital
  • Home
  • Latest
  • AI
  • Technology
  • Apple
  • Gadgets
  • Finance & Insurance
  • Deals
  • Automobile
  • Best AI Tools
  • Gaming
  • Home
  • Latest
  • AI
  • Technology
  • Apple
  • Gadgets
  • Finance & Insurance
  • Deals
  • Automobile
  • Best AI Tools
  • Gaming
No Result
View All Result
Eltaller Digital
No Result
View All Result
Home Artificial Intelligence

Google DeepMind’s Participation at NeurIPS 2024

December 21, 2024
in Artificial Intelligence
Reading Time: 5 mins read
0 0
A A
0
Google DeepMind’s Participation at NeurIPS 2024
Share on FacebookShare on Twitter


Research

Published 5 December 2024

Advancing adaptive AI agents, empowering 3D scene creation, and innovating LLM training for a smarter, safer future

Next week, AI researchers from around the globe will convene for the 38th Annual Conference on Neural Information Processing Systems (NeurIPS), scheduled for December 10-15 in Vancouver. Two significant papers by Google DeepMind researchers will receive the Test of Time awards for their impactful contributions to the field. Ilya Sutskever will discuss Sequence to Sequence Learning with Neural Networks, co-authored with Google DeepMind’s VP of Drastic Research, Oriol Vinyals, and Distinguished Scientist Quoc V. Le. Additionally, Ian Goodfellow and David Warde-Farley from Google DeepMind will present on Generative Adversarial Nets.

We’ll also demonstrate how our foundational research translates into real-world applications, showcasing live demonstrations such as Gemma Scope, AI for music generation, weather forecasting, and more. Google DeepMind teams will present over 100 new papers on topics including AI agents, generative media, and innovative learning approaches.

Building adaptive, smart, and safe AI Agents

AI agents based on large language models (LLMs) are proving effective in executing digital tasks through natural language commands. However, their success hinges on precise interactions with complex user interfaces, necessitating extensive training data. With AndroidControl, we provide the most diverse control dataset to date, featuring over 15,000 human-collected demonstrations across more than 800 apps. AI agents trained with this dataset exhibited notable performance improvements, which we hope will further research into more general AI agents.

To enable AI agents to generalize across tasks, they must learn from each experience. We introduce a method for in-context abstraction learning, which helps agents identify key task patterns and relationships from imperfect demonstrations and natural language feedback, thereby enhancing their performance and adaptability.

A frame from a video demonstration of someone making a sauce, with individual elements identified and numbered. ICAL can extract the important aspects of the process.

Developing AI that fulfills users’ goals can increase its usefulness, but aligning AI systems is crucial. We propose a theoretical method to measure an AI system’s goal-directedness and demonstrate how a model’s perception of its user can affect its safety filters. These insights highlight the importance of robust safeguards to prevent unintended or unsafe behaviors, ensuring AI agents’ actions align with intended safe uses.

Advancing 3D scene creation and simulation

The demand for high-quality 3D content is rising in industries like gaming and visual effects, but creating lifelike 3D scenes remains costly and time-consuming. Our latest work introduces innovative 3D generation, simulation, and control methods to streamline content creation for faster, more flexible workflows.

Producing high-quality, realistic 3D assets and scenes often requires capturing and modeling thousands of 2D photos. We present CAT3D, a system capable of creating 3D content in as little as a minute from any number of images—even one image or a text prompt. CAT3D uses a multi-view diffusion model to generate additional consistent 2D images from various viewpoints and uses those images for traditional 3D modeling techniques. This approach surpasses previous methods in both speed and quality.

CAT3D enables 3D scene creation from any number of generated or real images.

Left to right: Text-to-image-to-3D, a real photo to 3D, several photos to 3D.

Simulating scenes with numerous rigid objects, such as a cluttered tabletop or tumbling Lego bricks, remains computationally demanding. To tackle this challenge, we introduce SDF-Sim, a technique that represents object shapes in a scalable way, accelerating collision detection and enabling efficient simulation of large, complex scenes.

A complex simulation of shoes falling and colliding, accurately modeled using SDF-Sim.

AI image generators based on diffusion models struggle to control the 3D position and orientation of multiple objects. Our solution, Neural Assets, introduces object-specific representations that capture both appearance and 3D pose, learned through training on dynamic video data. Neural Assets allows users to move, rotate, or swap objects across scenes, making it a valuable tool for animation, gaming, and virtual reality.

Given a source image and object 3D bounding boxes, we can translate, rotate, and rescale the object, or transfer objects or backgrounds between images.

Improving how LLMs learn and respond

We are enhancing the way LLMs train, learn, and respond to users, focusing on improving performance and efficiency.

With larger context windows, LLMs can now learn from thousands of examples at once, known as many-shot in-context learning (ICL). This process enhances model performance on tasks like math, translation, and reasoning, although it typically requires high-quality, human-generated data. To make training more cost-effective, we explore methods to adapt many-shot ICL, reducing the need for manually curated data. With abundant data available for training language models, the main constraint for teams is the available compute. We address the critical question: with a fixed compute budget, how do you choose the right model size to achieve the best results?

Another innovative approach, Time-Reversed Language Models (TRLM), explores pretraining and finetuning an LLM to work in reverse. When given traditional LLM responses as input, a TRLM generates queries that might have produced those responses. When paired with a traditional LLM, this method not only helps ensure responses follow user instructions better but also improves the generation of citations for summarized text and enhances safety filters against harmful content.

Curating high-quality data is essential for training large AI models, but manual curation is challenging at scale. To address this, our Joint Example Selection (JEST) algorithm optimizes training by identifying the most learnable data within larger batches, enabling up to 13× fewer training rounds and 10× less computation, outperforming state-of-the-art multimodal pretraining baselines.

Planning tasks are another challenge for AI, especially in stochastic environments where randomness or uncertainty influences outcomes. Researchers use various inference types for planning, but there is no consistent approach. We demonstrate that planning itself can be seen as a distinct type of probabilistic inference and propose a framework for ranking different inference techniques based on their planning effectiveness.

Bringing together the global AI community

We are proud to be a Diamond Sponsor of the conference and support Women in Machine Learning, LatinX in AI, and Black in AI in building communities worldwide working in AI, machine learning, and data science.

If you’re attending NeurIPS this year, visit the Google DeepMind and Google Research booths to explore cutting-edge research in demos, workshops, and more throughout the conference.



Source link

Related

Tags: DeepMindsGoogleNeurIPSParticipation
Previous Post

The Pope’s Call for Priests to Be More Relaxed

Next Post

Top Sci-Fi Films to Stream on Netflix in December 2024

Related Posts

Artificial Intelligence

MLCommons: Benchmarking Machine Learning for a Better World

September 7, 2025
Artificial Intelligence

Generative Video AI: Creating Viral Videos with One Click

September 7, 2025
Artificial Intelligence

Realtime APIs: The Next Transformational Leap for AI Agents

September 7, 2025
Artificial Intelligence

AI in Cyber Threat Simulation: Outwitting Hackers with Bots

September 7, 2025
Artificial Intelligence

Responsible AI: How to Build Ethics into Intelligent Systems

September 7, 2025
Artificial Intelligence

Relevance AI & Autonomous Teams: Streamlining Work with AI

September 7, 2025
Next Post
Top Sci-Fi Films to Stream on Netflix in December 2024

Top Sci-Fi Films to Stream on Netflix in December 2024

Google Pledges Not to Impose Gemini on Partners in Antitrust Solution Proposal

Google Pledges Not to Impose Gemini on Partners in Antitrust Solution Proposal

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Get Your Steam Deck Payment Plan – Easy Monthly Options

Get Your Steam Deck Payment Plan – Easy Monthly Options

December 21, 2024
Will AI Take Over the World? How Close Is AI to World Domination?

Will AI Take Over the World? How Close Is AI to World Domination?

December 21, 2024
Installing the Nothing AI Gallery App on Any Nothing Device

Installing the Nothing AI Gallery App on Any Nothing Device

December 14, 2024
Applying Quartz Filters to Images in macOS Preview

Applying Quartz Filters to Images in macOS Preview

December 19, 2024
The Best 10 Luxury Perfumes for Women in 2025

The Best 10 Luxury Perfumes for Women in 2025

December 28, 2024
Bridging Knowledge Gaps with AI-Powered Contextual Search

Bridging Knowledge Gaps with AI-Powered Contextual Search

December 19, 2024

MLCommons: Benchmarking Machine Learning for a Better World

September 7, 2025

Generative Video AI: Creating Viral Videos with One Click

September 7, 2025

Realtime APIs: The Next Transformational Leap for AI Agents

September 7, 2025

AI in Cyber Threat Simulation: Outwitting Hackers with Bots

September 7, 2025

Responsible AI: How to Build Ethics into Intelligent Systems

September 7, 2025

Relevance AI & Autonomous Teams: Streamlining Work with AI

September 7, 2025
Eltaller Digital

Stay updated with Eltaller Digital – delivering the latest tech news, AI advancements, gadget reviews, and global updates. Explore the digital world with us today!

Categories

  • Apple
  • Artificial Intelligence
  • Automobile
  • Best AI Tools
  • Deals
  • Finance & Insurance
  • Gadgets
  • Gaming
  • Latest
  • Technology

Latest Updates

  • MLCommons: Benchmarking Machine Learning for a Better World
  • Generative Video AI: Creating Viral Videos with One Click
  • Realtime APIs: The Next Transformational Leap for AI Agents
  • About Us
  • Advertise With Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact Us

Copyright © 2024 Eltaller Digital.
Eltaller Digital is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Manage Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
View preferences
{title} {title} {title}
No Result
View All Result
  • Home
  • Latest
  • AI
  • Technology
  • Apple
  • Gadgets
  • Finance & Insurance
  • Deals
  • Automobile
  • Best AI Tools
  • Gaming

Copyright © 2024 Eltaller Digital.
Eltaller Digital is not responsible for the content of external sites.