This post is cowritten by Jeff Boudier, Simon Pagezy, and Florent Gbelidji from Hugging Face. Agentic AI systems represent an evolution from conversational AI to autonomous agents capable of complex …
AI Research Dispatch
Updates from labs, pre-print servers, and academic AI research groups.
LLM-based chatbots’ capabilities have been advancing every month. These improvements are mostly measured by benchmarks like MMLU, HumanEval, and MATH (e.g. sonnet 3.5, gpt-4o). However, as these …
Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research
What is the Role of Mathematics in Modern Machine Learning? The past decade has witnessed a shift in how progress is made in machine learning. Research involving carefully designed and mathematically …
“In projecting language back as the model for thought, we lose sight of the tacit embodied understanding that undergirds our intelligence.” –Terry Winograd The recent successes of …
Gemma Scope 2: helping the AI safety community deepen understanding of complex language model behavior
Announcing a new, open suite of tools for language model interpretability Large Language Models (LLMs) are capable of incredible feats of reasoning, yet their internal decision-making processes remain …
Today, we’re expanding the Gemini 3 model family with the release of Gemini 3 Flash, which offers frontier intelligence built for speed at a fraction of the cost. With this release, we’re making …
Tackling global challenges and opportunities at scale Our work throughout 2025 demonstrates how AI-enabled scientific progress is being directly applied to address the world’s most critical and …
Today, Veo is getting more expressive, with improvements that help you create more fun, creative, high-quality videos based on ingredient images, built directly for the mobile format. We’re excited to …
Introducing D4RT, a unified AI model for 4D scene reconstruction and tracking across space and time. Anytime we look at the world, we perform an extraordinary feat of memory and prediction. We see and …
In August, we previewed Genie 3 , a general-purpose world model capable of generating diverse, interactive environments. Even in this early form, trusted testers were able to create an impressive …
Collaborating with experts on 18 research problems, an advanced version of Gemini Deep Think helped resolve long-standing bottlenecks across algorithms, ML and combinatorial optimization, information …
Today, we’re releasing a major upgrade to Gemini 3 Deep Think , our specialized reasoning mode, built to push the frontier of intelligence and solve modern challenges across science, research, and …
Sample language model responses to different varieties of English and native speaker reactions. ChatGPT does amazingly well at communicating with people in English. But whose English? Only 15% of …
We introduce Anthology , a method for conditioning LLMs to representative, consistent, and diverse virtual personas by generating and utilizing naturalistic backstories with rich details of …
Training Diffusion Models with Reinforcement Learning [ ](https://bair.berkeley.edu/static/blog/rl_av_smoothing/megavandertest.mp4) We deployed 100 reinforcement learning (RL)-controlled cars into …
Whether you’re a scientist brainstorming research ideas or a CEO hoping to automate a task in human resources or finance, you’ll find that artificial intelligence tools are becoming the assistants you …
A firm that wants to use a large language model (LLM) to summarize sales reports or triage customer inquiries can choose between hundreds of unique LLMs with dozens of model variations, each with …
“This is science!” – MIT president talks about the importance of America’s research enterprise on GBH’s Boston Public Radio
In a wide-ranging live conversation, MIT President Sally Kornbluth joined Jim Braude and Margery Eagan live in studio for GBH’s Boston Public Radio on Thursday, February 5. They talked about MIT, the …
James J. Collins, the Termeer Professor of Medical Engineering and Science at MIT and faculty co-lead of the Abdul Latif Jameel Clinic for Machine Learning in Health, is embarking on a …
The signals that drive many of the brain and body’s most essential functions — consciousness, sleep, breathing, heart rate, and motion — course through bundles of “white matter” fibers in the …
For more than a decade, MIT Associate Professor Rafael Gómez-Bombarelli has used artificial intelligence to create new materials. As the technology has expanded, so have his ambitions. Now, the newly …
Olympic figure skating looks effortless. Athletes sail across the ice, then soar into the air, spinning like a top, before landing on a single blade just 4-5 millimeters wide. To help figure skaters …
New J-PAL research and policy initiative to test and scale AI innovations to fight poverty
The Abdul Latif Jameel Poverty Action Lab (J-PAL) at MIT has awarded funding to eight new research studies to understand how artificial intelligence innovations can be used in the fight against …
Mercedes-Benz Unveils New S-Class Built on NVIDIA DRIVE AV, Which Enables an L4-Ready Architecture
Mercedes-Benz is marking 140 years of automotive innovation with a new S-Class built for the AI era, bringing together automotive safety and NVIDIA’s advanced autonomous driving platform to enable a …
Nemotron Labs: How AI Agents Are Turning Documents Into Real-Time Business Intelligence
Editor’s note: This post is part of the Nemotron Labs blog series, which explores how the latest open models, datasets and training techniques help businesses build specialized AI systems and …