At a glance Today’s multimodal AI systems can give answers that sound right but may not be grounded in what they actually observe over time, leading to unpredictable errors and safety risks in …
AI Research Dispatch
Updates from labs, pre-print servers, and academic AI research groups.
UniRG: Scaling medical imaging report generation with multimodal reinforcement learning
At a glance AI-driven medical image report generation can help medical providers become more efficient and productive. Current models are difficult to train because reporting practices vary widely …
Paza: Introducing automatic speech recognition benchmarks and models for low resource languages
At a glance Microsoft Research releases PazaBench and Paza automatic speech recognition models , advancing speech technology for low resource languages. Human-centered pipeline for low-resource …
At a glance Imitation learning becomes easier when an AI agent understands why an action is taken. Predictive Inverse Dynamics Models (PIDMs) predict plausible future states, clarifying the direction …
Introducing Segment Anything: Working toward the first foundation model for image segmentation
Something Went Wrong We’re having trouble playing this video. Segmentation — identifying which image pixels belong to an object — is a core task in computer vision and is used in a broad array …
The Meta Research PhD Fellowship program awards PhD candidates conducting research on the cusp of emerging topics across computer science, engineering, and behavioral science. To support their …
Buck2, our new open source, large-scale build system , is now available on GitHub. Buck2 is an extensible and performant build system written in Rust and designed to make your build experience faster …
Health equity is a major societal concern worldwide with disparities having many causes. These sources include limitations in access to healthcare, differences in clinical treatment, and even …
We leverage two key techniques to aid convergence of this ill-posed problem. The first is a very lightweight, dynamically trained convolutional neural network (CNN) encoder that regresses camera poses …
Health datasets play a crucial role in research and medical education, but it can be challenging to create a dataset that represents the real world. For example, dermatology conditions are diverse in …
Building real-time voice assistants with Amazon Nova Sonic compared to cascading architectures
Voice AI agents are reshaping how we interact with technology. From customer service and healthcare assistance to home automation and personal productivity, these intelligent virtual assistants are …
Today, we are publishing a new open source sample chatbot that shows how to use feedback from Automated Reasoning checks to iterate on the generated content, ask clarifying questions, and prove the …
Brian Hedden named co-associate dean of Social and Ethical Responsibilities of Computing
Brian Hedden PhD ’12 has been appointed co-associate dean of the Social and Ethical Responsibilities of Computing (SERC) at MIT, a cross-cutting initiative in the MIT Schwarzman College of Computing, …
Antonio Torralba, Delta Electronics Professor of Electrical Engineering and Computer Science and faculty head of artificial intelligence and decision-making at MIT, has been named to the 2025 cohort …
In the pursuit of solutions to complex global challenges including disease, energy demands, and climate change, scientific researchers, including at MIT, have turned to artificial intelligence, and to …
Accelerating your marketing ideation with generative AI – Part 2: Generate custom marketing images from historical references
Marketing teams face major challenges creating campaigns in today’s digital environment. They must navigate through complex data analytics and rapidly changing consumer preferences to produce …
MIT senior Katie Spivakovsky has been selected as a 2026-27 Churchill Scholar and will undertake an MPhil in biological sciences at the Wellcome Sanger Institute at Cambridge University in the U.K. …
Democratizing business intelligence: BGL’s journey with Claude Agent SDK and Amazon Bedrock AgentCore
This post is cowritten with James Luo from BGL. Data analysis is emerging as a high-impact use case for AI agents. According to Anthropic’s 2026 State of AI Agents Report , 60% of organizations rank …
How can artificial intelligence step out of a screen and become something we can physically touch and interact with? That question formed the foundation of class 4.043/4.044 (Interaction …
Use Amazon Quick Suite custom action connectors to upload text files to Google Drive using OpenAPI specification
Many organizations need to manage file uploads across different cloud storage systems while maintaining security and compliance. Although Google Drive provides APIs for integration, organizations …
Building production-ready AI agents requires careful planning and execution across the entire development lifecycle. The difference between a prototype that impresses in a demo and an agent that …
Performing research and clinical analytics on vast amounts of clinical data can be difficult. Healthcare data scientists and epidemiologists possess deep domain expertise in patient care, disease …
What if ultrasound imaging is no longer confined to hospitals? Patients with chronic conditions, such as hypertension and heart failure, could be monitored continuously in real-time at home or on the …
How Clarus Care uses Amazon Bedrock to deliver conversational contact center interactions
This post was cowritten by Rishi Srivastava and Scott Reynolds from Clarus Care. Many healthcare practices today struggle with managing high volumes of patient calls efficiently. From appointment …
Generative artificial intelligence models have been used to create enormous libraries of theoretical materials that could help solve all kinds of problems. Now, scientists just have to figure out how …