AI Research Dispatch

Updates from labs, pre-print servers, and academic AI research groups.

ai-research EN

ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text

Mar 19, 2026 • Source • importai.substack.com

Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv and feedback from readers. If you’d like to support this, please subscribe. Can LLMs autonomously refine other LLMs for …

ai-research EN

Roche Scales NVIDIA AI Factories Globally to Accelerate Drug Discovery, Diagnostic Solutions and Manufacturing Breakthroughs

Mar 18, 2026 • Source • feeds.feedburner.com

Roche is deploying more than 3,500 NVIDIA Blackwell GPUs across hybrid cloud and on‑premises environments in the U.S. and Europe, expanding on an existing collaboration to turn AI and accelerated …

ai-research EN

NVIDIA DSX Air Boosts Time to Token With Accelerated Simulation for AI Factories

Mar 18, 2026 • Source • feeds.feedburner.com

Setting up AI factories in simulation — decreasing deployment time from months to days — is accelerating the next industrial revolution. Nowhere was that more apparent than at GTC 2026, in San Jose, …

ai-research EN

MIT-IBM Watson AI Lab seed to signal: Amplifying early-career faculty impact

Mar 18, 2026 • Source • news.mit.edu

The early years of faculty members’ careers are a formative and exciting time in which to establish a firm footing that helps determine the trajectory of researchers’ studies. This includes building a …

ai-research EN

Snap Decisions: How Open Libraries for Accelerated Data Processing Boost A/B Testing for Snapchat

Mar 18, 2026 • Source • feeds.feedburner.com

The features on social media apps like Snapchat evolve nearly as fast as what’s trending. To keep pace, its parent company Snap has adopted open data processing libraries from NVIDIA on Google Cloud …

ai-research EN

NVIDIA, Telecom Leaders Build AI Grids to Optimize Inference on Distributed Networks

Mar 18, 2026 • Source • feeds.feedburner.com

As AI‑native applications scale to more users, agents and devices, the telecommunications network is becoming the next frontier for distributing AI. At NVIDIA GTC 2026, leading operators in the U.S. …

ai-research EN

GTC Spotlights NVIDIA RTX PCs and DGX Sparks Running Latest Open Models and AI Agents Locally

Mar 18, 2026 • Source • feeds.feedburner.com

The paradigm of consumer computing has revolved around the concept of a personal device — from PCs to smartphones and tablets. Now, generative AI — particularly OpenClaw — has introduced a new …

ai-research EN

More Than Meets the Eye: NVIDIA RTX-Accelerated Computers Now Connect Directly to Apple Vision Pro

Mar 18, 2026 • Source • feeds.feedburner.com

Creating digital twins of AI factories and healthcare labs. Designing sleek car exteriors in extended reality (XR) with physically accurate color and lighting. Fully immersing in high-resolution …

ai-research EN

Build an offline feature store using Amazon SageMaker Unified Studio and SageMaker Catalog

Mar 18, 2026 • Source • aws.amazon.com

Building and managing machine learning (ML) features at scale is one of the most critical and complex challenges in modern data science workflows. Organizations often struggle with fragmented feature …

ai-research EN

Introducing Disaggregated Inference on AWS powered by llm-d

Mar 18, 2026 • Source • aws.amazon.com

We thank Greg Pereira and Robert Shaw from the llm-d team for their support in bringing llm-d to AWS. In the agentic and reasoning era, large language models (LLMs) generate 10x more tokens and …

ai-research EN

How Workhuman built multi-tenant self-service reporting using Amazon Quick Sight embedded dashboards

Mar 18, 2026 • Source • aws.amazon.com

This post is cowritten with Ilija Subanovic and Michael Rice from Workhuman. Workhuman’s customer service and analytics team were drowning in one-time reporting requests from seven million users …

ai-research EN

AWS and NVIDIA deepen strategic collaboration to accelerate AI from pilot to production

Mar 18, 2026 • Source • aws.amazon.com

AI is moving fast, and for most of our customers, the real opportunity isn’t in experimenting with it—it’s in running AI in production where it drives meaningful business outcomes. This means …

ai-research EN

AWS AI League: Atos fine-tunes approach to AI education

Mar 18, 2026 • Source • aws.amazon.com

This post is co-written with Mark Ross from Atos. Organizations pursuing AI transformation can face a familiar challenge: how to upskill their workforce at scale in a way that changes how teams build, …

ai-research EN

Agentic AI in the Enterprise Part 2: Guidance by Persona

Mar 18, 2026 • Source • aws.amazon.com

This is Part II of a two-part series from the AWS Generative AI Innovation Center. If you missed Part I, refer to Operationalizing Agentic AI Part 1: A Stakeholder’s Guide . The biggest barrier to …

ai-research EN

Information-Driven Design of Imaging Systems

Mar 14, 2026 • Source • bair.berkeley.edu

An encoder (optical system) maps objects to noiseless images, which noise corrupts into measurements. Our information estimator uses only these noisy measurements and a noise model to quantify how …

ai-research EN

Identifying Interactions at Scale for LLMs

Mar 14, 2026 • Source • bair.berkeley.edu

Understanding the behavior of complex machine learning systems, particularly Large Language Models (LLMs), is a critical challenge in modern artificial intelligence. Interpretability research aims to …

ai-research EN

P-EAGLE: Faster LLM inference with Parallel Speculative Decoding in vLLM

Mar 13, 2026 • Source • aws.amazon.com

EAGLE is the state-of-the-art method for speculative decoding in large language model (LLM) inference, but its autoregressive drafting creates a hidden bottleneck: the more tokens that you speculate, …

ai-research EN

Into the Omniverse: How Industrial AI and Digital Twins Accelerate Design, Engineering and Manufacturing Across Industries

Mar 13, 2026 • Source • feeds.feedburner.com

Editor’s note: This post is part of Into the Omniverse , a series focused on how developers, 3D practitioners and enterprises can transform their workflows using the latest advancements in OpenUSD and …

ai-research EN

GeForce NOW Raises the Game at the Game Developers Conference

Mar 13, 2026 • Source • feeds.feedburner.com

GeForce NOW is bringing the game to the Game Developers Conference (GDC), running this week in San Francisco. While developers build the future of gaming, GeForce NOW is delivering it to gamers. The …

ai-research EN

Can AI help predict which heart-failure patients will worsen within a year?

Mar 13, 2026 • Source • news.mit.edu

Characterized by weakened or damaged heart musculature, heart failure results in the gradual buildup of fluid in a patient’s lungs, legs, feet, and other parts of the body. The condition is chronic …

ai-research EN

Secure AI agents with Policy in Amazon Bedrock AgentCore

Mar 12, 2026 • Source • aws.amazon.com

Deploying AI agents safely in regulated industries is challenging. Without proper boundaries, agents that access sensitive data or execute transactions can pose significant security risks. Unlike …

ai-research EN

Multimodal embeddings at scale: AI data lake for media and entertainment workloads

Mar 12, 2026 • Source • aws.amazon.com

This post shows you how to build a scalable multimodal video search system that enables natural language search across large video datasets using Amazon Nova models and Amazon OpenSearch Service . You …

ai-research EN

Fine-tuning NVIDIA Nemotron Speech ASR on Amazon EC2 for domain adaptation

Mar 12, 2026 • Source • aws.amazon.com

This post is a collaboration between AWS, NVIDIA and Heidi . Automatic speech recognition (ASR), often called speech-to-text (STT) is becoming increasingly critical across industries like healthcare, …

ai-research EN

Systematic debugging for AI agents: Introducing the AgentRx framework

Mar 12, 2026 • Source • microsoft.com

At a glance Problem: Debugging AI agent failures is hard because trajectories are long, stochastic, and often multi-agent, so the true root cause gets buried. Solution: AgentRx (opens in new tab) …

ai-research EN

Improve operational visibility for inference workloads on Amazon Bedrock with new CloudWatch metrics for TTFT and Estimated Quota Consumption

Mar 12, 2026 • Source • aws.amazon.com

As organizations scale their generative AI workloads on Amazon Bedrock , operational visibility into inference performance and resource consumption becomes critical. Teams running latency-sensitive …