Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv and feedback from readers. If you’d like to support this, please subscribe.
Can LLMs autonomously refine other LLMs for …
Roche is deploying more than 3,500 NVIDIA Blackwell GPUs across hybrid cloud and on‑premises environments in the U.S. and Europe, expanding on an existing collaboration
to turn AI and accelerated …
Setting up AI factories in simulation — decreasing deployment time from months to days — is accelerating the next industrial revolution.
Nowhere was that more apparent than at GTC 2026, in San Jose, …
The early years of faculty members’ careers are a formative and exciting time in which to establish a firm footing that helps determine the trajectory of researchers’ studies. This includes building a …
The features on social media apps like Snapchat evolve nearly as fast as what’s trending. To keep pace, its parent company Snap has adopted open data processing libraries from NVIDIA on Google Cloud …
As AI‑native applications scale to more users, agents and devices, the telecommunications network is becoming the next frontier for distributing AI.
At NVIDIA GTC 2026, leading operators in the U.S. …
The paradigm of consumer computing has revolved around the concept of a personal device — from PCs to smartphones and tablets. Now, generative AI — particularly OpenClaw — has introduced a new …
Creating digital twins
of AI factories
and healthcare labs. Designing sleek car exteriors in extended reality (XR) with physically accurate color and lighting. Fully immersing in high-resolution …
Building and managing machine learning (ML) features at scale is one of the most critical and complex challenges in modern data science workflows. Organizations often struggle with fragmented feature …
We thank Greg Pereira and Robert Shaw from the llm-d team for their support in bringing llm-d to AWS.
In the agentic and reasoning era, large language models (LLMs) generate 10x more tokens and …
This post is cowritten with Ilija Subanovic and Michael Rice from Workhuman.
Workhuman’s customer service and analytics team were drowning in one-time reporting requests from seven million users …
AI is moving fast, and for most of our customers, the real opportunity isn’t in experimenting with it—it’s in running AI in production where it drives meaningful business outcomes. This means …
This post is co-written with Mark Ross from Atos.
Organizations pursuing AI transformation can face a familiar challenge: how to upskill their workforce at scale in a way that changes how teams build, …
This is Part II of a two-part series from the AWS Generative AI Innovation Center. If you missed Part I, refer to Operationalizing Agentic AI Part 1: A Stakeholder’s Guide .
The biggest barrier to …
An encoder (optical system) maps objects to noiseless images, which noise corrupts into measurements. Our information estimator uses only these noisy measurements and a noise model to quantify how …
Understanding the behavior of complex machine learning systems, particularly Large Language Models (LLMs), is a critical challenge in modern artificial intelligence. Interpretability research aims to …
EAGLE is the state-of-the-art method for speculative decoding in large language model (LLM) inference, but its autoregressive drafting creates a hidden bottleneck: the more tokens that you speculate, …
Editor’s note: This post is part of Into the Omniverse , a series focused on how developers, 3D practitioners and enterprises can transform their workflows using the latest advancements in OpenUSD and …
GeForce NOW
is bringing the game to the Game Developers Conference (GDC), running this week in San Francisco. While developers build the future of gaming, GeForce NOW is delivering it to gamers. The …
Characterized by weakened or damaged heart musculature, heart failure results in the gradual buildup of fluid in a patient’s lungs, legs, feet, and other parts of the body. The condition is chronic …
Deploying AI agents safely in regulated industries is challenging. Without proper boundaries, agents that access sensitive data or execute transactions can pose significant security risks. Unlike …
This post shows you how to build a scalable multimodal video search system that enables natural language search across large video datasets using Amazon Nova models and Amazon OpenSearch Service . You …
This post is a collaboration between AWS, NVIDIA and Heidi .
Automatic speech recognition (ASR), often called speech-to-text (STT) is becoming increasingly critical across industries like healthcare, …
At a glance Problem: Debugging AI agent failures is hard because trajectories are long, stochastic, and often multi-agent, so the true root cause gets buried. Solution: AgentRx (opens in new tab) …
As organizations scale their generative AI workloads on Amazon Bedrock , operational visibility into inference performance and resource consumption becomes critical. Teams running latency-sensitive …