The features on social media apps like Snapchat evolve nearly as fast as what’s trending. To keep pace, its parent company Snap has adopted open data processing libraries from NVIDIA on Google Cloud …
As AI‑native applications scale to more users, agents and devices, the telecommunications network is becoming the next frontier for distributing AI.
At NVIDIA GTC 2026, leading operators in the U.S. …
The paradigm of consumer computing has revolved around the concept of a personal device — from PCs to smartphones and tablets. Now, generative AI — particularly OpenClaw — has introduced a new …
Creating digital twins
of AI factories
and healthcare labs. Designing sleek car exteriors in extended reality (XR) with physically accurate color and lighting. Fully immersing in high-resolution …
Building and managing machine learning (ML) features at scale is one of the most critical and complex challenges in modern data science workflows. Organizations often struggle with fragmented feature …
We thank Greg Pereira and Robert Shaw from the llm-d team for their support in bringing llm-d to AWS.
In the agentic and reasoning era, large language models (LLMs) generate 10x more tokens and …
This post is cowritten with Ilija Subanovic and Michael Rice from Workhuman.
Workhuman’s customer service and analytics team were drowning in one-time reporting requests from seven million users …
AI is moving fast, and for most of our customers, the real opportunity isn’t in experimenting with it—it’s in running AI in production where it drives meaningful business outcomes. This means …
This post is co-written with Mark Ross from Atos.
Organizations pursuing AI transformation can face a familiar challenge: how to upskill their workforce at scale in a way that changes how teams build, …
This is Part II of a two-part series from the AWS Generative AI Innovation Center. If you missed Part I, refer to Operationalizing Agentic AI Part 1: A Stakeholder’s Guide .
The biggest barrier to …
An encoder (optical system) maps objects to noiseless images, which noise corrupts into measurements. Our information estimator uses only these noisy measurements and a noise model to quantify how …
Understanding the behavior of complex machine learning systems, particularly Large Language Models (LLMs), is a critical challenge in modern artificial intelligence. Interpretability research aims to …
EAGLE is the state-of-the-art method for speculative decoding in large language model (LLM) inference, but its autoregressive drafting creates a hidden bottleneck: the more tokens that you speculate, …
Editor’s note: This post is part of Into the Omniverse , a series focused on how developers, 3D practitioners and enterprises can transform their workflows using the latest advancements in OpenUSD and …
GeForce NOW
is bringing the game to the Game Developers Conference (GDC), running this week in San Francisco. While developers build the future of gaming, GeForce NOW is delivering it to gamers. The …
Characterized by weakened or damaged heart musculature, heart failure results in the gradual buildup of fluid in a patient’s lungs, legs, feet, and other parts of the body. The condition is chronic …
Deploying AI agents safely in regulated industries is challenging. Without proper boundaries, agents that access sensitive data or execute transactions can pose significant security risks. Unlike …
This post shows you how to build a scalable multimodal video search system that enables natural language search across large video datasets using Amazon Nova models and Amazon OpenSearch Service . You …
This post is a collaboration between AWS, NVIDIA and Heidi .
Automatic speech recognition (ASR), often called speech-to-text (STT) is becoming increasingly critical across industries like healthcare, …
At a glance Problem: Debugging AI agent failures is hard because trajectories are long, stochastic, and often multi-agent, so the true root cause gets buried. Solution: AgentRx (opens in new tab) …
As organizations scale their generative AI workloads on Amazon Bedrock , operational visibility into inference performance and resource consumption becomes critical. Teams running latency-sensitive …
Curiosity-driven research has long sparked technological transformations. A century ago, curiosity about atoms led to quantum mechanics, and eventually the transistor at the heart of modern computing. …
Agentic AI isn’t a feature you turn on. It’s a shift in how work is defined, who does it, and how decisions get made.
Most enterprises learn this the hard way. They launch pilots that stall the moment …
Wednesday, March 11, 8 a.m. PT
GTC 2026: What to Expect From the Biggest AI Conference of the Year 🔗 Every March, San Jose gets a little electric.
NVIDIA founder and CEO Jensen Huang will walk onto …
Launched today, NVIDIA Nemotron 3 Super is a 120‑billion‑parameter open model with 12 billion active parameters designed to run complex agentic AI systems at scale.
Available now, the model combines …