AI Research Dispatch

Updates from labs, pre-print servers, and academic AI research groups.

ai-research EN

Teaching AI models to say “I’m not sure”

Confidence is persuasive. In artificial intelligence systems, it is often misleading. Today’s most capable reasoning models share a trait with the loudest voice in the room: They deliver every …

ai-research EN

ToolSimulator: scalable tool testing for AI agents

You can use ToolSimulator, an LLM-powered tool simulation framework within Strands Evals, to thoroughly and safely test AI agents that rely on external tools, at scale. Instead of risking live API …

ai-research EN

Can we AI our way to a more sustainable world?

So maybe I’ll first turn it over to Amy. Can you tell us a little bit about your job at Microsoft and what got you into this space, maybe a little bit of your story? AMY LUERS: So as you said, I lead …

ai-research EN

Introducing granular cost attribution for Amazon Bedrock

As AI inference grows into a significant share of cloud spend, understanding who and what are driving costs is essential for chargebacks, cost optimization, and financial planning. Today, we’re …