Reasoning models can generate seven to 10 times as many tokens as necessary on simple tasks, creating unsustainable costs at scale. Amazon's vision for metacognitive AI could fundamentally shift how models allocate computational resources.

How Amazon uses AI agents to anticipate and counter cyber threats

Amazon's competitive-agent architecture creates a continuous improvement cycle that develops security protections at machine speed, reducing what typically takes weeks down to hours.

SupplyChainEmissions-Breakdown-Homepage (1).png

A new view of supply chain emissions

A new approach to reducing carbon emissions reveals previously hidden emission “hotspots” within value chains, helping organizations make more detailed and dynamic decisions about their future carbon footprints.

Demystifying AI agents

How agentic systems work under the hood — and how AWS’s new AgentCore framework implements their essential components.

The overthinking problem in AI

How Amazon uses AI agents to anticipate and counter cyber threats

A new view of supply chain emissions

Demystifying AI agents

Customer-obsessed science

Research areas

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

Making fairness in LLMs observable, quantifiable, and governable
November 20, 2025
4 min read
A new evaluation pipeline called FiSCo uncovers hidden biases and offers an assessment framework that evolves alongside language models.
Conversational AI
Introducing Chronos-2: From univariate to universal forecasting
October 20, 2025
4 min read
Machine learning
Why AI for good depends on good data
October 14, 2025
7 min read
Information and knowledge management
Novel “Kaputt” dataset sets new benchmark for large-scale visual defect detection
October 2, 2025
3 min read
Computer vision
Science in the age of foundation models
September 26, 2025
9 min read
Machine learning

View all

Featured news

Amazon launches $68 million AI PhD Fellowship program

Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.

Amazon - CMU AI Innovation Hub

The collaboration will advance research in generative AI, robotics, natural language processing and cloud computing while fostering innovation in foundational and emerging technologies.

Winners of the Amazon Nova AI Challenge

University teams battle to harden and hack AI coding assistants in head-to-head tournament

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

Publications

View all View all

SABER: Small actions, big errors — Safe-guarding mutating steps in LLM agents
Alex Cuadron Lafuente,Pengfei Yu,Yang Liu,Arpit Gupta
arXiv
2025
Despite rapid progress in LLM agents, performance on long-horizon, tool-using tasks remains fragile. To better understand this fragility, we ask a simple question: do all actions contribute equally to failure? Analyzing execution traces on τ-Bench (Airline/Retail) and SWE-Bench Verified, we decompose trajectories into mutating (environment-changing) vs. non-mutating steps and formalize de-cisive deviations—earliest
Conversational AI
Where did it all go wrong? A hierarchical look into multi-agent error attribution
Adi Banerjee,Anirudh Nair,Tarik Borogovac
NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle
2025
Error attribution in Large Language Model (LLM) multi-agent systems presents a significant challenge in debugging and improving collaborative AI systems. Current approaches to pinpointing agent and step level failures in multi-agent interaction traces—whether using all-at-once evaluation, step-by-step analysis, or binary search—fall short when analyzing complex patterns, struggling with both accuracy and
Conversational AI
Efficiently generating correlated sample paths from multi-step time series foundation models
Ethan Baron,Boris Oreshkin,Ruijun Ma,Hanyu Zhang,Kari Torkkola,Michael Mahoney,Andrew Gordon Wilson,Tatiana Konstantinova
NeurIPS 2025 Workshop on Recent Advances in Time Series Foundation Models
2025
Many time series applications require access to multi-step forecast trajectories in the form of sample paths. Recently, time series foundation models have leveraged multi-step lookahead predictions to improve the quality and efficiency of multi-step forecasts. However, these models only predict independent marginal distributions for each time step, rather than a full joint predictive distribution. To generate
Machine learning
Beyond collaborative filtering: Using transformers for personalized music recommendation
Tim Greer,Nicholas Capel,Yannik Stein,Giuseppe Di Benedetto,Emanuele Coviello,Amina Shabbeer
NeurIPS 2025
2025
Music recommendation systems face the dual challenge of capturing both immediate context and long-term preferences in users' listening patterns. We adapt a generalized sequential model architecture for music recommendation, introducing modifications that acknowledge how music preferences combine temporal patterns and stable tastes. By removing causal masking constraints typically used in sequential models
Machine learning
Structuring the unstructured: A multi-agent LLM framework for transforming ambiguous SOPs into code
Sachin Kumar Giroh,Pushpendu Ghosh,Aryan Jain,Harshal Paunikar,Anish Nediyanchath,Aditi Rastogi,Promod Yenigalla
EMNLP 2025
2025
This paper introduces, a three-stage multi agent LLM framework designed to transform unstructured and ambiguous Standard Operating Procedure (SOP) into a structured plan and an executable code template. Unstructured SOPs—common across industries such as finance, retail, and logistics—frequently suffer from ambiguity, missing information, and inconsistency, all of which hinder automation. We address this
Conversational AI

Collaborations

View all

Whether you're a faculty member or student, there are number of ways you can engage with Amazon.

View all

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Amazon Scholars

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

View all

Movatterモバイル変換

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Collaborations

Work with us