
The Top AI Papers of the Week (December 29 - January 4)

LLMs in 2025, YOLO in the Sandbox, Plan Caching for Agents, DeepTutor

The Top AI Papers of the Week (December 22-28)

MiniMax-M2.1, LLM Coding Workflows, GLM-4.7, MiniMax-M2.1, LaMer Meta-RL, Google's 2025 AI Breakthroughs

The Top AI Papers of the Week (December 15-21)
Gemini 3 Flash, GPT Image 1.5, Mistral OCR 3, GPT-5.2-Codex,NVIDIA Nemotron 3, Budget-aware Agent Scaling

The Top AI Papers of the Week (December 8-14)

GPT-5.2, Devstral 2, Measuring Agents in Production, Gemini Deep Research API, Deep RL Course

The Top AI Papers of the Week (Dec 1 - 7)

OpenRouter State of AI, Mistral 3, DeepSeek-V3.2, Google Workspace Studio, Puppeteer Multi-Agent RL, and more

The Top AI Papers of the Week (November 24 - 30)

Claude Opus 4.5, OmniScientist, FLUX.2, General Agentic Memory

The Top AI Papers of the Week (November 17 - 23)

Gemini 3, Nano Banana Pro, Antigravity, Agent-R1 RL Framework, Meta's SAM 3, OLMo 3

The Top AI Papers of the Week (November 10 - 16)

Omnilingual ASR, GPT-5.1, SIMA 2, Context Engineering Whitepaper, Mini-Agent, Marble World Model

The Top AI Papers of the Week (November 3 - 9)

Context Engineering 2.0, Kimi K2 Thinking, Windsurf Codemaps, Google File Search, Tool-to-Agent Retrieval

The Top AI Papers of the Week (October 27 - November 2)

MiniMax-M2, Cursor 2.0, SWE-1.5, Agent Data Protocol, Kimi CLI

The Top AI Papers of the Week (October 20-26)

DeepSeek-OCR, Claude Code on the Web, ChatGPT Atlas Browser

The Top AI Papers of the Week (October 13-19)

Claude Haiku 4.5, Deep Agents, SWE-grep, nanochat, Agent Skills, Veo 3.1 Fast, n8n AI Workflow Builder

On the future of AI Agents.

The Top AI Papers of the Week (October 6-12)

AgentKit, Gemini 2.5 Computer Use, State of AI Report 2025, Agentic Context Engineering, CodeMender

The Top AI Papers of the Week (September 29 - October 5)

Claude Agent SDK, Sora 2, Claude Sonnet 4.5, Microsoft Agent Framework, GLM-4.6, Agentic Commerce Protocol

The Top AI Papers of the Week (September 22-28)

Code World Model, Gemini Robotics-ER 1.5, Figma MCP server, Overhearing LLM Agents, Qwen3-Max, Gamma API

The Top AI Papers of the Week (September 15-21)

GPT-5-Codex, Grok 4 Fast, Tongyi DeepResearch, Magistral Small 1.2, Agent Payments Protocol (AP2)

The Top AI Papers of the Week (September 8-14)

Agent 3, ChatGPT Developer Mode, MCP Registry, Writing Effective Tools for Agents, Qodo Aware

The Top AI Papers of the Week (September 1-7)

Universal Deep Research, GPT-4b micro, Self-Evolving Agents, Tracking Multi-Agent Failures

The Top AI Papers of the Week (August 25-31)

Gemini 2.5 Flash Image, gpt-realtime, Anemoi Agent, Fine-tuning LLM Agents, Codex Updates, Agent Client Protocol

The Top AI Papers of the Week (August 18-24)

DeepSeek-V3.1, AGENTS.md, URL Context, Context Engineering Tips, Qwen-Image-Edit

The Top AI Papers of the Week (August 11-17)

DINOv3, Claude Sonnet-1M, GLM-4.5V, Benchmarking AI Agent Memory, Deep Agents, Claude Code Output Styles

The Top AI Papers of the Week (August 4-10)

GPT-5, Genie 3, gpt-oss, Cursor CLI, Opus 4.1, Efficient AI Agents

The Top AI Papers of the Week (July 28 - August 3)

GLM-4.5, AI SDK 5, Video Overviews, ChatGPT Study Mode, Context engineering Tips, AlphaEarth Foundations

The Top AI Papers of the Week (July 21 - 27)

Lovable Agents, GitHub Spark, Qwen3-Coder, Search Arena, Awesome Context Engineering

The Top AI Papers of the Week (July 14 - 20)

ChatGPT Agent, Gemini Embeddings, Agent Leaderboard v2, Voxtral, CRMAgent

The Top AI Papers of the Week (July 7 - 13)

Grok 4, Context Engineering Guide, Kimi K2, SmolLM3, MedGemma 27B, AI SDK 5

The Top AI Papers of the Week (June 30 - July 6)

Prompt engineering is being rebranded as context engineering

DeepSWE, Cursor 1.2, Evaluating Multi-Agent Systems, Prover Agent, Top AI Devs News

The Top AI Papers of the Week (June 23 - 29)
Gemini CLI, Qodo Gen CLI, Context Engineering, Claude Apps, AlphaGenome

The Top AI Papers of the Week (June 16 - 22)

Software 3.0, Gemini 2.5 Updates, Safer AI Agents, Deep Research Tutorial & Benchmark

The Top AI Papers of the Week (June 9 - 15)

Magistral, Agent Bricks, Code Researcher, Automating Workflow Generation, Verified Superintelligence

The Top AI Papers of the Week (June 2 - 8)

Self-Improving Agents, Eleven v3, /Search, Deep Research Updates, Top AI Devs News, Agents SDK for TypeScript

The Top AI Papers of the Week (May 26 - June 1)

Mistral Agents API, FLUX.1 Kontext, DeepSeek-R1 Update, Codestral Embed, AgentSeek

Best prompting techniques for building AI agents

The Top AI Papers of the Week (May 19 - 25)
Gemini 2.5 Updates, Claude 4, II-Agent, Gemma 3n, MCIP, Veo 3

The Top AI Papers of the Week (May 12 - 18)

AlphaEvolve, codex-1, SWE-1, AI Agents vs. Agentic AI, OpenMemory MCP

Main reasons LLMs get "lost" in multi-turn conversations and mitigation strategies

The Top AI Papers of the Week (May 5 - 11)

Mistral Medium 3, Gemini 2.5 Pro (Update), Deep Research Guide, Wave 8, Kevin-32B, Top AI Dev News

The Top AI Papers of the Week (April 28 - May 4)

Qwen3, mem0, Llama API, Bamba, Qwen2.5-Omni-3B

The Top AI Papers of the Week (April 21 - 27)

GPT-Image-1, ADK Guide, Multi-Agent Builder, UXAgent, Building Code Agents

The Top AI Papers of the Week (April 14 - April 20)

o4, Gemini 2.5 Flash, Embed 4, GUI-R1, FastAPI-MCP