Elvis Saravia's NLP Blog
nlp.elvissaravia.com/
🥇Top AI Papers of the Week
The Top AI Papers of the Week (November 3 - 9)
🤖 AI Agents Weekly: Context Engineering 2.0, Kimi K2 Thinking, Windsurf Codemaps, Google File Search, Tool-to-Agent Retrieval
Context Engineering 2.0, Kimi K2 Thinking, Windsurf Codemaps, Google File Search, Tool-to-Agent Retrieval
🥇Top AI Papers of the Week
The Top AI Papers of the Week (October 27 - November 2)
🤖 AI Agents Weekly: MiniMax-M2, Cursor 2.0, SWE-1.5, Agent Data Protocol, Kimi CLI
MiniMax-M2, Cursor 2.0, SWE-1.5, Agent Data Protocol, Kimi CLI
🥇Top AI Papers of the Week
The Top AI Papers of the Week (October 20-26)
🤖 AI Agents Weekly: DeepSeek-OCR, Claude Code on the Web, ChatGPT Atlas Browser,...
DeepSeek-OCR, Claude Code on the Web, ChatGPT Atlas Browser
🥇Top AI Papers of the Week
The Top AI Papers of the Week (October 13-19)
🤖 AI Agents Weekly: Claude Haiku 4.5, Deep Agents, SWE-grep, nanochat, Agent Skills, Veo 3.1 Fast, n8n AI Workflow Builder
Claude Haiku 4.5, Deep Agents, SWE-grep, nanochat, Agent Skills, Veo 3.1 Fast, n8n AI Workflow Builder
Deep Agents
On the future of AI Agents.
🥇Top AI Papers of the Week
The Top AI Papers of the Week (October 6-12)
🤖 AI Agents Weekly: AgentKit, Gemini 2.5 Computer Use, State of AI Report 2025, Agentic Context Engineering, CodeMender
AgentKit, Gemini 2.5 Computer Use, State of AI Report 2025, Agentic Context Engineering, CodeMender
🥇Top AI Papers of the Week
The Top AI Papers of the Week (September 29 - October 5)
🤖 AI Agents Weekly: Claude Agent SDK, Sora 2, Claude Sonnet 4.5, Microsoft Agent Framework, GLM-4.6, Agentic Commerce Protocol
Claude Agent SDK, Sora 2, Claude Sonnet 4.5, Microsoft Agent Framework, GLM-4.6, Agentic Commerce Protocol
🥇Top AI Papers of the Week
The Top AI Papers of the Week (September 22-28)
🤖 AI Agents Weekly: Code World Model, Gemini Robotics-ER 1.5, Figma MCP server, Overhearing LLM Agents, Qwen3-Max, Gamma API
Code World Model, Gemini Robotics-ER 1.5, Figma MCP server, Overhearing LLM Agents, Qwen3-Max, Gamma API
🥇Top AI Papers of the Week
The Top AI Papers of the Week (September 15-21)
🤖 AI Agents Weekly: GPT-5-Codex, Grok 4 Fast, Tongyi DeepResearch, Magistral Small 1.2, Agent Payments Protocol (AP2)
GPT-5-Codex, Grok 4 Fast, Tongyi DeepResearch, Magistral Small 1.2, Agent Payments Protocol (AP2)
🥇Top AI Papers of the Week
The Top AI Papers of the Week (September 8-14)
🤖 AI Agents Weekly: Agent 3, ChatGPT Developer Mode, MCP Registry, Writing Effective Tools for Agents, Qodo Aware
Agent 3, ChatGPT Developer Mode, MCP Registry, Writing Effective Tools for Agents, Qodo Aware
🥇Top AI Papers of the Week
The Top AI Papers of the Week (September 1-7)
🤖 AI Agents Weekly: Universal Deep Research, GPT-4b micro, Self-Evolving Agents, Tracking Multi-Agent Failures
Universal Deep Research, GPT-4b micro, Self-Evolving Agents, Tracking Multi-Agent Failures
🥇Top AI Papers of the Week
The Top AI Papers of the Week (August 25-31)
🤖 AI Agents Weekly: Gemini 2.5 Flash Image, gpt-realtime, Anemoi Agent, Fine-tuning LLM Agents, Codex Updates, Agent Client Protocol
Gemini 2.5 Flash Image, gpt-realtime, Anemoi Agent, Fine-tuning LLM Agents, Codex Updates, Agent Client Protocol
🥇Top AI Papers of the Week
The Top AI Papers of the Week (August 18-24)
🤖 Agents Weekly: DeepSeek-V3.1, AGENTS.md, URL Context, Context Engineering Tips, Qwen-Image-Edit
DeepSeek-V3.1, AGENTS.md, URL Context, Context Engineering Tips, Qwen-Image-Edit
🥇Top AI Papers of the Week
The Top AI Papers of the Week (August 11-17)
🤖 AI Agents Weekly: DINOv3, Claude Sonnet-1M, GLM-4.5V, Benchmarking AI Agent Memory, Deep Agents, Claude Code Output Styles
DINOv3, Claude Sonnet-1M, GLM-4.5V, Benchmarking AI Agent Memory, Deep Agents, Claude Code Output Styles
🥇Top AI Papers of the Week
The Top AI Papers of the Week (August 4-10)
🤖 AI Agents Weekly: GPT-5, Genie 3, gpt-oss, Cursor CLI, Opus 4.1, Efficient AI Agents
GPT-5, Genie 3, gpt-oss, Cursor CLI, Opus 4.1, Efficient AI Agents
🥇Top AI Papers of the Week
The Top AI Papers of the Week (July 28 - August 3)
🤖 AI Agents Weekly: GLM-4.5, AI SDK 5, Video Overviews, ChatGPT Study Mode, Context engineering Tips, AlphaEarth Foundations
GLM-4.5, AI SDK 5, Video Overviews, ChatGPT Study Mode, Context engineering Tips, AlphaEarth Foundations
🥇Top AI Papers of the Week
The Top AI Papers of the Week (July 21 - 27)
🤖 AI Agents Weekly: Lovable Agents, GitHub Spark, Qwen3-Coder, Search Arena, Awesome Context Engineering
Lovable Agents, GitHub Spark, Qwen3-Coder, Search Arena, Awesome Context Engineering
🥇Top AI Papers of the Week
The Top AI Papers of the Week (July 14 - 20)
🤖 AI Agents Weekly: ChatGPT Agent, Gemini Embeddings, Agent Leaderboard v2, Voxtral, CRMAgent
ChatGPT Agent, Gemini Embeddings, Agent Leaderboard v2, Voxtral, CRMAgent
🥇Top AI Papers of the Week
The Top AI Papers of the Week (July 7 - 13)
🤖 AI Agents Weekly: Grok 4, Context Engineering Guide, Kimi K2, SmolLM3, MedGemma 27B, AI SDK 5
Grok 4, Context Engineering Guide, Kimi K2, SmolLM3, MedGemma 27B, AI SDK 5
🥇Top AI Papers of the Week
The Top AI Papers of the Week (June 30 - July 6)
Context Engineering Guide
Prompt engineering is being rebranded as context engineering
🤖 AI Agents Weekly: DeepSWE, Cursor 1.2, Evaluating Multi-Agent Systems, Prover Agent, Top AI Devs News
DeepSWE, Cursor 1.2, Evaluating Multi-Agent Systems, Prover Agent, Top AI Devs News
🥇Top AI Papers of the Week
The Top AI Papers of the Week (June 23 - 29)
🤖 AI Agents Weekly: Gemini CLI, Qodo Gen CLI, Context Engineering, Claude Apps, AlphaGenome
Gemini CLI, Qodo Gen CLI, Context Engineering, Claude Apps, AlphaGenome
🥇Top AI Papers of the Week
The Top AI Papers of the Week (June 16 - 22)
🤖 AI Agents Weekly: Software 3.0, Gemini 2.5 Updates, Safer AI Agents, Deep Research Tutorial & Benchmark
Software 3.0, Gemini 2.5 Updates, Safer AI Agents, Deep Research Tutorial & Benchmark
🥇Top AI Papers of the Week
The Top AI Papers of the Week (June 9 - 15)
🤖AI Agents Weekly: Magistral, Agent Bricks, Code Researcher, Automating Workflow Generation, Verified Superintelligence
Magistral, Agent Bricks, Code Researcher, Automating Workflow Generation, Verified Superintelligence
🥇Top AI Papers of the Week
The Top AI Papers of the Week (June 2 - 8)
🤖 AI Agents Weekly: Self-Improving Agents, Eleven v3, /Search, Deep Research Updates, Top AI Devs News, Agents SDK for TypeScript
Self-Improving Agents, Eleven v3, /Search, Deep Research Updates, Top AI Devs News, Agents SDK for TypeScript
🥇Top AI Papers of the Week
The Top AI Papers of the Week (May 26 - June 1)
⚡AI Agents Weekly: Mistral Agents API, FLUX.1 Kontext, DeepSeek-R1 Update, Codestral Embed, AgentSeek
Mistral Agents API, FLUX.1 Kontext, DeepSeek-R1 Update, Codestral Embed, AgentSeek
State-Of-The-Art Prompting For AI Agents
Best prompting techniques for building AI agents
🥇Top AI Papers of the Week
The Top AI Papers of the Week (May 19 - 25)
🤖AI Agents Weekly: Gemini 2.5 Updates, Claude 4, II-Agent, Gemma 3n, MCIP, Veo 3
Gemini 2.5 Updates, Claude 4, II-Agent, Gemma 3n, MCIP, Veo 3
🥇Top AI Papers of the Week
The Top AI Papers of the Week (May 12 - 18)
🐙AI Agents Weekly: AlphaEvolve, codex-1, SWE-1, AI Agents vs. Agentic AI, OpenMemory MCP
AlphaEvolve, codex-1, SWE-1, AI Agents vs. Agentic AI, OpenMemory MCP
LLMs Get Lost in Multi-turn Conversation
Main reasons LLMs get "lost" in multi-turn conversations and mitigation strategies
🥇Top AI Papers of the Week
The Top AI Papers of the Week (May 5 - 11)
🔥AI Agents Weekly: Mistral Medium 3, Gemini 2.5 Pro (Update), Deep Research Guide, Wave 8, Kevin-32B, Top AI Dev News
Mistral Medium 3, Gemini 2.5 Pro (Update), Deep Research Guide, Wave 8, Kevin-32B, Top AI Dev News
🥇Top AI Papers of the Week
The Top AI Papers of the Week (April 28 - May 4)
🤖AI Agents Weekly: Qwen3, mem0, Llama API, Bamba, Qwen2.5-Omni-3B
Qwen3, mem0, Llama API, Bamba, Qwen2.5-Omni-3B
🥇Top AI Papers of the Week
The Top AI Papers of the Week (April 21 - 27)
🔥AI Agents Weekly: GPT-Image-1, ADK Guide, Multi-Agent Builder, UXAgent, Building Code Agents
GPT-Image-1, ADK Guide, Multi-Agent Builder, UXAgent, Building Code Agents
🥇Top AI Papers of the Week
The Top AI Papers of the Week (April 14 - April 20)
⚡AI Agents Weekly: o4, Gemini 2.5 Flash, Embed 4, GUI-R1, FastAPI-MCP
o4, Gemini 2.5 Flash, Embed 4, GUI-R1, FastAPI-MCP