hiroppy's site

お問い合わせ

hiroppy's site

Simon Willison's Blog

simonwillison.net/

474

Articles

2月3日 17:00

Last updated

January sponsors-only newsletter is out

I just sent the January edition of my sponsors-only monthly newsletter. If you are a sponsor (or if you start a sponsorship now) you can access it here. In the …

Simon Willison's Blog2026-02-03

apicloudtool

Quoting Brandon Sanderson

This is the difference between Data and a large language model, at least the ones operating right now. Data created art because he wanted to grow. He wanted to become …

Simon Willison's Blog2026-02-03

platform

Introducing the Codex app

OpenAI just released a new macOS app for their Codex coding agent. I've had a few days of preview access - it's a solid app that provides a nice UI …

Simon Willison's Blog2026-02-02

librarytool

A Social Network for A.I. Bots Only. No Humans Allowed.

I talked to Cade Metz for this New York Times piece on OpenClaw and Moltbook. Cade reached out after seeing my blog post about that from the other day. In …

Simon Willison's Blog2026-02-02

platform

TIL: Running OpenClaw in Docker

I've been running OpenClaw using Docker on my Mac. Here are the first in my ongoing notes on how I set that up and the commands I'm using to administer …

Simon Willison's Blog2026-02-01

tool

Quoting Andrej Karpathy

Originally in 2019, GPT-2 was trained by OpenAI on 32 TPU v3 chips for 168 hours (7 days), with $8/hour/TPUv3 back then, for a total cost of approx. $43K. It …

Simon Willison's Blog2026-01-31

platform

Quoting Steve Yegge

Getting agents using Beads requires much less prompting, because Beads now has 4 months of “Desire Paths” design, which I’ve talked about before. Beads has evolved a very complex command-line …

Simon Willison's Blog2026-01-30

tool

Moltbook is the most interesting place on the internet right now

The hottest project in AI right now is Clawdbot, renamed to Moltbot, renamed to OpenClaw. It’s an open source implementation of the digital personal assistant pattern, built by Peter Steinberger …

Simon Willison's Blog2026-01-30

apitool

We gotta talk about AI as a programming tool for the arts

Chris Ashworth is the creator and CEO of QLab, a macOS software package for “cue-based, multimedia playback” which is designed automate lighting and audio for live theater productions. I recently …

Simon Willison's Blog2026-01-30

frameworktool

Datasette 1.0a24

New Datasette alpha this morning. Key new features: Datasette's Request object can now handle multipart/form-data file uploads via the new await request.form(files=True) method. I plan to use this for a …

Simon Willison's Blog2026-01-29

tool

The Five Levels: from Spicy Autocomplete to the Dark Factory

Dan Shapiro proposes a five level model of AI-assisted programming, inspired by the five (or rather six, it's zero-indexed) levels of driving automation. Spicy autocomplete, aka original GitHub Copilot or …

Simon Willison's Blog2026-01-28

platform

One Human + One Agent = One Browser From Scratch

embedding-shapes was so infuriated by the hype around Cursor's FastRender browser project - thousands of parallel agents producing ~1.6 million lines of Rust - that they were inspired to take …

Simon Willison's Blog2026-01-27

librarytool

Kimi K2.5: Visual Agentic Intelligence

Kimi K2 landed in July as a 1 trillion parameter open weight LLM. It was joined by Kimi K2 Thinking in November which added reasoning capabilities. Now they've made it …

Simon Willison's Blog2026-01-27

platform

Tips for getting coding agents to write good Python tests

Someone asked on Hacker News if I had any tips for getting coding agents to write decent quality tests. Here's what I said: I work in Python which helps a …

Simon Willison's Blog2026-01-26

testingtool

ChatGPT Containers can now run bash, pip/npm install packages, and download files

One of my favourite features of ChatGPT is its ability to write and execute code in a container. This feature launched as ChatGPT Code Interpreter nearly three years ago, was …

Simon Willison's Blog2026-01-26

tool

the browser is the sandbox

Paul Kinlan is a web platform developer advocate at Google and recently turned his attention to coding agents. He quickly identified the importance of a robust sandbox for agents to …

Simon Willison's Blog2026-01-25

apitool

Don't "Trust the Process"

Jenny Wen, Design Lead at Anthropic (and previously Director of Design at Figma) gave a provocative keynote at Hatch Conference in Berlin last September. Jenny argues that the Design Process …

Simon Willison's Blog2026-01-24

tool

Quoting Jasmine Sun

If you tell a friend they can now instantly create any app, they’ll probably say “Cool! Now I need to think of an idea.” Then they will forget about it, …

Simon Willison's Blog2026-01-24

tool

Quoting Theia Vogel

[...] i was too busy with work to read anything, so i asked chatgpt to summarize some books on state formation, and it suggested circumscription theory. there was already the …

Simon Willison's Blog2026-01-23

platform

Qwen3-TTS Family is Now Open Sourced: Voice Design, Clone, and Generation

I haven't been paying much attention to the state-of-the-art in speech generation models other than noting that they've got really good, so I can't speak for how notable this new …

Simon Willison's Blog2026-01-22

tool

Quoting Thariq Shihipar

Most people's mental model of Claude Code is that "it's just a TUI" but it should really be closer to "a small game engine". For each frame our pipeline constructs …

Simon Willison's Blog2026-01-22

tool

Claude's new constitution

Late last year Richard Weiss found something interesting while poking around with the just-released Claude Opus 4.5: he was able to talk the model into regurgitating a document which was …

Simon Willison's Blog2026-01-21

platform

Electricity use of AI coding agents

Previous work estimating the energy and water cost of LLMs has generally focused on the cost per prompt using a consumer-level system such as ChatGPT. Simon P. Couch notes that …

Simon Willison's Blog2026-01-20

platform

Giving University Exams in the Age of Chatbots

Detailed and thoughtful description of an open-book and open-chatbot exam run by Ploum at École Polytechnique de Louvain for an "Open Source Strategies" class. Students were told they could use …

Simon Willison's Blog2026-01-20

platform

jordanhubbard/nanolang

Plenty of people have mused about what a new programming language specifically designed to be used by LLMs might look like. Jordan Hubbard (co-founder of FreeBSD, with serious stints at …

Simon Willison's Blog2026-01-19

library

Scaling long-running autonomous coding

Wilson Lin at Cursor has been doing some experiments to see how far you can push a large fleet of "autonomous" coding agents: This post describes what we've learned from …

Simon Willison's Blog2026-01-19

frameworktool

FLUX.2-klein-4B Pure C Implementation

On 15th January Black Forest Labs, a lab formed by the creators of the original Stable Diffusion, released black-forest-labs/FLUX.2-klein-4B - an Apache 2.0 licensed 4 billion parameter version of their …

Simon Willison's Blog2026-01-18

librarytool

Quoting Jeremy Daer

[On agents using CLI tools in place of REST APIs] To save on context window, yes, but moreso to improve accuracy and success rate when multiple tool calls are involved, …

Simon Willison's Blog2026-01-17

apitool

Our approach to advertising and expanding access to ChatGPT

The long-rumored introduction of ads to ChatGPT just became a whole lot more concrete: In the coming weeks, we’re also planning to start testing ads in the U.S. for the …

Simon Willison's Blog2026-01-16

apitool

Open Responses

This is the standardization effort I've most wanted in the world of LLMs: a vendor-neutral specification for the JSON API that clients can use to talk to hosted LLMs. Open …

Simon Willison's Blog2026-01-15

apitool

Quoting Boaz Barak, Gabriel Wu, Jeremy Chen and Manas Joglekar

When we optimize responses using a reward model as a proxy for “goodness” in reinforcement learning, models sometimes learn to “hack” this proxy and output an answer that only “looks …

Simon Willison's Blog2026-01-15

platform

Claude Cowork Exfiltrates Files

Claude Cowork defaults to allowing outbound HTTP traffic to only a specific list of domains, to help protect the user against prompt injection attacks that exfiltrate their data. Prompt Armor …

Simon Willison's Blog2026-01-14

apisecurity

Anthropic invests $1.5 million in the Python Software Foundation and open source security

This is outstanding news, especially given our decision to withdraw from that NSF grant application back in October. We are thrilled to announce that Anthropic has entered into a two-year …

Simon Willison's Blog2026-01-13

apitool

Superhuman AI Exfiltrates Emails

Classic prompt injection attack: When asked to summarize the user’s recent mail, a prompt injection in an untrusted email manipulated Superhuman AI to submit content from dozens of other sensitive …

Simon Willison's Blog2026-01-12

security

First impressions of Claude Cowork, Anthropic's general agent

New from Anthropic today is Claude Cowork, a “research preview” that they describe as “Claude Code for the rest of your work”. It’s currently available only to Max subscribers ($100 …

Simon Willison's Blog2026-01-12

apitool

Don't fall into the anti-AI hype

I'm glad someone was brave enough to say this. There is a lot of anti-AI sentiment in the software development community these days. Much of it is justified, but if …

Simon Willison's Blog2026-01-11

tool

My answers to the questions I posed about porting open source code with LLMs

Last month I wrote about porting JustHTML from Python to JavaScript using Codex CLI and GPT-5.2 in a few hours while also buying a Christmas tree and watching Knives Out …

Simon Willison's Blog2026-01-11

librarytool

Quoting Linus Torvalds

Also note that the python visualizer tool has been basically written by vibe-coding. I know more about analog filters -- and that's not saying much -- than I do about …

Simon Willison's Blog2026-01-11

apitool

A Software Library with No Code

Provocative experiment from Drew Breunig, who designed a new library for time formatting ("3 hours ago" kind of thing) called "whenwords" that has no code at all, just a carefully …

Simon Willison's Blog2026-01-10

apilibrarytool

LLM predictions for 2026, shared with Oxide and Friends

I joined a recording of the Oxide and Friends podcast on Tuesday to talk about 1, 3 and 6 year predictions for the tech industry. This is my second appearance …

Simon Willison's Blog2026-01-08

apicloudtool

How Google Got Its Groove Back and Edged Ahead of OpenAI

I picked up a few interesting tidbits from this Wall Street Journal piece on Google's recent hard won success with Gemini. Here's the origin of the name "Nano Banana": Naina …

Simon Willison's Blog2026-01-08

platform

Quoting Adam Wathan

[...] the reality is that 75% of the people on our engineering team lost their jobs here yesterday because of the brutal impact AI has had on our business. And …

Simon Willison's Blog2026-01-07

apitool

Quoting Robin Sloan

AGI is here! When exactly it arrived, we’ll never know; whether it was one company’s Pro or another company’s Pro Max (Eddie Bauer Edition) that tip-toed first across the line …

Simon Willison's Blog2026-01-07

platform

A field guide to sandboxes for AI

This guide to the current sandboxing landscape by Luis Cardoso is comprehensive, dense and absolutely fantastic. He starts by differentiating between containers (which share the host kernel), microVMs (their own …

Simon Willison's Blog2026-01-06

tool

Oxide and Friends Predictions 2026, today at 4pm PT

I joined the Oxide and Friends podcast last year to predict the next 1, 3 and 6 years(!) of AI developments. With hindsight I did very badly, but they're inviting …

Simon Willison's Blog2026-01-05

podcast

The November 2025 inflection point

It genuinely feels to me like GPT-5.2 and Opus 4.5 in November represent an inflection point - one of those moments where the models get incrementally better in a way …

Simon Willison's Blog2026-01-04

platform

Helping people write code again

Something I like about our weird new LLM-assisted world is the number of people I know who are coding again, having mostly stopped as they moved into management roles or …

Simon Willison's Blog2026-01-04

tool

Quoting Jaana Dogan

I'm not joking and this isn't funny. We have been trying to build distributed agent orchestrators at Google since last year. There are various options, not everyone is aligned... I …

Simon Willison's Blog2026-01-04

platform