Simon Willison

Co-creator of Django. Prolific writing on Python, LLMs, and data tools.

simonwillison.net

PythonLLMsdata tools

435 essays · 622k words total

Claude Opus 4.8: “a modest but tangible improvement” May 28, 2026 633 words
I think Anthropic and OpenAI have found product-market fit May 27, 2026 2k words
Notes on Pope Leo XIV’s encyclical on AI May 25, 2026 2k words
Datasette Agent May 21, 2026 575 words
The last six months in LLMs in five minutes May 19, 2026 944 words
Gemini 3.5 Flash: more expensive, but Google plan to use it for everything May 19, 2026 522 words
Notes on the xAI/Anthropic data center deal May 7, 2026 518 words
Vibe coding and agentic engineering are getting closer than I’d like May 6, 2026 2k words
Live blog: Code w/ Claude 2026 May 6, 2026 1k words
LLM 0.32a0 is a major backwards-compatible refactor Apr 29, 2026 1k words
Tracking the history of the now-deceased OpenAI Microsoft AGI clause Apr 27, 2026 669 words
DeepSeek V4—almost on the frontier, a fraction of the price Apr 24, 2026 553 words
A pelican for GPT-5.5 via the semi-official Codex backdoor API Apr 23, 2026 764 words
Extract PDF text in your browser with LiteParse for the web Apr 23, 2026 2k words
Is Claude Code going to cost $100/month? Probably not—it’s all very confusing Apr 22, 2026 1k words
Where’s the raccoon with the ham radio? (ChatGPT Images 2.0) Apr 21, 2026 615 words
Changes in the system prompt between Claude Opus 4.6 and 4.7 Apr 18, 2026 976 words
Join us at PyCon US 2026 in Long Beach—we have new AI and security tracks this year Apr 17, 2026 562 words
Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7 Apr 16, 2026 374 words
Meta’s new model is Muse Spark, and meta.ai chat has some interesting tools Apr 8, 2026 2k words
Anthropic’s Project Glasswing—restricting Claude Mythos to security researchers—sounds necessary to me Apr 7, 2026 1k words
The Axios supply chain attack used individually targeted social engineering Apr 3, 2026 352 words
Highlights from my conversation about agentic engineering on Lenny’s Podcast Apr 2, 2026 3k words
Mr. Chatterbox is a (weak) Victorian-era ethically trained model you can run on your own computer Mar 30, 2026 798 words
Vibe coding SwiftUI apps is a lot of fun Mar 27, 2026 925 words
Experimenting with Starlette 1.0 with Claude skills Mar 22, 2026 739 words
Profiling Hacker News users based on their comments Mar 21, 2026 964 words
Thoughts on OpenAI acquiring Astral and uv/ruff/ty Mar 19, 2026 1k words
GPT-5.4 mini and GPT-5.4 nano, which can describe 76,000 photos for $52 Mar 17, 2026 347 words
My fireside chat about agentic engineering at the Pragmatic Summit Mar 14, 2026 3k words
Perhaps not Boring Technology after all Mar 9, 2026 379 words
Can coding agents relicense open source through a “clean room” implementation of code? Mar 5, 2026 1k words
Something is afoot in the land of Qwen Mar 4, 2026 687 words
I vibe coded my dream macOS presentation app Feb 25, 2026 1k words
Writing about Agentic Engineering Patterns Feb 23, 2026 537 words
Adding TILs, releases, museums, tools and research to my blog Feb 20, 2026 519 words
Two new Showboat tools: Chartroom and datasette-showboat Feb 17, 2026 2k words
Deep Blue Feb 15, 2026 962 words
The evolution of OpenAI’s mission statement Feb 13, 2026 480 words
Introducing Showboat and Rodney, so agents can demo what they’ve built Feb 10, 2026 2k words
How StrongDM’s AI team build serious software without even looking at the code Feb 7, 2026 1k words
Running Pydantic’s Monty Rust sandboxed Python subset in WebAssembly Feb 6, 2026 664 words
Distributing Go binaries like sqlite-scanner through PyPI using go-to-wheel Feb 4, 2026 1k words
Moltbook is the most interesting place on the internet right now Jan 30, 2026 1k words
Adding dynamic features to an aggressively cached website Jan 28, 2026 870 words
ChatGPT Containers can now run bash, pip/npm install packages, and download files Jan 26, 2026 2k words
Wilson Lin on FastRender: a browser built by thousands of parallel agents Jan 23, 2026 2k words
First impressions of Claude Cowork, Anthropic’s general agent Jan 12, 2026 1k words
My answers to the questions I posed about porting open source code with LLMs Jan 11, 2026 1k words
Fly’s new Sprites.dev addresses both developer sandboxes and API sandboxes at the same time Jan 9, 2026 1k words
LLM predictions for 2026, shared with Oxide and Friends Jan 8, 2026 2k words
Introducing gisthost.github.io Jan 1, 2026 749 words
2025: The year in LLMs Dec 31, 2025 7k words
How Rob Pike got spammed with an AI slop “act of kindness” Dec 26, 2025 2k words
A new way to extract detailed transcripts from Claude Code Dec 25, 2025 930 words
Cooking with Claude Dec 23, 2025 1k words
Your job is to deliver code you have proven to work Dec 18, 2025 828 words
Gemini 3 Flash Dec 17, 2025 1k words
I ported JustHTML from Python to JavaScript with Codex CLI and GPT-5.2 in 4.5 hours Dec 15, 2025 2k words
JustHTML is a fascinating example of vibe engineering in action Dec 14, 2025 853 words
OpenAI are quietly adopting skills, now available in ChatGPT and Codex CLI Dec 12, 2025 708 words
GPT-5.2 Dec 11, 2025 667 words
Useful patterns for building HTML tools Dec 10, 2025 3k words
Under the hood of Canada Spends with Brendan Samek Dec 9, 2025 304 words
Highlights from my appearance on the Data Renegades podcast with CL Kao and Dori Wilson Nov 26, 2025 3k words
sqlite-utils 4.0a1 has several (minor) backwards incompatible changes Nov 24, 2025 983 words
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult Nov 24, 2025 936 words
Olmo 3 is a fully open LLM Nov 22, 2025 1k words
Nano Banana Pro aka gemini-3-pro-image-preview is the best available image generation model Nov 20, 2025 1k words
How I automate my Substack newsletter with content from my blog Nov 19, 2025 969 words
Trying out Gemini 3 Pro with audio transcription and a new pelican benchmark Nov 18, 2025 2k words
What happens if AI labs train for pelicans riding bicycles? Nov 13, 2025 294 words
Reverse engineering Codex CLI to get GPT-5-Codex-Mini to draw me a pelican Nov 9, 2025 2k words
Code research projects with async coding agents like Claude Code and Codex Nov 6, 2025 2k words
Video + notes on upgrading a Datasette plugin for the latest 1.0 alpha, with help from uv and OpenAI Codex CLI Nov 6, 2025 919 words
A new SQL-powered permissions system in Datasette 1.0a20 Nov 4, 2025 2k words
New prompt injection papers: Agents Rule of Two and The Attacker Moves Second Nov 2, 2025 1k words
Hacking the WiFi-enabled color screen GitHub Universe conference badge Oct 28, 2025 1k words
Video: Building a tool to copy-paste share terminal sessions using Claude Code for web Oct 23, 2025 1k words
Living dangerously with Claude Oct 22, 2025 1k words
Dane Stuckey (OpenAI CISO) on prompt injection risks for ChatGPT Atlas Oct 22, 2025 1k words
Getting DeepSeek-OCR working on an NVIDIA Spark via brute force using Claude Code Oct 20, 2025 2k words
Claude Code for web—a new asynchronous coding agent from Anthropic Oct 20, 2025 1k words
Claude Skills are awesome, maybe a bigger deal than MCP Oct 16, 2025 2k words
NVIDIA DGX Spark: great hardware, early days for the ecosystem Oct 14, 2025 2k words
Claude can write complete Datasette plugins now Oct 8, 2025 1k words
Vibe engineering Oct 7, 2025 1k words
OpenAI DevDay 2025 live blog Oct 6, 2025 1k words
Embracing the parallel coding agent lifestyle Oct 5, 2025 1k words
Designing agentic loops Sep 30, 2025 2k words
Claude Sonnet 4.5 is probably the “best coding model in the world” (at least for now) Sep 29, 2025 1k words
I think “agent” may finally have a widely enough agreed upon definition to be useful jargon now Sep 18, 2025 1k words
Recreating the Apollo AI adoption rate chart with GPT-5, Python and Pyodide Sep 9, 2025 2k words
My review of Claude’s new Code Interpreter, released under a very confusing name Sep 9, 2025 2k words
GPT-5 Thinking in ChatGPT (aka Research Goblin) is shockingly good at search Sep 6, 2025 2k words
V&A East Storehouse and Operation Mincemeat in London Aug 27, 2025 441 words
Open weight LLMs exhibit inconsistent performance across providers Aug 15, 2025 617 words
The Summer of Johann: prompt injections as far as the eye can see Aug 15, 2025 1k words
LLM 0.27, the annotated release notes: GPT-5 and improved tool calling Aug 11, 2025 1k words
Qwen3-4B-Thinking: “This is art—pelicans don’t ride bikes!” Aug 10, 2025 841 words
My Lethal Trifecta talk at the Bay Area AI Security Meetup Aug 9, 2025 2k words
The surprise deprecation of GPT-4o for ChatGPT consumers Aug 8, 2025 903 words
GPT-5: Key characteristics, pricing and model card Aug 7, 2025 2k words
OpenAI’s new open weight (Apache 2) models are really good Aug 5, 2025 3k words
ChatGPT agent’s user-agent Aug 4, 2025 991 words
The ChatGPT sharing dialog demonstrates how difficult it is to design privacy preferences Aug 3, 2025 856 words
Trying out Qwen3 Coder Flash using LM Studio and Open WebUI and LLM Jul 31, 2025 1k words
Reverse engineering some updates to Claude Jul 31, 2025 1k words
My 2.5 year old laptop can write Space Invaders in JavaScript now, using GLM-4.5 Air and MLX Jul 29, 2025 561 words
Using GitHub Spark to reverse engineer GitHub Spark Jul 24, 2025 3k words
Vibe scraping and vibe coding a schedule app for Open Sauce 2025 entirely on my phone Jul 17, 2025 2k words
Happy 20th birthday Django! Here’s my talk on Django Origins from Django’s 10th Jul 13, 2025 7k words
Grok: searching X for “from:elonmusk (Israel OR Palestine OR Hamas OR Gaza)” Jul 11, 2025 871 words
Phoenix.new is Fly’s entry into the prompt-driven app development space Jun 23, 2025 1k words
Trying out the new Gemini 2.5 model family Jun 17, 2025 1k words
The lethal trifecta for AI agents: private data, untrusted content, and external communication Jun 16, 2025 1k words
An Introduction to Google’s Approach to AI Agent Security Jun 15, 2025 2k words
Design Patterns for Securing LLM Agents against Prompt Injections Jun 13, 2025 2k words
Comma v0.1 1T and 2T—7B LLMs trained on openly licensed text Jun 7, 2025 632 words
The last six months in LLMs, illustrated by pelicans on bicycles Jun 6, 2025 3k words
Tips on prompting ChatGPT for UK technology secretary Peter Kyle Jun 3, 2025 1k words
How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM May 31, 2025 2k words
Talking AI and jobs with Natasha Zouves for News Nation May 30, 2025 2k words
Large Language Models can run tools in your terminal with LLM 0.26 May 27, 2025 2k words
Highlights from the Claude 4 system prompt May 25, 2025 6k words
Live blog: Claude 4 launch at Code with Claude May 22, 2025 2k words
I really don’t like ChatGPT’s new memory dossier May 21, 2025 2k words
Building software on top of Large Language Models May 15, 2025 2k words
Trying out llama.cpp’s new vision support May 10, 2025 749 words
Saying “hi” to Microsoft’s Phi-4-reasoning May 6, 2025 1k words
Feed a video to a vision LLM as a sequence of JPEG frames on the CLI (also LLM 0.25) May 5, 2025 1k words
Two publishers and three authors fail to understand what “vibe coding” means May 1, 2025 823 words
Understanding the recent criticism of the Chatbot Arena Apr 30, 2025 1k words
Qwen 3 offers a case study in how to effectively release a model Apr 29, 2025 1k words
Watching o3 guess a photo’s location is surreal, dystopian and wildly entertaining Apr 26, 2025 1k words
Exploring Promptfoo via Dave Guarino’s SNAP evals Apr 24, 2025 585 words
AI assisted search-based research actually works now Apr 21, 2025 1k words
Maybe Meta’s Llama claims to be open source because of the EU AI act Apr 19, 2025 803 words
Image segmentation using Gemini 2.5 Apr 18, 2025 1k words
GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet Apr 14, 2025 1k words
CaMeL offers a promising new direction for mitigating prompt injection attacks Apr 11, 2025 2k words
Model Context Protocol has prompt injection security problems Apr 9, 2025 1k words
Long context support in LLM 0.24 using fragments and template plugins Apr 7, 2025 2k words
Initial impressions of Llama 4 Apr 5, 2025 1k words
Putting Gemini 2.5 Pro through its paces Mar 25, 2025 2k words
Calling a wrap on my weeknotes Mar 20, 2025 280 words
New audio models from OpenAI, but how much can we rely on them? Mar 20, 2025 612 words
Not all AI-assisted programming is vibe coding (but vibe coding rocks) Mar 19, 2025 1k words
Adding AI-generated descriptions to my tools collection Mar 13, 2025 585 words
Notes on Google’s Gemma 3 Mar 12, 2025 710 words
Here’s how I use LLMs to help me write code Mar 11, 2025 5k words
What’s new in the world of LLMs, for NICAR 2025 Mar 8, 2025 2k words
I built an automaton called Squadron Mar 4, 2025 1k words
Hallucinations in code are the least dangerous form of LLM mistakes Mar 2, 2025 1k words
Notes from my Accessibility and Gen AI podcast appearance Mar 2, 2025 908 words
Structured data extraction from unstructured content using LLM schemas Feb 28, 2025 2k words
Initial impressions of GPT-4.5 Feb 27, 2025 649 words
Claude 3.7 Sonnet, extended thinking and long output, llm-anthropic 0.14 Feb 25, 2025 1k words
LLM 0.22, the annotated release notes Feb 17, 2025 1k words
Run LLMs on macOS using llm-mlx and Apple’s MLX framework Feb 15, 2025 1k words
URL-addressable Pyodide Python environments Feb 13, 2025 2k words
Using pip to install a Large Language Model that’s under 100MB Feb 7, 2025 1k words
OpenAI o3-mini, now available in LLM Jan 31, 2025 721 words
Anthropic’s new Citations API Jan 24, 2025 1k words
A selfish personal argument for releasing code as Open Source Jan 24, 2025 438 words
Six short video demos of LLM and Datasette projects Jan 22, 2025 896 words
DeepSeek-R1 and exploring DeepSeek-R1-Distill-Llama-8B Jan 20, 2025 1k words
My AI/LLM predictions for the next 1, 3 and 6 years, for Oxide and Friends Jan 10, 2025 3k words
Weeknotes: Starting 2025 a little slow Jan 4, 2025 484 words
Ending a year long posting streak Jan 2, 2025 272 words
I still don’t think companies serve you ads based on spying through your microphone Jan 2, 2025 684 words
Things we learned about LLMs in 2024 Dec 31, 2024 7k words
Trying out QvQ—Qwen’s new visual reasoning model Dec 24, 2024 2k words
My approach to running a link blog Dec 22, 2024 1k words
December in LLMs has been a lot Dec 20, 2024 777 words
Live blog: the 12th day of OpenAI—“Early evals for OpenAI o3” Dec 20, 2024 739 words
Building Python tools with a one-shot prompt using uv run and Claude Projects Dec 19, 2024 794 words
Gemini 2.0 Flash “Thinking mode” Dec 19, 2024 447 words
Gemini 2.0 Flash: An outstanding multi-modal LLM with a sci-fi streaming mode Dec 11, 2024 1k words
ChatGPT Canvas can make API requests now, but it’s complicated Dec 10, 2024 1k words
I can now run a GPT-4 class model on my laptop Dec 9, 2024 2k words
Prompts.js Dec 7, 2024 938 words
First impressions of the new Amazon Nova LLMs (via a new llm-bedrock plugin) Dec 4, 2024 2k words
Storing times for human events Nov 27, 2024 2k words
Ask questions of SQLite databases and CSV/JSON files in your terminal Nov 25, 2024 693 words
Weeknotes: asynchronous LLMs, synchronous embeddings, and I kind of started a podcast Nov 22, 2024 756 words
Notes from Bing Chat—Our First Encounter With Manipulative AI Nov 19, 2024 364 words
Project: Civic Band—scraping and searching PDF meeting minutes from hundreds of municipalities Nov 16, 2024 638 words
Qwen2.5-Coder-32B is an LLM that can code well that runs on my Mac Nov 12, 2024 567 words
Visualizing local election results with Datasette, Observable and MapLibre GL Nov 9, 2024 2k words
Project: VERDAD—tracking misinformation in radio broadcasts using Gemini 1.5 Nov 7, 2024 806 words
Claude 3.5 Haiku Nov 4, 2024 448 words
W̶e̶e̶k̶n̶o̶t̶e̶s̶ Monthnotes for October Oct 30, 2024 680 words
You can now run prompts against images, audio and video in your terminal using LLM Oct 29, 2024 1k words
Run a prompt to generate and execute jq programs using llm-jq Oct 27, 2024 364 words
Notes on the new Claude analysis JavaScript code execution tool Oct 24, 2024 777 words
Initial explorations of Anthropic’s new Computer Use capability Oct 22, 2024 1k words
Everything I built with Claude Artifacts this week Oct 21, 2024 2k words
Running Llama 3.2 Vision and Phi-3.5 Vision on a Mac with mistral.rs Oct 19, 2024 878 words
Experimenting with audio input and output for the OpenAI Chat Completion API Oct 18, 2024 1k words
Video scraping: extracting JSON data from a 35 second screen capture for less than 1/10th of a cent Oct 17, 2024 1k words
ChatGPT will happily write you a thinly disguised horoscope Oct 15, 2024 1k words
OpenAI DevDay: Let’s build developer tools, not digital God Oct 2, 2024 2k words
OpenAI DevDay 2024 live blog Oct 1, 2024 6k words
Weeknotes: Three podcasts, two trips and a new plugin system Sep 30, 2024 556 words
NotebookLM’s automatically generated podcasts are surprisingly effective Sep 29, 2024 1k words
Themes from DjangoCon US 2024 Sep 27, 2024 1k words
DJP: A plugin system for Django Sep 25, 2024 1k words
Notes on using LLMs for code Sep 20, 2024 821 words
Things I’ve learned serving on the board of the Python Software Foundation Sep 18, 2024 3k words
Notes on OpenAI’s new o1 chain-of-thought models Sep 12, 2024 1k words
Notes from my appearance on the Software Misadventures Podcast Sep 10, 2024 2k words
Calling LLMs from client-side JavaScript, converting PDFs to HTML + weeknotes Sep 6, 2024 2k words
Building a tool showing how Gemini Pro can return bounding boxes for objects in images Aug 26, 2024 2k words
Claude’s API now supports CORS requests, enabling client-side applications Aug 23, 2024 459 words
Optimizing Datasette (and other weeknotes) Aug 22, 2024 2k words
django-http-debug, a new Django app mostly written by Claude Aug 8, 2024 2k words
Weeknotes: a staging environment, a Datasette alpha and a bunch of new LLMs Aug 6, 2024 1k words
Datasette 1.0a14: The annotated release notes Aug 5, 2024 1k words
Weeknotes: GPT-4o mini, LLM 0.15, sqlite-utils 3.37 and building a staging environment Jul 19, 2024 658 words
Imitation Intelligence, my keynote for PyCon US 2024 Jul 14, 2024 7k words
Give people something to link to so they can talk about your features and ideas Jul 13, 2024 661 words
Weeknotes: a livestream, a surprise keynote and progress on Datasette Cloud billing Jul 2, 2024 970 words
Open challenges for AI engineering Jun 27, 2024 3k words
Building search-based RAG using Claude, Datasette and Val Town Jun 21, 2024 2k words
Weeknotes: Datasette Studio and a whole lot of blogging Jun 19, 2024 639 words
Language models on the command-line Jun 17, 2024 2k words
A homepage redesign for my blog’s 22nd birthday Jun 12, 2024 248 words
Thoughts on the WWDC 2024 keynote on Apple Intelligence Jun 10, 2024 768 words
Accidental prompt injection against RAG applications Jun 6, 2024 407 words
Training is not the same as chatting: ChatGPT and other LLMs don’t remember everything you say May 29, 2024 2k words
Weeknotes: PyCon US 2024 May 28, 2024 419 words
ChatGPT in “4o” mode is not running the new features yet May 15, 2024 836 words
Slop is the new name for unwanted AI-generated content May 8, 2024 264 words
Weeknotes: more datasette-secrets, plus a mystery video project May 7, 2024 774 words
Weeknotes: Llama 3, AI for Data Journalism, llm-evals and datasette-secrets Apr 23, 2024 850 words
Options for accessing Llama 3 from the terminal using LLM Apr 22, 2024 2k words
AI for Data Journalism: demonstrating what we can do with this stuff right now Apr 17, 2024 4k words
Three major LLM releases in 24 hours (plus weeknotes) Apr 10, 2024 951 words
Building files-to-prompt entirely using Claude 3 Opus Apr 8, 2024 2k words
Running OCR against PDFs and images directly in your browser Mar 30, 2024 1k words
llm cmd undo last git commit—a new plugin for LLM Mar 26, 2024 680 words
Building and testing C extensions for SQLite with ChatGPT Code Interpreter Mar 23, 2024 3k words
Claude and ChatGPT for ad-hoc sidequests Mar 22, 2024 1k words
Weeknotes: the aftermath of NICAR Mar 16, 2024 1k words
The GPT-4 barrier has finally been broken Mar 8, 2024 688 words
Prompt injection and jailbreaking are not the same thing Mar 5, 2024 1k words
Interesting ideas in Observable Framework Mar 3, 2024 2k words
Weeknotes: Getting ready for NICAR Feb 27, 2024 1k words
The killer app of Gemini Pro 1.5 is video Feb 21, 2024 2k words
Weeknotes: a Datasette release, an LLM release and a bunch of new plugins Feb 9, 2024 911 words
Datasette 1.0a8: JavaScript plugins, new plugin hooks and plugin configuration in datasette.yaml Feb 7, 2024 2k words
LLM 0.13: The annotated release notes Jan 26, 2024 1k words
Weeknotes: datasette-test, datasette-build, PSF board retreat Jan 21, 2024 649 words
Talking about Open Source LLMs on Oxide and Friends Jan 17, 2024 2k words
Publish Python packages to PyPI with a python-lib cookiecutter template and GitHub Actions Jan 16, 2024 595 words
What I should have said about the term Artificial Intelligence Jan 9, 2024 372 words
It’s OK to call it Artificial Intelligence Jan 7, 2024 2k words
Weeknotes: Page caching and custom templates for Datasette Cloud Jan 7, 2024 882 words
Tom Scott, and the formidable power of escalating streaks Jan 2, 2024 1k words
Stuff we figured out about AI in 2023 Dec 31, 2023 3k words
Recommendations to help mitigate prompt injection: limit the blast radius Dec 20, 2023 521 words
Many options for running Mistral models in your terminal using LLM Dec 18, 2023 2k words
The AI trust crisis Dec 14, 2023 2k words
Weeknotes: datasette-enrichments, datasette-comments, sqlite-chronicle Dec 8, 2023 966 words
Datasette Enrichments: a new plugin framework for augmenting your data Dec 1, 2023 1k words
llamafile is the new best way to run an LLM on your own computer Nov 29, 2023 537 words
Prompt injection explained, November 2023 edition Nov 27, 2023 1k words
I’m on the Newsroom Robots podcast, with thoughts on the OpenAI board Nov 25, 2023 1k words
Deciphering clues in a news article to understand how it was reported Nov 22, 2023 1k words
Weeknotes: DevDay, GitHub Universe, OpenAI chaos Nov 22, 2023 666 words
Exploring GPTs: ChatGPT in a trench coat? Nov 15, 2023 5k words
Financial sustainability for open source projects at GitHub Universe Nov 10, 2023 2k words
ospeak: a CLI tool for speaking text in the terminal via OpenAI Nov 7, 2023 995 words
DALL-E 3, GPT4All, PMTiles, sqlite-migrate, datasette-edit-schema Oct 30, 2023 1k words
Execute Jina embeddings with a CLI using llm-embed-jina Oct 26, 2023 1k words
Now add a walrus: Prompt engineering in DALL‑E 3 Oct 26, 2023 3k words
Embeddings: What they are and why they matter Oct 23, 2023 5k words
Weeknotes: PyBay, AI Engineer Summit, Datasette metadata and JavaScript plugins Oct 22, 2023 533 words
Open questions for AI engineering Oct 17, 2023 5k words
Multi-modal prompt injection image attacks against GPT-4V Oct 14, 2023 743 words
Weeknotes: the Datasette Cloud API, a podcast appearance and more Oct 1, 2023 1k words
Things I’ve learned about building CLI tools in Python Sep 30, 2023 1k words
Talking Large Language Models with Rooftop Ruby Sep 29, 2023 15k words
Weeknotes: Embeddings, more embeddings and Datasette Cloud Sep 17, 2023 2k words
Build an image search engine with llm-clip, chat with models with llm chat Sep 12, 2023 1k words
LLM now provides tools for working with embeddings Sep 4, 2023 3k words
Datasette 1.0a4 and 1.0a5, plus weeknotes Aug 30, 2023 2k words
Making Large Language Models work for you Aug 27, 2023 10k words
Datasette Cloud, Datasette 1.0a3, llm-mlc and more Aug 16, 2023 1k words
How I make annotated presentations Aug 6, 2023 2k words
Weeknotes: Plugins for LLM, sqlite-utils and Datasette Aug 5, 2023 1k words
Catching up on the weird world of LLMs Aug 3, 2023 6k words
Run Llama 2 on your own Mac using LLM and Homebrew Aug 1, 2023 1k words
sqlite-utils now supports plugins Jul 24, 2023 1k words
Accessing Llama 2 from the command-line with the llm-replicate plugin Jul 18, 2023 1k words
Weeknotes: Self-hosted language models with LLM plugins, a new Datasette tutorial, a dozen package releases, a dozen TILs Jul 16, 2023 1k words
My LLM CLI tool now supports self-hosted language models via plugins Jul 12, 2023 1k words
Weeknotes: symbex, LLM prompt templates, a bit of a break Jun 27, 2023 969 words
Symbex: search Python code for functions and classes, then pipe them into a LLM Jun 18, 2023 869 words
Understanding GPT tokenizers Jun 8, 2023 1k words
It’s infuriatingly hard to understand how closed models train on their input Jun 4, 2023 1k words
Weeknotes: Parquet in Datasette Lite, various talks, more LLM hacking Jun 4, 2023 674 words
ChatGPT should include inline tips May 30, 2023 788 words
Lawyer cites fake cases invented by ChatGPT, judge is not amused May 27, 2023 2k words
llm, ttok and strip-tags—CLI tools for working with ChatGPT and other LLMs May 18, 2023 1k words
Delimiters won’t save you from prompt injection May 11, 2023 789 words
Weeknotes: sqlite-utils 3.31, download-esm, Python in a sandbox May 10, 2023 566 words
Big Opportunities in Small Data May 8, 2023 274 words
Midjourney 5.1 May 4, 2023 299 words
Leaked Google document: “We Have No Moat, And Neither Does OpenAI” May 4, 2023 993 words
download-esm: a tool for downloading ECMAScript modules May 2, 2023 1k words
Prompt injection explained, with video, slides, and a transcript May 2, 2023 2k words
Weeknotes: Miscellaneous research into Rye, ChatGPT Code Interpreter and openai-to-sqlite May 1, 2023 809 words
Let’s be bear or bunny May 1, 2023 493 words
Enriching data with GPT3.5 and SQLite SQL functions Apr 29, 2023 1k words
The Dual LLM pattern for building AI assistants that can resist prompt injection Apr 25, 2023 3k words
Weeknotes: Citus Con, PyCon and three new niche museums Apr 23, 2023 207 words
Data analysis with SQLite and Python for PyCon 2023 Apr 20, 2023 267 words
What’s in the RedPajama-Data-1T LLM training set Apr 17, 2023 777 words
Web LLM runs the vicuna-7b Large Language Model entirely in your browser, and it’s very impressive Apr 16, 2023 1k words
sqlite-history: tracking changes to SQLite tables using triggers (also weeknotes) Apr 15, 2023 1k words
Prompt injection: What’s the worst that can happen? Apr 14, 2023 2k words
Running Python micro-benchmarks using the ChatGPT Code Interpreter alpha Apr 12, 2023 2k words
Thoughts on AI safety in this era of increasingly powerful open source LLMs Apr 10, 2023 766 words
Working in public Apr 8, 2023 526 words
The Changelog podcast: LLMs break the internet Apr 8, 2023 422 words
We need to tell people ChatGPT will lie to them, not debate linguistics Apr 7, 2023 1k words
Semi-automating a Substack newsletter with an Observable notebook Apr 4, 2023 2k words
Weeknotes: A new llm CLI tool, plus automating my weeknotes and newsletter Apr 4, 2023 697 words
What AI can do for you on the Theory of Change podcast Apr 2, 2023 502 words
Think of language models like ChatGPT as a “calculator for words” Apr 2, 2023 1k words
AI-enhanced development makes me more ambitious with my projects Mar 27, 2023 2k words
I built a ChatGPT plugin to answer questions about data hosted in Datasette Mar 24, 2023 1k words
Don’t trust AI to talk accurately about itself: Bard wasn’t trained on Gmail Mar 22, 2023 2k words
Weeknotes: AI won’t slow down, a new newsletter and a huge Datasette refactor Mar 22, 2023 1k words
A conversation about prompt engineering with CBC Day 6 Mar 18, 2023 2k words
Could you train a ChatGPT-beating model for $85,000 and run it in a browser? Mar 17, 2023 2k words
Stanford Alpaca, and the acceleration of on-device large language model development Mar 13, 2023 2k words
Large language models are having their Stable Diffusion moment Mar 11, 2023 2k words
ChatGPT couldn’t access the internet, even though it really looked like it could Mar 10, 2023 974 words
Weeknotes: NICAR, and an appearance on KQED Forum Mar 7, 2023 2k words
Thoughts and impressions of AI-assisted search from Bing Feb 24, 2023 2k words
In defense of prompt engineering Feb 21, 2023 904 words
I talked about Bing and tried to explain language models on live TV! Feb 19, 2023 1k words
Analytics: Hacker News v.s. a tweet from Elon Musk Feb 17, 2023 639 words
Bing: “I will not harm you unless you harm me first” Feb 15, 2023 4k words
Weeknotes: A bunch of things I learned this week, plus datasette-explain Feb 9, 2023 1k words
datasette-scraper, Big Local News and other weeknotes Jan 30, 2023 1k words
Exploring MusicCaps, the evaluation data released to accompany Google’s MusicLM text-to-music model Jan 27, 2023 1k words
Weeknotes: AI hacking and a SpatiaLite tutorial Jan 15, 2023 357 words
How to implement Q&A against your documentation with GPT3, embeddings and Datasette Jan 13, 2023 3k words
Datasette 0.64, with a warning about SpatiaLite Jan 9, 2023 637 words
2022 in projects and blogging Dec 31, 2022 1k words
Weeknotes: Datasette 0.63.3, datasette-ripgrep Dec 20, 2022 681 words
Datasette 1.0a2: Upserts and finely grained permissions Dec 15, 2022 2k words
Over-engineering Secret Santa with Python cryptography and Datasette Dec 11, 2022 2k words
AI assisted learning: Learning Rust with ChatGPT, Copilot and Advent of Code Dec 5, 2022 2k words
Weeknotes: datasette-ephemeral-tables, datasette-export Dec 5, 2022 515 words
A new AI game: Give me ideas for crimes to do Dec 4, 2022 924 words
Datasette’s new JSON write API: The first alpha of Datasette 1.0 Dec 2, 2022 2k words
Coping strategies for the serial project hoarder Nov 26, 2022 3k words
Weeknotes: Implementing a write API, Mastodon distractions Nov 23, 2022 757 words
Tracking Mastodon user numbers over time with a bucket of tricks Nov 20, 2022 1k words
Datasette is 5 today: a call for birthday presents Nov 13, 2022 525 words
Designing a write API for Datasette Nov 9, 2022 1k words
Mastodon is just blogs Nov 8, 2022 2k words
What to blog about Nov 6, 2022 508 words
It looks like I’m moving to Mastodon Nov 5, 2022 1k words
The Perfect Commit Oct 29, 2022 2k words
Datasette 0.63: The annotated release notes Oct 27, 2022 1k words
Weeknotes: DjangoCon, SQLite in Django, datasette-gunicorn Oct 23, 2022 1k words
Measuring traffic during the Half Moon Bay Pumpkin Festival Oct 19, 2022 2k words
Automating screenshots for the Datasette documentation using shot-scraper Oct 14, 2022 1k words
Weeknotes: Publishing data using Datasette Cloud Oct 12, 2022 531 words
Is the AI spell-casting metaphor harmful or helpful? Oct 5, 2022 956 words
Software engineering practices Oct 1, 2022 2k words
A tool to run caption extraction against online videos using Whisper and GitHub Issues/Actions Sep 30, 2022 1k words
Weeknotes: Datasette Cloud preview invitations Sep 30, 2022 592 words
Exploring 10m scraped Shutterstock videos used to train Meta’s Make-A-Video text-to-video model Sep 29, 2022 801 words
You can’t solve AI security problems with more AI Sep 17, 2022 1k words
Weeknotes: Datasette Lite, s3-credentials, shot-scraper, datasette-edit-templates and more Sep 16, 2022 1k words
I don’t know how to solve prompt injection Sep 16, 2022 528 words
Prompt injection attacks against GPT-3 Sep 12, 2022 1k words
Exploring the training data behind Stable Diffusion Sep 5, 2022 3k words
Notes on the SQLite DuckDB paper Sep 1, 2022 999 words
Stable Diffusion is a really big deal Aug 29, 2022 1k words
Building a searchable archive for the San Francisco Microscopical Society Aug 25, 2022 1k words
Analyzing ScotRail audio announcements with Datasette—from prototype to production Aug 21, 2022 4k words
Plugin support for Datasette Lite Aug 17, 2022 670 words
Litestream backups for Datasette Cloud (and weeknotes) Aug 11, 2022 2k words
Weeknotes: Joining the board of the Python Software Foundation Jul 30, 2022 2k words
Weeknotes: Datasette, sqlite-utils, Datasette Desktop Jul 20, 2022 949 words
sqlite-comprehend: run AWS entity extraction against content in a SQLite database Jul 11, 2022 1k words
Using GPT-3 to explain how code works Jul 9, 2022 2k words
s3-ocr: Extract text from PDF files stored in an S3 bucket Jun 30, 2022 1k words
First impressions of DALL-E, generating images from text Jun 23, 2022 1k words
Joining CSV files in your browser using Datasette Lite Jun 20, 2022 446 words
Weeknotes: datasette-socrata, and the last 10%... Jun 19, 2022 1k words
A tiny web app to create images from OpenStreetMap maps Jun 12, 2022 696 words
Twenty years of my blog Jun 12, 2022 4k words
Weeknotes: Datasette Cloud ready to preview Jun 7, 2022 321 words
How to use the GPT-3 language model Jun 5, 2022 670 words
A Datasette tutorial written by GPT-3 May 31, 2022 1k words
Weeknotes: Building Datasette Cloud on Fly Machines, Furo for documentation May 26, 2022 912 words
Bundling binary tools in Python wheels May 23, 2022 818 words
Weeknotes: Camping, a road trip and two new museums May 16, 2022 598 words
Weeknotes: Datasette Lite, nogil Python, HYTRADBOI May 6, 2022 1k words
Datasette Lite: a server-side Python web application running in a browser May 4, 2022 4k words
Automatically opening issues when tracked file content changes Apr 28, 2022 1k words
Weeknotes: Parallel SQL queries for Datasette, plus some middleware tricks Apr 27, 2022 1k words
Useful tricks with pip install URL and GitHub Apr 24, 2022 776 words
Building a Covid sewage Twitter bot (and other weeknotes) Apr 18, 2022 922 words
Pillar Point Stewards, pypi-to-sqlite, improvements to shot-scraper and appreciating datasette-dashboards Apr 8, 2022 2k words
Weeknotes: datasette-auth0 Mar 28, 2022 798 words
Datasette 0.61: The annotated release notes Mar 24, 2022 1k words
SQLite Happy Hour—a Twitter Spaces conversation about three interesting projects building on SQLite Mar 23, 2022 2k words
Weeknotes: Tildes not dashes, and the big refactor Mar 19, 2022 1k words
Scraping web pages from the command line with shot-scraper Mar 14, 2022 955 words
Instantly create a GitHub repository to take screenshots of a web page Mar 14, 2022 934 words
Weeknotes: Distracted by Playwright Mar 12, 2022 736 words
shot-scraper: automated screenshots for documentation, built on Playwright Mar 10, 2022 2k words
Why I invented “dash encoding”, a new encoding scheme for URL paths Mar 5, 2022 1k words
Weeknotes: Datasette Tutorials Feb 27, 2022 421 words
Support open source that you use by paying the maintainers to talk to your team Feb 23, 2022 637 words
Google Drive to SQLite Feb 20, 2022 1k words
Using SQLite and Datasette with Fly Volumes Feb 15, 2022 1k words
Help scraping: track changes to CLI tools by recording their --help using Git Feb 2, 2022 922 words
Writing better release notes Jan 31, 2022 864 words
Weeknotes: python_requires, documentation SEO Jan 25, 2022 1k words
Weeknotes: s3-credentials prefix and Datasette 0.60 Jan 18, 2022 1k words
Datasette 0.60: The annotated release notes Jan 14, 2022 968 words
How I build a feature Jan 12, 2022 3k words
What’s new in sqlite-utils 3.20 and 3.21: --lines, --text, --convert Jan 11, 2022 2k words
Weeknotes: Taking a break in Moss Landing Jan 4, 2022 434 words