Autoresearch
Two weeks since Andrej Karpathy released Autoresearch, here are some noteworthy projects to keep an eye on.
Two weeks since Andrej Karpathy released Autoresearch, here are some noteworthy projects to keep an eye on.
I discovered a better way of converting PDFs to Markdowns, with all mathematical formulas converted to LaTeX, on Apple silicon.
Running Qwen 3.5 27B Q4_K_M on an RTX 4090 with llama-server and Hermes.
A reflection on a New York Times Magazine story about AI coding tools, software labor, and what future programmers may stop learning by hand.
I started to use Linear to track the tasks and their dependencies when I implemented new features with multiple agents in Codex.
A short answer to a student’s question about AI agents, hardware progress, and why software creativity still matters.
A short note on why Pages’ older, more colorful chrome still feels preferable, and why staying on macOS 14.8 is a useful guardrail.
I rechecked the Days codebase with GPT 5.4 xhigh and GPT 5.4 Pro, and the pair of models has found serious issues in one aspect that I asked it to focus on in the current implementation.
A few weeks ago, OpenAI posted a blog post on harness engineering. Yesterday, it also released a component of its workflow as open-source, called Symphony.
Prof. Donald Knuth, at age 88, said: “Shock! Shock! I learned yesterday that an open problem I’d been working on for several weeks had just been solved by Claude Opus 4.6 — Anthropic’s hybrid reasoning model that had been released three weeks earlier! It seems that I’ll have to revise my opinions about “generative AI” one of these days.”.
GPU-accelerated PDF-to-Markdown workflow with Marker that produces high-quality output quickly on an RTX 4090.
An email triage system for Fastmail that auto-sorts messages by priority and drafts replies for high-priority emails.
I tried Simon Willison’s prompt to build a linear walkthrough of Nextmini. Codex unsurprisingly launched several subagents as scouts to explore different parts of the codebase.
I have read Simon Willison’s Agentic Engineering Patterns, and red/green TDD, which I have not previously heard of, seems to be so effective that I must give it a try.
I wrote my own extension for the Pi coding agent to allow me to start multiple agents that collaborate with one another by sending and receiving messages.
More of us are replacing Netflix with Codex and spinning up a new agentic session before falling asleep.
I have been looking for a way to get Codex to draw figures reasonably well. I think I finally found a way.
It is surprisingly straightforward to migrate a website from Next.js to TanStack Start.
A handy AGENTS.md addition that makes sure that codex writes better plans and uses subagents proactively.
The iOS codex workflow has been streamlined again: now with the Moshi iOS app to ssh into my computer via the Tailscale network. Also, GPT 5.3 Codex Spark is super fast.
Electric’s Configurancy argues that when code is cheap, specs and oracle testing matter more than unit tests alone. And something big is happening.
A quick iOS Codex access tip with Agentboard, plus a strong Rust-over-Python essay for agentic programming.
I redesigned my personal website, featuring not only a simple, minimalist design, but also a streamlined process of writing and publishing new entries via CLI tools.
Use ↑/↓ to navigate results, Enter to open, Esc to close.
Type to search posts.
No matching posts found.