Blog — Baochun Li

Backing up using rsync

May 12, 2026

The precise steps to follow for backing up a drive in macOS and Ubuntu Linux.

Defensible Moat and OpenAI

Apr 8, 2026

Does OpenAI has moat and is it defensible?

Autoresearch

Mar 22, 2026

Two weeks since Andrej Karpathy released Autoresearch, here are some noteworthy projects to keep an eye on.

Converting PDFs with Apple Silicon GPU Acceleration

Mar 21, 2026

I discovered a better way of converting PDFs to Markdowns, with all mathematical formulas converted to LaTeX, on Apple silicon.

Successfully running Qwen 3.5 27B on my NVIDIA RTX 4090 (using 21 GB of CUDA memory)

Mar 20, 2026

Running Qwen 3.5 27B Q4_K_M on an RTX 4090 with llama-server and Hermes.

After the Prompt, Who Still Learns to Program?

Mar 14, 2026

A reflection on a New York Times Magazine story about AI coding tools, software labor, and what future programmers may stop learning by hand.

My Multi-Agent Setup using Linear

Mar 12, 2026

I started to use Linear to track the tasks and their dependencies when I implemented new features with multiple agents in Codex.

Computer Engineering in the Next 10 Years

Mar 9, 2026

A short answer to a student’s question about AI agents, hardware progress, and why software creativity still matters.

Pages, Window Chrome, and Liquid Glass

Mar 8, 2026

A short note on why Pages’ older, more colorful chrome still feels preferable, and why staying on macOS 14.8 is a useful guardrail.

Rechecking the Days Codebase with GPT 5.4 and GPT 5.4 Pro

Mar 6, 2026

I rechecked the Days codebase with GPT 5.4 xhigh and GPT 5.4 Pro, and the pair of models has found serious issues in one aspect that I asked it to focus on in the current implementation.

Harmony and Harness Engineering

Mar 5, 2026

A few weeks ago, OpenAI posted a blog post on harness engineering. Yesterday, it also released a component of its workflow as open-source, called Symphony.

Prof. Donald Knuth, at age 88, said: “Shock! Shock! I learned yesterday that an open problem I’d been working on for several weeks had just been solved by Claude Opus 4.6 — Anthropic’s hybrid reasoning model that had been released three weeks earlier! It seems that I’ll have to revise my opinions about “generative AI” one of these days.”.

Open-source PDF to Markdown with Marker

Mar 2, 2026

GPU-accelerated PDF-to-Markdown workflow with Marker that produces high-quality output quickly on an RTX 4090.

Email Triage System with Codex

Feb 26, 2026

An email triage system for Fastmail that auto-sorts messages by priority and drafts replies for high-priority emails.

Building a Linear Walkthrough of a Codebase

Feb 24, 2026

I tried Simon Willison’s prompt to build a linear walkthrough of Nextmini. Codex unsurprisingly launched several subagents as scouts to explore different parts of the codebase.

Agentic Engineering Patterns

Feb 23, 2026

I have read Simon Willison’s Agentic Engineering Patterns, and red/green TDD, which I have not previously heard of, seems to be so effective that I must give it a try.

My Own Extension for the Pi Coding Agent

Feb 17, 2026

I wrote my own extension for the Pi coding agent to allow me to start multiple agents that collaborate with one another by sending and receiving messages.

Token Anxiety and Cognitive Debt

Feb 16, 2026

More of us are replacing Netflix with Codex and spinning up a new agentic session before falling asleep.

Codex is finally able to draw reasonably well

Feb 15, 2026

I have been looking for a way to get Codex to draw figures reasonably well. I think I finally found a way.

Migrated Days and Nextmini websites to TanStack Start

Feb 14, 2026

It is surprisingly straightforward to migrate a website from Next.js to TanStack Start.

Subagent-Friendly Planning Rules

Feb 13, 2026

A handy AGENTS.md addition that makes sure that codex writes better plans and uses subagents proactively.

iOS Codex Workflow with Moshi and GPT 5.3 Codex Spark

Feb 12, 2026

The iOS codex workflow has been streamlined again: now with the Moshi iOS app to ssh into my computer via the Tailscale network. Also, GPT 5.3 Codex Spark is super fast.

Oracle Testing, Something Big, and Something Small

Feb 11, 2026

Electric’s Configurancy argues that when code is cheap, specs and oracle testing matter more than unit tests alone. And something big is happening.

A Language for Agents

Feb 10, 2026

A quick iOS Codex access tip with Agentboard, plus a strong Rust-over-Python essay for agentic programming.

Redesigned Personal Website with a Minimal Writing Workflow

Feb 8, 2026

I redesigned my personal website, featuring not only a simple, minimalist design, but also a streamlined process of writing and publishing new entries via CLI tools.

tiny-llm and Practical PyTorch Learning Prerequisites

Apr 29, 2025

tiny-llm is exactly what I wished for. It also contains links to two existing PyTorch related courses to machine learning from Carnegie Mellon University.

Arc Browser and the Modern shadcn/ui Tooling Stack

Apr 2, 2025

Arc — My new browser of choice. I love the fact that bookmarks are organized on the side panel, rather than clustered at the top of the window.

Evaluating Eleventy as a Lightweight Static Site Generator

Apr 1, 2025

Eleventy appears to be a pretty simple static website generator that is worth exploring. A competitor to Hugo.

How I Use LLMs: Key Notes from Andrej Karpathy

Mar 11, 2025

How I use LLMs by Andrej Karpathy — A must watch.

Panasonic S1R II and Early Claude Code Impressions

Feb 25, 2025

Panasonic S1R II — With the Sigma 28-105 f/2.8, this would be my dream camera. It is just slightly heavier than my Panasonic S5 IIx (1.57 lb vs. 1.45 lb body only).

Ultra-Scale LLM Training Playbook and Streaming DiLoCo

Feb 19, 2025

The Ultra-Scale Playbook: Training LLMs on GPU Clusters — Amazing, and finally we have a 100-page open-source online book on how models are trained with multiple GPUs.

Crafted UI, Fumadocs, and Design System References

Feb 18, 2025

Crafted — What a great looking set of open-source, hand-crafted UI templates based on shadcn/ui!

Better Auth, Origin UI, and Open Research Data Tools

Feb 16, 2025

Better Auth — A new authentication library that is feature-complete and easy-to-use. Compared to Lucia, which advocates a copy-and-paste approach.

A Minimal GRPO Implementation from First Principles

Feb 15, 2025

Andriy Burkov’s minimalist implementation of GRPO from scratch — Rather than using a library such as Hugging Face’s TRL.

Transformer Lab: MLX Fine-Tuning Workspace on Mac

Feb 14, 2025

Transformer Lab — a free, open-source LLM workspace that prepares a custom dataset and fine-tunes a model using MLX on the Mac.

Lucia’s New Authentication Design and Practical Tradeoffs

Feb 11, 2025

Lucia — Lucia, the authentication library, has adopted the design of cutting and pasting code, just like shadcn/ui, rather than implementing a library.

From 0 to Production: Notes on Theo’s Modern React Tutorial

Feb 9, 2025

From 0 to Production — The Modern React Tutorial — Theo released it last year, and I always wanted to learn from this marathon tutorial.

Unsloth GRPO, S1-Style Scaling, and RL Learning Resources

Feb 8, 2025

Unsloth.ai’s GRPO — it seems that the Unsloth implementation of GRPO uses less GPU memory, and it supports QLoRA and LoRA.

AI Peer Review with LLMs and S1 Test-Time Scaling

Feb 6, 2025

DOGE: Make AI Conferences Great Again — Zeyuan (Allen) Zhu wrote a very interesting piece on using LLMs as arbitrators in the reviewer-author discussions.

Karpathy’s LLM Deep Dive and MLX Rust Ecosystem Links

Feb 5, 2025

Deep Dive into LLMs like ChatGPT — Andrej Karpathy continues his top-notch hours-long education on large language models with a new episode today.

GRPO on Apple MLX and Minimal-R1 Scaling Insights

Feb 3, 2025

GRPO will soon be added to Apple MLX — The PR now works, using about 32 GB of memory when training Qwen2.5-0.5B.

Simple GRPO Implementations and DeepSeek FAQ Highlights

Feb 2, 2025

Another simple DeepSeek R1 reproduction — This reproduction of GRPO has one distinct feature: it is exceedingly simple and quite elegant.

Reproducing DeepSeek R1 GRPO on Consumer Hardware

Feb 1, 2025

Fourth attempt on reproducing DeepSeek R1’s GRPO on small models — The third fourth time is the charm. I can successfully run this repo, without activating vLLM.

Running DeepSeek R1 on Lambda Labs and Notes on Ghostty

Jan 31, 2025

Lambda Labs hosts DeepSeek R1 — the dashboard is simple, nice to look at, free to use, and pretty fast when generating tokens. Overall, an excellent user experience.

Fine-Tuning Open LLMs in 2025 with Hugging Face and Mini-R1

Jan 30, 2025

How to fine-tune open LLMs in 2025 with Hugging Face — Philipp Schmid a Technical Lead at Hugging Face, posted this article on fine-tuning LLMs using Hugging Face.

DeepSeek, Export Controls, and Open-Weight AI Debates

Jan 29, 2025

On DeepSeek and Export Controls — Dario Amodei, Anthropic’s CEO, wrote a fairly long editorial on DeepSeek.

The Illustrated DeepSeek-R1: A Clear Visual Walkthrough

Jan 28, 2025

The Illustrated DeepSeek-R1 — Jay Alammar, the author of O’Reilly’s Hands-On Large Language Models, wrote a short piece on explaining DeepSeek R1 at a high level.

Qwen 2.5 7B 1M Local Testing and RL Survey Notes

Jan 27, 2025

Qwen 2.5 7B 1M — I have just tried Qwen’s latest local model, the 7B 1M, locally in LM Studio 0.3.8 (Build 4). I loaded an entire PhD thesis into the model, and LM Studio gleefully chose inject-full-content as its content injection strategy.

Nvidia, DeepSeek, and RL Reasoning: Long-Form Analysis Notes

Jan 26, 2025

Although it’s quite long, The Short Case for Nvidia Stock is a fascinating read. Also, agents are not happening yet.

Open-R1 and TinyZero: Early DeepSeek R1 Reproductions

Jan 25, 2025

Open-R1 — Hugging Face started to reproduce DeepSeek R1 in the open, and discussed the R1 technical report in a recorded YouTube video.

What I’ve Been Reading

Jan 24, 2025

This website is a space for storing — and sharing, if anyone cares about these — some of the websites, code repositories, and tweets that I have read.