forestwalk: A Series

Testing the Untestable

October 31, 2024

The four phases of automated evals for LLM-powered features.

I gave a talk version of this article at the first Infer meetup earlier this month. Let’s say you want to build an LLM-powered app. With a modern model and common-sense prompting, it’s easy to get a demo going with reasonable results. Of course, before going live, you test various...

9 min read →

Starting Forestwalk

August 16, 2024

A wild startup appears.

Last month, I started full-time on a new startup. It’s early days, but we’re having a lot of fun. A startup, fundamentally, is a search for a repeatable, scalable business model. You rapidly try things, run experiments, learn, and iterate your theories about how to build a useful product that...

2 min read →