Over the last 12 months I built my way into a conclusion I did not expect.

I started from the bottom up. I built a multi-GPU RTX 3090 box, trained and refined models, learned the operational reality of serving models locally, and then built harnesses that could actually do work. They could write code, run commands, touch files, call APIs, and mutate databases. It felt like the future.
It also felt like something was fundamentally missing from how we think about agent safety.
[Read More]




