Hiring someone full-time for this.
@steipete
-
Tweaks to Make AI Agents Do Right Thing
By
–
Making some tweaks to make it easier for agents to do the right thing!
-
Low error rate achieved in AI system review
By
–
I reviewed a few 100 and error rate is very very low.
-
Thread Optimization: Docker Testing Requirements Complete
By
–
oh I have a thread crunching, just needed more docker tests
-
GPT 5.5 emerges as most reliable coding model
By
–
pretty sure, maybe just not as reliable. GPT 5.5 is the most reliable coding model.
-
Claude CLI as AI model with terminology adjustments
By
–
claude-cli as model, yeah. you might need to rename a few words tho
-
Swap Codex with Claude: One-Line Code Change
By
–
you can swap codex with claude with one line change
-
GPT 5.5 Reviewed: Impressive Performance Across Hundred Tests
By
–
I reviewed a few 100 and they all seemed correct, GPT 5.5 is a goat.
-
README as Dashboard: Innovative Monitoring Approach
By
–
My favorite part: instead of a dashboard it just updates the README as it works. Readme is the new dashboard.