joyfl — v0.4: Python API and Package Forgot to post here about the previous release multiple weeks back! Working on hobby projects has changed a lot for me since coding agents. Prototypes happen faster, more code "written", reviewed, thrown away. Most time is spent on
@alexjc
-

joyfl v0.5: Types, Safety and Testing Framework
By
–
joyfl — v0.5: Types, Safety & Testing Major changes in this last release with a particular focus on types (ADT-lite), validation, and a test framework. The language is still dynamic, but I expect more & more checking will be done statically… Now 111 tests in the langspec!
-

Minor AI Misalignments Create Daily Workflow Friction
By
–
The misalignments in the little details are actually the most jarring in daily work, rather than the high-end failures on major problems. A thousand cuts like this one…
-
Timeout Bug: AI Resubmits Old Messages Instead New
By
–
100% confirming this one. If there's a timeout due to my internet or the cloud provider, and instead of clicking "Try Again" you just resubmit the prior answer with ENTER, then it responds to the old message. I caught it by sending it unique codes in each message, multiple times.
-

AI Code Generation Benchmarks Miss Human Review Bottleneck
By
–
These kinds of benchmarks are misleading without a joint metric showing much work was necessary by humans after the fact. How much time to clean up that 2h42m of code? Style and architecture need to make sense, not just passing tests. That's the bottleneck now: reviewing!
-
Off-by-One Errors in Long AI Model Conversations
By
–
Thanks! In longer chats I'm convinced models respond to messages 1 in the past, maybe due to timeout/revert earlier in the conversation. I have been sending them message codes they have to echo back, and sometimes comes back 1 delayed, response content is also off-by-one. Could
-
Software Complexity: The Case for Removing Features
By
–
Well the problem you have now is that there are too many features, they conflict together, files get mixed up with the various reviewing tools, options, now worktrees applying stuff back. I'd actually remove features, make only a core set of things that work reliably together.
-

LLM Performance Degradation at 100K Tokens Context
By
–
People working on basic code and reset their Agent chats every 4-5 replies I envy you. Having to work on deep context design work and at about 100k tokens, LLMs start to get lazy / confused. I resorted to giving them codes they have to echo back. They often seem to think
-
Cursor patch reviewing issues with Agent overlapping features
By
–
I will say that Cursor's handling of patch reviewing always feels on the edge of becoming non-functional; so many overlapping features shipped quickly — e.g. if you do things like CMD+K in a file that already has changes by an Agent then it can also blow up…
-
AI Tool Performance Issues and Server-Side Optimization Solutions
By
–
Shows signs of brilliance, faster — maybe it's a problem on the tool side or a simple fix on the serving.
