My first use of Claude this week, for code reviews only(since no LLM can be trusted to write a user story or test suite), had it gaslight me.
It marked down my code for using a specific practice to make some xml safer and easier to read.
When I tried things its way, it wanted me to change it back.
I’ve used Claude and Codex, and while both are based on untenable economics, I can at least attest that my use of Codex has yielded some productive results. Claude, so far, has delivered fuck all that’s useful to me.
I have found the opposite. Codex spits back mostly useless code that is twice the length it needs to be with a bunch of unessesary stuff and Claude is the only thing I get useful output from.
I can’t imagine paying for AI when the open source tools have made it so easy to set up a model locally.


