I Went All-In on AI. The MIT Study Is Right.

AutistoMephisto@lemmy.world · edit-2 6 months ago

I Went All-In on AI. The MIT Study Is Right.

Agent641@lemmy.world · 6 months ago

I cannot understand and debug code written by AI. But I also cannot understand and debug code written by me.

Let’s just call it even.

I Cast Fist@programming.dev · 6 months ago

At least you can blame yourself for your own shitty code, which hopefully will never attempt to “accidentally” erase the entire project

PoliteDudeInTheMood@lemmy.ca · 6 months ago

I don’t know how that happens, I regularly use Claude code and it’s constantly reminding me to push to git.

MangoCats@feddit.it · 6 months ago

As an experiment I asked Claude to manage my git commits, it wrote the messages, kept a log, archived excess documentation, and worked really well for about 2 weeks. Then, as the project got larger, the commit process was taking longer and longer to execute. I finally pulled the plug when the automated commit process - which had performed flawlessly for dozens of commits and archives, accidentally irretrievably lost a batch of work - messed up the archive process and deleted it without archiving it first, didn’t commit it either.

AI/LLM workflows are non-deterministic. This means: they make mistakes. If you want something reliable, scalable, repeatable, have the AI write you code to do it deterministically as a tool, not as a workflow. Of course, deterministic tools can’t do things like summarize the content of a commit.

PoliteDudeInTheMood@lemmy.ca · 6 months ago

The longer the project the more stupid Claude gets. I’ve seen it both in chat, and in Claude code, and Claude explains the situation quite well:

Increased cognitive load: Longer projects have more state to track - more files, more interconnected components, more conventions established earlier. Each decision I make needs to consider all of this, and the probability of overlooking something increases with complexity.

Git specifically: For git operations, the problem is even worse because git state is highly sequential - each operation depends on the exact current state of the repository. If I lose track of what branch we’re on, what’s been committed, or what files exist, I’ll give incorrect commands.

Anything I do with Claude. I will split into different chats, I won’t give it access to git but I will provide it an updated repository via Repomix. I get much better results because of that.

MangoCats@feddit.it · 6 months ago

I also cannot understand and debug code written by me.

So much this. I look back at stuff I wrote 10 years ago and shake my head, console myself that “we were on a really aggressive schedule.” At least in my mind I can do better, in practice the stuff has got to ship eventually and what ships is almost never what I would call perfect, or even ideal.