Using AI to write better code more slowly

(nolanlawson.com)

250 points | by signa11 4 hours ago

30 comments

bottlepalm 3 hours ago
I've hit this point with AI where it's not a simple process, but a long drawn out back and forth.
I'll use AI to design the implementation of a medium sized, cross cutting feature. Review all the details, maybe iterate on just that. Then implement with Claude 4.7 Max - which runs slower, but does a better job. Then review the implementation, then have Codex GPT 5.5 xhigh fast review it - which almost always finds corner cases. Have Claude fix those - Claude is better at writing intuitive maintainable code versus Codex overengineered/shortcut filled code. (Codex is better at finding/fixing bugs and doing reviews - it's annoyingly pedantic)
Then repeat with fresh Claude/Codex instances having them both review the current staged changes and getting feedback, handling the feedback. Then covering it in tests. I mean overall I still implement the feature faster than coding it manually, but I spend a majority of the time going back and forth with reviews, handling corner cases and at the finish end up with what I feel a really solid implementation of whatever feature I'm working on. The v1 feature feels more like a v3 given the amount of iteration it already went through.
[-]
- aomix 1 hour ago
  Talking the problem to death with the AI before implementation is a nice zone for me. I feel productive, get good results out of the AI, and still largely understand the code. That’s the part of the AI revolution that I feel has made me a better engineer because I argue about design and architecture all day with a robot.
  [-]
  - mikepurvis 49 minutes ago
    Despite the cynical sibling reply, I also feel like there's real value here. Contrary to the meme, I don't think Claude just tells me I'm brilliant, but really does push back on directions that are unproductive, identify when a part is overcomplicated or a dependency has become redundant, etc. Those are important things to have at least a sightline on before getting too deep into the code, even in a world where a lot of code can be created basically for free.
  - bottlepalm 8 minutes ago
    One strategy I use in the planning phase is even when I know how I'd implement the solution, I ask the Claude/Codex how they would solve the problem or implement the feature without giving them any clues - and then compare their solutions to my own. Often I am pleasantly surprised by alternative ways of doing things and ideas that we integrate into the final design.
  - qsera 1 hour ago
    >I argue about design and architecture all day with a robot.
    You will outgrow it at some point.
    [-]
    - bartread 45 minutes ago
      I think this is OK though. We can still micromanage[0] the code generation part for a useful productivity boost, I think.
      [0] At least, in my experience, "micromanaging" the AI is what gives me the best results. Iterating on the initial design, then iterating on the plan, then reviewing the proposed code changes (including tests), then getting an independent code review from another LLM, etc. If you give an LLM too much latitude that's when the really shitty code and ill-considered breaking changes/obliteration of existing functionality starts to creep in.
    - Terretta 48 minutes ago
      Or learn something at some point.
      https://en.wikipedia.org/wiki/Rubber_duck_debugging
    - busterarm 49 minutes ago
      nullsanity's comment is dead and downvoted to oblivion but also incredibly underrated.
      I was more annoyed than anything that I didn't hit this moment until my 40s.
      Except it's not just reddit (I quit reddit 15 years ago). It's the whole internet.
    - nullsanity 1 hour ago
      Its like that phase people go through where they argue with morons on reddit, and then one day grow up and realize that most of these people are unemployed/underemployed terminally online nobodies aren't ever going to learn anything, and even if they did it wouldn't impact the world since they were just some below average hobbyist anyway and aren't in charge of anything more important than a box of paperclips.
- scosman 3 hours ago
  yes exactly. Too many people ask AI to one-shot complex tasks, and wonder it behaves like a junior asked to rush something.
  I have my own skill: 5 rounds of research/planning/test-planning. Interactive with me in loop for all important decisions. Starts with high level shape, then details. Planning can take 2-3 days of my time, then the implementation agent can take many hours (Opus 4.7). It splits the implementation across many phases/commits, each with its own code-review fix loop. Deep code review at the end can take another hour or two. It opens a PR, Gemini reviews, it reads out and resolves those issues.
  Projects still take days or weeks, but 5x faster than doing it all myself.
  Edit: the skill - https://github.com/scosman/vibe-crafting
  [-]
  - dawnerd 1 hour ago
    Even fully planned it’s still no better than a junior dev. You’re leaving out how much back and forth you have the ai do on itself, which you’d have on a junior dev too. In the end does it matter if it’s giving you what you want? Guess not really. But let’s not act like it’s crazy good when you’re still doing a lot of rounds of revisions on something an experienced dev would know to do right the first time.
  - deadbabe 2 hours ago
    Does the 5x faster including shipping? Or just the work part?
    IMO if you are not shipping out faster then the faster work gains are meaningless.
    If you are shipping faster, you’re probably picking up more work and shipping everything too fast leading to burnout.
    [-]
    - mhluongo 2 hours ago
      If you're not shipping faster, it's meaningless, and if you are, it's also bad?
    - scosman 1 hour ago
      yup.
- boringstack 11 minutes ago
  You've essentially promoted yourself from coder to engineering manager, trading syntax fatigue for the mental marathon of refereeing specialized AI developers to ship v3-quality code on the first try.
  [-]
  - bottlepalm 3 minutes ago
    Indeed. AI is bumping everyone up to manager level, and having dealt with long PR feedback cycles with humans for years - I don't mind the promotion. Also shipping a v3 is so much nicer than shipping a v1 and dealing the all the corner cases in production.
    Before AI, myself and everyone else I knew was drowning in tech debt. And now with AI we are treading water.
- dawnerd 1 hour ago
  When I use ai to code this is pretty close to my workflow too but I find it ends up taking at best just as long as if I were to write the code myself. If m some cases I’ve thrown away what the ai has done and just done it myself. I think that’s just a skill people need to learn - at a certain point you have to cut your losses. I’ve seen some coworkers argue back and forth with an llm trying to get it to do something. Especially true on simpler changes.
- democracy 1 hour ago
  Similar approach, but I also go a step further with some basic manual architecture/high level contract/stubs setups, just to keep it consistent with other systems (and easier reading as well).
- chrisweekly 2 hours ago
  You helpfully cite Claude w/ Opus 4.7 max and Codex w/ GPT5.5 xhigh fast, but what "AI" do you use for the initial design?
  [-]
  - bottlepalm 2 hours ago
    Claude primarily, though will sometimes get a second opinion from Codex.
- rootnod3 3 hours ago
  And then Anthropic has an outage and you what...have a coffee break until then? All that time babysitting the AIs just to be a little faster but probably with less knowledge/control over what they did?
  [-]
  - afavour 3 hours ago
    I don’t think you’re quite getting what OP is describing. I work in a similar way… I am aware of all the code being written. If Claude had an outage I could write it myself. It would just take longer.
    You say “all that time” babysitting AIs but in my experience it isn’t that much time, if anything the back and forth at the planning stages is more productive than when I’m doing it by myself because I’m being asked questions and having to think things through from different angles.
    [-]
    - halfcat 36 minutes ago
      How do you stay aware of all code being written?
      Maybe it’s just me, but I’ve never understood how one understands from reading code. Yes you can understand what that code does, but not why it was done that way instead of a different way. In the end I only understand it deeply if I end up writing it. Chatting through it is helpful to me, but having AI crank out code loses all of that context pretty quickly.
      I’m not disagreeing. Just curious how you think about this, and if there are key parts of your process that help you stay contexted in.
  - efitz 2 hours ago
    If you only have one AI window open, you’re doing it wrong. You task swap to another window/agent, get it working on something, rinse and repeat. I can keep 4 busy most of the time. When I task swap I also check in on what the other agents are doing to make sure they’re on track, not blocked and not struggling.
    [-]
    - well_ackshually 2 hours ago
      congratulations on your soon to be coming burnout.
      Keeping that many tasks in parallel, running all the time will kill you.
      [-]
      - speff 1 hour ago
        I suppose it depends how hands-off the tasks are - I max out at 2 parallel sessions working on different parts and it's fairly exhausting once done. I can see the number of parallel work increasing if there's a good dev/test loop. But at $WORK, that's not usually an option.
        [-]
        rootnod3 1 hour ago
        So, hands-off meaning "just let the AI cook and don't check it"?
        Either you follow everything it does, revise the plans, do the code review, manual adjustments, etc, or you run sessions in parallel, not being that attentive and constantly context-switch (also resulting in less attention I guess).
        I fail to see the benefits honestly.
      - DonHopkins 1 hour ago
        It's great to work from home so you can take nice little micro naps while code's generating, reviewing, building, and deploying.
        A calm attentive alternative of vibe coding: restful coding.
        It's much easier to read and review code after a refreshing cat nap, especially with a real cat.
        Too bad that's not usually acceptable to do that in the office. It should be! Slacking off by sword fighting all day is too exhausting.
        https://xkcd.com/303/
  - jerezzprime 58 minutes ago
    Yes get a coffee. Being able to execute 5 things at once is amazing, but it's a recipe for burnout. We have to be more careful and explicit about how we spend our time, and that means more explicit time away. If this thing makes you 10x more effective (I truly believe it can), you can afford to spend 20% less time behind the desk and more time doing whatever it is that actually makes you happy. Hopefully your manager understands that calculus.
  - bottlepalm 3 hours ago
    As the AI is working, I am working - reviewing, regression testing, thinking about if the currently implementation is too complex and how to simplify it etc.. I totally review and understand everything the AI is generating and often push back, have it re-do something, or do it myself. In the end I feel like the quality of the work is at a v3 level in the time it took to do a v1. The productivity and quality increase is real.
  - gitaarik 45 minutes ago
    What do you do when your search engine goes down?
  - comradesmith 3 hours ago
    I’ll deal with that problem when it happens
  - refactor_master 2 hours ago
    We're already having coffee breaks when AWS and CloudFlare are down. What's another break in the mix? If anything, we might be lucky that they're down at the same time, so we can consolidate the breaks.
  - raven12345 1 hour ago
    You can have multiple tasks running
  - mohamedkoubaa 2 hours ago
    And then solar radiation permanently knocks out the electrical grid and you what... have coffee break until society finds a new equilibrium?
  - 8note 2 hours ago
    why not?
    then demand some lack-of-uptime compensation for a lack of uptime
  - glhaynes 2 hours ago
    "All that time babysitting the AIs just to be a little faster" doesn't seem like an accurate/unbiased portrayal of what they said: "The v1 feature feels more like a v3 given the amount of iteration it already went through."
  - busterarm 46 minutes ago
    Company I'm familiar with that went all in on Codex ran out of tokens for a week and wouldn't increase their spend.
    I pretty significant number of their engineers flat out refused to work. Like publicly said so. "Increase our plan or I'm taking the week off."
  - wahnfrieden 2 hours ago
    Codex has 99.98% uptime
  - soupspaces 2 hours ago
    In Soviet Russia, the AI babysits you https://en.wikipedia.org/wiki/In_Soviet_Russia
- sunsetSamurai 1 hour ago
  maybe it's dumb question, but how do you feed the results of one agent to another? do you copy and paste manually? or how do you do it programmatically?
  [-]
  - bottlepalm 44 minutes ago
    Yea I'll take the review feedback from one, validate it, and then copy/paste it into the other session saying like, "hey I got this feedback, what do you think?" So I'm not even telling the other AI the feedback is valid, I want it to independently validate it. Often the feedback is not like a bug, but a red flag, design consideration, or trade off.
    Often depending on how complex the feedback, I'll do it one at a time addressing each one individually. And after the feedback is addressed, I'll go back to the AI that generated the feedback and say like, "I handled 4/5 items you found, can you double check."
    It's similar to handling PR feedback, where you do it, validate it, but then still have to submit it for peer review.
  - kevinsync 1 hour ago
    When I pair Claude and Codex, I use claude-co-commands [0] to drive from Claude and talk to Codex via MCP. Lately I've found Codex has been far more consistent for my specific projects, so I've just been almost entirely inside Codex. YMMV
    [0] https://github.com/SnakeO/claude-co-commands
  - adrianN 1 hour ago
    Having the agents write their plans into text files and iterating on those works reasonably well.
  - DonHopkins 1 hour ago
    Just switch models whenever you want with the menu at the bottom of the chat window in Cursor.
    And maybe don't use tools that lock you into one model?
- newsicanuse 40 minutes ago
  At this point one might as well code by themselves
- vessenes 3 hours ago
  I have a very similar workflow, and experience similar temperaments from the agents. I also find anecdotally that they are moderately competitive - you get very different attention from them when you say "competitor X wrote this - please find all bugs" than when you say "you just wrote this - please find all bugs".
  [-]
  - bottlepalm 3 hours ago
    Hah yea I just told them I wrote it, or I reviewed it. I don't want to get the AI's in a pissing contest with each other because they will get distracted and try to show off.
- nomel 1 hour ago
  I've noticed the following really helps (most important at end):
  1. Have claude form the plan and converse with a simple "Note any concerns with this plan" type plan-critic agent.
  2. Let it run.
  3. After (with everything in context) have it make a future_recommendations.md.
  4. Have it make a plan.md to implement those future recommendations, conversing with the plan critic..
  5. Clear context. Repeat with 1. Do this loop a few times, with some feedback from actual review thrown in.
  But, most importantly, because Claude will aggressively try to maintain code "as is", and happily build on it's previous crap, while preferring to hand roll implementations of everything, add something like this to memories/directives:
  * When evaluating designs, default to "pull in the library" over "hand-roll it." Hand-rolling is much worse than a dependency.
  * "Precedent" / "matches house style" / "reuses existing pattern" / "consistent with what we already do" are not valid engineering arguments.
  * This project is still in the development stage with no real deployments. Mitigation costs and existing precedence are not a concern.
  With these, in the last week that I've started using them (after inspecting the insane justifications for leaving crap design decisions in the plans), Claude went from junior level slop that required more oversight than it was worth to something very reasonable, using standard libraries, requiring nudges for architecture rather than pure "wtf!?".
  I think they've fine tuned heavily towards "don't rewrite the codebase" tuning, which completely rational from multiple perspectives, but also not appropriate for new code.
  I do enjoy a considerable daily token allowance, so this may not apply to everyone.
- i_love_retros 1 hour ago
  This all sounds insane. If it requires so much back and forth with the AI why on earth wouldn't you just write the code yourself? At least then you build the mental model of the code and keep your brain healthy. Reading the comments in here about all the hoops people are having to jump through just to do the same thing they were doing a year ago without AI... and spending a fortune to do it! I think you've all got AI psychosis.
  [-]
  - bottlepalm 22 minutes ago
    I would never imagine this is where programming would be five years ago, but at the end of day having the AI write the code is easier, faster, and results in higher quality.
    The mental model is still in my head, my brain is overloaded, but only from the amount of code reviews - like I said, I'm building v3 of a feature in the time it takes to build v1, but I am in a way doing 3x the code reviews going back and forth. That's the fall out of the iteration speed enabled by AI.
    Between submitting PRs, getting feedback, iterating, re-submitting, repeat - there used to be breathing room. Now it's all compressed into an afternoon. Productivity is through the roof, but it can be draining.
    [-]
    - habinero 11 minutes ago
      You're not on v3 lol. You're on v1 that you had to redo three times.
      If the feature isn't released, it's not a new version.
  - democracy 1 hour ago
    You can be right but quite often it helps keeping focus on the forrest rather then getting lost in the trees - at least for me. Boilerplate steals a lot of attention, focus and can just be mentally exhausting.
  - habinero 14 minutes ago
    I honestly don't get it, either. Most of them just flat out can't code at all, but for the ones who can, the only explanation I got is it feels like productivity.
    I will say, it does help me get over procrastination lol. I get annoyed by the robot doing dumb shit and finish it myself.
- skydhash 2 hours ago
  That sounds too much like three weeks of work saving you three hours of planning.
  In my experience, software engineering is a matter of knowledge. Understanding it and then coming up with a solution. The latter is a flash of insight that comes mostly from experience. Then you gather more information to flesh it out, or brainstorm it with your colleagues.
  What you're describing sounds more like a ritual of doing busy work than anything practical. Because tasks vary so much. A feature may be huge, but you take care of it in a day with copy pasting because you already have all the building blocks in other files. And something may be twenty lines of code, but you spent the whole week sweating on it (concurrency stuff maybe). Those ritualistic workflows sounds more like someone imagining software development than actually doing it.
  [-]
  - bottlepalm 15 minutes ago
    A lot of people say you need to go through at least three versions of something before it is mature - and v3 is not something you can design upfront. You need to see v1 both in code, and at runtime. Use it, get the feedback, and iterate. This is where AI tightens that loop immensely.
    Lost you in the last paragraph - features are not "copy pasting because you already have all the building blocks" and "something may be twenty lines of code". Mid sized features often mean tearing up many layers of code across the stack to add in some sort of new capability. Tearing up existing code means there are all sorts of add-on considerations in addition to feature you are working on.
    [-]
    - habinero 7 minutes ago
      > Mid sized features often mean tearing up many layers of code across the stack to add in some sort of new capability
      What? No, it shouldn't. I've worked on a lot of codebases and if you have to do this, something is very, very wrong.
- DonHopkins 1 hour ago
  Low frequency defensive long drawn out back and forth bullet dodging vibe coding should be called "serpentine coding".
  The In-Laws (1979): Getting off the plane in Tijuara:
  https://www.youtube.com/watch?v=A2_w-QCWpS0
  [-]
  - bottlepalm 35 minutes ago
    Heh it feels like that in a way, and the more complex the feature, the more endless the back and forth reviews can be - there seems to be always some feedback, so you need to decide when to be done with it and commit. You can easily get into review paralysis.
crabmusket 3 hours ago
The linked article about getting LLMs to critique each others' code review[1], the magpie tool[2], and also this recent article from Cloudflare about their code review stack[3] are all quite compelling.
I'm fairly AI-skeptical not on grounds of "do they work" but "are they good for the world". I feel that getting AIs to do this kind of review work is a rare case that doesn't outsource thinking and deskill workers. It doesn't trigger the same alarm bells as having the AI write the code (including having the AI fix the issues it discovers). That's setting aside environmental and other ethical concerns, which are still significant to me.
I have been impressed by the recent quality of AI code reviews*, but the experience of interacting with 3 separate AI reviewers via GitHub PRs is pretty terrible. Having more local-oriented and jj/rebase-aware review rounds would be great.
*context: fairly large PHP/Laravel backend and Vue frontend
[1]: https://milvus.io/blog/ai-code-review-gets-better-when-model...
[2]: https://github.com/liliu-z/magpie
[3]: https://blog.cloudflare.com/ai-code-review/
justinlivi 3 hours ago
I find myself spending on average more time in LLM review/resolution loops than it would take for me to write the code by hand. Partially because once I'm in the flow I write very very quickly and the code pours out sometimes faster than I can write. But also because the LLM code on the first few tries is generally really really bad. What I find interesting though is that spending the time to personally review and direct the LLM through several iterations of review and revision on average results in higher quality code written in about the same time as I would have written it. This might be particular to me, but seeing several interations of someone else's code helps me better understand holistically my objective as opposed to whatever happens to come out of my flow-state consciousness.
[-]
- zik 2 hours ago
  If your AI is writing bad code then you need to change your AI. No current high-end AI should be producing bad code.
  [-]
  - singingtoday 29 minutes ago
    The tool is important but then so it's the way you use it. I've seen small LLMs produce good code and frontier LLMs produce poor quality code. Depending on context..
  - newsicanuse 39 minutes ago
    Source trust me bro.
  - th0ma5 1 hour ago
    [dead]
sreekanth850 4 minutes ago
100% agree after building a production ready platform ground up. it took 3-4 months but without AI i would never had been done with a team of 3. one thing to note that AI is weak at Front end. So, we did the entire front end without AI.
TACIXAT 2 hours ago
This article doesn't address writing code with AI, just code review. My issue with agentic coding is that I make numerous micro-architectural decisions while programming. I almost never have a full spec up front and develop one as I consider what I am writing.
When using Claude Code or Codex, that is all gone. Claude Code is extremely eager to reach the end goal to the point that it feels like a fever dream to write code with it. In the end, I have low confidence about edge cases and fit into the project's architectural and design goals.
On top of that, I enjoy programming, reverse engineering, etc. and I feel that the LLMs, while able to solve some problems or deliver some features, take that fun away. I'm trying really hard to find a workflow with them that I'm confident in, but I fear that workflow is just chat, search, and being a rubber duck for my thoughts.
smusamashah 3 hours ago
Title of this article suggested more depth and I was expecting actual code examples. But it is like other opinion pieces. It suggests a prompt (ask AI to find bugs) that works for the author advising everyone to do it that way.
I use these tools at both work and for personal side projects and I was expecting to watch and learn. But these opinion pieces without examples are way too many now.
[-]
- vessenes 3 hours ago
  Have you tried his suggested workflow? I think it's a useful workflow, and if I hadn't found a workflow like this already would appreciate the pointer.
  I guess he could write a code harness to do this, or gin one up really quickly, but that kind of tooling today seems like the purview of the practitioner -- you -- it's frankly faster for you to spec what you want to try this idea out if you want it automated than it would likely be to deal with his code.
vessenes 3 hours ago
One thing that's been interesting to me over the last few years is charting the edge of my coding laziness. As a coder, I'm lazy about boilerplate code -- I hate writing it, I hate maintaining it, etc. And so I design and architect (or used to) around that preference. Sometimes that's smart, sometimes that's not. But it was my preference, and I avoided something that was hard for me to do.
When LLMs started being somewhat useful for coding a few years ago, and I found they were in fact great at boilerplate, in fact pretty much only good at boilerplate ca 2023 or so, it got me thinking about all the accommodations we make in design and systems architecture that are sort of tacitly understanding who we're working with and their strengths and weaknesses.
The modern models have their own very different strengths and weaknesses compared to humans, and deploying them is a really interesting exercise of different architectural and engineering skills. I've enjoyed it, and hope I continue to.
[-]
- foobarbecue 1 hour ago
  The thing about boilerplate is that a good library or framework makes it optional, and / or automatically written.
  I'd much rather django-admin startproject, npm init, or meteor create and get deterministic output than prompt an LLM and get who knows what.
  In a mature web ecosystem, boilerplate is minimal. I worry now that we've given this task to LLMs, less development effort will go into startproject-esqe CLIs and good opinionated defaults.
  [-]
  - ehnto 26 minutes ago
    I wonder this in general, what's the impetus for writing new frameworks and such? Are we already seeing a slow down in that space? HN front page certainly paints that picture.
    You're better off plonking down an existing framework and getting all the structural boilerplate benefits the LLM can leverage.
    LLMs are far better at frameworks they have a lot of training data for, if have been around for a while. They write more idiomatic, ecosystem friendly code. Does that still matter?
themanmaran 58 minutes ago
As I read this, I'm also working through a pretty dense feature that took a fair bit of iteration. The end result is actually significantly less code than it was about halfway through. And I was wondering if the AI actually helped me at all, since surely I could have written the code in the same time it took to iterate
But! Because of AI I was able to rapidly hack out like 4 variants of this feature that I didn't like. And felt comfortable throwing them away just as quick.
[-]
- a_victorp 32 minutes ago
  This has been one the most significant improvements of using AI for me. Before I would have to really think through the plan of a new feature before committing to the implementation and would only catch incompatibilities with existing code after a good portion of the implementation was already written. Now I can ask AI for detailed implementation plans and find these nitty gritty detail problems in a few hours if not less
- halfcat 6 minutes ago
  So what’s the verdict? Was it worth it?
boringstack 16 minutes ago
Optimizing for code quality over raw output speed is a great approach. The time 'lost' writing it slowly is easily made up by the time saved on debugging and maintenance later.
Waterluvian 1 hour ago
I think my current conclusion is that AI makes <foo> more important than ever.
I’m not exactly sure what <foo> is but I feel it. I think it’s quality and authenticity and craftsmanship. That difference between an expensive tool and a cheap one that you can’t easily describe but you just know it.
Is there a word for this? I bet the Japanese or Germans have a word for this.
I use AI a lot now. But I also do it in small steps. It isn’t a craftsman, but it can help me be one.
[-]
- datadrivenangel 30 minutes ago
  Quality as Pirsig would say in Zen and the Art of Motorcycle maintenance.
- samrus 1 hour ago
  People use the word "taste" to describe that
  [-]
  - Waterluvian 1 hour ago
    Yeah, maybe that.
    I feel like AI promises a factory that can make Walmart quality tools. Which I think will make the well-crafted tools more important than ever.
kiba 3 hours ago
I used LLM as a tutor to tackle unfamiliar terrain. That is, I write code that I know very likely doesn't work but is the best code that I could have written. The LLM will happily tirelessly show me what I did wrong and what the correct code actually look like. Then, at the end of it, I got code that running. That's a tight feedback loop.
It's still very slow. It took me two hours to write code that generate JSON data and then to write a web page that displays a knowledge graph.
One thing you have to be aware is that the LLM will happily generate code for you and you have to discipline it from time to time. I notice that my reading comprehension begins to suffer if I don't write the code myself and have to understand what the LLM wrote for me as opposed to the LLM correcting where I went wrong.
One thing I would like to try with an LLM is understanding a large and complex existing codebase like OpenSCAD that doesn't leverage my existing skillset(high level programming languages with OpenSCAD as primary language in the past year). That has always been a barrier to contribution for me.
anuis258 6 minutes ago
hmmm
ElenaDaibunny 53 minutes ago
The bug-finding use case alone makes this worth it.
reactordev 1 hour ago
This is the approach I take, with many guardrails and nested CLAUDE.md's to keep things sane.
syntaxing 3 hours ago
Hot take, barring from special edge cases, I find using dumber models (like local Qwen 3.6) to be the best balance. Smart enough to do stuff but dumb enough where I don’t trust it and verify what it’s doing rather than letting it do the third whole code base refactoring of the day. Also forces me to know my code base and ask very descriptive tasks rather than go “something is wrong, fix it”.
ptlan_asnh 3 hours ago
How profound! Talking points are changing from "vibe coding delivers bug free software" to "slow down and enjoy the AI".
Great how the promoters are mirroring the current anti-AI sentiment. The next step is canceling all subscriptions and not using AI at all. Maybe your mind will work again.
[-]
- CuriouslyC 3 hours ago
  Not so much. People are just walking things back from the Gastown/Oh My Opencode/etc peak of trying to get 10 agents working simultaneously on a project unsupervised. They've collectively realized that you still have to understand and validate what the agents produce in some way if you want to build maintainable software.
tonymet 1 hour ago
Are we overcomplicating AI by approaching top down, so naturally there are trillions of variations and too many ways to fail? Supervising a component-level scope, with emphasis on quality control (regression, perf testing, benchmarking, etc), seems to produce great work.
EFLKumo 2 hours ago
https://news.ycombinator.com/item?id=48246232
This reminds me the article above. Now people have diverse ideas on agentic coding. Some suggest human-in-the-loop while others suggest giving a detailed specification and let the agent run freely; some suggest leveraging LLM's high productivity and here we get an opinion that LLM can actually slowly write good code.
It's happy to see opinions that are more practical and variant emerging, turning LLM into literally a tool instead of something to be hated or hyped.
In my own practice, I find LLMs (SOTA ones) good at medium-level tasks, those needed to reason and plan for a while. However, the design taste on architecture is unexpectedly disgusting. Sometimes writing interfaces myself and asking LLMs to fill in implementations, alongside context-completing tools like context7, deepwiki, docs.rs MCPs, etc. and giving a escape hatch (e.g. encouraging it to use the AskUser tool in Claude Code), may be considered my best practice.
knuckle 1 hour ago
Stop being reasonable! This is a hypecycle!
efitz 2 hours ago
Great article and right on point.
npollock 3 hours ago
learn by considering critique
slopinthebag 3 hours ago
I use cheaper models (Deepseek is king, but GLM and Kimi as well) and do the planning myself. I often start a task myself, write some code to get the LLM on the right track, and then have it complete parts of the implementation that are kind of boring or repetitive. LLM's are just next token predictors, I don't mean that in a demeaning way, but I've found if I can get the LLM started on the right track with my own code, it completes what I want. Having the LLM write code just from a spec ends up with poor quality slop in my experience.
I'm not 100x'ing my output like some people claim, but using it as a augmentation rather than delegating my work to it results in better code, and I don't lose context / control over my codebases. I really have read 100% of the code, because the LLM is generating smaller pieces around and inside my own written code. Works well enough for me, and open models are already both cheap enough and good enough for this workflow. This is why the big companies are so desperate to push full-on agentic hands-off workflows and developer replacement - that's the only way they won't go bankrupt.
[-]
- foo12bar 1 hour ago
  What app do you use with deepseek? I've been used claude code but pointing it at the deepseek api and it works ok, but I'm wondering if there are better options. (https://api-docs.deepseek.com/quick_start/agent_integrations...)
nibblecid 11 minutes ago
[flagged]
xuanlin314 23 minutes ago
[flagged]
chengyongru 1 hour ago
[flagged]
zhxiaoliang 2 hours ago
[dead]
jdw64 3 hours ago
[dead]
seblon 3 hours ago
I want to mention, Claude code has a command /code-review. I find it quiet useful.
ai_fry_ur_brain 2 hours ago
Just dont use it lol, it does nothing you cant do by yourself. You're nerfing parts of your brain by relying on it.
[-]
- ianm218 1 hour ago
  There is things you really can't do by yourself. I've been working on porting some large codebases to Rust lately to experiment with fixing memory safety bugs. There is just no way you can write 100k LOC in a week of production code with tons of tests etc. Even "10X" engineers just can't type that fast.
- ctrl-alt-zen 1 hour ago
  Yeah, agreed. “Cognitive surrender” is one way of describing that loss of personal faculty. I don’t think AI proponents are acknowledging second order effects of letting your mind interact less and less while requesting more and more complexity built for you without adequate verification.
- bachmeier 1 hour ago
  Where they are extremely powerful, and it's hard to debate this IMO, is adding comments to the code, writing complete documentation, and constantly updating the readme. The value in actually writing the code is still up for debate (I'm on the side that sees the value there too) but the mind-numbing, boring, make-you-hate-life parts of the codebase are without question better for the use of AI.
alasano 3 hours ago
Instead of using a skill and having the agent own the flow for this, I've been building an external orchestrator that handles the process.
By default it uses pi agent core + pi ai (from the excellent pi coding agent) as a multi model runtime but also supports a Claude Agent SDK runtime.
I can have an implementation and review process of an OpenSpec change run anywhere from 2 hours to 24+ hours going through review/fix/verification rounds automatically until the implementation matches the spec and any additional reviewers are done finding issues after the fix rounds.
it's going to be fully open sourced in the next two weeks and fully free to use
https://engine.build
[-]
- whattheheckheck 3 hours ago
  Maybe we can come up with an spec for aligning asci diagrams. Can't really build anything with confidence when the attention to detail is lacking in these agentic systems
  https://imgur.com/a/r4fhOwy
  [-]
  - tudelo 2 hours ago
    It's interesting... Opus seems horrible at keeping text aligned. Markdown it is I suppose
  - alasano 3 hours ago
    What's that from? OpenSpec docs?
    [-]
    - whattheheckheck 2 hours ago
      Yeah not trying to pick on any particular project because its quite the mark that the writer didn't proof read the documentation and its quite widespread