So previously the bottleneck was production. I'd wager now the bottleneck is willingness to test your hypotheses. The willingness to experience failure as soon as possible. To test and iterate.
As technology brings the cost of everything else to 0, psychological costs will predominate.
Reality testing is ultimately unavoidable, of course, but I'd guess most people still lean away from that rather than into it. (Our whole culture is set up that way, and most of us get like two decades of Pavlovian conditioning in that direction.)
Add on the compounding effect that "QA" or "test" in someone's job description was viewed as a synonym for "less-highly compensated" over the past few decades, and you have an entire generation of mid career devs with poorly adapted instincts regarding what is valuable in the process of shipping working product.
The post reads like written by someone who read too much about AI rather than tried to build a startup with the help of AI that they advocate so much. I'm still bounded by system design, UX, pricing and feature decisions, if not by the speed of code output, by the review time for sure. Yes, iterating is faster, but we're nowhere near agentic AI loops spitting out working products. Technically it's possible, but then you just spent that time planning and writing the spec up front, which you'd interleave with dev time otherwise. If the product is a simple CRUD database skin, then yeah, chances of success are lower I think, but this is not the type of startups the post seems to write about.
It's gotten to the point where, when I see yet another Thought Leadership article about software development, I search the page for the word "will". If I see unqualified predictions of the future (AI will change this and Agents will do that and developers will need to do thus), I think I can safely ignore the article. Who has the hubris to make such strong and unwavering statements about a future nobody can see?
A local "thought leader" wrote a ridiculous piece about software leadership in a post-LLM world, when clearly they'd never actually used an LLM to build anything, or actually led a team using LLMs. Lots of hand-waving but obviously no real experience.
As gp says, there's a big difference between theory and practice here, and a lot of the things we needed when we weren't using LLMs are still needed when we are, but it takes a bit of actual practice to work this out. It's still not at the stage where an Ideas Guy can make a real working product without someone on the team actually knowing how to develop software.
At least in my experience, so far. But the world is changing fast.
How come all the talking heads telling us AI has made whatever we've learned or built obsolete never point the lens back at themselves? I personally thing knowledge from entrepreneurs who learned their lessons in previous decades is valuable, but if what they're spouting is true it doesn't make any sense to listen to them, i.e. these thought leaders are desperate to tell you of their irrelevance. Same thing for every CTO who tells me AI replaces developers. That's a stretch, but if we could do that, don't you think it would be trivial to replace your job?
Someone is still needed to decide what the AI should do, and to harness and manage it. A CTO can do that with existing skills, and the resulting organization should converge quickly on near-zero need or want for human SWEs.
So goes the thinking, anyway. It's why my couple decades of experience and I still occasionally get to hear from rando cold recruiters desperate to sell someone a "pivot to AI," probably thinking they can lowball me by holding my mortgage over my head in order to screw three times the work out of me that they'd pay for.
Interfaces will definitely evolve. Visual graphs and representations are useful but speaking to an agent will become mainstream as its faster than typing. Also the ability for agents to code on the fly will open up different interfaces. For instance you could say "show me the impact that our marketing campaign x over this time period" and out comes graphs that were coded. Drawing might even make a comeback for instance when designing a website you just cross out things you don't want, draw boxes of where you want things, talk at the same time saying what you want in that box. Then some people are using virtual relativity. Not everything will become a chatbot but its definitely going to evolve with chatting being an integral part of the user interface.
You're describing some Hollywood version of SF, not the real world. Speaking is not faster than pressing a key or turning a knob (like try to operate a CAD or a DAW without keyboard and mouse). And for most report/infographic, you mostly need to design a few dashboard and almost never change them, because those are your core metrics that you need to monitor. And the ability to sketch even a simple wireframe relies on a lot of knowledge that most people don't want to burden themselves with.
In the real world a verbal description to start off a design from a template is _very much_ a competitive advantage.
I dabble in music production and having a DAW to help me guide some parts of the process would be extremely useful to get me out of certain creative ruts.
I usually find chatbot interfaces completely infuriating. I know the UX paradigm of click-reduction went the way of the dodo (and rightfully so, because it was based on bullshit research,) but I think it’s funny that completely removing the user’s agency and visibility into any process, and turning 3-click processes into 200 keystroke processes is the hot shit right now.
I'm glad this is the top comment. I'm ambivalent about a bunch of writing I've seen from Steve Blank - some of his stuff I've loved and some I thought was awful.
But this I just thought was vacuous. I agree with what you wrote, but more to the point, I didn't find any real advice about how a startup should actually change that passed my sniff test. I left the tech startup world about 2 years ago myself, and I'm glad I did, because I just think there are way fewer differentiable opportunities now. That is, even if I accept what Blank says is true, what are all these 2+ year old startups supposed to do - just create some model wrapper/RAG chatbot product like the million other startups out there?
Even in defense, like the article says, there are now a bajillion drone companies, and it looks like a race to the bottom. The most successful plan at this point just looks like the grifter plan, e.g. getting the current president to tweet out your stock ticker.
I'm honestly curious what folks think are good startup business plans these days. Even startups that looked they were "knock it out of the park" successes like Cursor and Lovable just seem like they have no moat to me - I see very few startups (particularly in the "We're AI for X!" that got a ton of funding in the past two years) with defensible positions.
I have a lot of respect for Steve Blank, but my heuristic by now is to ignore any breathless posts that state “teams are doing X with AI, if you are not doing the same you’re behind”.
The much more useful posts are “my team and I are doing X with AI”. Of course, the challenge there is that the ones who are truly getting a competitive edge through AI are usually going to be too busy building to blog about it.
I really enjoyed reading his articles a while back. He wrote an article about about silicon valley's roots in microchip development. I can't remember the details but I reached out to point out how important Autonetics was to chip development and that was based in Los Angeles. From my perspective what made Silicon Valley significant was it's connection to wall street. I wanted to engage on the topic of Venture Capital might be the real product of silicon Valley.
He could have ignored the email or engaged on the topic I introduced. Instead he sent me a wikilink to Autonetics. I was left with the feeling that he had no real interest in the topic he wrote about. It was really no big deal. He is a busy guy and doesn't need to engage with strangers. I never read anything by him again because I was left with the feeling he is just phoning these posts in.
I'm not, but this is not a great introduction. It's handwavy and makes the assumption that AI dev tools are much farther along than they are. I have seen this a lot lately; the farther up the management chain and farther away from putting hands on code, the more confident people seem to be in the power of AI tools.
For big complex real world problems, and big complex real worlde codebases, the AIs are helpful but not yet earth shattering. And that helpfulness seems to have plateaued as of late.
I will take a lot more hand waving from the 70-something year-old Stanford professor who co-created far-up the chain management paradigms that run a good chunk of the economy. That context kinda changes things but what do I know.
That thing he created says you should take your assumptions out into the real world and validate them, ya?
So hand-waving about how easy it is to have an MVP in days w/o actually experience in doing that seems ironic.
Now, maybe he's saying this based on companies he's funded who've had great success with what he's saying. But it's curious that the only concrete example of a company mentioned is one that's six years old and not operating like that.
And in fact, many of the ways he thinks that company went wrong seem completely unrelated to AI?
> Chris is now starting to raise his first large fundraising round. In looking at his investor deck I realized that while he’s been heads down, the world has changed around him – by a lot. The software moat he built with his 5-year investment in autonomy development is looking less unique every day. Autonomous drones and ground vehicles in Ukraine have spawned 10s, if not 100s, of companies with larger, better funded development teams working on the same problem.
> While Chris has been fighting for adoption for this niche market (one that is ripe for disruption, but the incumbents still control), the market for autonomy in an adjacent market – defense – has boomed. In the last five years VC Investment in defense startups has gone from zero to $20 billion/year. His product would be perfect for contested logistics and medical evacuation. But he had literally no clue these opportunities in the defense market had occurred.
> While there’s still a business to be had (Chris’s team has done amazing system integration with an existing airborne platform that makes his solution different from most), – it’s not the business he started.
"Being heads down without paying enough attention to the market for 6 (!!) years" doesn't seem like an AI-caused issue.
Meanwhile, the core suggestion doesn't seem to fix that, it seems almost completely perpendicular.
> You can now test multiple versions of the same business at once (or simultaneously be testing different businesses). While you can be simultaneously testing five pricing models, ten messages or twenty UX flows, the “user interface” may no longer be a screen at all. Testing might be to find prompt(s) to AI Agent(s) deliver needed outcomes.
Ok, but this person didn't even seem to be doing enough paying to the market of one version already?
And while this claim about parallel development being a huge unlock is the most interesting thing, it also sounds a bit glib. Getting your foot in the door is the hardest thing early on, now you're trying to run six versions of your company at once? Each time you get a foot in the door sales-wise, are you trying to make them use all 6 versions, or are you only gonna get feedback on 1? Would you want to pay money to be a beta tester of 6 different products simultaneously, with reason to believe that 5 of them will probably evaporate over night soon?
> the ultracapitalist dystopia the US has turned into
Seriously, where do ideas like this come from? An "ultracapitalist" country that has about as much redistributive social spending as other developed economies[0]? A "dystopia" that millions of people from all over the world clamor to get into every year?
So in other words... he doesn't actually use the tools he's firmly convinced will automate the building of software.
I don't agree with the parent; I think capitalism is doing a lot of great things for us and will continue to, even with AI. But man I'm tired of these hot takes from people with limited practical experience.
One year ago, the models were only slightly less competent than today. There were models writing entire apps 3 years ago. Competent function writing is basically a given on all models since GPT3.
Much of the progress in the past year has been around the harnesses, MCPs, and skills. The models themselves are not getting better exponentially, if anything the progress is slowing down significantly since the 2023-2024 releases.
>One year ago, the models were only slightly less competent than today.
That has not been my experience. This weekend I pointed Claude Code+Opus 4.6+effort=max at a PRD describing a Docusign-like software. The exact same document I gave to Claude Code+Opus 4.5+Ultrathink around 6 months ago.
The touch-ups I needed after it completed implementation was around a tenth that it took with 4.5. It is a pretty startling difference.
Agree with this. Opus 4.6 thinks of things I didn't even put in the spec, but absolutely need. It thinks around all the edge cases and gotchas. And I love the way modern AI UIs stop in their tracks and have you answer a bunch of questions about all the ambiguities you left in the spec.
They still do dumb shit from time-to-time, but it's getting rarer.
Yeah I've been able to get great Python functions out of everything since the ChatGPT 4 API in early-to-mid 2023.
It takes far less manual prompting to make it have consistent output, work well with other languages, etc. But if you watch the "thinking" logs it looks an awful lot like the "prompt engineering" you'd do by hand back then. And the output for tricky cases still sometimes goes sideways in obviously-naive-ways. The most telling thing in my experience is all the grepping, looping, refining - it's not "I loaded all twenty of these files into context and have such a perfect understanding of every line's place in the big picture that I can suggest a perfect-the-first-time maximally-elegant modification." It's targeted and tactical. Getting really good at its tactics for that stuff, though!
I can get more done now than a year ago because taking me out of the annoying part of that loop is very helpful.
But there's still a very curious gap that the tool that can quickly and easily recognize certain type of bugs if you ask them directly will also happily spit out those sorts of bugs while writing the code. "Making up fake functions" doesn't make it to the user much anymore, but "not going to be robust in production but technically satisfies the prompt" still does, despite it "knowing better" when you ask it about the code five seconds later.
One year before 1969 we had never been to the moon. In the 70s credible scientists and physicists predicted that large martian colonies would exist before the year 2000.
If a metric goes from 0 to 2 it doesn't mean it's on a long-lived exponential trajectory.
Even if there is a growth pattern that doesn't say how long it will continue. Some things can grow for a while and then hit a ceiling. Sometimes they are a fad that dies (when was the last time you bought a pet rock), sometimes everyone has one - you get a few sales of replacemets but no growth (you have a washing machine and won't buy another until the old wears out)
What if I fold you... that using 1960s technology it would be easier to just go to the moon than it would be to fake it? Have you SEEN a 1960s movie with SFX?
This comment is getting punished for the incorrect timeline (I would know, I've been harping on about AI getting good at coding for ~2 years now!) but I do think it is directionally correct. Just over 3 years ago, (publicly available) AI could not write code at all. Today it can write whole modules and project scaffoldings and even entire apps, not to mention all the other stuff agents can do today. Considering I didn't think I'd see this kind of stuff in my lifetime, this is a blink of an eye.
Even if a lot of the improvements we see today are due to things outside the models themselves -- tools, harnesses, agents, skills, availability of compute, better understanding of how to use AI, etc. -- things are changing very quickly overall. It would be a mistake to just focus on one or two things, like models or benchmarks, and ignore everything else that is changing in the ecosystem.
I agree it's directionally correct, but only in the ways that don't matter to this discussion. If 2026->2029 AI is as much of an improvement as 2023->2026 AI, is anything we learn about how to leverage it in 2026 going to stay relevant?
Seems extremely disingenuous to say that one year ago models could barely write a working function. In fact, there were plenty capable of writing a working function with the right context fed in, exactly as today.
Made me wonder if there's a live-streaming equivalent for blogging... some platform that both ensures the reader knows the blogger is a person, and promotes a parasocial relationship.
There's live-coding, so it's not totally a crazy idea.
Your comment immediately made me think about the extreme opposite: a Davy Force-like, Infochammel-style livestream of a never-ending AI generated Ted Talk, offering delectable morsels of tech startup wisdom, but is ultimately zero calorie.
As a way to get my feet wet vibe coding, I made https://seeitwritten.com with that idea. That by capturing how you write with all its fits and starts you can show that a human wrote it. So, sort of recorded live-streaming. But I'm thinking that a sufficiently cute agent could be prompted to write something and re-write something in a convincing manner. I'm not so sure about that, though, since their corpus is completed text rather than text in action.
In 2020, I wrote a book and livestreamed the writing of it. Felt more raw and real at the time but I guess is probably more human vs ai these days.
Not sure how to make that a platform, as when i wrote i explicitly put everything directly into a book unedited, whereas for many people, the editing is probably at least half if not more of the time they spend writing.
I think Blank's story is about a guy who missed a $20B market shift in defense VC, not a guy who failed to adopt AI.
He's not saying "add AI to your product" or "use AI or die" but more that AI has shifted institutional assumptions about tech stacks, defensibility and fundability. The bottleneck moved up the stack from engineering to judgment, insight and design.
Chris lost because he was heads down building while $20B in defense VC was flowing into his exact problem space and he didn't build the boat to capture that wave.
Well of course it is, most startups are dead on arrival.
The big pinch of salt I throw in with advice like this though is that startup failure rate hasn't dramatically shifted despite two decades of lean startup methodology, accelerators, and an entire cottage industry of startup advice. It's never the fault of the framework, mind you.
If anything the failure rate has probably increased with more capital and founders chasing after the same opportunities. Being a startup founder gained prestige and became the default thing to do after college for certain types of people who before the GFC would have ended up in finance.
isn't this a problem of being able to see whatever we want in the data? For example maybe more startups than ever are created in part due to access to this info. Volume increase but rate doesn't. Or maybe magnitude of success increases. Surely this is true but largely attributed to internet overall. But maybe methodologies are indeed a factor.
edit: Tobi from Shopify has an insight that relates. His north star metric is user churn. sounds crazy on its face. he's known for that. But increasing churn means you've increased top line exposure to more would-be entrepreneurs. Not all of them will succeed, but shopifys mission is to create more entrepreneurs. Grow the pie. A focus on increasing conversion tends to have a narrowing effect.
Sure, but if the advice works, you'd see failure rate drops over time. It would work like medicine - we have more people than ever before, but almost no one dies of polio anymore. That's not what we see though, failure rate is basically the same.
Interestingly, this was always true, it's just made more obvious when it takes less time to bring a product to a potential customer.
Launching a product was never the finish line; it was always the start line. But technical founders could trick themselves into thinking that building a product was building a business.
The same "Lean Startup" rules apply. Build something and get it in front of real people who will pay you for the thing. If they won't, back to the drawing board.
The only real "shift" I've seen is that most startups don't actually need VC at all. That's a great thing.
> Founders who started pre-2025 typically have built a technical stack optimized for a world where software development was bespoke and expensive.
Of all the things that AI has changed, tech stacks aren't one of them. The bots will gladly write Typescript, Java, Python, Rust, what have you. They could not give less of a shit.
Build vs. buy is an eternal question in enterprises. I remember many in-house data teams trying to build tools for "digital transformation" and cloud migration about 10 years ago. The challenge was, building those tools was more expensive than those enterprises could budget for (IT as cost center), so a startup like Snowflake would easily outcompete in-house solutions with their custom, cloud-based tech stack that was necessarily complex because it needed to serve the needs of thousands of customers.
If he's right, the build vs. buy equation has shifted more towards build, at least as far as enterprise software is concerned. IT is still a cost center, but in theory an internal team can now handle more requests for custom tools without looking to outside vendors. Essentially the cost of building in-house might be collapsing and therefore enterprise software startups will be serving fewer customers (who would all pay you more because if solving the problem was cheap they'd do it).
If you had to build a stack for dozens of customers paying huge amounts of money, how would that stack differ from the stack you'd build to serve thousands of customers? Certainly it wouldn't need to be as scalable! And that's probably what he's getting at. I think what you'd do instead, to capture those higher price point customers, is solve their problems more specifically, in a higher value manner.
Many companies already do this, investing far more in field engineers than they do in their tech stack, since customization is essential.
Note the mention of "systems of record" being unsuitable for the present level of AI. The real question is whether the costs of AI mistakes and hallucinations can be dumped on some external party who can't impose costs on you. If not, there's a problem.
"You better be doing something in AI" applies to Startups - businesses that are expected to spend every penny as quickly as possible, to meet the metrics needed to raise the next (bigger) round of funding.
It is also not the same as, "If you want to be a profitable company...". For that you need to somehow make more money than you are spending.
> For that you need to somehow make more money than you are spending.
I've had this idea of 'business as reducing entropy' floating around in my head for awhile. It's a neat way to think about the value a business offers to buyers; a washing machine manufacturer is selling reduced time to reduced entropy (clean cloths), spreadsheet software is selling reduced time to understanding (information from tabulated data), and so on.
From that perspective, a lot of AI-driven development is failing.
We're still in the phase of 'how do we get order out of semi-average chaos?' for LLMs. For ML we're largely past that point.
I've been using this framing as a means to guide me towards 'what is actually useful, what might someone actually buy'. I don't have my own business at this point, but its still fun to think about off and on.
I think this is application dependent. LLM's are quite good for brainstorming, even if they are not arguably creative, at least they draw from a lot of information that is already out there, which saves me time in researching and learning.
Just stop thinking of software products. There are a million businesses offering solid value propositions that need a lot of software to run, but software isn't the product.
> The bottleneck is no longer engineering. It’s moving up the stack to judgment, customer insight for desired outcomes and distribution.
My posit is this: engineering never was the bottleneck, or at least hasn't been for 10 years now. Frameworks and best practices are pretty well known at this point. AI is simply exposing this reality to engineers' faces.
Proof point - most publicly traded SaaS first businesses S&M equals their R&D spend, if not dwarfs it. You're going to see this even more lopsided going forward.
Building SaaS businesses has become a whole lot less capital intensive. Solo founders can go much further than they've been able to previously. New startups probably don't need funding anymore.
This article resonated with me, especially since I've noticed the fund raising hoopla's in my circle has dramatically dropped. Either investors are tightening the belt so founder-investor fit has crossed into the realm of disillusionment
"a pricing model based on seats, a product roadmap built around features rather than outcomes [is outdated]"
I disagree with this.
On pricing, I get that agents and tokens can scale in a way that's unrelated to # of users. But for much SaaS software, AI remains helpful to a human and the human remains the receiver of value. Seat-based pricing is easy to understand and you can always layer in token/agent costs thresholds.
On features vs. outcomes, the latter is hard to define and measure in many industries. In marketing SaaS, which I know well, you can't often tell what outcome to expect. You have to try a lot of ideas and some will hit. No way a SaaS vendor can guarantee that.
The argument against seat pricing is that companies will employ fewer people in general. So a 45 person company paying for 10 seats will become a 7 person company paying for one seat.
Not sure I agree with this train of thought, but a SaaS CEO made this exact argument to me last week.
I’d add that token pricing doesn’t work for anyone but the frontier models. Everything else will be commodified. So Opus can charge us top prices per token until a lower (or local) model hits parity and then price goes to zero.
The author didn't spend more than, maybe, 30 seconds thinking this through? Information I could've gotten in 3 seconds by opening a screen and looking at a line item, I now have to extract by writing a paragraph to an AI agent (and cross my fingers that nothing I said was ambiguous or misunderstood). And that's supposedly an upgrade?
I read it less as “every startup now needs an AI feature” and more as “your assumptions expire faster than they used to.” That part feels true even if the examples are a bit overstated imo...
It’s easier to view it in terms of DCF - the value of a cash flow generating asset = present value of expected cash flows discounted back at a risk discount adjusted rate. In other words what you’ve invested into your existing assets is irrelevant - the cash flows generated by them and the growth assets through future investment, is what matters.
One assumption that's also changed: small teams no longer need dedicated QA. Tools like Autonoma (open source, getautonoma.com) use AI agents to generate and run E2E tests in real browsers, and tests self-maintain as your app evolves. One of the things making the solo-founder path more viable.
Seems like the headline should have been "is now dead on arrival". As currently written, it fails to convey the temporal aspect that is the focus of the blog post.
It also fails to convey that he's actually only talking about startups that were created 2+ years ago, rather than the many AI startups founded in the last 2 years.
> Now, before you build a physical prototype, you can simulate more design variants, create digital twins, and stress-test assumptions earlier and much cheaper than before.
Hahahahahahahaha no you can't. The rise of LLMs has done little to nothing in this area because it's very much compute-limited. Digital-twins and other ML-based strategies predate ChatGPT by a long shot. There are definitely places in hardware design where LLMs and agentic workflows will help, but that's largely because the existing tooling is utter garbage, and now the industry has a fire under its ass to make things automatable so they can build their own agents.
I think this is a lot of fluff supported by a weak anecdote.
Chris' company's assumptions are no longer true, but that doesn't apply to everyone's startup. This is mostly a Chris problem.
No, not every product can just be a chat window like in the silly little screenshot.
If the author actually wrote software they'd realize that, no, AI isn't speeding up development by any more than a modest amount. It's great that we have it and it's removing tedium but it has replaced zero engineers at my company or at any other company of anyone else I know.
And no, your company laying off some people isn't because of AI, your startup idea not getting funding is not because of AI, it's because we've been in a regular old recession which is now a developing oil crisis. Interest rates aren't 0% so nobody wants to lend money to infinite startups.
depends on assumptions. thats the load bearing element. 99.99% of what was true then is still true. its mostly on the fractal churning edges where hyped change happens. things flip-flop too:
2021-2024: good time in US for EV startup
2025: terrible time in US for EV startup
2026 March/April: AWESOME time in world for EV startup
focus on fundamentals, not flakey ephemerals
2020: wise to have smart elite software engineers on your team
2021: ditto
2022: ditto
2023: ditto
2024: ditto (is this when ChatGPT launched? dont care. snore)
It is insane to me that anyone could look at the US Economy right now and think this was a good time to start a business. Between the war, AI, a pending economic recession (look at bond prices), all I see is failure. Maybe a funeral home, but that is all I can think of.
I find the story of a startup founder who entirely missed the developments of the last two years and did absolutely nothing with AI difficult to believe. If that actually happened it's the exception, not the rule. Most startup founders are way more in-tune with AI developments. This makes it sound like Chris (the mentioned founder) is behind marketing people who use LLM bots to post slop on LinkedIn.
In that case, yes their startup is most certainly DOA.
Look around, you're most definitely in a bubble. LLMs are bleeding edge by themselves. Using agentic anything is mega bleeding edge. Having something actually working reliably is a tiny sliver of the bleeding edge audience. We have barely entered early adoption phase. Most AI users out there are Q&A'ing it and they have no idea what agents, tool calling or context compaction are.
I don't follow. Tech startups are bleeding edge. You may be over-generalizing here. I talk a lot of my dev friends, they are all using AI for work. So if Joe Blow at some consulting company is using it, then a SV startup CEO should be too.
>Most AI users out there are Q&A'ing it and they have no idea what agents, tool calling or context compaction are.
Again, talking about a tech CEO not a random "AI user."
I know a 60-ish year old serial startup CEO who never coded. Now he's running teams of Claude Code agents which he used to build a healthcare platform by himself which he is now selling. He's already wealthy, just doing it for fun. This skill set is the bare minimum for a startup founder today.
Going to VCs with a 2 year outdated deck with no AI functionality or plans or tool use is unimaginable.
Yeah I get what you're saying. Sounds like this guy is driven, smart and savvy then? Definitely a bleeding edge minority who goes where the puck is.
As to my original point I'll go and say that 95/100 of business owners out there in the world don't have any idea about Claude Code, Codex or anything of that caliber. It's too early. By the time that group gets to it there'll be tools tailored to their needs, not a terminal-based coding agent messing up filesystem.
You keep returning to this which is a strawman of my argument. From the start I have been talking about technical startup founders, which is also the subject of the blog post.
You on the other hand keep assuming that startup founders, even in SV are technical enough to envision AI or can change the thesis of their business on a whim of technology change that has been introduced less than a year ago. It's not realistic. Businesses have systems, people, obligations, legacy, etc.
Did you read the blog post? It seems like you may be missing a lot of context here.
I'm not assuming anything, that is my argument. I think you could do a better job countering it than pointing at the status quo of every business in the world. lol.
Me: 'cutting edge startups should use cutting edge tools'
You: 'every business in the world isn't using cutting edge tools!'
It's not about the status quo. It's about the speed of change and focus. While I'm all in on building custom agentic loops and building extensions for my pi I have enough bandwidth and xp to see that it's quite easy for business people to be focused on the matters that are completely non-technical. That plus plain sunk cost fallacy and blind denial ofc.
A laundromat isn't a startup in that sense. There's no potential for exponential growth. A VC would never give you money to open a laundromat, you'd go to a bank and get a business loan for that.
Washio was an American on-demand laundry cleaning and delivery service. The company was founded in 2013 by Jordan Metzner, Bob Wall, and Juan Dulanto, and raised $17 million in funding.
As technology brings the cost of everything else to 0, psychological costs will predominate.
Reality testing is ultimately unavoidable, of course, but I'd guess most people still lean away from that rather than into it. (Our whole culture is set up that way, and most of us get like two decades of Pavlovian conditioning in that direction.)
Edit: Expanded here: https://nekolucifer.substack.com/p/willingness-to-fail-is-no...
The bottleneck was never coding...
If anything LLM chatbots & synthetic users will make the majority of founders evermore comfortable not testing reality.
As gp says, there's a big difference between theory and practice here, and a lot of the things we needed when we weren't using LLMs are still needed when we are, but it takes a bit of actual practice to work this out. It's still not at the stage where an Ideas Guy can make a real working product without someone on the team actually knowing how to develop software.
At least in my experience, so far. But the world is changing fast.
Some that came after might be worthy of the title, but those who claim it for themselves aren't.
So goes the thinking, anyway. It's why my couple decades of experience and I still occasionally get to hear from rando cold recruiters desperate to sell someone a "pivot to AI," probably thinking they can lowball me by holding my mortgage over my head in order to screw three times the work out of me that they'd pay for.
I was in this business too long.
You will!
LinkedIn influencers
I dabble in music production and having a DAW to help me guide some parts of the process would be extremely useful to get me out of certain creative ruts.
But this I just thought was vacuous. I agree with what you wrote, but more to the point, I didn't find any real advice about how a startup should actually change that passed my sniff test. I left the tech startup world about 2 years ago myself, and I'm glad I did, because I just think there are way fewer differentiable opportunities now. That is, even if I accept what Blank says is true, what are all these 2+ year old startups supposed to do - just create some model wrapper/RAG chatbot product like the million other startups out there?
Even in defense, like the article says, there are now a bajillion drone companies, and it looks like a race to the bottom. The most successful plan at this point just looks like the grifter plan, e.g. getting the current president to tweet out your stock ticker.
I'm honestly curious what folks think are good startup business plans these days. Even startups that looked they were "knock it out of the park" successes like Cursor and Lovable just seem like they have no moat to me - I see very few startups (particularly in the "We're AI for X!" that got a ton of funding in the past two years) with defensible positions.
The much more useful posts are “my team and I are doing X with AI”. Of course, the challenge there is that the ones who are truly getting a competitive edge through AI are usually going to be too busy building to blog about it.
He could have ignored the email or engaged on the topic I introduced. Instead he sent me a wikilink to Autonetics. I was left with the feeling that he had no real interest in the topic he wrote about. It was really no big deal. He is a busy guy and doesn't need to engage with strangers. I never read anything by him again because I was left with the feeling he is just phoning these posts in.
For big complex real world problems, and big complex real worlde codebases, the AIs are helpful but not yet earth shattering. And that helpfulness seems to have plateaued as of late.
I am extremely skeptical of posts like this.
These guys don't get to have it both ways.
So hand-waving about how easy it is to have an MVP in days w/o actually experience in doing that seems ironic.
Now, maybe he's saying this based on companies he's funded who've had great success with what he's saying. But it's curious that the only concrete example of a company mentioned is one that's six years old and not operating like that. And in fact, many of the ways he thinks that company went wrong seem completely unrelated to AI?
> Chris is now starting to raise his first large fundraising round. In looking at his investor deck I realized that while he’s been heads down, the world has changed around him – by a lot. The software moat he built with his 5-year investment in autonomy development is looking less unique every day. Autonomous drones and ground vehicles in Ukraine have spawned 10s, if not 100s, of companies with larger, better funded development teams working on the same problem.
> While Chris has been fighting for adoption for this niche market (one that is ripe for disruption, but the incumbents still control), the market for autonomy in an adjacent market – defense – has boomed. In the last five years VC Investment in defense startups has gone from zero to $20 billion/year. His product would be perfect for contested logistics and medical evacuation. But he had literally no clue these opportunities in the defense market had occurred.
> While there’s still a business to be had (Chris’s team has done amazing system integration with an existing airborne platform that makes his solution different from most), – it’s not the business he started.
"Being heads down without paying enough attention to the market for 6 (!!) years" doesn't seem like an AI-caused issue.
Meanwhile, the core suggestion doesn't seem to fix that, it seems almost completely perpendicular.
> You can now test multiple versions of the same business at once (or simultaneously be testing different businesses). While you can be simultaneously testing five pricing models, ten messages or twenty UX flows, the “user interface” may no longer be a screen at all. Testing might be to find prompt(s) to AI Agent(s) deliver needed outcomes.
Ok, but this person didn't even seem to be doing enough paying to the market of one version already?
And while this claim about parallel development being a huge unlock is the most interesting thing, it also sounds a bit glib. Getting your foot in the door is the hardest thing early on, now you're trying to run six versions of your company at once? Each time you get a foot in the door sales-wise, are you trying to make them use all 6 versions, or are you only gonna get feedback on 1? Would you want to pay money to be a beta tester of 6 different products simultaneously, with reason to believe that 5 of them will probably evaporate over night soon?
Seriously, where do ideas like this come from? An "ultracapitalist" country that has about as much redistributive social spending as other developed economies[0]? A "dystopia" that millions of people from all over the world clamor to get into every year?
[0]: https://www.piie.com/publications/policy-briefs/2016/true-le...
The man is an economist, not a crony operating at the federal level (one does not imply the other, and I know nothing of the man's background).
I don't agree with the parent; I think capitalism is doing a lot of great things for us and will continue to, even with AI. But man I'm tired of these hot takes from people with limited practical experience.
I hear you there. There's a reason I limit what I say online. I know very little about very little.
One year ago models could barely write a working function.
One year ago, the models were only slightly less competent than today. There were models writing entire apps 3 years ago. Competent function writing is basically a given on all models since GPT3.
Much of the progress in the past year has been around the harnesses, MCPs, and skills. The models themselves are not getting better exponentially, if anything the progress is slowing down significantly since the 2023-2024 releases.
That has not been my experience. This weekend I pointed Claude Code+Opus 4.6+effort=max at a PRD describing a Docusign-like software. The exact same document I gave to Claude Code+Opus 4.5+Ultrathink around 6 months ago.
The touch-ups I needed after it completed implementation was around a tenth that it took with 4.5. It is a pretty startling difference.
They still do dumb shit from time-to-time, but it's getting rarer.
It takes far less manual prompting to make it have consistent output, work well with other languages, etc. But if you watch the "thinking" logs it looks an awful lot like the "prompt engineering" you'd do by hand back then. And the output for tricky cases still sometimes goes sideways in obviously-naive-ways. The most telling thing in my experience is all the grepping, looping, refining - it's not "I loaded all twenty of these files into context and have such a perfect understanding of every line's place in the big picture that I can suggest a perfect-the-first-time maximally-elegant modification." It's targeted and tactical. Getting really good at its tactics for that stuff, though!
I can get more done now than a year ago because taking me out of the annoying part of that loop is very helpful.
But there's still a very curious gap that the tool that can quickly and easily recognize certain type of bugs if you ask them directly will also happily spit out those sorts of bugs while writing the code. "Making up fake functions" doesn't make it to the user much anymore, but "not going to be robust in production but technically satisfies the prompt" still does, despite it "knowing better" when you ask it about the code five seconds later.
If a metric goes from 0 to 2 it doesn't mean it's on a long-lived exponential trajectory.
Have you even seen 2001 A Space Odyssey? That was filmed by Kubrick also.
This is a false claim.
Claude Code was released over a year ago.
Models have improved a lot recently, but if you think 12 months ago they could barely write a working function you are mistaken.
Even if a lot of the improvements we see today are due to things outside the models themselves -- tools, harnesses, agents, skills, availability of compute, better understanding of how to use AI, etc. -- things are changing very quickly overall. It would be a mistake to just focus on one or two things, like models or benchmarks, and ignore everything else that is changing in the ecosystem.
We can’t say for sure yet which trajectory we are on.
90% of blog articles created in the last two years are probably dead on arrival
There's live-coding, so it's not totally a crazy idea.
Not sure how to make that a platform, as when i wrote i explicitly put everything directly into a book unedited, whereas for many people, the editing is probably at least half if not more of the time they spend writing.
Or we could just bring back Google Wave :-)
jimkleiber.com/project-35 if you’re curious.
> The bottleneck is no longer engineering. It’s moving up the stack to judgment, customer insight for desired outcomes and distribution.
He's not saying "add AI to your product" or "use AI or die" but more that AI has shifted institutional assumptions about tech stacks, defensibility and fundability. The bottleneck moved up the stack from engineering to judgment, insight and design.
Chris lost because he was heads down building while $20B in defense VC was flowing into his exact problem space and he didn't build the boat to capture that wave.
The big pinch of salt I throw in with advice like this though is that startup failure rate hasn't dramatically shifted despite two decades of lean startup methodology, accelerators, and an entire cottage industry of startup advice. It's never the fault of the framework, mind you.
edit: Tobi from Shopify has an insight that relates. His north star metric is user churn. sounds crazy on its face. he's known for that. But increasing churn means you've increased top line exposure to more would-be entrepreneurs. Not all of them will succeed, but shopifys mission is to create more entrepreneurs. Grow the pie. A focus on increasing conversion tends to have a narrowing effect.
If you have good product idea, the methodology to get there mostly affect profit marigins, not whether it will be success or total failure
What happens is that the original idea rarely matters at all. It is the people that implements the idea what matters.
The original idea is almost always terrible, but great people pivot or change the idea gradually while having contact with reality.
Launching a product was never the finish line; it was always the start line. But technical founders could trick themselves into thinking that building a product was building a business.
The same "Lean Startup" rules apply. Build something and get it in front of real people who will pay you for the thing. If they won't, back to the drawing board.
The only real "shift" I've seen is that most startups don't actually need VC at all. That's a great thing.
You’ve always needed to constantly learn and innovate to launch a successful business.
Startups are mostly all default dead. That's why they need VC money.
Of all the things that AI has changed, tech stacks aren't one of them. The bots will gladly write Typescript, Java, Python, Rust, what have you. They could not give less of a shit.
What is he getting at? How does the code and infra stack differ at all between a company that is using AI, vs one that is not?
Build vs. buy is an eternal question in enterprises. I remember many in-house data teams trying to build tools for "digital transformation" and cloud migration about 10 years ago. The challenge was, building those tools was more expensive than those enterprises could budget for (IT as cost center), so a startup like Snowflake would easily outcompete in-house solutions with their custom, cloud-based tech stack that was necessarily complex because it needed to serve the needs of thousands of customers.
If he's right, the build vs. buy equation has shifted more towards build, at least as far as enterprise software is concerned. IT is still a cost center, but in theory an internal team can now handle more requests for custom tools without looking to outside vendors. Essentially the cost of building in-house might be collapsing and therefore enterprise software startups will be serving fewer customers (who would all pay you more because if solving the problem was cheap they'd do it).
If you had to build a stack for dozens of customers paying huge amounts of money, how would that stack differ from the stack you'd build to serve thousands of customers? Certainly it wouldn't need to be as scalable! And that's probably what he's getting at. I think what you'd do instead, to capture those higher price point customers, is solve their problems more specifically, in a higher value manner.
Many companies already do this, investing far more in field engineers than they do in their tech stack, since customization is essential.
It is also not the same as, "If you want to be a profitable company...". For that you need to somehow make more money than you are spending.
I've had this idea of 'business as reducing entropy' floating around in my head for awhile. It's a neat way to think about the value a business offers to buyers; a washing machine manufacturer is selling reduced time to reduced entropy (clean cloths), spreadsheet software is selling reduced time to understanding (information from tabulated data), and so on.
From that perspective, a lot of AI-driven development is failing.
We're still in the phase of 'how do we get order out of semi-average chaos?' for LLMs. For ML we're largely past that point.
I've been using this framing as a means to guide me towards 'what is actually useful, what might someone actually buy'. I don't have my own business at this point, but its still fun to think about off and on.
You could generalize this to the purpose of life itself, probably.
My posit is this: engineering never was the bottleneck, or at least hasn't been for 10 years now. Frameworks and best practices are pretty well known at this point. AI is simply exposing this reality to engineers' faces.
Proof point - most publicly traded SaaS first businesses S&M equals their R&D spend, if not dwarfs it. You're going to see this even more lopsided going forward.
VC for conventional SaaS is dead.
I disagree with this.
On pricing, I get that agents and tokens can scale in a way that's unrelated to # of users. But for much SaaS software, AI remains helpful to a human and the human remains the receiver of value. Seat-based pricing is easy to understand and you can always layer in token/agent costs thresholds.
On features vs. outcomes, the latter is hard to define and measure in many industries. In marketing SaaS, which I know well, you can't often tell what outcome to expect. You have to try a lot of ideas and some will hit. No way a SaaS vendor can guarantee that.
Not sure I agree with this train of thought, but a SaaS CEO made this exact argument to me last week.
The author didn't spend more than, maybe, 30 seconds thinking this through? Information I could've gotten in 3 seconds by opening a screen and looking at a line item, I now have to extract by writing a paragraph to an AI agent (and cross my fingers that nothing I said was ambiguous or misunderstood). And that's supposedly an upgrade?
That said, if you believe universe exists, chances are not null that you are correct. But solipsism might actually be right.
In case of doubt, remember that your memory might be mere illusions.
he obviously generated large parts of this with claude or chatgpt or whatever.
i don't know what that means for the rest of us, but boy it's a big ole spike of signal
This trap has killed many startups, well before AI.
Now that code is cheaper to write, hopefully it becomes less of a problem?
In either case, founders should never fall in love with their solutions.
It also fails to convey that he's actually only talking about startups that were created 2+ years ago, rather than the many AI startups founded in the last 2 years.
Hahahahahahahaha no you can't. The rise of LLMs has done little to nothing in this area because it's very much compute-limited. Digital-twins and other ML-based strategies predate ChatGPT by a long shot. There are definitely places in hardware design where LLMs and agentic workflows will help, but that's largely because the existing tooling is utter garbage, and now the industry has a fire under its ass to make things automatable so they can build their own agents.
Chris' company's assumptions are no longer true, but that doesn't apply to everyone's startup. This is mostly a Chris problem.
No, not every product can just be a chat window like in the silly little screenshot.
If the author actually wrote software they'd realize that, no, AI isn't speeding up development by any more than a modest amount. It's great that we have it and it's removing tedium but it has replaced zero engineers at my company or at any other company of anyone else I know.
And no, your company laying off some people isn't because of AI, your startup idea not getting funding is not because of AI, it's because we've been in a regular old recession which is now a developing oil crisis. Interest rates aren't 0% so nobody wants to lend money to infinite startups.
Now with AI, it is likely going to be 98%.
After AI: 4900 of 5000 fail (98%), 100 succeed
Like this?
2021-2024: good time in US for EV startup
2025: terrible time in US for EV startup
2026 March/April: AWESOME time in world for EV startup
focus on fundamentals, not flakey ephemerals
2020: wise to have smart elite software engineers on your team
2021: ditto
2022: ditto
2023: ditto
2024: ditto (is this when ChatGPT launched? dont care. snore)
2025: ditto (what are YC/HN/VC hyping now? snore)
2026: ditto
2027+: ditto, likely
In that case, yes their startup is most certainly DOA.
>Most AI users out there are Q&A'ing it and they have no idea what agents, tool calling or context compaction are.
Again, talking about a tech CEO not a random "AI user."
- The AI jump happened in Q3-Q4 2025 with Opus 4.5 so it's been six months or so? Not long enough.
- Most developers out there use AI for their coding work, not for re-envisioning business models.
Going to VCs with a 2 year outdated deck with no AI functionality or plans or tool use is unimaginable.
As to my original point I'll go and say that 95/100 of business owners out there in the world don't have any idea about Claude Code, Codex or anything of that caliber. It's too early. By the time that group gets to it there'll be tools tailored to their needs, not a terminal-based coding agent messing up filesystem.
You keep returning to this which is a strawman of my argument. From the start I have been talking about technical startup founders, which is also the subject of the blog post.
I'm not assuming anything, that is my argument. I think you could do a better job countering it than pointing at the status quo of every business in the world. lol.
Me: 'cutting edge startups should use cutting edge tools'
You: 'every business in the world isn't using cutting edge tools!'
If your business is selling services at 40% margin that are entirely digitally based, then maybe you’ll need to cut some margin, sure.
https://en.wikipedia.org/wiki/Washio_(company)