Business - @stuartgh

The MVP Was the Easy Part

Steve Blank has been teaching Stanford’s Lean LaunchPad for sixteen years. This year, for the first time, teams walked in on day one with MVPs that looked finished, not rough prototypes, not wireframes, but finished-looking products. “I’ve been writing about how AI is going to change startups, but seeing 8 teams actually implementing it was mind blowing,” he admitted. What previously took weeks to build now appeared almost overnight. Blank’s conclusion? “After the class, as the instructors sat processing what just happened, we realized there’s no going back.” He’s right, but he may be understating what we’re going back from. The friction AI has now removed wasn’t just annoying; it was doing a job.

When building was hard, weak assumptions had somewhere to get caught. A team had to spend real time, real money, and real energy turning an idea into something testable. That process created natural points of resistance, those moments when someone might pause and ask whether the problem was actually real, whether users genuinely cared, whether the customer was clear. The irony is that AI has cleaned up the mess, but in doing so it may have removed the mechanism that made the mess useful.

Jim Hornthal, of the Berkeley-based Haas School of Business, commenting on Blank’s post, put it simply: “Product development is no longer the primary bottleneck.” Teams can produce functional MVPs in days. But the hard constraint, he notes, hasn’t changed at all, it’s still customer discovery, validation, and the search for a repeatable business model. The “AI that still matters most,” as he drily puts it, is “Actual Interviews”. In his reply Netherlands-based Joost Okkinga added a key observation that sharpens the whole problem: “AI makes teams feel further ahead than they are. The product artefact improves faster than the evidence base. So the discipline needs to move upstream: map assumptions first, then use AI to accelerate the right tests.”

A startup founder can now generate a landing page, a prototype, a pitch deck, a feature roadmap, a user onboarding flow, and a full set of customer personas before they have properly tested whether the key assumption is true. The results may look sophisticated; it impresses casual observers, and it may even impress the founder themselves. But underneath it the core thinking may still be fragile, and now thanks to AI it’s harder to see that, because the surface is so tidy and clean looking.

A line graph comic titled "THE MVP WAS THE EASY PART" plotting a founder's confidence and mood over time. The curve goes up during the "BUILDING (DOPAMINE)" phase, which features captions like "Oh, random idea at 1 AM" and "14h: launch tweet scheduled 🚀". At the peak, the line turns red and plummets into the "DISTRIBUTION (CHARACTER DEVELOPMENT)" phase. Captions on the downward slope track growing realization over 45 days, including "4 days: 17 visitors. 9 are me" and "21 days: becoming a linkedin thought leader against my will," ending with a character lying defeated on the floor with the text "discovering that distribution was the startup." Underneath the phases, text reads: "AI GOT OUR BACK" for building, and "GOOD LUCK 💀" for distribution.

Credit: Dmitry Trofimets (https://www.linkedin.com/in/dtrofimets/)

When Gaps Become Outputs

There is a subtler version of the same problem that most teams discover the hard way. When AI becomes the most frequent reader of your instructions, documents, and context, the assumptions you never thought to write down stop being harmless gaps and start becoming inputs. A human colleague fills in what is missing from shared experience, background knowledge, or through a quiet word across a desk. A model does not ask for a discussion. It works with what it is given, and where something is missing it may not pause; it proceeds. As a result the gap does not stay a gap; it is baked into the output. This is why working with AI does not just reward precision. It also exposes every step you assumed did not need saying.

Research-Looking Is Not Research

The same dynamic is playing out in scientific research, and in some ways it’s more alarming there. A recent paper on AI-generated research “AI for Auto-Research: Roadmap & User Guide” describes systems that can produce a complete paper including idea generation, literature search, code, experiments, charts, manuscript, simulated peer review, and rebuttal all for as little as $15. One system ran for 228 hours, used 11.4 billion tokens, and produced 100 papers. Another reportedly ran more than 20 GPU experiments overnight and improved a draft score from 5.0 to 7.5 through automated review and revision loops.

The striking thing isn’t the $15 figure. It is the paper’s deeper warning: a research paper can now carry a clean title, a polished abstract, organised sections, good-looking figures, citations, experiments, and a confident conclusion, while the science underneath remains fragile. The code may run while testing the wrong thing. The idea may sound original until someone tries to implement it. To quote the paper’s authors: “The core challenge is therefore no longer whether AI can produce the forms of research, but whether it can preserve the substance of research: evidence, judgment, provenance, and accountability.”

In both cases, from startups to research labs, it’s clear that AI is lowering the cost of producing something that looks complete while raising the cost of knowing whether it actually is fit for purpose. The bottleneck doesn’t disappear; thanks to AI it moves, from production upstream to validation, and it becomes harder to see because the downstream MVP or scientific paper no longer carries the obvious marks of imperfection that used to signal it.

Marketing Activity Is Not Evidence

The same confusion can show up in marketing, where AI agents can generate activity faster than teams can interpret whether that activity means anything commercially. In the crypto gaming sector, one of the first campaigns using AI agents has delivered results for Whale.io.”This MCP campaign featured over 15 AI agents that collectively generated more than $900k in volume,” reported igaming expert Adar Ziv this week. “While some use Claude and Cursor to find bugs in smart contracts, break bridges, and earn bug bounties, others have taken a more entertaining route,” he added.

And at first glance, that looks like evidence of success, that a well-designed incentive campaign can produce high transaction volume. But in this case, the key question is not “did the agents produce volume?”, the better question is: “what did that volume prove?” Did it reveal durable demand for agentic igaming, or did it show that agents can be directed to move capital rapidly inside a nicely designed incentive loop? Both are interesting, but they are not the same game. The danger is not that AI gets things wrong. It is that AI produces valid-looking evidence at speed, allowing teams to mistake accelerated activity for validated demand, or business value.

Mission-Ready Is Not Flight-Ready

Space technology is where this problem becomes most unforgiving, and where the stakes of misreading an AI-generated output are hardest to recover from. In ordinary software, a weak assumption may surface as churn, wasted roadmap time, or a failed feature — costly, but usually recoverable. In space, the same kind of hidden dependency can become a missed launch window, an expensive redesign, a failed payload, or a lost mission entirely. NASA’s own systems-engineering guidance warns that systems can “pass verification but fail validation,” and its modelling and simulation standards require assumptions, limits of operation, and uncertainty to be made explicit before decision-makers rely on simulation outputs. That is a formal acknowledgement of something engineers have long understood: a model can be internally consistent, well-documented, and technically impressive while still depending on assumptions that have not been properly tested.

If AI can help space tech teams generate cleaner mission plans, simulations, technical documentation, and design options earlier in the process, it can also make those plans look more compelling before the load-bearing engineering assumptions have been stress-tested. The simulation runs, the outputs look plausible, the documentation is polished, and funding is being sought. But coherence is not the same as correctness. A model can look rigorous precisely because it is self-consistent, while still being wrong in the way that matters. In space, discovering that distinction late is not a sprint retrospective. It can be an expensive failed mission. Surfacing assumptions before building on them is therefore not a methodology choice. In space, it is an engineering necessity. The first obligation is not to pick the right assumption to test, but to surface the assumptions behind the simulation, mission plan, or design case. You cannot rank risks you have not yet surfaced.

A Stitch in Time

This is where I believe the Needle Framework earns its keep, not as another productivity layer or a better prompt pack, but as a way of forcing the hidden logic into the open before AI accelerates everything built on top of it. Its value is not that it magically knows the right answer. It is that it asks the awkward prior questions: what does this plan depend on being true, what has been assumed rather than evidenced, and which dependency would hurt most if it failed?

AI does not automatically improve judgement. In fact, it can all too easily disguise the lack of it. It can produce a fluent explanation, a polished simulation, a confident-looking research paper, or a finished-looking product before any of the assumptions underneath have been properly tested. That is the real risk: not that AI gets things wrong, but that it makes wrong things look right, faster than ever before. When people are forced to explain their logic step by step, weak assumptions start to show themselves.

The Needle Framework is a discipline for making that happen earlier: before the MVP looks finished, before the paper sounds convincing, before the campaign dashboard fills up, and before the simulation becomes a source of false confidence. Because AI is increasingly good at giving us what we want. The harder task is working out what we actually need.

Before you speak to an investor or get feedback from a customer your business may already have been read, analysed — and judged.

Before an investor opens your deck, before a customer compares you to a competitor, before a potential partner decides whether a meeting is worth their time — something has already been there. It has read what you’ve published, compared your model to thousands of others, compressed your entire value proposition into a handful of signals, and formed a view. That view now shapes what comes next, almost always without your involvement, and almost always before you’ve said a word.

Most business owners, when they hear this, assume the risk is that AI might get things badly wrong — a hallucination, a fabricated fact, a botched summary. In practice, that’s not where the real danger lives.

The problem isn’t when AI gets it wrong. It’s when AI gets it almost right

AI doesn’t just read what’s there. It has to resolve what it sees into a coherent version of your business. If your materials leave gaps — things implied but not clearly stated, logic that isn’t fully connected, assumptions that are never made explicit — it doesn’t pause and ask for clarification. It fills those gaps itself, confidently, based on what similar-looking businesses usually look like.

The result gets passed along. It becomes the mental model investors carry into the room, the one you’re now working against rather than building on. And because it sounds right — organised, plausible, no obvious red flags — it’s surprisingly hard to correct once it’s formed. You’re no longer defending your business model. You’re defending it against a version of your business model that someone else finds more believable.

This is the shift that matters: an untested assumption used to be an internal risk. A gap that might surface during due diligence, in a hard conversation with a board member, or in the market eventually. Now it becomes an external perception risk almost immediately — because AI reads your business, draws the inference you left implicit, and distributes that reading to anyone who asks, before you’re ever in the room.

What this looks like when you’re sitting in the meeting

Take a scenario most founders will recognise. A company positions itself around customer retention as its core value driver — it’s in the copy, the investor updates, the product narrative. But no one has been fully explicit about what retention actually means here: what the signal is, what the benchmark is, what the underlying mechanism relies on.

Externally, an AI has to decide what retention means in this context, and it decides based on what retention usually looks like in comparable businesses. So you walk into a conversation where someone has already formed a view. They’re not hostile. They’re not confused. They just have a slightly wrong picture in their head, and neither of you knows it yet. You spend the first twenty minutes of a meeting you needed to go well quietly realising you’re not building on a foundation — you’re correcting one.

The iGaming industry knows this problem intimately. A casino operator spots what looks like a valuable VIP segment: strong spend, consistent behaviour, a pattern that looks like genuine loyalty. They build reward structures and marketing investment around it. But the signal was distorted from the start: multiple accounts that were, in reality, the same person. The dashboard still looked fine. All the numbers still looked actionable. But the strategy was built on a false assumption, which meant wasted spend, misplaced confidence, and the long grind of trying to improve performance using data that was never telling the whole truth. The gap wasn’t obvious from the inside — it never is. But it would have been visible immediately to anyone reading the business from the outside.

The same process that exposes you can protect you

Here’s where most people stop — at the risk. But the more important realisation is that the same mechanism works in your favour, if you use it first.

A few teams have already worked this out. VENDOR.Energy, a deep tech company navigating complex investor due diligence, didn’t leave their interpretation to chance. They built a custom evaluation prompt, a structured set of instructions that tells any AI analysing them what to read first, in what order, and what conceptual framework to apply before drawing conclusions. They’re not waiting to be misread, they’re briefing the reader before it reads.

You don’t need to be in deep tech for this approach to work. The principle is the same for any business: use AI to analyse your own materials before anyone else does. Watch where it makes assumptions. Notice where it fills in gaps you didn’t know you’d left open. Find the assumption it confidently makes that isn’t actually true — and then decide whether to close that gap in your logic, your communications, or both.

The value of doing this isn’t primarily about controlling the message. It’s about seeing your business the way everyone else already is. Founders are too close to what they’ve built to spot the assumptions that are load-bearing but untested. An AI reading your materials has no such familiarity. It just reads what’s there, infers what isn’t, and hands you back a version of your business that might be the most honest outside perspective you’ve ever received.

What actually changes the outcome

Most instincts here run toward a content fix — better copy, a cleaner whitepaper, tighter messaging. Those things matter at the margin. But the more fundamental question is: what does this business actually depend on?

Almost every strategy is held together by a small number of assumptions. Often one that matters more than the rest — the thing that, if wrong, makes the rest of the logic stop holding. The purpose of finding it isn’t to write a better pitch. It’s to stress-test whether the commercial logic is actually sound, without the over-familiarity that comes from having built the thing yourself.

Once it holds under your own scrutiny, communication becomes straightforward. And once it’s communicated clearly, the reading that circulates — through AI tools, through analysts, through anyone who encounters your business before they meet you — is far more likely to be one you’d recognise.

That’s the real opportunity. Not reputation management. Not better positioning. The chance to see your own business more clearly than you have before, fix what doesn’t hold, and walk into every room knowing that the version of you that arrived first is one you shaped.

The only question is who reads your business first

For a long time, controlling the narrative meant being good at telling your story. That still matters. But the version of that control that holds up now, in a world where AI is reading your business before most humans do, comes from having a logic that’s clear enough and tested enough that it doesn’t depend on your presence to be understood correctly.

Your business is already being read, compared, and broken down at speed, by tools being used by the people whose decisions matter most to you. The only question is whether you’ve reviewed your own business first, and whether what you found made you stronger or just more surprised.

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

@stuartgh

my thoughts on #crypto #blockchain #AI

Category Archives: Business

AI Is Giving Us What We Want, Not What We Need

Featured