Alchemy

Part VI · The Investment Thesis

VI.C — The Risk Surface

12 min read · 2,249 words

The thesis rests on three assumptions about the transition: that cognitive capability commoditizes, that actuation constraints persist, and that the shift unfolds gradually enough to allow repositioning. Each can be wrong. Beyond these, the thesis can fail to generate returns even if the assumptions hold—through infrastructure that does not emerge, capital cycles that punish correct theses with poor timing, or valuations that overshoot before reality confirms the framework.


Consider how the first assumption might fail.

A frontier lab releases a model that outperforms competitors on every benchmark. Within six months, three other labs match or exceed its performance. Prices collapse. The model layer commoditizes as predicted.

But the customers do not switch.

Why not? The switching costs are not in the model. They are in the evaluation data accumulated through deployment—data that tells the enterprise which edge cases the model handles well and which require human review. They are in the compliance certifications already obtained, the workflow integrations already built, the institutional trust already established with procurement, legal, and security teams. Usage generates evaluation data; evaluation data improves reliability; reliability enables compliance deployment; compliance deployment generates more usage. The loop compounds faster than the capability diffuses.

If one or two frontier providers achieve escape velocity through these loops, the margin pool stays upstream. The contrarian bet on actuation underperforms—possibly permanently. The thesis was correct about capability commoditization but wrong about where value accrues.

The tell is customer behavior when superior alternatives appear at comparable price. Switching indicates commoditization proceeds. Retention despite superior alternatives indicates something else generates lock-in. Enterprise renewal rates, price-performance spreads between frontier and open-weight alternatives, and whether frontier labs pursue vertical integration into compliance and liability layers all provide signal. If OpenAI and Anthropic build their own actuation infrastructure, they concede that the model layer alone does not capture sufficient value—a concession that validates the thesis.


The second assumption—that actuation bottlenecks remain durable—might loosen faster than expected. Robotics could advance discontinuously. Regulators could converge on permissive standards. Liability frameworks could evolve to accommodate agent participation. If institutions adapt at the speed of software rather than the speed of law, actuation assets become bets on temporary constraints.

The question is speed relative to capital commitment.

A power generation asset takes three to five years to construct and operates for thirty to fifty. If the energy bottleneck persists for two decades, the investment compounds. If fusion economics arrive in five years, or if solar-plus-storage undercuts grid pricing faster than expected, the asset enters a market that no longer resembles the one that justified the commitment. Capital is locked. The lesson arrives after the decision is irreversible.

The same calculus applies to every long-duration actuation investment: fabrication facilities, logistics networks, sensor infrastructure, regulatory licenses. Each commits capital to assumptions about bottleneck durability. The longer the time-to-build, the greater the exposure to regime change during construction.

What signals would indicate the analysis is wrong? Permitting timelines shortening rather than lengthening—ERCOT’s interconnection queue has surged past 100 GW, with projects spending years in study phases, and compression of these timelines would signal the physical bottleneck is clearing. Licensing frameworks converging on permissive standards across jurisdictions. Early liability rulings establishing precedents that enable scaling. Insurance products for agent operations reaching mainstream adoption with reasonable pricing rather than bespoke, capacity-constrained underwriting. Each can be tracked before committing to twenty-year assumptions.


The third risk differs in kind from the first two.

Wrong about commoditization: integrated platform companies outperform expectations. The error is recoverable for liquid capital. Wrong about bottleneck durability: actuation assets underperform. Also recoverable. Positions adjust. Lessons apply forward.

Wrong about transition speed in the direction of acceleration: a breakthrough creates agents that bypass institutional constraints on identity, permissions, settlement, and liability. The assets that seemed durable turn out vulnerable. The positioning framework becomes obsolete overnight. This is not a recoverable error from which lessons transfer forward; it is a discontinuity that invalidates the framework itself.

The signal is the gap between demonstrated and deployed capability. GPT-4 became available via API in March 2023; broad enterprise deployment with compliance certification required roughly a year. If deployment lag for subsequent frontier models compresses below six months, or if agentic deployment expands to high-stakes domains (financial transactions, procurement, legal filings) faster than institutional constraints would predict, the transition is accelerating beyond the speed at which capital can reposition.

Long-duration build assets face the hardest calculus. Capital cannot be redeployed. Weak balance sheets force sales at distress prices. Strong sponsors survive the dislocation and acquire from those who cannot.

The only partial hedge is diversification across the assumption set: some exposure to frontier capability, some to actuation assets, some to liquidity.


Beyond the load-bearing assumptions, three parameters shape the risk surface for any specific position.

Time-to-build measures how long it takes to create the asset. A software product can be built in months. A data center takes two to four years. A chip fabrication facility takes five to seven years. A nuclear power plant takes a decade or more, assuming permits are granted at all. Long time-to-build creates both risk and opportunity: the world may change before the asset is complete, but competitors cannot replicate quickly, and scarcity persists because supply cannot respond to demand signals in real time. Short time-to-build assets face rapid competition but allow quick repositioning. Long time-to-build assets face less competition but commit capital to assumptions that may not hold.

Regulatory convexity measures how regulatory outcomes affect asset value. Some assets have symmetric exposure; others have convex exposure where favorable regulation helps disproportionately, or vice versa. Power generation has convex upside: favorable policy—accelerated permitting, grid investment—dramatically increases value, while unfavorable policy diminishes but does not destroy it. The asymmetry favors the long position. Autonomous systems have convex downside: favorable regulation enables rapid scaling, but unfavorable regulation—strict liability, state-by-state fragmentation—constrains deployment indefinitely. The asymmetry favors waiting for clarity.

Balance-sheet capacity measures the ability to absorb losses through adversity. Infrastructure assets require sponsors with substantial capacity: utilities, sovereign wealth funds, large private equity. Cognitive applications can be funded by venture capital with smaller commitments and faster feedback. When assets require more capacity than current owners possess, distress sales occur. When sponsors have more capacity than opportunities require, they bid up prices. The thesis creates both dynamics: cognitive assets may face pressure as margins compress, while actuation assets attract premium valuations as capital rotates.


For any specific investment, the three parameters yield a set of questions.

What is the time-to-build relative to the expected transition timeline? If the asset takes longer to build than the window of scarcity, the position is exposed.

What is the regulatory convexity? Does favorable regulation help more than unfavorable regulation hurts, or vice versa? The asymmetry determines whether to enter now or wait for clarity.

Does the sponsor have balance-sheet capacity to sustain the position through adversity? Long time-to-build assets in the hands of weak sponsors become forced sellers. Strong sponsors can acquire at distress prices.

What would have to be true for this position to fail? If the answer is “the thesis is wrong,” the position is undiversified. If the answer is “specific execution failures by this management team,” the position has idiosyncratic risk that can be underwritten or hedged.


Even if the three assumptions hold, the thesis can fail to generate returns.

The Bitcoin yield curve—the benchmark rate that would collapse O(N²) bilateral negotiations to O(N) benchmark-referenced contracts—may not emerge. The infrastructure may not be built. Liquidity may not concentrate. Legal and regulatory environments may prevent the benchmark from achieving the status necessary for widespread adoption. If this happens, agent-mediated markets remain shallow. Multi-period contracts continue on bilateral terms, or they do not happen at all. The positions that would capture benchmark-rate infrastructure capture nothing because the infrastructure does not exist.

Or the thesis becomes consensus before playing out. Capital floods into the theme. Valuations rise to levels that discount decades of future value creation. Reality disappoints—even slightly—and valuations collapse. The thesis was correct; the trade lost money. The language of “picks and shovels,” “infrastructure,” and “actuation bottlenecks” already circulates. If rotation into actuation assets is too aggressive, the underlying thesis may prove correct while investments underperform.

The hedge against the first is monitoring infrastructure emergence: liquidity concentration, benchmark adoption, legal precedent. The hedge against the second is valuation discipline. If data center REITs trade at capitalization rates implying 15%+ annual demand growth for a decade, the narrative has overshot; if they trade at rates implying 5-8% growth with reasonable terminal assumptions, margin of safety remains. The thesis identifies where value flows. It does not justify any price.


A related risk operates at the level of capital cycles rather than individual valuations.

Carlota Perez’s framework describes technological transitions in two phases: installation, when capital floods toward the new paradigm faster than deployment can absorb it, and deployment, when infrastructure matures and returns normalize. The transition typically involves a crisis—installation-phase valuations colliding with deployment-phase realities. The dot-com crash is the canonical example: the thesis was correct; the timing was catastrophic.

The Factor Prime transition is susceptible to this pattern. Installation-phase dynamics are visible: capital flooding toward AI infrastructure, extraordinary valuations for frontier labs, discourse dominated by capability rather than deployment. The deployment phase has not arrived. Investors who commit at installation-phase valuations may hold positions requiring patience they do not possess.

The risk compounds through feedback. Rising prices attract capital; the cycle continues until disappointment triggers reversal; the reversal overshoots. Assets overvalued at peak become undervalued at trough—often precisely when deployment begins and long-term returns are most attractive. Buying at installation peak and selling at deployment trough captures the thesis’s worst possible expression.

The hedge is calibrating exposure to the cycle. Early installation: smaller positions, optionality, readiness to add at crisis. Late installation: caution despite momentum. Post-crisis deployment: conviction despite pessimism, recognition that the trough is where compounding begins.


The three assumptions above concern what might go wrong. A separate question is which elements of the framework rest on durable foundations and which rest on fragile conjectures.

Durable physics: Landauer’s limit sets a floor on computational energy cost that no engineering can breach. Thermodynamic depth accumulates through irreversible computation and cannot be faked. Lead times for infrastructure are measured in years because the physical constraints are real: concrete cures at a fixed rate, transformers require specialized manufacturing, permitting involves sequential review. Geology determines where energy endowments exist. These claims rest on physics. Policy shifts and market dynamics cannot alter them.

Durable incentives: The hurdle rate created by Bitcoin mining persists as long as proof-of-work operates at scale. Selection gradients distinguish productive computation from waste—a structural feature of markets, not a choice. The O(N²) coordination problem for bilateral credit negotiations is mathematics; a common benchmark collapses it to O(N) regardless of which asset serves as the benchmark. These claims persist across institutional variation.

Fragile conjectures: The speed at which the Bitcoin term structure emerges depends on coordination that may or may not occur. Liability frameworks may evolve permissively, restrictively, or not at all. Regulatory adaptation may accelerate through crisis or stall through gridlock. The competitive dynamics among frontier labs may produce commoditization or durable concentration; the outcome depends on execution, not structure. These claims rest on institutional evolution that cannot be derived from first principles.

Positions built on durable physics can be held with long time horizons and high conviction. Positions built on durable incentives require monitoring for structural changes but can tolerate institutional noise. Positions built on fragile conjectures require optionality, hedging, and readiness to reverse.

The indicators that distinguish installation from deployment are observable. Valuation basis: installation rests on revenue multiples and TAM projections; deployment rests on cash flow multiples and demonstrated unit economics. When the conversation shifts from “how big could this be” to “what does this earn,” the phase is turning. Dominant investor type: installation capital comes from venture and growth equity with short fund lives; deployment capital comes from infrastructure funds and pension systems with long horizons. When pension funds displace venture capital as the marginal buyer, the phase is turning. Regulatory posture: installation is permissive and exploratory; deployment is prescriptive and liability-bearing. When the first major liability ruling holds deployment stacks accountable, the phase is turning. Labor market: installation exhibits talent scarcity premiums; deployment exhibits capability abundance and wage normalization. When AI engineering becomes a commodity skill, the phase is turning. Public narrative: installation emphasizes transformation; deployment emphasizes utility. When the discourse shifts from wonder to implementation, the phase is turning.

Current readings suggest early-to-mid installation: venture dominance, revenue multiple valuations, exploratory regulation, talent scarcity, and transformation narrative. The deployment phase has not begun. Capital committed at current valuations must survive the installation-to-deployment transition—including whatever crisis marks the turning point.


The risk surface is asymmetric.

Recoverable errors: cognitive moats persist longer than expected; bottlenecks loosen faster than expected. Liquid capital adjusts. Long-duration assets face harder arithmetic—the lesson arrives after commitment is irreversible—but the framework survives.

Non-recoverable error: transition accelerates discontinuously. Positioning strategies built on gradual adjustment become invalid. This is not an error within the framework; it is a failure of the framework itself.

Full hedges against regime change do not exist. Partial responses: liquid reserves, diversified exposures, information advantages that provide early warning, and the capacity to update beliefs as evidence accumulates.