The deployment. How the AI labs verticallyintegrated into the serviceslayer — the Palantir modelat scale.

📊 Full opportunity report: The deployment. How the AI labs verticallyintegrated into the serviceslayer — the Palantir modelat scale. on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

In early May 2026, Anthropic and OpenAI announced large-scale investments to embed engineers directly into client operations, adopting Palantir’s model. This move aims to control the deployment process, deepen enterprise lock-in, and capture more value from AI adoption.

In early May 2026, Anthropic and OpenAI announced major initiatives to embed their engineers directly into client companies’ operations, adopting a model inspired by Palantir’s forward-deployed engineer approach. This shift marks a strategic move by the AI labs to control not only the models but also the deployment and integration process, aiming to accelerate enterprise AI adoption and capture a larger share of the associated revenue.

Within 72 hours, Anthropic revealed a $1.5 billion enterprise-services partnership with firms including Blackstone, Hellman & Friedman, and Goldman Sachs to embed Claude into mid-market companies. Similarly, OpenAI announced its $4 billion Deployment Company, ‘DeployCo,’ with 19 investment partners and an immediate acquisition of consulting firm Tomoro, bringing 150 engineers into client environments from day one.

Both labs are adopting Palantir’s proven model of deploying engineers who sit with clients, understand workflows, and build operational systems around AI models. This approach transforms deployment from a consulting service into an embedded, product-like operation that generates recurring revenue through ongoing work and token-based economies. The move reflects a recognition that the bottleneck in enterprise AI is no longer model performance but rather integration, security, and process redesign, which are labor-intensive and require close client collaboration.

The strategy aims to create operational dependency and switching costs, making clients reliant on the embedded engineers and the AI systems they develop. This deepens enterprise lock-in and expands the revenue potential, as the embedded engineer model scales with the work done, not just the software licenses sold. However, the labor-intensive nature of this approach raises questions about margins and scalability, with some experts questioning whether it resembles ongoing consulting or can evolve into a standardized product.

The Deployment — Thorsten Meyer AI
DEPLOY
● DISPATCH / MAY 2026
THORSTEN MEYER AI · ENTERPRISE REORG · § 03
ENTERPRISE REORG · 03
FDE / DEPLOY
Essay · Deployment-Architecture Forensic · 2026-05-29

The deployment.
How the AI labs vertically
integrated into the services
layer — the Palantir model
at scale.

In seventy-two hours, the two largest labs made the same move: embed engineers inside companies, the way Palantir does — because the model isn’t the bottleneck, deployment is.
Anthropic launched a $1.5B venture with Blackstone, H&F, and Goldman; hours later OpenAI launched its $4B Deployment Company (19 partners, $10B pre-money) and bought Tomoro for 150 forward-deployed engineers. The structure is copied from Palantir “almost line for line” — the engineer flies to the client, learns the workflow, ships software that wraps a model around the problem, and stays until production works. The reason is a ratio: for every $1 on software, companies spend $6 on services. The labs sold the software dollar; the services dollar is six times larger. The structural argument: the labs are vertically integrating into the services layer because the model commoditizes, the services layer is six times larger, and the FDE is not a consulting arm but a product-formation mechanism that converts deployment into uncapped, token-metered, operationally-locked revenue. The risk: the FDE resembles consulting more than software — and whether it scales is the open Palantir question they have all inherited.
72 hrs
Between the two labs making
the identical structural move
$1 : $6
Software dollar vs services dollar ·
the labs had the smaller half
~70%
Anthropic inference margin (from 38%) ·
why the embedded customer is rational
18-20%
Palantir services as % of revenue ·
the unresolved scalability question
THE DEPLOYMENT· ANTHROPIC $1.5B JV · BLACKSTONE / H&F / GOLDMAN· OPENAI DEPLOYCO $4B · $10B PRE-MONEY · 19 PARTNERS· TOMORO ACQUI-HIRE · 150 FDEs DAY ONE· COPIED FROM PALANTIR ALMOST LINE FOR LINE· $1 SOFTWARE : $6 SERVICES· THE MODEL IS NOT THE BOTTLENECK · DEPLOYMENT IS· 95% OF GENAI PILOTS FAIL TO LEAVE PILOT· FDE JOB POSTINGS +800% IN 2025· FDE = PRODUCT FORMATION, NOT SERVICES ARM· OPERATIONAL DEPENDENCY, NOT CONTRACTUAL LOCK-IN· SEAT PRICING → TOKEN PRICING · UNCAPPED CEILING· TOKENS ARE THE NEW COAL · PALANTIR IS THE TRAIN· BULL · PRODUCT FORMATION AT SOFTWARE MARGINS· BEAR · LABOR-BOUND SERVICES AT CONSULTING MARGINS· BECOMING THE CONSULTANTS THEY COMPRESS· THE DEPLOYMENT· ANTHROPIC $1.5B JV · BLACKSTONE / H&F / GOLDMAN· OPENAI DEPLOYCO $4B · $10B PRE-MONEY · 19 PARTNERS· TOMORO ACQUI-HIRE · 150 FDEs DAY ONE· COPIED FROM PALANTIR ALMOST LINE FOR LINE· $1 SOFTWARE : $6 SERVICES· THE MODEL IS NOT THE BOTTLENECK · DEPLOYMENT IS· 95% OF GENAI PILOTS FAIL TO LEAVE PILOT· FDE JOB POSTINGS +800% IN 2025· FDE = PRODUCT FORMATION, NOT SERVICES ARM· OPERATIONAL DEPENDENCY, NOT CONTRACTUAL LOCK-IN· SEAT PRICING → TOKEN PRICING · UNCAPPED CEILING· TOKENS ARE THE NEW COAL · PALANTIR IS THE TRAIN· BULL · PRODUCT FORMATION AT SOFTWARE MARGINS· BEAR · LABOR-BOUND SERVICES AT CONSULTING MARGINS· BECOMING THE CONSULTANTS THEY COMPRESS·
FIG. 01 — THE SIMULTANEOUS MOVE · TWO LABS, ONE STRUCTURE, 72 HOURS
When the two fiercest competitors make the identical move in three days, it is not a bet — it is a recognition
Both read the same constraint and reached the same answer: the model is not enough
Anthropic · May 4
PE-portfolio distribution
$1.5B
  • Blackstone, H&F, Goldman ($300M / $300M / $150M)
  • Apollo, General Atlantic, Leonard Green, GIC, Sequoia
  • Embed Claude in PE portfolio companies — hundreds of mid-market firms
  • Aligned with ~80% enterprise mix
OpenAI · May 11
Acqui-hire and scale
$4B
  • $10B pre-money · 19 partners (TPG, Bain, Advent, Brookfield)
  • Bought Tomoro — 150 FDEs day one (Tesco, Virgin Atlantic, Red Bull)
  • Builds the enterprise depth it lacked
  • ~2.7x the capital of Anthropic’s vehicle
OpenAI did not build the FDE org from scratch — it bought one (Tomoro) to start with 150 engineers already operating, a statement that the deployment work matters enough that building it organically was too slow. When competitors converge this precisely — standalone services entity, embedded engineers, investor-network distribution, FDE model — the move is not a differentiated bet; it is both companies concluding there is only one answer. Both labs are now, in addition to model companies, deployment companies — and they became so in the same week.
FIG. 02 — THE SIX-TO-ONE RATIO · WHY THE SERVICES LAYER IS THE PRIZE
The labs had been competing for one-seventh of the value their own technology unlocks
For every dollar on software, companies spend six on services
$1
Software
(the labs sold this)
$6
Services — implementation, integration, change management
(the deployment move claims this)
The ratio exists because making software work inside a real organization is harder than building it. For enterprise AI, the labs say model performance is no longer the bottleneck — integration, security review, evaluation harnesses, and workflow redesign are. MIT: 95% of GenAI pilots fail to leave the experimental phase. The scarce input is the engineer who understands both the technology and the business — FDE job postings rose 800% in 2025. The labs are reaching past the software dollar they own toward the services dollar they did not, by fielding the engineers who earn it.
FIG. 03 — THE PALANTIR MODEL · THE FDE IS PRODUCT FORMATION, NOT A SERVICES ARM
The most misread point — and the whole bet rests on it
Consultants operate downstream of the contract; FDEs operate upstream of the roadmap
The consultant
Delivers a recommendation — a deck, downstream of the contract. Accountable for the advice, not the outcome.
vs
recommend

build &
own
The forward-deployed engineer
Builds the production system, upstream of the roadmap. Accountable for whether it works. The bespoke build becomes the product.
The FDE is not a revenue-generating services business — it is the product-discovery and product-formation engine. The bespoke systems built inside clients become the patterns generalized into the product. Treating early deployment cost as a permanent margin drag rather than a product-formation investment is the systematic misread that has fooled Palantir’s investors for years. The dependency it creates is operational, not contractual — the system becomes woven into the institution’s operating fabric, a deeper lock than a license. Palantir’s answer to scale: the boot camp (12-18 month sales cycle → 5 days, >75% conversion, >$1M initial deal).
FIG. 04 — THE TOKEN ECONOMICS · WHY THE EMBEDDED CUSTOMER IS UNCAPPED
The FDE acquires an uncapped, token-metered annuity — which is why the high-touch cost is rational
A seat-based customer is capped by headcount; a token-based customer is bounded only by the work the AI does
The old unit · seat-based
Capped by headcount
A developer = a $20/month subscription. Revenue ceiling fixed by the number of seats. The deployment cost could never be justified against it.
The new unit · token-based
Bounded only by the work
That same developer = hundreds-to-thousands/month in tokens, scaling with the value the AI generates. The FDE’s job is to put the AI on more of the work.
Front-loaded deployment cost buys a recurring, expanding, uncapped token annuity — and with Anthropic’s inference margins reported at ~70% (up from 38% a year earlier), a high-margin one. That is what makes the high-touch acquisition cost rational: the labs are not buying a seat-capped subscription; they are buying an uncapped consumption stream and paying an engineer to maximize it. Palantir’s Shyam Sankar: “Tokens are the new coal. Palantir is the train.” The FDE is infrastructure for the token economy.
FIG. 05 — THE SCALABILITY QUESTION · WHAT DECIDES WHETHER IT WORKS
The whole vertically-integrated structure rests on whether the FDE scales — and that is genuinely unresolved
The FDE resembles consulting more than software · Palantir runs services at 18-20% of revenue after years
The bull case
The bear case
Product formation that scales. Token economics + boot-camp standardization make the FDE acquire uncapped, high-margin annuities; margins expand as the platform matures.
Labor-bound services that drag. Standardization lags the customer base; each new client needs proportional FDE hours; margins compress as it scales.
The labs capture the six-to-one services dollar at software margins — becoming something larger than software companies.
The labs run large, capital-intensive services operations at consulting margins — having become the consultants they set out to compress.
The token-economy tailwind (uncapped consumption, ~70% inference margins) genuinely differentiates the labs’ FDE from Palantir’s per-seat-era version — but it offsets the labor-cost question, by an amount not yet measured. Palantir, after years, runs services at 18-20% of revenue and a 50% adjusted operating margin — neither pure software nor pure services. The labs inherit that exact ambiguity, at larger scale and with less operating history. The bet is that the FDE is product formation that scales. The risk is that they have rebuilt consulting and called it product.
The labs have concluded the model is not the product — the deployment is — and moved, in the same week, to own the layer where the model meets the operation. Whether that makes them something larger than software companies or merely rebuilds a labor-bound consulting business at consulting margins is the Palantir question they have all inherited.
Thorsten Meyer · The Deployment · Enterprise Reorg 03

Implications of Embedding Engineers in Enterprise AI Deployment

This development signals a fundamental shift in how AI companies are monetizing their technology. By embedding engineers into client operations, the labs move beyond selling models to owning the entire deployment process, creating operational dependencies that can lead to recurring, scalable revenue streams. This approach also increases client lock-in, making switching costs higher and potentially transforming the AI deployment landscape into a new form of enterprise service industry.

It also raises strategic questions about margins and scalability, as the labor-intensive deployment process resembles consulting more than software licensing. The success of this model will determine whether the AI labs can sustain high margins and grow their enterprise revenue, or if the approach will become a permanent drag on profitability.

Your AI Survival Guide: Scraped Knees, Bruised Elbows, and Lessons Learned from Real-World AI Deployments

Your AI Survival Guide: Scraped Knees, Bruised Elbows, and Lessons Learned from Real-World AI Deployments

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background on the Shift to Integrated Deployment Models

Historically, AI labs focused on model development and licensing, with deployment handled by clients or third-party consultants. The move toward embedded engineering reflects a recognition that the real challenge in enterprise AI is not the model itself but integrating it into complex workflows, ensuring security, and redesigning business processes around AI capabilities.

Palantir pioneered the forward-deployed engineer model in defense and intelligence sectors, where engineers work closely with operators to build operational systems. Recently, AI labs have adopted this approach to address the high failure rate of AI pilots—95% according to MIT research—that fail to move beyond experimentation. By owning deployment, labs aim to improve success rates, increase revenue, and deepen client relationships.

This shift also coincides with the commoditization of models, making the deployment and integration layer the primary battleground for enterprise AI leadership.

“The labs’ move to embed engineers directly into client operations is a strategic shift designed to control the deployment process, deepen enterprise lock-in, and capture more value from AI adoption.”

— Thorsten Meyer

The AI Composer's Workstation: From Prompt to Production: A Hybrid Music Logbook for Suno, Udio & DAW Creators

The AI Composer's Workstation: From Prompt to Production: A Hybrid Music Logbook for Suno, Udio & DAW Creators

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unclear Outcomes of the Embedded Engineer Strategy

It is still uncertain whether the embedded engineer model will achieve sustainable margins comparable to software licensing or if it will remain a labor-intensive process akin to consulting, which could limit scalability. The long-term viability of this approach depends on whether deployment can be standardized and scaled efficiently or if it will continue to require proportional labor for each client.

Additionally, it remains to be seen how this strategy will impact the competitive landscape and whether other firms will adopt similar models or develop alternative approaches to enterprise AI deployment.

C++26 Mastery for Modern Developers: A Practical Guide to High-Performance Programming, AI Integration, and Next-Generation Software Engineering (FutureTech Professional Series)

C++26 Mastery for Modern Developers: A Practical Guide to High-Performance Programming, AI Integration, and Next-Generation Software Engineering (FutureTech Professional Series)

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps in AI Deployment and Industry Adoption

Over the coming months, industry observers will monitor how effectively the labs can standardize deployment processes and whether margins improve as they scale. The success of this model will influence broader enterprise AI strategies, potentially prompting other firms to adopt or counter this embedded engineering approach.

Further developments may include new partnerships, technological innovations to automate parts of deployment, or shifts in client adoption patterns. The ongoing evaluation of the economic and operational impact of this strategy will shape the future of enterprise AI integration.

Computer Science for Curious Kids: An Illustrated Introduction to Software Programming, Artificial Intelligence, Cyber-Security―and More!

Computer Science for Curious Kids: An Illustrated Introduction to Software Programming, Artificial Intelligence, Cyber-Security―and More!

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why are AI labs embedding engineers into client companies?

They aim to control the deployment process, improve integration success rates, deepen client lock-in, and capture ongoing revenue streams beyond model licensing.

How does this approach differ from traditional consulting?

Unlike traditional consulting that recommends solutions, embedded engineers build and implement operational AI systems, creating ongoing dependencies and revenue streams similar to product-based models.

What are the risks of this embedded engineer model?

The approach is labor-intensive, resembling consulting, which raises concerns about margins and scalability. Its long-term success depends on whether deployment can be standardized and automated.

Will this strategy change the competitive landscape?

Potentially, as firms that master scalable deployment could gain significant advantages, while others may develop alternative approaches or attempt to automate parts of the process.

Source: ThorstenMeyerAI.com

You May Also Like

GentleOS – Classic operating system with a lovely retro GUI

GentleOS is a hobby OS for 32-bit PCs featuring a retro GUI, supporting minimal hardware, with plans for bugfixes and new apps.

The bridge. Why the AI buildout runs on a nuclear story and a gas reality.

Exploring how AI data centers rely on gas for immediate power despite nuclear deals promising long-term clean energy, highlighting a timeline mismatch.

The policy menu. There’s no single answer. There’s a menu — and choosing is a values choice in disguise.

Analyzing the diverse policy options for managing AI-driven economic shifts, emphasizing values over technical correctness and highlighting uncertainties.

The Laptop Spec Trap That Makes Premium Machines Feel Slow

Discover why high-end laptops can feel sluggish despite top specs. Learn hidden bottlenecks and how to fix performance traps in your premium machine.