MoreRSS

site iconTomasz TunguzModify

I’m a venture capitalist since 2008. I was a PM on the Ads team at Google and worked at Appian before.
Please copy the RSS to your reader, or quickly subscribe to:

Inoreader Feedly Follow Feedbin Local Reader

Rss preview of Blog of Tomasz Tunguz

The Unsustainable Subsidy

2026-05-20 08:00:00

Google’s AI triples in price each year.

Google Gemini: Flash and Pro Pricing

OpenAI’s flagship model was seemingly subsidized for a while, before rising again.

OpenAI API Prices: Flagship Falling, Then Rising Again

Anthropic’s AI has been the same price for a little bit & decreased for the most powerful models.

Anthropic API Prices: Opus, Sonnet, Haiku

Those are three very different pricing strategies. If we compare the absolutes, the data completes the picture.

Vendor Model Input ($/1M) Output ($/1M)
Google Gemini 3.1 Pro $2.00 $12.00
Anthropic Claude Opus 4.7 $5.00 $25.00
OpenAI GPT-5.5 $5.00 $30.00

Google remains the low-cost player, increasing the price on all its models but still less than half of the competition. Anthropic had maintained a luxe pricing until late last year.

The pricing changes indicate changes in strategy : cuts when cash is plentiful & share matters. Increases when cash is tight & margins matter. The latter is the case for all three vendors now when capex spending continues to set records.

Observations on Writing with AI

2026-05-18 08:00:00

As I was paging through Good Writing, Anne Lamott’s new book, I wondered what AI would say about twisting cliches & finding hidden metaphors (chapters 18 & 19).

Over the last 16 years of writing, I’ve read books about writing, hired an editor, & used AI. I’ve fine-tuned models to mimic my voice, tested more than 10 AI systems, & written many post with AI, with some Hindenburgs I’ve kept public as proof despite my embarrassment.

Writing is hard for AI. First, AI has its own voice : Gemini beams sunshine ; Claude’s is languid but sharp ; & OpenAI Codex is the most dispassionate. Writing in another voice is hard for people & AI.

So writing with a single model doesn’t work. What about an AI editorial council? The concept shines with code review. Why not blog review?

At my fourth draft, I asked Gemini, Claude, & OpenAI Codex to edit my work with each other. The result wasn’t an elegant mosaic but a fingerpaint disaster. Each model had its own voice. Like three editors with three visions of the piece, the AI models couldn’t agree on a consistent tone or style.

And each is willing to deliver it directly, casually cruel in the name of being an editor.

Eg, this post:

Verdict. This is a three-beer conversation mistaken for a finished essay. Pick one angle : the Lamott meditation, or the AI choir experiment, or the vinyl-flare theory. Develop it with specifics, quotes, images. As written, it’s 500 words of intelligent observation without a single indelible sentence.

…imagine that daily farrago in triplicate.

AI’s ability to synthesize images, video, text means anything can be created. What’s authentic? Imperfection.

The pops of a vinyl record, the solar flare on Kodachrome film, the imperfect analogy and the punctuation peccadilloes (lovers of ampersands, unite!), stand out.

AI may generate digital reams of manuals & documentation, & may one day parrot the way we write authentically. But the imperfections of writing are what make it good writing.

The First Derivative of Inference

2026-05-15 08:00:00

The fastest-growing companies in AI & software are either selling AI directly or reselling inference. At worst, they are the first derivative of inference.

Inference is the largest & fastest growing market in technology today, surpassing the database market & projected to be three times the size within seven years at $250 billion.1, 2 By selling inference or indexing a business to it, they grow at spectacular rates.

Anthropic has booked $9b & $10b in consecutive months.3 Google Cloud is growing 63% at an $80 billion run rate.4 Most businesses selling inference are exploding.

For public software & infrastructure companies that predate AI, there are two standouts so far : Twilio & Datadog.

TWLO & DDOG stock performance YTD 2026 indexed to 100

Both of these companies are benefiting as the first derivatives of inference. They don’t sell inference primarily, but anyone building AI systems needs to understand how they perform, & agentic companies with voice use Twilio.

“The number of spans sent to our LLM Observability product nearly tripled quarter-over-quarter.” — Olivier Pomel, CEO, Datadog Q1 2026 Earnings Call

As a result of AI growing so spectacularly, there are huge power law dynamics.

“We now have over 6,500 customers sending data for one or more of our AI integrations. Though this is only 20% of total customers, they represent about 80% of our ARR.” — Olivier Pomel, CEO, Datadog Q1 2026 Earnings Call 5

This is also true for another element of core infrastructure, voice & SMS via telephone.

“Voice reimagined through the lens of AI is increasingly an entry point to the Twilio platform for AI natives & enterprises alike.” — Khozema Shipchandler, CEO, Twilio Q1 2026 Earnings Call 6

A few customers can drive tremendous gains. This level of concentration is characteristic of the current cycle.7

For any pre-AI company, the key question must be put at the board level : how do we either resell inference or benefit from our customers buying huge volumes of it?

That’s the only way out of the Saaspocalypse.

What Would AI Email Cost?

2026-05-14 08:00:00

In yesterday’s post (which an agent pushed in raw outline form via email!), I wrote about the future of AI email. What does that future cost?

chart_monthly_cost_by_model

If you are using state-of-the-art model ranging, it costs between $22 to $130 per month. Would you pay for that? At work, I imagine, many would. Let’s take the middle case of $26/month raw cost.

A software company seeking 75% gross margin would charge about $350 per year for that product excluding hosting & serving costs. So let’s call it a $500 per year list with a 15% discount at scale.

A Google Enterprise plan is $11-18/month. A fully agentic solution would then cost about twice as much.

chart_all_models_comparison

Smaller models help. They cut cost by 10 to 20x, but we can do better.

By running the models locally, when the cost plummets to zero : users’ GPU does the work.

It’s this type of cost optimization that I have done crudely here that I think will define the next 12 to 24 months of AI software : determining which components can be executed deterministically, like the email filters, which are just rules. And the next is matching the model to the workload.

With some basic heuristics and techniques we can drop the overall cost by 100x. Given the tremendous shortage of GPUs, this segmentation of inference is inevitable.

The 6 Messages That Actually Matter

2026-05-13 08:00:00

Nobody will open Gmail five times a day in five years.

The average knowledge worker receives 121 emails per day.1 That’s one every four minutes during working hours.

The inbox is a conveyor belt that keeps accelerating. You open Gmail. You read. You decide. You respond. One at a time. But the belt doesn’t wait. It just moves faster.

AI Email Architecture diagram showing how 121+ daily emails flow through AI processing into verbatim and processed streams, feeding a personal context layer

Today’s triage is generic : “This is from your boss. I need to work on that today. Next. Spam. Archive. Spam. Archive. Newsletter, read & archive.” Tomorrow’s is personal. User-defined skills & rules. Programming in English that encodes your priorities, your relationships, your workflow.

A receipt arrives & forwards itself to the expense platform before you see it. An inbound lead hits the CRM, gets scored, & a draft proposal waits in your outbox. The workflow starts the moment the email lands.

Then there’s the archive. Years of context about every relationship, commitment, & decision you’ve made. That history becomes a personal context layer that informs how your AI handles the next message. On-device models process sensitive messages privately.2

The inbox disappears. What remains are the 6 messages that actually matter.



  1. Radicati Group 2025 : average office worker receives 121 emails per day; executives receive 150-200+ ↩︎

  2. Microsoft Work Trend Index, June 2025 : 40% of employees check email before 6 a.m., driving demand for automated triage ↩︎

2026 Theory GTM Survey

2026-05-12 08:00:00

It’s time for the 2026 Annual Theory Go-to-Market Survey. This is a brief 25-question survey.

Our goal is to understand how startups have evolved their sales, marketing, customer success, and cash management over the last several years by comparing these results to our surveys from 2022 through 2025.

We will publish these results and answer questions about them at upcoming Office Hours.

This year, we’re focused on five key hypotheses. Each is designed to be rigorously testable with the survey data:

Augmented reps outperform both autonomous AI and unaugmented humans. As AI tools mature, companies face a choice: deploy AI alongside SDR teams, replace them with autonomous tools, or forego AI entirely. We expect augmented teams, humans plus AI, will show the best conversion rates and productivity gains.

AI is widening the performance gap between top and bottom quartile GTM teams. The companies investing most heavily in AI may be pulling ahead, widening the efficiency, growth, and conversion gap between quartile-one and quartile-four sellers.

Buyer-side AI adoption is the bigger GTM disruptor than seller-side AI. While most companies focus AI investment on their own GTM teams, an emerging dynamic is buyers using AI themselves: automated RFPs, AI-assisted evaluations, and even AI negotiation. We expect this shift will lengthen sales cycles and create new objections.

AI efficiency gains are being captured as headcount reduction, not revenue growth. When companies report “AI productivity” gains, the primary result may be flatter SDR hiring, not faster pipeline or higher revenue. The headcount flattening signal, not cost savings, is the real story.

Founder expectations on AI have reset downward as reality caught up. In 2024, the most optimistic respondents expected 500% efficiency gains from AI and recorded 0%. We expect this perceived-measured gap has narrowed as companies recalibrate what AI can realistically deliver.

With this data, we should be able to draw broader conclusions about the continued shift from growth to efficiency, measure the real impact of AI on GTM teams, and understand the emerging dynamics of AI-to-AI selling.

If you complete the survey, I will share with you the anonymized raw data so you can perform your own analyses. If you have questions, just message me on Twitter or send me an email.