410 points
• 1 day ago
• Article
Link
目前,所有主要 AI 提供商在企业订阅上都在亏损,而且这是刻意为之。 OpenAI 、 Anthropic 、 Google 等公司在推行一场前所未有的行业性亏本策略,以远低于真实服务成本的价格出售强大 AI 能力。企业为这些订阅支付的费用与实际交付成本之间不是小幅差异,而是巨大的裂口;凡是把关键工作流程建立在这些补贴价格之上的组织,都站在悬崖边上。
数据很直白。 Claude Pro 每月 20 美元,但一个重度使用它做文档分析、撰写报告和处理数据的知识工作者,每周就可能消耗数百万 token 。按真实 API 费用计算,同样工作量每个席位每月要花 200 到 400 美元。据报道,微软在 GitHub Copilot 上每位用户每月亏损超过 20 美元,重度用户在 10 美元订阅下的实际成本可达 80 美元。有分析发现,Anthropic 用户每赚取 1 美元订阅收入,需消耗高达 8 美元的算力成本。 ChatGPT Plus 三年一直维持每月 20 美元,尽管模型能力和功能大幅提升,价格却未调整;那些在此期间锁定价格的企业买家拿到的是无法长期维持的便宜票。
所有主要厂商玩的都是同一套。 Google 把 Gemini Advanced 按消费级价格捆绑进 Google One,但对开发者却按真实 API 价格收费。 Meta 免费放出 Llama,完全靠广告收入补贴数亿次查询。 xAI 的 Grok 把 API 价压到每百万输入 token 0.20 美元,明显是以亏损换取市场份额的策略。行业普遍模式是:先用低价吸引采用、锁定企业、让 AI 成为日常工作负载,再慢慢处理账单。对企业而言,"以后"正在到来。据称 OpenAI 正从消费者业务向企业业务倾斜,因为企业端的单位经济稍好一些,而在冲刺 IPO 的过程中公司也错过了关键的营收和用户目标。
向智能体(agentic)AI 的转变,把原本就不合理的补贴算术变成了灾难性的账目。聊天机器人时代,token 消耗较可预测,一次对话可能只消耗几千 token;但智能体会自主长时间运行,token 消耗远超对话场景。有用户反映,在不到 90 分钟内就耗尽了五小时速率额度。 GitHub 决定在 2026 年 6 月 1 日改为按使用计费,正是因为扁平订阅在智能体负载下崩溃。当多个 AI 代理并行处理同一项目时,token 消耗不是对话使用的简单倍增,而是呈数量级增长,而相应席位的订阅价却没变。
大多数企业尚未做好准备。过去两年里,成千上万家公司已将 AI 订阅深度嵌入营销、工程、研发、客户成功和财务等业务流程。这些已不再是试验,而是支撑业务运转的核心流程,大部分公司仍按当前订阅价做预算。一个 50 人团队用 Claude Pro 每月只要 1000 美元,在损益表上只是个小数目;但按等量 API 使用计费,那支团队每月要花 1.5 万到 4 万美元。价格一旦调整,那些把 20 美元 / 月的 AI 视为永久廉价投入的公司,将面临未预算的巨额账单,而此时相关工作流程已深度嵌入、难以拆除。 KPMG 发现,美国企业预计未来 12 个月平均 AI 支出为 2.07 亿美元,几乎是上一年的两倍;高盛的调查也显示,许多大公司已经大幅超支其 AI 预算。
推动重新定价的机制已在运转。 OpenAI 和 Anthropic 都在为 IPO 做准备。据报道,Anthropic 年化收入已超过 300 亿美元,高于 2025 年底的 90 亿美元;OpenAI 的收入轨迹约为 250 亿美元。但成本端则更为严峻。 OpenAI 预计到 2029 年累计现金消耗为 1150 亿美元,并承诺到 2030 年投入 6650 亿美元的算力支出。 Oracle 在一个财年内举债 430 亿美元为 OpenAI 建数据中心。公司一旦上市,缩小订阅价与实际成本之间差距就成了生死问题:公开市场要利润、分析师要合理的单位经济、投资者要不依赖无尽融资的盈利路径。要最快弥合差距,最直接的办法就是涨价、设限或转为按用量计费。
信号已经很清楚。 GitHub 将自 6 月 1 日起改为按使用量计费,用基于 token 的 AI Credits 取代固定费用的高级请求额度。微软在四年内已两次上调 Microsoft 365 价格,最新一轮直接与 AI 基础设施成本挂钩。 OpenAI 推出了 100 美元的 Pro 层,定位为重度用户的"真实"价格;Anthropic 每月 200 美元的 Max 层也预示着补贴结束后真实使用成本的可能水平。正如一位行业高管所言,AI 领域的圈地竞争规模巨大,主导这一新世界的代价同样巨大。将这些服务货币化并回收投资,将迫使商业模式和定价快速发生重大变化。
企业领导者现在就必须行动:审计各团队的实际 token 消耗,而不仅仅统计席位数;建立情景模型,测算在当前价格的 2 倍、 5 倍或 10 倍下 AI 成本的走向;在技术栈中构建供应商可选性,避免单一提供商的定价变动一夜之间毁掉预算;并在账单到来前与财务团队进行坦诚对话。如今企业为 AI 支付的价格与 18 个月后将要支付的价格之间的差距,很可能成为多数公司历史上最具破坏性的成本跳增之一。补贴时代正在走向终结,倒计时已经开始,而大多数企业甚至还没开始这场对话。
Every major AI provider is currently losing money on enterprise subscriptions, and they are doing it deliberately. OpenAI, Anthropic, Google, and others are running an unprecedented industry-wide loss-leader program, selling powerful AI capabilities at prices that bear no relation to the actual cost of serving them. The gap between what companies pay for AI subscriptions and what those subscriptions truly cost to deliver is not a minor discrepancy. It is a massive gulf, and every organization that has built critical workflows on top of these subsidized prices is standing right on the edge of it.
The math is stark. Claude Pro costs $20 a month, but a knowledge worker using it heavily for document analysis, report drafting, and data work can easily burn through several million tokens per week. At actual API rates, that same workload would run between $200 and $400 per seat per month. Microsoft was reportedly losing over $20 per user per month on GitHub Copilot, with power users costing $80 a month on a $10 subscription. One analysis found Anthropic users consuming upwards of $8 in compute for every $1 of subscription revenue. ChatGPT Plus has been $20 a month for three years, even as the models became dramatically more capable and feature-rich. The price never moved, and enterprise buyers who locked in rates during this window got a deal that cannot last.
Every major provider is playing the same game. Google bundles Gemini Advanced into Google One at consumer prices while charging developers real API rates for the same models. Meta gives away Llama for free, subsidizing hundreds of millions of queries entirely through ad revenue. xAI's Grok undercuts everyone on API pricing at $0.20 per million input tokens, a number that only makes sense as a market-share grab funded by losses. The pattern across the board is identical. Price for adoption, lock organizations in, make AI a load-bearing part of daily workflows, and worry about the bill later. For enterprises, "later" is arriving. OpenAI is reportedly pivoting away from consumer bets toward enterprise, where the unit economics are slightly less ruinous, and the company missed key revenue and user targets in its sprint toward an IPO.
The shift to agentic AI has turned bad subsidy math into catastrophic math. When AI was a chatbot, token consumption was relatively predictable. A conversation might run a few thousand tokens. But agentic sessions run autonomously for extended periods, burning through tokens at rates that dwarf conversational usage. Users have reported exhausting five-hour rate limit windows in under 90 minutes. GitHub is moving to usage-based billing on June 1, 2026, specifically because the flat-fee model collapsed under agentic workloads. When multiple AI agents work in parallel on a single project, the token burn is not a multiple of chat usage. It is an order of magnitude more, while the subscription price on that seat has not changed.
Most enterprises are not prepared for what is coming. Over the past two years, thousands of companies have woven AI subscriptions deep into operations across marketing, engineering, research, customer success, and finance. These are not experiments anymore. They are load-bearing workflows, and most companies are still budgeting at current subscription prices. A team of 50 on Claude Pro costs $1,000 a month, a rounding error on the P&L. But the equivalent API usage for that same team would run between $15,000 and $40,000 a month. When prices adjust, the companies that treated $20-a-month AI as a permanently cheap input will get hit with bills they never budgeted for, at a time when the workflows are too embedded to rip out. KPMG found U.S. organizations projecting average AI spending of $207 million over the next 12 months, nearly double the previous year, while a Goldman Sachs survey found many large companies already overrunning their AI budgets by orders of magnitude.
The specific mechanism forcing repricing is already in motion. Both OpenAI and Anthropic are preparing for IPOs. Anthropic has reportedly surpassed $30 billion in annualized revenue, up from $9 billion at the end of 2025, while OpenAI is on pace for roughly $25 billion. But the cost side tells a different story. OpenAI projects $115 billion in cumulative cash burn through 2029 and has committed to $665 billion in compute spending by 2030. Oracle took on $43 billion in debt in a single fiscal year to build data centers for OpenAI. When these companies go public, the pressure to close the gap between subscription price and actual cost becomes existential. Public markets demand margins, analysts demand unit economics, and investors demand a path to profitability that does not depend on infinite fundraising. The fastest way to close that gap is to raise prices, impose usage caps, or shift to consumption-based billing.
The signals are already visible. GitHub is moving to usage-based billing on June 1, replacing flat-rate premium requests with token-based AI Credits. Microsoft has raised Microsoft 365 prices twice in four years, with the latest round tied directly to AI infrastructure costs. OpenAI has introduced a $100 Pro tier positioned as the new real price for heavy users. Anthropic's Max tier at $200 a month provides a preview of what committed usage will actually cost when subsidies end. As one industry executive put it, the AI land-grab is on a colossal scale, and the price tag for dominating this new world is equally colossal. Monetizing these services and recouping investment is going to force significant changes in business models and pricing, and those changes are likely to happen fast.
Enterprise leaders need to act now. That means auditing actual token consumption across teams, not just counting seats. It means modeling what AI costs look like at two, five, or ten times current prices. It means building vendor optionality into the stack so no single provider's pricing change can blow up the budget overnight. And it means having an honest conversation with the finance team before the bill arrives. The gap between what organizations pay for AI today and what they will pay in 18 months is going to be one of the most disruptive line-item increases most companies have ever absorbed. The subsidy era is ending, the clock is running, and most enterprises have not even started the conversation.
396 comments • Comments Link
• 关于"AI 订阅是定时炸弹"的核心论点在多方面被质疑。评论者指出,本地运行最前沿的模型需要极高的硬件配置(例如 1.5–6 Ti 的显存),在可预见的未来,云端托管在成本效率上仍优于本地部署;此外,本地模型普遍落后领先模型 6–18 个月,尽管计算效率可能提升,但硬件成本的下限仍然很高。
• 对 AI 公司靠代币销售是否盈利存在重大争议。有人引用 Brad Gerstner 的话说代币并非亏本出售,但反对者指出这忽略了庞大的研发、训练与基础设施开销。有证据表明,高达 70% 的算力支出用于研发,像 Anthropic 这样的公司尽管估值高企,仍在不断烧钱。
• 对商业模式的批评主要集中在补贴获客:AI 实验室用低于成本的价格锁定企业客户,期望日后提价以收回成本。但也有人认为,这与其说是对企业的"定时炸弹",不如说是投资者承担的风险——若市场无法整合或实现盈利,投资者可能永远收不回数万亿美元的投入。
• 企业的付费模式使"订阅"论述更复杂。许多公司通过按使用量计费的 API(如经由 Azure 或 Bedrock)结算,而非固定订阅费。订阅在中小企业或影子 IT 中更常见,但大型企业通常谈判基于使用量的合同。真正的风险在于那些把补贴性 AI 深度嵌入核心工作流、却没有为未来成本变化做规划的组织。
• 开源和中国的模型(如 GLM 、 Kimi 、 DeepSeek)被视为潜在竞争压力,但在西方企业中的采用受限于地缘政治、法律和信任问题。即便在技术上可比,这类模型仍因数据主权和监管风险而难以被广泛接受,造成可负担且值得信赖的替代方案缺口。
• 模型架构效率的提升(例如更小的激活参数、更好的量化方法)有望逐步降低成本。像 Qwen 27B A3B 这种性能接近更大模型的例子表明,性价比会提升,可能推动更多本地或边缘部署,进而减少对集中式提供商的依赖。
• 讨论中反映出对 AI 炒作的普遍怀疑。多人将原文斥为"AI 废话"——戏剧化、论证薄弱,甚至可能是 AI 生成,批评点包括措辞重复、缺乏证据,以及未能区分消费者订阅与企业计费模式。
• 有观察者将此与历史科技周期类比:先以低价抢占市场,再转为按量计费,类似云计算的发展路径。也有人警告,如果当前以股权驱动、持续融资为特征的模式在实现可持续收入前崩溃,可能引发更广泛的债务或资本危机。
• 讨论中还含有文化层面的批评:AI 生成的语言(如"load-bearing"、"the unlock")在企业场景常被视为表演性信号,领导层鼓励使用但技术人员往往嗤之以鼻,反映了关于真实性、技能贬值和沟通商品化的紧张关系。
• 尽管对成本与可持续性有所担忧,许多人承认 AI 工具确实能带来显著价值——尤其是像 Claude Code 或 Codex 这样的编码助手——即便这些工具在一定程度上被补贴。企业面临的挑战是如何在不陷入对不透明且可能波动的定价模式的不可逆依赖下,战略性地利用这些工具。
• 总体来看,讨论显示出对原文耸人听闻框架的强烈怀疑,参与者强调 AI 经济的复杂性、消费者与企业计费的差异,以及模型效率的持续演进。尽管担忧供应商提价与锁定风险是合理的,很多人认为真正的风险更多落在投资者一方,尤其是在开源替代方案和硬件改进持续重塑格局的背景下。对话还反映出人们对 AI 生成内容和企业术语日益增长的厌倦,呼唤更实证、务实的讨论。 • The core argument that AI subscriptions are a "ticking time bomb" is challenged on multiple fronts. Several commenters point out that running frontier-quality models locally requires extraordinary hardware (e.g., 1.5–6 Ti of VRAM), making cloud hosting far more cost-efficient for the foreseeable future. Others note that local models currently lag behind frontier models by 6–18 months, and while efficiency improvements are expected, the hardware cost floor remains high.
• There is significant skepticism about whether AI companies are currently profitable on token sales. While one commenter cites Brad Gerstner's claim that tokens aren't sold at a loss, others counter that this ignores massive R&D, training, and infrastructure costs. Evidence suggests that up to 70% of compute spending goes to R&D, and companies like Anthropic are still burning cash despite high valuations.
• The business model critique centers on subsidized adoption: AI labs are pricing below cost to lock in enterprise users, with the expectation of raising prices once dependence is established. However, some argue this is less a "time bomb" for enterprises and more a risk for investors, who may never recoup trillions in capital if the market doesn't consolidate or monetize effectively.
• Enterprise usage patterns complicate the subscription narrative. Many companies already use metered API billing (e.g., via Azure or Bedrock), not flat-rate subscriptions. Subscriptions are more common among SMEs or via shadow IT, but large enterprises typically negotiate usage-based contracts. The real risk lies in organizations that deeply integrate subsidized AI into core workflows without planning for future cost shifts.
• Open-source and Chinese models (e.g., GLM, Kimi, DeepSeek) are seen as competitive pressures, but adoption in Western enterprises is limited by geopolitical, legal, and trust concerns. Even when technically comparable, these models face resistance due to data sovereignty and regulatory risks, leaving a gap in affordable, trusted alternatives.
• Efficiency gains in model architecture (e.g., smaller active parameters, better quantization) are expected to reduce costs over time. Examples like Qwen 27B A3B performing nearly as well as much larger models suggest that capability-per-dollar will improve, potentially enabling more on-premise or edge deployment and reducing reliance on centralized providers.
• The discussion reflects broader skepticism about AI hype, with multiple commenters dismissing the original article as "AI slop"—overly dramatic, poorly argued, and possibly generated by AI itself. Criticisms include repetitive phrasing, lack of evidence, and failure to distinguish between consumer subscriptions and enterprise billing models.
• Some observers draw parallels to historical tech cycles: land-grab pricing followed by metered billing, similar to cloud computing's evolution. Others warn of a potential debt crisis if current funding models (equity-fueled, circular investment) collapse before sustainable revenue is achieved.
• There's a cultural critique embedded in the discussion: the use of AI-generated language (e.g., "load-bearing," "the unlock") is seen as performative signaling within corporate environments, often encouraged by leadership but viewed with disdain by technical staff. This reflects tensions around authenticity, skill depreciation, and the commodification of communication.
• Despite concerns about cost and sustainability, there's acknowledgment that AI tools provide significant value—especially in coding assistants like Claude Code or Codex—even if heavily subsidized. The challenge for enterprises is to leverage these tools strategically without becoming irreversibly dependent on opaque, potentially volatile pricing models.
The discussion reveals deep skepticism toward the original article's alarmist framing, with participants emphasizing the complexity of AI economics, the distinction between consumer and enterprise billing, and the ongoing evolution of model efficiency. While concerns about future price increases and vendor lock-in are valid, many argue that the real risks lie with investors rather than end users, especially as open-source alternatives and hardware improvements continue to reshape the landscape. The conversation also highlights growing fatigue with AI-generated content and corporate jargon, underscoring a desire for more substantive, evidence-based discourse.