Key Takeaways

- Google told Meta in March it could not meet full Gemini capacity demands, disrupting Meta's AI projects
- Meta is now pushing staff to use AI tokens more efficiently due to the restrictions
- Google Cloud's compute constraints cost it growth despite hitting $20 billion in Q1 revenue
Google has restricted Meta's access to its Gemini AI models after Meta requested more computing capacity than Google could deliver, according to a Financial Times report. The limits, communicated around March 2025, have delayed some of Meta's internal AI projects and forced the company to ration its AI usage.
The shortage affects other Google Cloud clients too, though Meta felt the impact hardest because of its unusually high demand for Gemini capacity. In response, Meta has instructed employees to be more efficient with AI tokens, the units that measure AI model usage.
Why is Google limiting Gemini access?
The answer is simple: demand outstrips supply. Even trillion-dollar companies cannot build AI infrastructure fast enough. Google Cloud posted $20 billion in Q1 2025 revenue, but CEO Sundar Pichai acknowledged that compute constraints prevented higher growth and nearly doubled the cloud unit's backlog quarter over quarter.
Google is spending $75 billion on capital expenditures in 2025, primarily for AI data centers. Meta has committed to spending $37 billion or more this year on similar infrastructure. Yet both remain capacity-constrained. The GPU shortage that began in 2023 has evolved into a broader compute crunch affecting the entire AI supply chain.
The strange bedfellows of AI competition
Meta and Google compete directly in online advertising, their primary revenue source. They also compete in AI, with Meta's Llama models challenging Google's Gemini. Yet Meta has been exploring partnerships with Google Cloud since at least September 2025, looking to fine-tune Gemini and the open-source Gemma models with its own advertising data.
The Information reported that Meta was also considering working with OpenAI to improve features in Meta AI, its chatbot, and across its social apps. The company has been hedging its bets, building massive internal AI infrastructure while simultaneously shopping for external capacity.
Meta operates one of the world's largest GPU clusters, reportedly exceeding 600,000 H100 chips. That it still needs Google's capacity speaks to the scale of AI compute demands in 2025.
What this means for enterprise AI projects
If Google cannot fully serve Meta, a company with virtually unlimited resources, smaller enterprises should expect their own capacity requests to face scrutiny. Cloud providers are prioritizing workloads, and customers without leverage may find themselves waiting.
The token efficiency push at Meta offers a preview of what other organizations will face. Companies that optimize their AI workflows now, reducing unnecessary model calls, caching responses, and choosing appropriately sized models for each task, will fare better in a supply-constrained environment.
Logicity's Take
This dispute reveals that even Big Tech cannot buy its way out of the AI compute shortage. For enterprise buyers evaluating cloud AI services, the lesson is clear: capacity guarantees matter as much as pricing. Google Cloud, AWS Bedrock, and Azure AI all offer Gemini or competing models, but their SLAs differ significantly. Organizations running production AI workloads should negotiate committed capacity pools rather than relying on on-demand access. The alternative, as Meta discovered, is watching projects stall when the provider cannot deliver.
Will the shortage ease in 2026?
Probably not. NVIDIA's next-generation Blackwell chips are ramping production, but demand continues to outpace supply. New data centers take 18 to 24 months to build. The hyperscalers are all expanding, but so is the appetite for AI compute.
Google's backlog nearly doubling in a single quarter suggests the gap is widening, not closing. Companies planning major AI initiatives in 2026 should secure capacity commitments now rather than assume availability later.
Frequently Asked Questions
Why did Google limit Meta's Gemini AI access?
Google could not meet Meta's full capacity request due to compute constraints affecting its cloud infrastructure. The company has a growing backlog of AI demand it cannot immediately serve.
How is Meta responding to the Gemini restrictions?
Meta has asked employees to use AI tokens more efficiently. The company is also exploring partnerships with Google, OpenAI, and others to supplement its internal AI infrastructure.
Are other Google Cloud customers affected?
Yes, several other clients face similar constraints, though to a lesser extent than Meta due to Meta's exceptionally high demand.
Why would Meta use Google's AI when they compete?
Meta was exploring Gemini to improve advertising effectiveness and power features in Meta AI. Despite competing, large tech companies often use each other's infrastructure where it offers advantages.
Samsung's massive investment includes chip manufacturing capacity that could eventually ease AI compute constraints
Need Help Implementing This?
If your organization is navigating AI cloud capacity planning or evaluating multi-provider strategies, reach out to the Logicity team for guidance on enterprise AI infrastructure decisions.
Source: mint
Manaal Khan
Tech & Innovation Writer
Produced with AI assistance and reviewed by the Logicity editorial team. Learn more in our Editorial Policy.
Related Articles
Browse all
AI Revolution: How Tech is Transforming the World, One Industry at a Time
From desalination plants in Iran to AI-powered manufacturing, the tech world is abuzz with innovation. Discover how AI is changing the game for small entrepreneurs and what it means for the future of industry. Explore the latest developments in cybersecurity, robotics, and more.

Revolutionizing AI: The Game-Changing Tech That's Making Agents Smarter
A new technology is set to revolutionize the way AI agents learn and adapt, enabling them to accumulate wisdom and apply it to new situations. This innovation has the potential to significantly boost the reliability of AI agents, especially in complex tasks. By converting raw agent trajectories into reusable guidelines, this tech is poised to transform the AI landscape.

The Dark Side of AI: How Bots Are Fueling a Monetized Abuse Ecosystem
A recent analysis of 2.8 million Telegram messages reveals a shocking truth: AI-powered bots are being used to create and sell non-consensual intimate images. These bots can turn ordinary photos into synthetic nude images, and the abuse is being monetized through affiliate programs and subscription-based archives. The researchers behind the study are calling for stricter regulations to combat this growing problem.

AI's Secret Sauce: How Journalism Became the Unlikely Ingredient
A recent study reveals that AI chatbots rely heavily on journalistic sources for their quotes, with one in four coming from news outlets. This shocking discovery has significant implications for the media industry and our understanding of AI's information gathering processes. As AI technology continues to evolve, it's essential to consider the role of journalism in shaping its responses.


