OpenAI's Jalapeno chip takes aim at Nvidia dependency

Huma ShaziaJuly 4, 2026 at 2:47 PM5 min read

Key Takeaways

OpenAI's Jalapeno chip, designed with Broadcom, is its first custom silicon for AI inference workloads
The chip reportedly matches Nvidia Blackwell and Google TPU performance, with deployment planned by end of 2025
Engineers completed the design in nine months, using AI to accelerate parts of the chip development process

OpenAI has unveiled Jalapeno, its first custom AI chip, developed in partnership with Broadcom. The chip handles inference workloads, the computationally intensive process of answering user queries in ChatGPT and similar applications. OpenAI plans to deploy it by the end of 2025, marking the company's first serious move to reduce its dependence on Nvidia's GPUs.

The announcement signals a broader industry shift. Google, Amazon, and Meta have all invested heavily in custom silicon for AI workloads. OpenAI, which reportedly spends billions annually on Nvidia hardware, now joins that race.

What does Jalapeno actually do?

Jalapeno is an inference chip, not a training chip. That distinction matters. Training large language models requires massive computational resources over weeks or months. Inference is the ongoing cost: every time someone asks ChatGPT a question, the model runs inference to generate a response.

With over 300 million weekly ChatGPT users, inference costs dwarf training costs at OpenAI's scale. A chip optimized specifically for inference on OpenAI's model architectures could dramatically reduce per-query costs.

"It will be performant on, we think, all kind of future iterations of LLMs," OpenAI hardware chief Richard Ho told Reuters. The company already has samples running in its labs on the GPT-5.3-Codex-Spark AI model, hitting target power and performance specifications.

How does it compare to Nvidia and Google silicon?

Broadcom CEO Hock Tan made a bold claim: Jalapeno matches the performance of Nvidia's Blackwell chips and Google's tensor processing units. That's a high bar. Nvidia's B200 Blackwell chips represent the current state of the art for AI inference, and Google's TPUs have years of refinement behind them.

The comparison comes with caveats. OpenAI hasn't published benchmarks. "As good as" can mean many things, performance varies across workloads, and Tan has an obvious interest in promoting Broadcom's design capabilities to potential customers.

Still, even parity would be significant. Nvidia commands 80% of the AI chip market and charges accordingly. Any credible alternative gives OpenAI negotiating leverage and supply chain insurance.

Nine months from design to silicon

OpenAI's engineers completed the chip design in roughly nine months before sending it to TSMC for manufacturing. That's fast. Traditional chip design cycles run 18 to 24 months or longer.

The company credits AI tools for accelerating specific aspects of the process. Chip design increasingly uses machine learning for tasks like placement and routing, where algorithms can explore more design variations than human engineers. The irony is obvious: OpenAI used AI to build hardware that will run more AI.

Canadian electronics manufacturer Celestica will build the server systems that house Jalapeno. Both the chips and servers will be exclusive to OpenAI, not sold externally.

Why Broadcom, not in-house?

Building a chip design team from scratch takes years. Broadcom provides design services and intellectual property that would be difficult to replicate internally. Meta, Amazon, and Google have all used Broadcom or Marvell for similar partnerships.

The tradeoff: Broadcom's profit margins on custom AI chips are lower than on its other products, like networking switches. AI chips require large amounts of high-bandwidth memory from suppliers like SK Hynix and Samsung, which squeezes Broadcom's margins. That cost pressure eventually flows to customers like OpenAI.

Reuters first reported OpenAI was exploring custom chips back in 2023. The company's rival Anthropic is now weighing a similar move, according to Reuters sources from April 2025.

The compute shortage driving this decision

AI labs face a severe compute shortage. Nvidia's latest GPUs ship with waitlists extending months. OpenAI, Anthropic, and others compete for limited production capacity while demand for AI services keeps growing.

OpenAI's $500 billion Stargate infrastructure project, announced earlier this year, reflects the scale of compute investment required. Custom chips are one piece of that strategy. Owning the silicon design means OpenAI can optimize for its specific model architectures rather than using general-purpose GPUs.

Jalapeno is explicitly the "first step in a multi-generation chip development plan." Expect future chips to expand into training workloads if inference proves successful.

ℹ️

Logicity's Take

OpenAI's chip ambitions make strategic sense but face execution risk. Google spent years refining TPUs before they became competitive with Nvidia. Amazon's Trainium chips are only now gaining adoption. First-generation custom silicon rarely delivers promised performance. The real test comes when Jalapeno runs production workloads at scale, not samples in a lab. For CTOs watching this space, the lesson is less about OpenAI specifically and more about the broader unbundling of AI infrastructure. Nvidia's dominance created a single point of failure for the entire industry. Expect every major AI company to diversify their chip supply, whether through Broadcom partnerships, Marvell designs, or internal teams.

Frequently Asked Questions

When will OpenAI's Jalapeno chip be available?

OpenAI plans to deploy Jalapeno by the end of 2025. The chip will be used exclusively by OpenAI and won't be sold to other companies.

What is the difference between AI training and inference chips?

Training chips process massive datasets over weeks to create AI models. Inference chips run those trained models to answer user queries in real time. Inference happens constantly and represents the larger ongoing cost at scale.

Will OpenAI stop using Nvidia GPUs?

Not immediately. Jalapeno handles inference only. OpenAI still needs Nvidia GPUs for training and likely will for years. Custom chips reduce dependence but don't eliminate it.

Who manufactures the Jalapeno chip?

TSMC in Taiwan manufactures the silicon. Celestica builds the server systems. Broadcom provided chip design services and intellectual property.

Is Anthropic also building custom AI chips?

Reuters reported in April 2025 that Anthropic is weighing building its own AI chip. No public announcement has been made.

Need Help Implementing This?

If your team is evaluating AI infrastructure decisions or building internal AI tooling, Logicity offers consulting and implementation support. Contact us to discuss your specific requirements.

Source: Tech-Economic Times / ET

A new technology is set to revolutionize the way AI agents learn and adapt, enabling them to accumulate wisdom and apply it to new situations. This innovation has the potential to significantly boost the reliability of AI agents, especially in complex tasks. By converting raw agent trajectories into reusable guidelines, this tech is poised to transform the AI landscape.

9 Apr 2026

Trending Tech·10 min

The Dark Side of AI: How Bots Are Fueling a Monetized Abuse Ecosystem

A recent analysis of 2.8 million Telegram messages reveals a shocking truth: AI-powered bots are being used to create and sell non-consensual intimate images. These bots can turn ordinary photos into synthetic nude images, and the abuse is being monetized through affiliate programs and subscription-based archives. The researchers behind the study are calling for stricter regulations to combat this growing problem.

9 Apr 2026

Trending Tech·8 min

AI's Secret Sauce: How Journalism Became the Unlikely Ingredient

A recent study reveals that AI chatbots rely heavily on journalistic sources for their quotes, with one in four coming from news outlets. This shocking discovery has significant implications for the media industry and our understanding of AI's information gathering processes. As AI technology continues to evolve, it's essential to consider the role of journalism in shaping its responses.

9 Apr 2026