Hyperscalers and AI companies have been turning toward specialized processors to run inference workloads in the cloud.
Arm Holdings' chip design architectures have gained immense popularity among those seeking to design efficient AI inference processors.
The company's share of the AI server processor market is poised to more than triple in just four years.
The artificial intelligence (AI) megatrend has been the biggest catalyst moving the stock market since ChatGPT debuted in November 2022. Ever since, hyperscalers, AI specialists, and governments have been spending boatloads of money and using huge amounts of data to train powerful large language models (LLMs).
However, Nvidia CEO Jensen Huang recently pointed out that AI is now at an inflection point.
Will AI create the world's first trillionaire? Our team just released a report on the one little-known company, called an "Indispensable Monopoly" providing the critical technology Nvidia and Intel both need. Continue »
Whereas previously, a large volume of AI processing power was dedicated to model training, inference -- the actual use of those models -- is poised to become the bigger driver in this space. The time to shift the focus from model training to putting those models to work in the real world seems to have finally arrived after years of huge infrastructure investments.
Inference-based agentic AI tools are gaining popularity because they can automate processes, perform tasks independently, improve productivity, and reduce costs. And the mainstream use of physical AI tools, such as humanoid robots, appears to be moving closer.
According to a forecast from market research company S&S Insider, the AI inference market is poised to quadruple in size from $87 billion in 2024 to $350 billion in 2032.
There are multiple semiconductor stocks one could buy to capitalize on this terrific opportunity. But one seems to me to be the ultimate pick-and-shovel play on the inference economy.
Image source: Getty Images.
AI inference doesn't need as much processing power as model training. This explains why hyperscalers and AI companies have been shifting toward specialized processors, such as application-specific integrated circuits (ASICs) and central processing units (CPUs), which are sufficient for some inference tasks.
Anthropic, for instance, has extended its partnership with Alphabet's Google and Broadcom to deploy 3.5 gigawatts of the search engine giant's custom tensor processing units (TPUs). Google and Broadcom have been working together to design TPUs for the past decade, and these chips are now playing a central role in AI inference due to their performance and cost advantages.
Google announced the availability of its seventh-generation TPUs, known as Ironwood, in November 2025, promising significant performance gains over its previous-generation chips. At the same time, it announced its Axion general-purpose CPU, based on Arm Holdings' (NASDAQ: ARM) architecture.
Google claims that the Arm-based Axion server CPUs are up to two times better than comparable x86 processors from Intel and AMD in price-to-performance terms. This may explain why Anthropic has chosen Google's data centers to run its AI workloads. And Google and Broadcom aren't the only ones developing Arm-based server CPUs.
Nvidia's next-generation Vera server CPU, which the company claims is purpose-built for agentic AI, is based on the Armv9.2 architecture. The chipmaker has decided to offer its Vera server CPU on a standalone basis to customers. This is a first for management because it offered its prior-generation Grace server CPUs in rack-scale systems integrated with its Hopper graphics processing units (GPUs).
Nvidia estimates that the Vera CPU could become a multibillion-dollar product. That's great news for Arm, which generates revenue by licensing its chip designs to customers, and earning royalties on each chip made using its intellectual property (IP).
Marvell Technology, the second-largest custom AI processor company, relies on Arm's IP to design its ASICs. As such, it is easy to see why Arm is expected to become the kingpin of the AI inference economy in the next few years by becoming the dominant player in custom ASICs.
Market research firm Counterpoint Research estimates that the share of Arm-based server CPUs in custom AI processor servers could jump to 90% by 2029. That would be a huge increase over last year's share of just 25%. The server CPU market is currently dominated by Intel and AMD, which manufacture chips based on the x86 architecture.
However, Counterpoint says that the number of companies developing custom in-house chips using Arm's IP will increase rapidly. This should ideally lead to healthy growth in Arm's revenue and earnings, opening new streams of licensing fees and royalties. Arm's AI-focused Armv9 architecture commands double the royalty rate of the previous-generation Armv8 architecture.
So, the continued adoption of Armv9 should produce a bigger spike in the company's earnings in the long run. Analysts are expecting a 21% increase in Arm's earnings per share (EPS) in its fiscal 2027 (which began April 1) to $2.14, followed by a 35% jump in the next fiscal year.

ARM EPS Estimates for Current Fiscal Year, data by YCharts.
This all suggests that Arm's earnings acceleration could continue through the end of the decade, paving the way for impressive gains in this AI stock over the long run.
Before you buy stock in Arm Holdings, consider this:
The Motley Fool Stock Advisor analyst team just identified what they believe are the 10 best stocks for investors to buy now… and Arm Holdings wasn’t one of them. The 10 stocks that made the cut could produce monster returns in the coming years.
Consider when Netflix made this list on December 17, 2004... if you invested $1,000 at the time of our recommendation, you’d have $556,335!* Or when Nvidia made this list on April 15, 2005... if you invested $1,000 at the time of our recommendation, you’d have $1,160,572!*
Now, it’s worth noting Stock Advisor’s total average return is 975% — a market-crushing outperformance compared to 193% for the S&P 500. Don't miss the latest top 10 list, available with Stock Advisor, and join an investing community built by individual investors for individual investors.
See the 10 stocks »
*Stock Advisor returns as of April 14, 2026.
Harsh Chauhan has no position in any of the stocks mentioned. The Motley Fool has positions in and recommends Advanced Micro Devices, Alphabet, Intel, Marvell Technology, and Nvidia. The Motley Fool recommends Broadcom. The Motley Fool has a disclosure policy.