Beyond GPUs: The Broadening of AI Infrastructure

Contributors

Stay Up To Date

Something has gone wrong, check that all fields have been filled in correctly. If you have adblock, disable it.

The form was sent successfully

May 27, 2026

Since the release of ChatGPT in late 2022, the dominant narrative around artificial intelligence has centered on compute. More specifically, on GPUs. As increasingly powerful models required exponentially greater training capacity, accelerator chips became the foundational scarce resource of the AI era and the primary beneficiaries of the initial infrastructure buildout.

Yet technological shifts rarely remain concentrated in a single layer of the stack for long. As AI systems evolve from passive prediction engines into autonomous agents capable of reasoning, planning, and executing complex tasks, the nature of infrastructure demand is beginning to change. Increasingly, the limiting factor is no longer compute alone, but the infrastructure required to coordinate, feed, connect, and contextualize that compute.

In many ways, this marks a transition from the first phase of AI infrastructure toward the second. The first phase required immense computational power to train large models. The next phase increasingly depends on persistent memory, orchestration, and high-speed networking capable of supporting millions of continuously operating, context-aware AI agents. As a result, bottlenecks across the AI stack are beginning to migrate outward from accelerators alone and toward the systems that enable those accelerators to scale efficiently.

This shift is already becoming visible in the economics of the semiconductor ecosystem. While NVIDIA and Broadcom defined the first chapter of the AI era through their exposure to accelerator compute, leadership is broadening toward a wider set of infrastructure providers across memory, CPUs, and networking. Historically, many of these markets were viewed as cyclical and commoditized, with demand tied largely to enterprise hardware or telecom spending cycles. Today, however, AI appears to be transforming several of these industries into strategic bottlenecks within a rapidly expanding ecosystem.

From Compute Scaling to Context Scaling

The emergence of reasoning and agentic AI systems may represent one of the most important architectural shifts in modern computing. Traditional AI systems largely responded to prompts. Agentic systems increasingly operate autonomously, capable of managing workflows, retrieving information, interacting with software tools, and completing tasks end-to-end with limited human intervention.

The effectiveness of these systems depends heavily on context. The more of the relevant world an AI agent can actively access, including codebases, documents, prior decisions, workflows, and user preferences, the more capable and economically valuable it becomes. In practice, context increasingly functions as a form of working memory for AI systems, allowing them to retain information, maintain continuity, and operate with greater autonomy over time.

This dynamic is especially powerful in coding workloads, where the economic returns are already compelling. An AI coding agent that saves even modest amounts of time for highly compensated engineers can generate extraordinary productivity gains. But the difference between a basic assistant and something closer to a true software engineer increasingly comes down to memory and context depth. Agents capable of maintaining awareness across repositories, testing environments, style guides, and architectural history can perform substantially more valuable work than systems operating with fragmented information.

As AI systems become more persistent and autonomous, memory increasingly shifts from a supporting component to a gating factor in system capability. If the value of an AI agent scales with the amount of active context it can maintain, then demand for memory scales not only with the number of deployed agents, but also with the depth of information each agent continuously accesses.

At the same time, the supply side of the memory industry is becoming structurally more constrained. Advanced forms of memory such as High Bandwidth Memory (HBM) and Dynamic Random-Access Memory (DRAM) are becoming increasingly difficult and capital intensive to manufacture, while the lead times required to build cutting-edge fabrication capacity continue to lengthen. The transition toward HBM, which remains essential to advanced AI accelerators, further tightens supply because it requires materially greater manufacturing complexity and capital investment than conventional memory technologies.

The result is a market where memory demand may accelerate materially faster than supply can realistically respond. In our view, this dynamic may support a more durable period of favorable industry economics than has historically characterized the memory sector.

The Return of the CPU

While GPUs remain central to AI computation, the rise of autonomous systems is also driving a meaningful resurgence in the importance of CPUs.

Traditional AI workloads were heavily concentrated within the model inference process itself, with GPUs performing the overwhelming majority of economically valuable work. Increasingly, however, orchestration is becoming part of the workload. Autonomous AI systems require continuous scheduling, memory management, retrieval, planning, tool usage, and coordination across increasingly complex workflows.

Many of these functions occur outside the core inference layer and are significantly more CPU intensive. As a result, CPUs are becoming more deeply integrated into AI inference architectures than many initially expected.

This transition is already visible in changing deployment patterns across data centers. Traditional AI systems often paired one CPU with multiple GPUs, as CPUs primarily acted as lightweight coordinators for accelerator clusters. Emerging AI architectures require materially greater CPU resources per accelerator as orchestration and workflow management workloads expand.

In many ways, this reflects a broader shift in AI infrastructure. Historically, compute itself was the primary scarce resource. Increasingly, however, the scarce resource may become the ability to coordinate increasingly complex systems efficiently. That shift has important implications not only for CPUs, but for the broader architecture of AI infrastructure.

Networking as the Next Bottleneck

If the first AI bottleneck was compute and the second may be memory, the next increasingly appears to be networking.

As AI clusters scale, moving data efficiently between accelerators is becoming as important as the accelerators themselves. Modern AI systems rely heavily on parallel computation, requiring continuous communication between thousands of chips operating simultaneously across increasingly large data center environments.

This creates an important dynamic: networking demand scales faster than compute. Every additional accelerator added to a cluster must communicate with an increasing number of other accelerators across the system, driving exponential growth in bandwidth and interconnect requirements.

Historically, networking infrastructure was often viewed as a secondary consideration relative to compute. Increasingly, however, accelerator utilization rates themselves are becoming constrained by communication overhead and data movement limitations. In effect, AI data centers are transitioning from compute-bound architectures toward networking-bound architectures.

This shift is reshaping the economics of the broader networking ecosystem. Optical interconnects, advanced circuit boards, lasers, optical engines, and testing equipment are becoming increasingly strategic components of AI infrastructure. Technologies such as co-packaged optics, which integrate optical networking closer to compute, may ultimately become essential to scaling future AI systems efficiently.

Importantly, the significance of these technologies extends beyond simple hardware demand. As bottlenecks migrate outward through the stack, the systems responsible for enabling compute increasingly become as strategically important as compute itself.

A Broader Infrastructure Cycle

Taken together, these developments suggest the AI investment cycle may be entering a new phase. The first chapter of AI infrastructure was overwhelmingly defined by accelerators and training compute. The next chapter increasingly appears likely to be defined by the broader ecosystem required to support persistent, context-rich, autonomous systems operating at scale.

That does not imply all infrastructure markets will evolve equally. Many of these industries remain cyclical and historically prone to periods of oversupply. In our view, industry structure matters as much as end-market growth. The most attractive opportunities are likely to emerge in segments where accelerating demand intersects with structurally constrained supply, high barriers to entry, and increasingly consolidated competitive dynamics.

Memory may represent one such example. Connectivity may represent another, though portions of the optical ecosystem could ultimately prove more vulnerable to aggressive future capacity expansion. More broadly, however, we believe the key takeaway is that AI infrastructure is no longer scaling linearly with compute alone. Increasingly, it scales with memory, orchestration, and data movement.

Ultimately, the rise of autonomous AI systems represents more than simply another increase in computing demand. It reflects the emergence of a fundamentally different compute architecture, one that requires a broader, more interconnected, and more sophisticated infrastructure stack than the first phase of the AI era. In our view, that shift may create a durable and expanding opportunity set across select areas of the semiconductor and data center ecosystem in the years ahead.

Disclosures:

The views expressed are the opinion of Sands Capital and are not intended as a forecast, a guarantee of future results, investment recommendations or an offer to buy or sell any securities. The views expressed were current as of the date indicated and are subject to change. This material may contain forward-looking statements, which are subject to uncertainty and contingencies outside of Sands Capital’s control. Readers should not place undue reliance upon these forward-looking statements. There is no guarantee that Sands Capital will meet its stated goals.

All investments are subject to market risk, including the possible loss of principal. Recent tariff announcements may add to this risk, creating additional economic uncertainty and potentially affecting the value of certain investments. Tariffs can impact various sectors differently, leading to changes in market dynamics and investment performance. Companies that manufacture AI infrastructure are subject to risks including rapid technological change, competitive pressures, supply chain disruptions, regulatory developments, customer concentration, and fluctuations in demand for AI-related products and services, which may adversely affect financial performance and valuations.

As of May 15, 2026, NVIDIA and Broadcom were held across Sands Capital strategies. The specific securities identified and described do not represent all of the securities purchased, sold, or recommended for advisory clients. There is no assurance that any securities discussed will remain in the portfolio or that securities sold have not been repurchased. You should not assume that any investment is or will be profitable. The companies were chosen based on an objective criterion and represent the two primary providers of AI semiconductor chip design, as measured by market share.

Further Disclosures

References to the “firm”, “we” or “our” are references to Sands Capital. Sands Capital refers to the combination of Sands Capital Management, LLC, Sands Capital Alternatives, LLC and Sands Capital Horizons, LLC. All three firms are registered investment advisers with the United States Securities and Exchange Commission in accordance with the Investment Advisers Act of 1940. The three registered investment advisers share certain personnel, office space, and other resources.

This communication is for informational purposes only and does not constitute an offer, invitation, or recommendation to buy, sell, subscribe for, or issue any securities. The material is based on information that we consider correct, and any estimates, opinions, conclusions, or recommendations contained in this communication are reasonably held or made at the time of compilation. However, no warranty is made as to the accuracy or reliability of any estimates, opinions, conclusions, or recommendations. It should not be construed as investment, legal, or tax advice and may not be reproduced or distributed to any person.

Investment Strategy

The Stack That China Built

China’s AI future requires more than models. It rests on the companies supplying the equipment, memory, cloud capacity, and data movement needed to make domestic AI work at scale.

Learn more

Watch the video

Listen here

Investment Strategy

The New Power Economy

Electricity demand is rising as AI, electrification, and advanced manufacturing make the economy more power intensive, creating opportunities for businesses tied to generation, transmission, grid equipment, and on-site power.

Learn more

Watch the video

Listen here

Investment Strategy

The AI Supply Shock

AI adoption may be moving from experimentation to deployment faster than infrastructure can expand, creating potential opportunities for companies positioned at key bottlenecks across the AI value chain.

Learn more

Watch the video

Listen here

Investment Strategy

Korea in Demand

Korea’s opportunity set has broadened as global demand rises for trusted suppliers across AI infrastructure, semiconductors, power equipment, shipbuilding, nuclear, robotics, and beauty.

Learn more

Watch the video

Listen here

Investment Strategy

From Umami to Automation: How Japan’s Reinvention Is Creating New Investment Frontiers

After many research trips to Japan, we moved to increase some of our exposure to businesses supported by key trends, such as digitalization and advanced computing, industrial innovation, unique consumer preferences, and an aging domestic and global population.

Learn more

Watch the video

Listen here

Sands Capital is an active, long-term investor in leading innovative growth businesses, globally. Our approach combines analytical rigor and creative thinking to identify high-quality growth businesses that are creating the future. Through an integrated investment platform spanning alternatives, growth equity and public equity, we provide growth capital solutions to institutions and fund sponsors in more than 30 countries.

An independent, staff-owned firm founded in 1992 and headquartered in the Washington, D.C. area with offices in London, New York, San Francisco and Singapore, Sands Capital managed $50B in client assets as of June 30, 2026.

For institutional and professional investors only. Investment advisory fees are described in Part II of the adviser’s Form ADV.

Our Culture

Our People

News & Events

Careers

Korea in Demand

Building Resilience in an AI-Driven World

Obvious. Difficult. Repeatable.

FPT Harnesses Education to Transform Talent Landscape

Philosophy & Approach

Stewardship

Public Equity

Venture and Growth Equity

Beyond GPUs: The Broadening of AI Infrastructure

Contributors

Daniel C. Fu

Michael F. Rallo, CFA

Stay Up To Date

The form was sent successfully

From Compute Scaling to Context Scaling

The Return of the CPU

Networking as the Next Bottleneck

A Broader Infrastructure Cycle

Related Articles

Important Notice

The form was sent successfully

Our Culture

Our People

News & Events

Careers

Korea in Demand

Building Resilience in an AI-Driven World

Obvious. Difficult. Repeatable.

FPT Harnesses Education to Transform Talent Landscape

Philosophy & Approach

Stewardship

Public Equity

Venture and Growth Equity

Beyond GPUs: The Broadening of AI Infrastructure

Contributors

Daniel C. Fu

Michael F. Rallo, CFA

Related Articles

Stay Up To Date

The form was sent successfully

From Compute Scaling to Context Scaling

The Return of the CPU

Networking as the Next Bottleneck

A Broader Infrastructure Cycle

Related Articles

Important Notice​

The form was sent successfully

Important Notice