Gilad Shainer

Profile

The Architect Behind the World's Fastest Supercomputer Networks

Somewhere inside the machines running the most advanced AI models ever built, packets are racing through switches at 800 gigabits per second. Gilad Shainer designed the protocols that make that speed usable.

At NVIDIA, Shainer holds the title of Senior Vice President of Networking - but the scope of his work runs considerably deeper than the org chart suggests. When NVIDIA CEO Jensen Huang talks about "AI factories," those factories need wiring. Shainer is the person deciding what that wiring looks like, what speeds it runs at, and what software stack orchestrates the billions of messages that pass through it every second.

More than half the world's top 500 supercomputers run on NVIDIA's Quantum InfiniBand networking - a fact Shainer cited live at ISC 2025 in Hamburg. That isn't a coincidence of market share. It reflects two decades of architectural decisions, protocol design, and ecosystem building that Shainer has been accumulating since his days as a design engineer at Mellanox Technologies in 2001.

Mellanox - the Israeli networking company that pioneered InfiniBand as a serious high-performance interconnect - was acquired by NVIDIA for $6.9 billion in 2020. The acquisition was, in many ways, an acknowledgment that AI training at scale has the same demands as supercomputing: low latency, high bandwidth, zero jitter, and the ability to synchronize thousands of processors without losing a single clock cycle. Shainer had spent 19 years proving that was possible. He joined NVIDIA with the architecture already built.

"Today, the scale of the data center has shifted. In the era of AI, the data center itself has become the unit of computing. Instead of asking 'How many CPUs can I buy?' the question is now, 'How do I design a data center capable of running my workloads at maximum efficiency?'"

- Gilad Shainer, NVIDIA SVP Networking

Origin Story

From an Artificial Pancreas to 800Gb/s Switches

Shainer completed both his B.Sc. and M.Sc. in Electrical Engineering at the Technion - Israel Institute of Technology - graduating Cum Laude both times. Technion is roughly what MIT is to Cambridge, Massachusetts: it sits near the top of every global engineering ranking, and its graduates have an outsized share of Israel's technology exports.

During his master's research, Shainer worked on a project that had nothing to do with networking. The Artificial Pancreas initiative was an early biomedical engineering project exploring closed-loop insulin delivery systems. It was interdisciplinary work with real stakes - the kind of research that trains an engineer to think in terms of systems under load, feedback loops, and latency consequences. That instinct would translate, in unexpected ways, to the world of high-performance interconnects.

After completing his M.Sc. in 2001, Shainer joined Mellanox Technologies - then a small Israeli startup in Yokneam that was betting the company on a then-nascent interconnect standard called InfiniBand. The bet looked questionable at first. InfiniBand faced fierce competition from competing standards, skeptical hyperscalers, and an industry that largely preferred Ethernet because it was familiar. Mellanox - and Shainer - spent the better part of a decade proving the skeptics wrong one supercomputer cluster at a time.

By the time Shainer rose to VP of Marketing in 2013, InfiniBand was the dominant fabric for HPC. The TOP500 list - the ranking of the world's most powerful supercomputers - was increasingly populated by machines wired with Mellanox's technology. The technical advantages were real: RDMA (Remote Direct Memory Access) let processors exchange data directly without CPU involvement, slashing latency. Adaptive routing prevented congestion hotspots. And a series of innovations that Shainer helped shepherd - including CORE-Direct collective offload and the SHARP protocol for in-network aggregation - moved computation itself inside the switch fabric, reducing the amount of data that had to travel across the network at all.

The Oscars of Innovation

R&D 100 Award — 2015

CORE-Direct

In-network collective offload technology. Moves MPI reduction operations inside the switch hardware itself, dramatically reducing AI/HPC training time.

R&D 100 Award — 2019

UCX Framework

Unified Communication X - open-source, hardware-native communication framework connecting MPI, SHMEM, and emerging AI workloads to the hardware layer.

"InfiniBand was designed from the ground up for synchronous, high-performance computing - with features like RDMA to bypass CPU jitter, adaptive routing, and congestion control. It's the gold standard for AI training at scale."

- Gilad Shainer

Community

400 Organizations. One Mission. Built from Scratch.

In 2008, Shainer founded the HPC-AI Advisory Council. At the time, it was an idea: a community where HPC practitioners from industry, academia, and government could exchange knowledge, benchmark tools, and push the field forward. There was no membership fee logic. No venture backing. Just a conviction that the HPC community was undersupported and that information was siloed in ways that hurt everyone.

Seventeen years later, the council has more than 400 member organizations. It runs global workshops, produces benchmarking resources, and has become one of the primary networking venues for the people who build and operate the world's most demanding compute infrastructure. When AI workloads started displacing traditional HPC simulations on the same hardware, the council adapted - adding AI track programming before most organizations had realized the convergence was real.

Around the same time Shainer was building the council, he co-founded the ISC Student Cluster Competition - a global competition that puts student teams inside real HPC cluster hardware, challenging them to achieve the highest performance on a mix of real-world scientific workloads. The competition is now in its 13th year. Shainer still shows up. He is on the advisory board of the Winter Classic Invitational, a similar student competition, and runs the APAC HPC-AI University Competition annually. The pattern is not accidental. He genuinely believes that the future of the field runs through the people coming up now, and acts accordingly.

          The HPC-AI Advisory Council in numbers: Founded 2008 with zero members. Now spans 400+ organizations including national labs, Fortune 500 companies, universities, and government agencies. Runs workshops on every continent.
        

The AI Factory Era

When the Data Center Becomes the Computer

There is a moment in every technology transition when the unit of analysis shifts. For decades, it was the processor. Then the server. Then the rack. Now, in Shainer's framing - which is increasingly also Jensen Huang's framing - the unit is the data center itself.

AI training at scale does not distribute workloads the way traditional cloud computing does. A GPT-scale training run does not tolerate stragglers. If one GPU in a cluster of ten thousand slows down, every other GPU waits. This is why the network is not infrastructure in the traditional sense - it is a performance-critical component, as important as the GPU itself. A cluster connected by a slow or jittery fabric trains models slower than a smaller cluster connected by a fast one. Shainer has been making this argument since before it was fashionable, and the AI industry has arrived at his position.

NVIDIA's response is a portfolio that now spans two networking paradigms. Quantum InfiniBand - built specifically for synchronous, tightly-coupled HPC and AI training workloads - powers the majority of the TOP500 list and the largest AI training clusters. Spectrum-X, announced in 2023, is NVIDIA's answer to customers who require Ethernet for operational or economic reasons: it is the first Ethernet fabric designed specifically for AI workloads, with hardware-level congestion management and adaptive routing that bring InfiniBand-competitive performance to an Ethernet architecture.

In 2025, Shainer announced Spectrum-XGS - a cross-domain networking capability that connects physically separate data centers into a single logical AI fabric. The concept is a "virtual mega-datacenter": rather than forcing all workloads into one physical building, Spectrum-XGS allows operators to pool compute across sites, connected at 800Gb/s and treated by the software stack as a unified resource. The implications for the economics of AI infrastructure are significant.

"In the coming generation you will see the ability to actually build those remote datacenters together and form a large, virtual, single datacenter." - Gilad Shainer, 2025

Key Technologies

INTERCONNECT

NVIDIA Quantum InfiniBand

World's first 800Gb/s InfiniBand platform. Powers 271 of the TOP500 supercomputers. RDMA, adaptive routing, hardware-level congestion control.

AI-NATIVE ETHERNET

NVIDIA Spectrum-X

The only Ethernet fabric designed specifically for AI training. Brings InfiniBand-competitive performance to organizations requiring standard Ethernet.

IN-NETWORK COMPUTE

SHARP Protocol

Performs data reductions inside the network switch itself. Eliminates the all-reduce bottleneck in distributed AI training without touching the GPU.

OPEN SOURCE

UCX Framework

Unified Communication X. Hardware-native, ultra-low-weight communication layer supporting MPI, SHMEM, and AI frameworks. 2019 R&D 100 winner.

SCALE-ACROSS

Spectrum-XGS

Cross-domain networking that connects distributed data centers into a unified AI super-factory. Announced 2025. The "virtual mega-datacenter."

DPU

BlueField DPU

Data Processing Units that offload networking, storage, and security from the main CPU/GPU path. Critical for performance at AI-cluster scale.

"The oldest generation will determine the performance of the newest generation."

- Gilad Shainer, on the risks of mixing hardware generations in an AI cluster

Research

An Executive Who Still Publishes in IEEE Micro

Most technology company SVPs stop publishing academic papers around the time they stop writing code. Shainer has not. His name appears on research published in IEEE Micro, ISC High Performance, IEEE Hot Interconnects, EuroMPI, and Springer venues - a publication record that is unusual for someone managing a multi-billion-dollar product portfolio.

His most recent IEEE Micro paper, published in 2025, covers Unified Collective Communication - a unified library for CPU, GPU, and DPU collectives that addresses one of the central performance bottlenecks in distributed AI training. Earlier work on SHARP was published in Supercomputing Frontiers and Innovations in 2017. The co-design architecture for exascale systems appeared in a Springer journal in 2013. There are 19 papers in IEEE Xplore alone, spanning computer networks, hardware architecture, and information systems.

The through-line in his research is the same as the through-line in his products: how do you move data between thousands of processors fast enough that the processors never have to wait? The answers turn out to involve doing more computation inside the network itself, reducing the volume of data that ever has to leave the switch.

IEEE Xplore → DBLP Profile → ACM Digital Library →

Career Timeline

25 Years, One Throughline

1998

B.Sc. Electrical Engineering, Cum Laude - Technion Israel Institute of Technology

2001

M.Sc. Electrical Engineering, Cum Laude - Technion. Joins Mellanox Technologies as Design Engineer. Thesis work includes the Artificial Pancreas biomedical project.

2005-2012

Senior Marketing Management at Mellanox - drives InfiniBand adoption in HPC clusters, supercomputers, and financial services infrastructure

2008

Founds HPC-AI Advisory Council - zero members at launch. Now 400+ organizations from industry, academia, and government worldwide

2012-2013

Promoted to VP Marketing Development, then VP Marketing at Mellanox. Co-founds the ISC Student Cluster Competition

2015

R&D 100 Award for CORE-Direct in-network collective offload technology - the "Oscars of Innovation"

2019

Second R&D 100 Award for UCX (Unified Communication X) - open-source hardware-native communication framework

2020

NVIDIA acquires Mellanox for $6.9 billion. Shainer joins NVIDIA as Senior Vice President of Networking

2022-2024

Leads NVIDIA Quantum-2 and Quantum-X800 InfiniBand platforms - world's first 400Gb/s, then 800Gb/s InfiniBand. Announces Spectrum-X AI-native Ethernet.

2025

NVIDIA InfiniBand powers 271 of TOP500 supercomputers. Spectrum-XGS cross-domain networking announced. IEEE Hot Interconnects 2026 keynote confirmed.

Notable Facts

Seven Things Worth Knowing

During his M.Sc. at Technion, Shainer worked on an artificial pancreas biomedical project - closed-loop insulin delivery research that had nothing to do with network switches. The experience of designing low-latency feedback systems for biological processes has some resonance with designing low-latency feedback systems for distributed computing.
He has won the R&D 100 Award - often called the "Oscars of Innovation" - twice, for two completely different technologies, four years apart. Most researchers consider one lifetime win notable.
The HPC-AI Advisory Council, which he founded in 2008 with essentially no resources, now has 400+ member organizations. He did not spin it out of a corporate program. He built it as a community from scratch.
NVIDIA's InfiniBand networking now connects more than half the world's 500 most powerful supercomputers - a market position built over two decades of incremental technical advantage.
His title changes depending on context. Conference materials and press releases call him "SVP Networking." LinkedIn lists him as "SVP Marketing." Both are accurate. His scope spans deep technical strategy and go-to-market leadership simultaneously.
He co-founded the ISC Student Cluster Competition over 13 years ago and remains personally involved in mentoring student HPC teams. The competition runs annually and has launched careers in HPC across multiple continents.
Shainer graduated Cum Laude at both B.Sc. and M.Sc. from Technion - one of the world's most rigorous technical universities. His undergraduate and graduate work were both completed at the top of the class.

Industry Leadership