Oasis: Pooling PCIe Devices Over CXL to Boost Utilization

Use CXL shared memory as a message channel to pool PCIe devices (NICs, SSDs) across hosts in a CXL pod

Featured image

Venue: SOSP 2025
Link: ACM DL

Topic: PCIe devices (NICs, SSDs) are underutilized in cloud platforms because they are allocated conservatively for peak demand. Oasis enables pooling PCIe devices across hosts in a CXL pod using CXL shared memory as the communication channel.


Summary

PCIe devices are frequently underutilized because cloud platforms allocate them conservatively to satisfy each host’s peak demand. PCIe device pools let multiple hosts share devices, but existing solutions (PCIe switches, RDMA-based disaggregation) are expensive, inflexible, or can’t handle all device types. Oasis uses CXL memory pools as shared memory across hosts in a CXL pod — enabling PCIe device pooling at near-zero extra cost by reusing existing CXL pod designs.


Background

PCIe underutilization

Existing solutions and limitations


Key Idea

CXL memory pool as shared message channel

Challenges


Design

Datapath

Control plane

Implementation


Evaluation


Limitations

  1. CXL failures: CXL link/cable faults are the most common failure type → need resilient CXL pod designs.
  2. CXL bandwidth saturation: if CXL bandwidth is exhausted, may need traffic rebalancing (e.g., Intel RDT).

Meeting Notes