
Solidigm, MinIO and Intel Put Object Storage Scaling to the Test
That’s why Solidigm, MinIO, and Intel partnered to design and benchmark S3-compatible object storage, with a focus on performance and concurrency scaling. This effort leverages Solidigm 122TB QLC NVMe drives, MinIO AIStor software, and Intel® Xeon® 6 processors, showcasing how a tightly integrated stack can deliver the throughput and efficiency required for modern AI workloads at scale.
AI infrastructure is at an inflection point as models grow from billions to trillions of parameters, as inference workload multiply through agentic systems, and as enterprises race to build their own AI factories. What is the one bottleneck that links all of these trends? Storage. Not just storage capacity, but the ability to serve massive datasets at the throughput modern GPU clusters demand without blowing up the data center power budget or floor space in the process.
Inference pipelines, especially those serving retrieval-augmented generation (RAG) and agentic workflows, require low-latency random reads across billions of objects. The traditional answer has been to throw more servers at the problem. But that approach collides with hard physical constraints: power capacity, rack space, cooling, and operational complexity. When your storage tier requires hundreds of nodes to hold your AI model, you’ve introduced a sprawl problem that undermines the economics of your entire AI factory.
What the industry needs is density without compromise:
The cluster we tested is extremely dense. Each storage node delivers approximately 3PB of raw capacity in a single 2U chassis. Our test was set up to measure S3 object storage GET & PUT throughput as client concurrency increases against the full 8-node storage cluster (~24PB). The baseline was a single client targeting all 8 storage nodes (~24PB), and we scaled up to 8 clients to saturate the cluster.
The client nodes were further over-provisioned with 1TB of DRAM, to ensure the load generator was never the bottleneck.
The storage servers are the fundamental building block of MinIO’s ExaPOD reference architecture, which scales linearly from petabytes to exabytes (EB) using this same dense node design. Where ExaPOD defines the blueprint for exascale, this benchmark validates the performance reality at the single-pod level.
Reaching these numbers required deliberate co-tuning of the full stack, storage software, host OS and network stack, NIC offloads, CPU and IRQ topology, and switch fabric. Each solution brings its own tuning challenges for optimal performance, as evidenced by these performance-tuning results, which achieve 3x performance over hardware and software defaults. The types of adjustments made are listed at the end of this paper.
We ran the MinIO Warp benchmark to measure how S3 throughput scales with increasing client concurrency against a fixed 8-node storage cluster (~24PB). Each run lasted 15 minutes per phase (PUT and GET separately). Object size was set to 256 MiB with 32 concurrent connections per client.
.png)
These results aren’t academic. They map directly to the MinIO ExaPOD reference architecture, which scales to 1EB usable across 640 servers in 32 racks, delivering up to 19.2 TB/s aggregate throughput. The current ~24PB, 8-node deployment provides a representative small-scale instantiation of the ExaPOD design, demonstrating the same architectural principles, density, and core performance characteristics in a more compact footprint.
For organizations planning their AI infrastructure, this benchmark answers a critical question. Can you achieve exascale economics at petascale entry points? The answer is yes. The same hardware, the same software, the same operational model
The compute announcements across the industry are impressive. But compute without storage is a GPU waiting on data. As you plan your AI factory, consider the storage math:
Solidigm and MinIO are proving that the densest storage doesn’t have to be the slowest. It can be the foundation of your entire AI data platform.
At the heart of this density story is the Solidigm™ D5-P5336,a hyper-scalable, cost-effective solution for AI and data-intensive workloads. With industry-leading capacity up to 122.88TB, the D5-P5336 is architected to efficiently accelerate and scale with the increasingly massive datasets found in widely-deployed, modern read-intensive workloads.
This testing was conducted in the Solidigm AI Central Lab, a purpose-built facility that brings together storage and AI capabilities to perform cutting-edge research and improve bottom-line results. The lab features high-performance GPUs, 800 Gbps Ethernet networking, and extensive Solidigm SSD infrastructure, all designed on reference architectures that mirror what hyperscalers and enterprises are deploying in data centers worldwide, making the findings broadly applicable to customer environments.
The AI Central Lab hosts what Solidigm believes to be the densest storage test cluster ever built: 192 Solidigm D5-P5336 SSDs packing 23.4PB into just 16U of rack space. The lab also has the capability to collect telemetry, creating a detailed picture of how system resources are used and where bottlenecks exist, allowing Solidigm and its collaborators to recommend optimizations to improve performance and power efficiency.


