AWS recently announced Amazon S3 Tables, a feature designed to address the unique challenges of storing and querying tabular data in the cloud. While this may sound revolutionary, a closer inspection reveals that the limitations AWS is trying to overcome are intrinsic to their own infrastructure, not universal to object storage. Let’s break it down and explain why AIStor users don’t need to worry about “special buckets” for their data lakehouse.
S3 Tables introduce a new type of S3 bucket—a table bucket—specifically optimized for Apache Iceberg based analytics workloads. Key features include:
However, these optimizations come with notable trade-offs: added expenses, potential Glue dependencies, and limited flexibility for non-AWS environments. For example, if you’re already using a tool for compaction like Lambdas, Spark, Athena or other service of software you could be paying extra for compute that you don’t need.
The primary driver for S3 Tables is to address performance bottlenecks in analytics workloads. Standard S3 buckets hit their transaction limits quickly when used with Apache Iceberg for data lakehouses, causing hotspots and degraded performance. By introducing table buckets, AWS can now offer significantly higher request rates for these speciality workloads.
But here’s the kicker: these bottlenecks are unique to AWS’s architecture. They arise because of the way S3 is built and because of the request limits initially imposed by AWS, not because of Iceberg or object storage in general. For MinIO users, these issues simply don’t exist.
It goes without saying that AWS is a major part of the S3 ecosystem. AWS S3 Tables only optimize Iceberg Tables, leaving users of Hudi and Delta Lake to create and manage their own buckets. This choice by AWS to throw their weight around investing and promoting Iceberg over the other open table formats will be very impactful. This action further continues the course started by Databrick’s acquisition of Tabular and Snowflake’s open-sourcing of Polaris.
More importantly, all these investments, convergences, and contributions all support the rising supremacy of open-table format data lakehouses built on object storage. The era of object storage as primary storage has arrived.
AIStor subscribers have always been able to store Iceberg tables in any bucket without worrying about request limits. Of course, as AIStor is your storage layer, you’ve always needed a compute layer like Spark, Dremio or Starburst to create, manage and retrieve your open table format data. AIStor is uniquely capable in this partnership for the following reasons;:
AWS markets S3 Tables as delivering up to 10x higher transaction rates for Iceberg tables. But with AIStor, you’re not constrained by predefined limits. Instead, you size your cluster based on workload demands, achieving the performance you need without additional cost or complexity.
Moreover, AIStor’s object storage is built to deliver consistent, high performance for both analytics and transactional workloads. Meaning that unlike AWS’s table buckets, you’re not forced to segregate storage types to achieve acceptable performance and you’re not limited to a single open-table format.
S3 Tables introduce complexity and additional expense:
By contrast, MinIO offers a simpler, more cost-effective solution. There’s no “table bucket” tax, and you can use open table formats like Iceberg without artificial restrictions.
Amazon S3 Tables address limitations in AWS’s own infrastructure but add complexity, cost, and lock-in for users. AIstor, on the other hand, empowers users to run high-performance queries over Iceberg tables without special buckets or cloud dependencies.
The takeaway? If you’re already using AIStor, you’re ahead of the game. And if you’re considering S3 Tables, take a closer look at whether they’re solving real problems—or just the ones created by AWS.
If you have any further questions on how AIStor delivers scalable, high-performance object storage for AI and advanced analytics in data lakehouses, please reach out to us at hello@min.io or on our Slack channel.