Data Lakes & Analytics

MinIO Blog Posts

The Bank of the East - Replacing Hadoop with MinIO and Dremio
arrow
Financial institution replacing Hadoop with MinIO and Dremio for analytics
Case Studies & Solutions
Apache Ecosystem
Data Lakes & Analytics
PostgreSQL Meets Object Storage: Access External Data in MinIO
arrow
PostgreSQL pg_lakehouse extension queries object storage data as external tables
Integrations & Partners
Data Lakes & Analytics
The Architect's Guide to the New Private Cloud
arrow
Private cloud architecture guide for on-prem infrastructure replacing public cloud workloads
Data Lakes & Analytics
Architecture & Design Patterns
Kubernetes & Containers
Cloud Infrastructure
Architecting a Modern Data Lake
arrow
Modern data lakehouse reference architecture patterns for AI/ML infrastructure
Data Lakes & Analytics
Earn your RAG-ing rights with MinIO
arrow
Building RAG applications with MinIO vector embeddings and retrieval augmented generation
AI/ML
Data Lakes & Analytics
Architecture & Design Patterns
Deploy MinIO and Trino with Kubernetes
arrow
Deploy MinIO with Trino on Kubernetes—distributed SQL queries on object storage for fast lakehouse analytics
AI/ML
Data Lakes & Analytics
Integrations & Partners
Optimizing AI Data Processing with MinIO Weaviate and Langchain in Retrieval Augmented Generation (RAG) Pipelines
arrow
Integrate MinIO with Weaviate & LangChain for RAG pipelines—optimizing AI data processing & retrieval workflows
AI/ML
Data Lakes & Analytics
Integrations & Partners
Architecture & Design Patterns
Storage & Infrastructure
Query Iceberg Tables on MinIO with Dremio
arrow
Tutorial: Set up Dremio on Kubernetes to query Apache Iceberg tables stored on MinIO object storage
Integrations & Partners
Data Lakes & Analytics
The Architect’s Guide: A Modern Datalake Reference Architecture
arrow
Modern datalake reference architecture—unified data warehouse & lake using object storage with Iceberg/Hudi/Delta formats
Data Lakes & Analytics
Building Modern Data Architectures with Iceberg, Tabular and MinIO
arrow
Modern data architecture with Iceberg & Tabular on MinIO—eliminate vendor lock-in with universal storage platform
Data Lakes & Analytics
Apache Ecosystem
SQL Server 2022 Machine Learning Services Unlock the Value of Your Data
arrow
SQL Server 2022 external tables and machine learning services tutorial with MinIO
Integrations & Partners
AI/ML
Data Lakes & Analytics
Operations
MinIO and Apache Tika: A Pattern for Text Extraction
arrow
MinIO & Apache Tika text extraction—automate document processing with bucket notifications for ML/AI pipelines
AI/ML
Data Lakes & Analytics
Integrations & Partners
Event-Driven Architecture: MinIO Event Notification Webhooks using Flask
arrow
Webhook event notifications trigger automated workflows for bucket operations
Integrations & Partners
Kubernetes & Containers
Data Lakes & Analytics
Operations
Backing Up SQL Server 2022 Databases to MinIO
arrow
Complete guide to backing up SQL Server 2022 databases directly to MinIO using S3-compatible storage
Operations
Storage & Infrastructure
Data Lakes & Analytics
Integrations & Partners
Streamlining Data Events with MinIO and PostgreSQL
arrow
Streamline data events with MinIO & PostgreSQL using Docker—configure bucket notifications for event-driven apps
Kubernetes & Containers
Integrations & Partners
Data Lakes & Analytics
LanceDB: Your Trusted Steed in the Joust Against Data Complexity
arrow
LanceDB vector database with Lance columnar format for fast ML embeddings and semantic search
AI/ML
Data Lakes & Analytics
Apache Ecosystem
Architecture & Design Patterns
A Guide to Decoupled Storage with StarRocks and MinIO
arrow
StarRocks OLAP database with decoupled compute-storage architecture using MinIO
Apache Ecosystem
Architecture & Design Patterns
Kubernetes & Containers
Cloud Infrastructure
Data Lakes & Analytics
Smooth Sailing from Docker to Localhost
arrow
Resolving Docker localhost networking issues for MinIO event notifications with Flask
Kubernetes & Containers
Data Lakes & Analytics
Operations
Integrations & Partners
Data Lake Mysteries Unveiled: Nessie, Dremio, and MinIO Make Waves
arrow
GraphRAG architecture with Nessie catalog, Dremio SQL, and Iceberg for data lakehouses
Integrations & Partners
Apache Ecosystem
Architecture & Design Patterns
Data Lakes & Analytics
The Architects Guide to the Modern Data Stack
arrow
Modern data stack guide—explore storage, transformation, observability, discovery & ML in cloud-native architecture
Architecture & Design Patterns
Data Lakes & Analytics
Operations
Kubernetes & Containers
Cloud Infrastructure
Build a Streaming CDC Pipeline with MinIO and Redpanda into Snowflake
arrow
Build streaming CDC pipeline with Redpanda, Debezium & MinIO into Snowflake—simplify data migration & analytics
Apache Ecosystem
Architecture & Design Patterns
Integrations & Partners
Data Lakes & Analytics
Strict Consistency is a Hard Requirement for Primary Storage
arrow
Read-after-write consistency requirements for databases and AI workloads on primary storage
Architecture & Design Patterns
Storage & Infrastructure
Data Lakes & Analytics
Kubernetes & Containers
Cloud Infrastructure
Snapshot Backups for MongoDB Using MinIO
arrow
Using MinIO as target for MongoDB backup and restore operations
Data Lakes & Analytics
Integrations & Partners
Operations
Storage & Infrastructure
Apache Ecosystem
DuckDB and MinIO for a Modern Data Stack
arrow
DuckDB & MinIO for modern data stack—in-process OLAP database queries Parquet/CSV directly from object storage
Data Lakes & Analytics
Kubernetes & Containers