Data Lakes & Analytics

MinIO Blog Posts

Architecting a Modern Data Lake
arrow
Data Lakes & Analytics
Earn your RAG-ing rights with MinIO
arrow
In this blog, we will demonstrate how to use MinIO to build a Retrieval Augmented Generation(RAG) based chat application using commodity hardware.
AI/ML
Data Lakes & Analytics
Architecture & Design Patterns
Deploy MinIO and Trino with Kubernetes
arrow
In this tutorial, we'll deploy a cohesive system that allows distributed SQL querying across large datasets stored in Minio, with Trino leveraging metadata from Hive Metastore and table schemas from Redis.
AI/ML
Data Lakes & Analytics
Integrations & Partners
Optimizing AI Data Processing with MinIO Weaviate and Langchain in Retrieval Augmented Generation (RAG) Pipelines
arrow
Delve into AI’s next frontier with MinIO S3 Object-Store and SDK, enhancing a Weaviate Retreival Augmented Generation (RAG) Pipeline for robust data management. Discover how to elevate efficiency in AI systems using LangChain, unlocking new dimensions in scalable AI solutions.
AI/ML
Data Lakes & Analytics
Integrations & Partners
Architecture & Design Patterns
Storage & Infrastructure
Query Iceberg Tables on MinIO with Dremio
arrow
Integrations & Partners
Data Lakes & Analytics
The Architect’s Guide: A Modern Datalake Reference Architecture
arrow
Data Lakes & Analytics
Building Modern Data Architectures with Iceberg, Tabular and MinIO
arrow
Explore modern data architecture with Iceberg, Tabular, and MinIO. Learn to seamlessly integrate structured and unstructured data, optimize AI/ML workloads, and build a high-performance, cloud-native data lake.
Data Lakes & Analytics
Apache Ecosystem
SQL Server 2022 Machine Learning Services Unlock the Value of Your Data
arrow
Learn how to run Python stored procedures on SQL Server 2022.
Integrations & Partners
AI/ML
Data Lakes & Analytics
Operations
MinIO and Apache Tika: A Pattern for Text Extraction
arrow
AI/ML
Data Lakes & Analytics
Integrations & Partners
Event-Driven Architecture: MinIO Event Notification Webhooks using Flask
arrow
Explore deploying MinIO and Flask with Docker-compose for event-driven architecture. Master MinIO bucket events and Flask webhooks for efficient data workflows and robust applications. Dive into the synergy of cloud technologies.
Integrations & Partners
Kubernetes & Containers
Data Lakes & Analytics
Operations
Backing Up SQL Server 2022 Databases to MinIO
arrow
Learn how to back up SQL Server 2022 to MinIO on-premise.
Operations
Storage & Infrastructure
Data Lakes & Analytics
Integrations & Partners
Streamlining Data Events with MinIO and PostgreSQL
arrow
Explore 'Streamlining Data Events with MinIO and PostgreSQL,' a guide for developers using Docker, MinIO, and PostgreSQL. Learn about using Docker Compose for real-time data events, enhancing data analytics, and developing robust, event-driven applications.
Kubernetes & Containers
Integrations & Partners
Data Lakes & Analytics
LanceDB: Your Trusted Steed in the Joust Against Data Complexity
arrow
Joust against data complexity with LanceDB, a lightning-fast vector database optimized for AI/ML on the open-source Lance format. Teaming up with MinIO, it scales seamlessly, offering high-performance, cloud-native storage. Dive into the tutorial for a swift deployment.
AI/ML
Data Lakes & Analytics
Apache Ecosystem
Architecture & Design Patterns
A Guide to Decoupled Storage with StarRocks and MinIO
arrow
The combination of StarRocks and MinIO offers a cloud-native, flexible, and efficient data architecture for modern enterprises, enabling independent scaling and optimized resource utilization. Read the full tutorial for insights into cloud-native analytics with StarRocks and MinIO
Apache Ecosystem
Architecture & Design Patterns
Kubernetes & Containers
Cloud Infrastructure
Data Lakes & Analytics
Smooth Sailing from Docker to Localhost
arrow
Explore the integration of Dockerized MinIO with localhost Flask apps. This guide addresses Docker networking challenges, ensuring seamless MinIO and Flask communication for a development environment that closely mirrors production. Dive into practical solutions for robust workflows.
Kubernetes & Containers
Data Lakes & Analytics
Operations
Integrations & Partners
Data Lake Mysteries Unveiled: Nessie, Dremio, and MinIO Make Waves
arrow
Unleash data collaboration and quality with Nessie! Learn to manage branches, commits, and merges effortlessly. This guide walks you through deploying Dremio, MinIO, and Nessie, transforming your data engineering with collaborative precision. Dive in to revolutionize your workflows!
Integrations & Partners
Apache Ecosystem
Architecture & Design Patterns
Data Lakes & Analytics
The Architects Guide to the Modern Data Stack
arrow
This is your symphony for data excellence. Explore the components of this modern data stack, including storage, data integration, transformation, data observability, data discovery, data visualization, data analytics, and machine learning.
Architecture & Design Patterns
Data Lakes & Analytics
Operations
Kubernetes & Containers
Cloud Infrastructure
Build a Streaming CDC Pipeline with MinIO and Redpanda into Snowflake
arrow
Build a streaming Change Data Capture (CDC) pipeline with Redpanda and MinIO into Snowflake. This solution simplifies data migration and analytics, with Redpanda offering scalability, MinIO as efficient storage, and Snowflake as a cloud-native analytics engine.
Apache Ecosystem
Architecture & Design Patterns
Integrations & Partners
Data Lakes & Analytics
Strict Consistency is a Hard Requirement for Primary Storage
arrow
Architecture & Design Patterns
Storage & Infrastructure
Data Lakes & Analytics
Kubernetes & Containers
Cloud Infrastructure
Snapshot Backups for MongoDB Using MinIO
arrow
Explore how MongoDB's Ops Manager pairs with MinIO's high-performance object storage, creating a robust backup strategy for safeguarding MongoDB data. Discover the power of this combination and how it can transform your data management strategy.
Data Lakes & Analytics
Integrations & Partners
Operations
Storage & Infrastructure
Apache Ecosystem
DuckDB and MinIO for a Modern Data Stack
arrow
Dive into the world of efficient data exploration with DuckDB and MinIO, without the burden of moving your data, and discover the limitless possibilities of this powerful combination.
Data Lakes & Analytics
Kubernetes & Containers
Building a Data Lakehouse using Apache Iceberg and MinIO
arrow
AI/ML
Integrations & Partners
Data Lakes & Analytics
Dremio and MinIO on Kubernetes for Fast Scalable Analytics
arrow
Learn how to get started with Dremio and MinIO on Kubernetes for fast, scalable analytics.
Data Lakes & Analytics
Kubernetes & Containers
Integrations & Partners
Apache Ecosystem
The Architect’s Guide to Data and File Formats
arrow
You must have heard of different data formats like Parquet, ORC, Avro, Arrow, Protobuf, Thrift and MessagePack. What are they and how to choose the right one?
Architecture & Design Patterns
Data Lakes & Analytics
Apache Ecosystem