BergLoom is a high-performance Iceberg table compaction runtime built on Rust, optimized specifically for large-scale data lakes. The project name, which means "iceberg weaver," symbolizes its core function of weaving scattered Iceberg data files into an efficient and orderly data structure.
- Rust-Native Performance: Low-latency, high-throughput compaction with memory safety guarantees
- DataFusion Engine: Leverages Apache DataFusion for query planning and vectorized execution
- Iceberg Native Support: Full compliance with Iceberg table formats via iceberg-rs
- Multi-Cloud Ready: Currently supports AWS S3, with plans for Azure Blob Storage and GCP Cloud Storage
- Full Compaction: Merges all data files in an Iceberg table and removes old files
- Deletion Support:
- Positional deletions (POS_DELETE)
- Equality deletions (EQ_DELETE)
- Incremental compaction support
- Merge-on-read performance optimization
- Standalone scheduler component
- Partition evolution support
- Schema evolution support
- Azure Blob Storage integration
- GCP Cloud Storage integration