← Explore
TOPIC

#datalake

Open source repositories tagged with #datalake, ranked by health score.

StarRocks
StarRocks/starrocks
Java
88
health

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

11.7k
trinodb
trinodb/trino
Java
86
health

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

12.8k
apache
apache/hudi
Java
80
health

Upserts, Deletes And Incremental Processing on Big Data.

6.2k