TOPIC

#big-data

Open source repositories tagged with #big-data, ranked by health score.

apache/flink

Java

health

Apache Flink

★ 26.2k

hortonworks/cloudbreak

Java

health

CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.

★ 361

reductstore/reductstore

Rust

health

High Performance Data Backbone for Robotics and Industrial IoT

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

The AI search platform

Apache Ignite

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

★ 9.0k