← Explore
TOPIC

#data-science

Open source repositories tagged with #data-science, ranked by health score.

opengeos
opengeos/leafmap
Python
89
health

A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment

3.7k
christopherkarani
christopherkarani/Wax
Swift
89
health

Single-file memory layer for AI agents, sub mili-second RAG on Apple Silicon. Metal Optimized On-Device. No Server. No API. One File. Pure Swift

738
supabase
supabase/supabase-py
Python
88
health

Python Client for Supabase. Query Postgres from Flask, Django, FastAPI. Python user authentication, security policies, edge functions, file storage, and realtime data streaming. Good first issue.

2.5k
catboost
catboost/catboost
C++
87
health

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

9.0k
trinodb
trinodb/trino
Java
86
health

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

12.8k
elixir-explorer
elixir-explorer/explorer
Elixir
83
health

Series (one-dimensional) and dataframes (two-dimensional) for fast and elegant data exploration in Elixir

1.3k
im-anishraj
im-anishraj/arnio
Python
83
health

C++ accelerated data quality toolkit for Python: CSV parsing, cleaning, schema validation, profiling, and pandas integration.

72
PavanMudigonda
PavanMudigonda/zero-to-ai
Jupyter Notebook
80
health

Free AI/ML course with 950+ Jupyter notebooks — Python, deep learning, LLMs, RAG, agents, prompt engineering, fine-tuning, MLOps

20
zaina-ml
zaina-ml/ml_forge
Python
70
health

A visual-based graph node editor for training computer vision models.

412
devAmoghS
devAmoghS/Machine-Learning-with-Python
Python
65
health

Small scale machine learning projects to understand the core concepts . Give a Star 🌟If it helps you. BONUS: Interview Bank coming up..!

1.3k
harmonydata
harmonydata/harmony
Python
63
health

The Harmony Python library: a research tool for psychologists to harmonise data and questionnaire items. Open source.

56
panagiotisanagnostou
panagiotisanagnostou/HiPart
Python
59
health

Hierarchical divisive clustering algorithm execution, visualization and Interactive visualization.

52