← Explore
TOPIC

#distributed-training

Open source repositories tagged with #distributed-training, ranked by health score.

PaddlePaddle
PaddlePaddle/Paddle
C++
89
health

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

23.9k
skypilot-org
skypilot-org/skypilot
Python
88
health

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

10.0k