← Back to Discover
NVIDIA

NVIDIA/TransformerEngine

PythonApache-2.0active
88Health

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

Stars3.4k
Forks738
Open Issues357
Contributors738
Last Push0d ago

Health Breakdown

Activity
25
Community
25
Maintenance
13
Popularity
25
#cuda#deep-learning#fp4#fp8#gpu#jax#machine-learning#python#pytorch
View on GitHub ↗Issues (357) ↗Pull Requests ↗

Should you contribute to NVIDIA/TransformerEngine?

NVIDIA/TransformerEngine has a FoundDev health score of 88/100, which puts it in the active-and-maintained tier. The maintainer team is shipping recently, issues are being closed, and a PR you open this week has a realistic chance of being reviewed.

Last push was 0 days ago — that signals an actively maintained project. New issues are likely to get a maintainer response within days. The project is written primarily in Python, so prior Python experience will shorten ramp-up.

Licensed under Apache-2.0, a standard OSI-approved license — safe to contribute to under normal employer IP policies.

Community

NVIDIA
NVIDIA/TransformerEngine
PythonApache 2.0
88

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

active
3.4k738 contributors357 issues
0d ago

More Python repos

openstates
openstates/openstates-scrapers
source for Open States scrapers
90295
openstack
openstack/nova
OpenStack Compute (Nova). Mirror of code maintained at opendev.org.
3.2k92
jupyter
jupyter/docker-stacks
Ready-to-run Docker images containing Jupyter applications
8.4k92