NVIDIA repositories

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwel…

python machine-learning deep-learning

python machine-learning deep-learning gpu cuda pytorch jax fp8 fp4

Python

•

Apache License 2.0

•702•3.3k•230•124•Updated

Apr 22, 2026

aicr

Public

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

config kubernetes manifest

config kubernetes manifest ai runtime gpu helm argocd

Go

•

Apache License 2.0

•33•270•26•8•Updated

Apr 22, 2026

nccl

Public

Optimized primitives for collective multi-GPU communication

deep-learning cpp gpu

deep-learning cpp gpu cuda nvidia communications

C++

•

Other

•1.2k•4.6k•235•89•Updated

Apr 22, 2026

IsaacTeleop

Public

The unified framework for sim & real robot teleoperation

Python

•

Apache License 2.0

•15•150•42•21•Updated

Apr 22, 2026

Megatron-LM

Public

Ongoing research training transformer models at scale

transformers model-para large-language-models

Python

•

Other

•3.9k•16k•353•366•Updated

Apr 22, 2026

ncx-infra-controller-core

Public

NCX Infra Controller - Hardware Lifecycle Management and multitenant networking

Rust

•

Apache License 2.0

•83•127•168•64•Updated

Apr 21, 2026

TensorRT-LLM

Public

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inferen…

cuda pytorch moe

cuda pytorch moe blackwell llm-serving

Python

•

Other

•2.3k•13k•582•768•Updated

Apr 21, 2026

multi-storage-client

Public

Unified high-performance Python client for object and file stores.

Python

•

Apache License 2.0

•13•65•2•1•Updated

Apr 21, 2026

Isaac-GR00T

Public

NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.

Python

•

Apache License 2.0

•1.1k•6.8k•186•77•Updated

Apr 21, 2026

aistore

Public

AIStore: scalable storage for AI applications

kubernetes high-performance distributed-storage

kubernetes high-performance distributed-storage high-availability object-storage multi-cloud batch-jobs s3-compatible multipart-upload ml-training

Go

•

MIT License

•246•1.8k•2•1•Updated

Apr 21, 2026

topograph

Public

A toolkit for discovering cluster network topology.

Go

•

Apache License 2.0

•20•117•6•3•Updated

Apr 21, 2026

cccl

Public

CUDA Core Compute Libraries

cpp hpc gpu

cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing

C++

•

Other

•379•2.3k•1.3k•256•Updated

Apr 21, 2026

NVSentinel

Public

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go

•

Apache License 2.0

•72•260•32•15•Updated

Apr 21, 2026

NeMo-Retriever

Public

NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVI…

Python

•

Apache License 2.0

•315•2.9k•126•71•Updated

Apr 21, 2026

cudaqx

Public

Accelerated libraries for quantum-classical computing built on CUDA-Q.

C++

•

Other

•58•96•33•23•Updated

Apr 21, 2026

cuCollections

Public

datastructures cpp gpu

datastructures cpp gpu cuda hashmap cpp17 hashset hashtable

C++

•

Apache License 2.0

•107•636•54•14•Updated

Apr 21, 2026

cuEquivariance

Public

cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and …

Python

•29•387•17•2•Updated

Apr 21, 2026

NVFlare

Public

NVIDIA Federated Learning Application Runtime Environment

python decentralized pet

python decentralized pet privacy-protection federated-learning federated-analytics federated-computing

Python

•

Apache License 2.0

•249•922•15•17•Updated

Apr 21, 2026

trt-samples-for-hackathon-cn

Public

Simple samples for TensorRT programming

Python

•

Apache License 2.0

•349•1.7k•65•2•Updated

Apr 21, 2026

srt-slurm

Public

NVIDIA Inference Benchmarks provide recipes in ready-to-use templates for evaluating platform speed. Validate your platform across specific AI use cases across…

Python

•

Other

•20•15•4•8•Updated

Apr 21, 2026

tilus

Public

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

tile programming kernel

tile programming kernel cuda

Python

•

Apache License 2.0

•24•477•5•1•Updated

Apr 21, 2026

ncx-infra-controller-rest

Public

NCX Infra Controller - Hardware Lifecycle Management (REST API)

Go

•

Apache License 2.0

•32•34•30•13•Updated

Apr 21, 2026

nvidia-resiliency-ext

Public

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time…

Python

•

Other

•50•284•4•19•Updated

Apr 21, 2026

fleet-intelligence-agent

Public

NVIDIA Fleet Intelligence Agent - Host agent for GPU telemetry collection and attestation

Go

•

Apache License 2.0

•1•14•0•3•Updated

Apr 21, 2026

makani

Public

Massively parallel training of machine-learning based weather and climate models

Python

•

Other

•72•373•5•3•Updated

Apr 21, 2026

OSMO

Public

The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge device…

TypeScript

•

Apache License 2.0

•34•146•61•16•Updated

Apr 21, 2026

physicsnemo

Public

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

machine-learning deep-learning physics

machine-learning deep-learning physics pytorch nvidia-gpu nvidia-warp

Python

•

Apache License 2.0

•644•2.7k•21•38•Updated

Apr 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA Corporation

All

All

710 repositories

NemoClaw

context-aware-rag

cuda-quantum

TransformerEngine

aicr

nccl

IsaacTeleop

Megatron-LM

ncx-infra-controller-core

TensorRT-LLM

multi-storage-client

Isaac-GR00T

aistore

topograph

cccl

NVSentinel

NeMo-Retriever

cudaqx

cuCollections

cuEquivariance

NVFlare

trt-samples-for-hackathon-cn

srt-slurm

tilus

ncx-infra-controller-rest

nvidia-resiliency-ext

fleet-intelligence-agent

makani

OSMO

physicsnemo

All

All

Repositories list

710 repositories