Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 750 143

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 419 71

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.8k 1.6k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.8k 242

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.1k 492

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.8k 1k

Repositories

Showing 10 of 693 repositories
  • OSMO Public

    The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML

    NVIDIA/OSMO’s past year of commit activity
    TypeScript 107 Apache-2.0 19 65 4 Updated Mar 15, 2026
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 13,099 2,182 534 576 Updated Mar 15, 2026
  • k8s-nim-operator Public

    An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.

    NVIDIA/k8s-nim-operator’s past year of commit activity
    Go 151 Apache-2.0 42 8 12 Updated Mar 15, 2026
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 2,157 Apache-2.0 295 72 105 Updated Mar 15, 2026
  • bare-metal-manager-core Public

    NVIDIA Bare Metal Manager - Hardware Lifecycle Management and multitenant networking

    NVIDIA/bare-metal-manager-core’s past year of commit activity
    Rust 94 Apache-2.0 63 91 (3 issues need help) 38 Updated Mar 15, 2026
  • NV-Kernels Public

    Ubuntu kernels which are optimized for NVIDIA server systems

    NVIDIA/NV-Kernels’s past year of commit activity
    93 58 0 15 Updated Mar 15, 2026
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 15,656 3,693 321 (1 issue needs help) 333 Updated Mar 15, 2026
  • torch-harmonics Public

    Differentiable signal processing on the sphere for PyTorch

    NVIDIA/torch-harmonics’s past year of commit activity
    Jupyter Notebook 650 65 5 8 Updated Mar 15, 2026
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
    C++ 2,211 358 1,275 (6 issues need help) 212 Updated Mar 15, 2026
  • edk2-platforms Public

    NVIDIA fork of tianocore/edk2-platforms

    NVIDIA/edk2-platforms’s past year of commit activity
    C 13 4 0 0 Updated Mar 15, 2026