Skip to content
Change the repository type filter

All

    Repositories list

    • Primus-SaFE(Stability and Fault Endurance)
      Go
      Other
      05606Updated Apr 21, 2026Apr 21, 2026
    • GEAK

      Public
      Generating Efficient AI-Centric Kernels
      Python
      MIT License
      16891230Updated Apr 21, 2026Apr 21, 2026
    • TraceLens

      Public
      Automating analysis from trace files
      Python
      MIT License
      11668314Updated Apr 21, 2026Apr 21, 2026
    • Primus-Turbo

      Public
      Python
      Other
      14651118Updated Apr 21, 2026Apr 21, 2026
    • torchtitan-amd

      Public
      A PyTorch native platform for training generative AI models
      Python
      BSD 3-Clause "New" or "Revised" License
      7921601Updated Apr 21, 2026Apr 21, 2026
    • Primus

      Public
      Python
      Other
      3189534Updated Apr 21, 2026Apr 21, 2026
    • Primus-DLRM

      Public
      DLRM implementation for Primus
      Python
      MIT License
      0001Updated Apr 21, 2026Apr 21, 2026
    • Toolkit for launching and observing MaxText training on Slurm-managed GPU clusters
      Shell
      MIT License
      22702Updated Apr 20, 2026Apr 20, 2026
    • AgentKernelArena

      Public
      AgentKernelArena provides an end-to-end siloed-benchmarking environment where different LLM-powered agents—such as Cursor Agent, Claude Code, Codex, SWE-agent, …
      Python
      Apache License 2.0
      31351Updated Apr 20, 2026Apr 20, 2026
    • Magpie

      Public
      A lightweight, general-purpose framework for evaluating GPU kernel correctness and performance.
      Python
      MIT License
      55111Updated Apr 20, 2026Apr 20, 2026
    • Apex

      Public
      Agents, and RL environment, for optimizing GPU kernels on AMD ROCm using LLM agents. Benchmarks LLM serving workloads end-to-end, profiles bottleneck kernels, o…
      Python
      MIT License
      96410Updated Apr 19, 2026Apr 19, 2026
    • FLy

      Public
      Python
      MIT License
      0110Updated Apr 1, 2026Apr 1, 2026
    • Reference implementations of MLPerf® inference benchmarks
      Python
      Apache License 2.0
      621000Updated Mar 16, 2026Mar 16, 2026
    • PARD

      Public
      PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation (ICLR 26)
      Python
      MIT License
      11910Updated Mar 13, 2026Mar 13, 2026
    • DUET-VLM

      Public
      DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference
      Python
      Apache License 2.0
      12311Updated Mar 5, 2026Mar 5, 2026
    • nixl

      Public
      NVIDIA Inference Xfer Library (NIXL)
      C++
      Other
      295100Updated Feb 24, 2026Feb 24, 2026
    • dynamo

      Public
      A Datacenter Scale Distributed Inference Serving Framework
      Rust
      Other
      1k000Updated Feb 24, 2026Feb 24, 2026
    • Nitro-E

      Public
      Python
      MIT License
      1011921Updated Feb 24, 2026Feb 24, 2026
    • Repository for Showcasing DLRM v2 Functionality on AMD
      Python
      MIT License
      0000Updated Feb 16, 2026Feb 16, 2026
    • HummingbirdXT

      Public
      Python
      Apache License 2.0
      01500Updated Feb 10, 2026Feb 10, 2026
    • For world model code developing and releasing.
      Python
      Other
      55000Updated Feb 6, 2026Feb 6, 2026
    • TraceLens-inference

      Public archive
      Automating analysis from trace files
      Python
      MIT License
      11000Updated Feb 5, 2026Feb 5, 2026
    • axlearn

      Public
      An Extensible Deep Learning Library
      Python
      Apache License 2.0
      404100Updated Jan 29, 2026Jan 29, 2026
    • Neurips2025-GPU-kernels-Tutorial

      Public
      Repo containing artifacts for Neurips 2025 tutorial- How to Build Agents to Generate Kernels for Faster LLMs (and Other Models!)
      Jupyter Notebook
      MIT License
      21500Updated Jan 23, 2026Jan 23, 2026
    • Python
      Other
      0710Updated Jan 22, 2026Jan 22, 2026
    • Hummingbird

      Public
      AMD 0.9B efficient text to video diffusion model
      Python
      Other
      64511Updated Jan 12, 2026Jan 12, 2026
    • This is a short course covering GPU optimization techniques for LLM inference
      Python
      MIT License
      1100Updated Jan 11, 2026Jan 11, 2026
    • awesome-rocm-autodrive

      Public
      Examples of training autodrive models in ROCm
      Python
      Other
      1400Updated Jan 9, 2026Jan 9, 2026
    • GEAK-eval

      Public
      Python
      61390Updated Dec 24, 2025Dec 24, 2025
    • sand-pipeline

      Public
      Synthetic data generation pipeline, finetuning and evaluation scripts.
      Python
      Other
      1210Updated Dec 24, 2025Dec 24, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.