Skip to content
Change the repository type filter

All

    Repositories list

    • vllm-fork

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      18k88031Updated Jun 5, 2026Jun 5, 2026
    • Python
      Apache License 2.0
      4816131Updated Jun 4, 2026Jun 4, 2026
    • C++
      Apache License 2.0
      71812Updated May 6, 2026May 6, 2026
    • slurm

      Public
      Slurm: A Highly Scalable Workload Manager
      C
      Other
      849300Updated May 4, 2026May 4, 2026
    • Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
      Python
      273604Updated Apr 29, 2026Apr 29, 2026
    • Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      28k405Updated Apr 22, 2026Apr 22, 2026
    • Exporter that exposes Gaudi metrics for Prometheus
      Rust
      GNU General Public License v2.0
      0000Updated Apr 16, 2026Apr 16, 2026
    • Gaudi Feature Discovery for Kubernetes is a software component that allows you to automatically generate labels for the set of Gaudi accelerators available on a…
      Go
      Apache License 2.0
      0010Updated Apr 16, 2026Apr 16, 2026
    • Gaudi BMC metrics exporter for Prometheus
      Go
      Apache License 2.0
      0000Updated Apr 16, 2026Apr 16, 2026
    • Gaudi device plugin for Kubernetes is a Daemonset that allows you to automatically expose the number of Gaudi devices on each nodes of your cluster, keep track …
      Go
      GNU General Public License v2.0
      0000Updated Apr 16, 2026Apr 16, 2026
    • Intel Gaudi Base Operator for Kubernetes automates the management of all necessary Intel Gaudi software components on a Kubernetes cluster.
      Go
      Apache License 2.0
      0000Updated Apr 16, 2026Apr 16, 2026
    • Gaudi aware container runtime, compatible with the Open Containers Initiative (OCI) specification used by Docker, CRI-O, and other popular container technologie…
      Go
      Apache License 2.0
      0000Updated Apr 16, 2026Apr 16, 2026
    • Setup and Installation Instructions for Habana binaries, docker image creation
      Python
      Apache License 2.0
      182867Updated Apr 14, 2026Apr 14, 2026
    • Ongoing research training transformer models at scale
      Python
      Other
      4k600Updated Apr 14, 2026Apr 14, 2026
    • The friendly PIL fork
      Python
      Other
      2.5k000Updated Feb 16, 2026Feb 16, 2026
    • Apptainer: Application containers for Linux
      Go
      Other
      181000Updated Feb 5, 2026Feb 5, 2026
    • NIC drivers (Ethernet, IBverbs and common) for the NIC IP that is inside Intel's data-center GPU
      C
      Other
      2109Updated Feb 3, 2026Feb 3, 2026
    • Container images with driver installer script
      Shell
      Apache License 2.0
      0000Updated Jan 14, 2026Jan 14, 2026
    • Model-References

      Public archive
      Reference models for Intel(R) Gaudi(R) AI Accelerator
      Python
      9117113Updated Jan 8, 2026Jan 8, 2026
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      Apache License 2.0
      4.9k1403Updated Jan 8, 2026Jan 8, 2026
    • C
      Other
      0200Updated Dec 1, 2025Dec 1, 2025
    • gohlml

      Public
      HABANA Management Library bindings for Go
      Go
      GNU General Public License v2.0
      4301Updated Nov 24, 2025Nov 24, 2025
    • Shell
      3200Updated Oct 22, 2025Oct 22, 2025
    • hccl_demo

      Public
      C++
      Apache License 2.0
      192402Updated Oct 9, 2025Oct 9, 2025
    • Gaudi-tutorials

      Public archive
      Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/
      Jupyter Notebook
      506566Updated Sep 18, 2025Sep 18, 2025
    • perftest

      Public
      Gaudi RDMA Performance Test
      Python
      2200Updated Sep 16, 2025Sep 16, 2025
    • C++
      BSD 3-Clause "New" or "Revised" License
      1200Updated Sep 4, 2025Sep 4, 2025
    • AutoGPTQ

      Public
      An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
      Python
      MIT License
      543002Updated Sep 4, 2025Sep 4, 2025
    • SGLang is a fast serving framework for large language models and vision language models.
      Python
      Apache License 2.0
      6.4k000Updated Sep 4, 2025Sep 4, 2025
    • HCL

      Public
      C++
      7900Updated Jul 31, 2025Jul 31, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.