gpu

Star

Here are 862 public repositories matching this topic...

pytorch / pytorch

Star

Tensors and Dynamic neural networks in Python with strong GPU acceleration

python machine-learning deep-learning neural-network gpu numpy autograd tensor

Updated Apr 28, 2025
Python

deepspeedai / DeepSpeed

Star

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

machine-learning compression deep-learning gpu inference pytorch zero data-parallelism model-parallelism mixture-of-experts pipeline-parallelism billion-parameters trillion-parameters

Updated Apr 28, 2025
Python

plasma-umass / scalene

Sponsor

Star

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

python cpu profiler gpu performance-analysis memory-allocation profiling cpu-profiling memory-consumption gpu-programming python-profilers scalene profiles-memory performance-cpu

Updated Apr 18, 2025
Python

apache / tvm

Star

Open deep learning compiler stack for cpu, gpu and specialized accelerators

javascript machine-learning performance deep-learning metal compiler gpu vulkan opencl tensor spirv rocm tvm

Updated Apr 28, 2025
Python

cupy / cupy

Sponsor

Star

NumPy & SciPy for GPU

python gpu numpy cuda cublas scipy tensor cudnn rocm cupy cusolver nccl curand cusparse nvrtc cutensor nvtx cusparselt

Updated Apr 24, 2025
Python

triton-inference-server / server

Star

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

machine-learning cloud deep-learning gpu inference edge datacenter

Updated Apr 28, 2025
Python

OlafenwaMoses / ImageAI

Sponsor

Star

A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities

python machine-learning algorithm video gpu detection prediction python3 artificial-intelligence artificial-neural-networks image-recognition densenet object-detection squeezenet inceptionv3 offline-capable image-prediction imageai ai-practice-recommendations

Updated Aug 3, 2024
Python

MVIG-SJTU / AlphaPose

Star

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Updated May 13, 2024
Python

skypilot-org / skypilot

Star

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Updated Apr 28, 2025
Python

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

gpu transformers pytorch llm

Updated Apr 28, 2025
Python

chainer / chainer

Star

A flexible framework of neural networks for deep learning

python machine-learning deep-learning neural-network chainer gpu numpy cuda neural-networks cudnn cupy

Updated Aug 28, 2023
Python

XuehaiPan / nvitop

Star

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

console monitoring gpu grafana cuda prometheus nvidia prometheus-exporter curses nvml top command-line-tool htop grafana-dashboard nvidia-smi monitoring-tool process-monitoring gpu-monitoring resource-monitor

Updated Apr 25, 2025
Python

google / tf-quant-finance

Star

High-performance TensorFlow library for quantitative finance.

python finance tensorflow gpu high-performance quantlib high-performance-computing gpu-computing quantitative-finance numerical-methods numerical-optimization numerical-integration

Updated Mar 21, 2025
Python

sktime / pytorch-forecasting

Sponsor

Star

Time series forecasting with PyTorch

python data-science machine-learning ai timeseries deep-learning gpu pandas pytorch uncertainty neural-networks forecasting temporal artifical-intelligense timeseries-forecasting pytorch-lightning

Updated Apr 18, 2025
Python

wookayin / gpustat

Star

📊 A simple command-line utility for querying and monitoring GPU status

python monitoring command-line gpu nvidia-smi gpustat

Updated Apr 13, 2025
Python

tlkh / asitop

Star

Perf monitoring CLI tool for Apple Silicon

macos cli cpu gpu m1 apple-silicon

Updated Apr 18, 2024
Python

Jittor / jittor

Star

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

python deep-learning gpu cuda jittor

Updated Apr 22, 2025
Python

pytorch / executorch

Star

On-device AI across mobile, embedded and edge for PyTorch

machine-learning mobile embedded deep-learning neural-network gpu tensor

Updated Apr 28, 2025
Python

leptonai / leptonai

Star

A Pythonic framework to simplify AI service building

python machine-learning cloud deep-learning gpu artificial-intelligence

Updated Apr 17, 2025
Python

NVIDIA / TransformerEngine

Star

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

python machine-learning deep-learning gpu cuda pytorch jax fp8

Updated Apr 25, 2025
Python

Improve this page

Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu

Here are 862 public repositories matching this topic...

pytorch / pytorch

deepspeedai / DeepSpeed

plasma-umass / scalene

apache / tvm

cupy / cupy

triton-inference-server / server

OlafenwaMoses / ImageAI

MVIG-SJTU / AlphaPose

skypilot-org / skypilot

intel / ipex-llm

chainer / chainer

XuehaiPan / nvitop

google / tf-quant-finance

sktime / pytorch-forecasting

wookayin / gpustat

tlkh / asitop

Jittor / jittor

pytorch / executorch

leptonai / leptonai

NVIDIA / TransformerEngine

Improve this page

Add this topic to your repo