Which NVIDIA and AMD GPUs are available at RunPod?

💡 Answer

As of April 9, 2026, RunPod provides access to the following GPU models:

B300, B200, H200, H100 SXM, H100 PCIe, H100 NVL, MI300X, A100 SXM, A100 PCIe, RTX 5090, RTX PRO 6000, L40S, L40, RTX 6000 Ada, RTX 5000 Ada, RTX A6000, RTX A5000, RTX 4090, RTX 4080 SUPER, RTX 4080, RTX 4070 Ti, RTX 3090 Ti, RTX 3090, RTX 3080 Ti, RTX 3080, RTX 3070, A40, A30, A2, L4

For memory-intensive workloads such as large language model training, the highest VRAM option at RunPod is 288 GB. Multi-GPU instances support up to 8 GPUs with NVLink interconnect for efficient parallel computation.

See which GPU models are currently in stock at RunPod official website.

More FAQs about RunPod

Guides Where RunPod Is Featured

These guides include RunPod alongside other cloud GPU providers, grouped by hardware, pricing, features, and infrastructure.

RunPod vs Massed Compute vs Latitude.sh - GPU Provider Comparison (April 2026)

Side-by-side comparison of RunPod vs Massed Compute vs Latitude.sh. Quickly scan maximum funding, profit splits, risk rules, leverage, platforms, instruments, payout schedules, payment options, trading permissions and KYC restrictions to narrow down your prop trading firm shortlist. Data updated April 2026.

RunPod vs Massed Compute vs Latitude.sh - GPU Provider Comparison (April 2026)
	RunPod The cloud built for AI — deploy and scale GPU workloads from serverless inference to instant multi-node clusters on demand.	Massed Compute GPU cloud with direct engineer support	Latitude.sh Bare metal GPU cloud across 23 global locations
Overview
Trustpilot Rating	3.8	0	3.7
Headquarters	United States	United States	Brazil
Provider Type	GPU-Focused	GPU-Focused	Bare Metal
Best For	AI training inference fine-tuning Stable Diffusion batch processing rendering research LLM serving generative AI	AI training inference VFX rendering generative AI fine-tuning HPC Stable Diffusion research	AI training inference bare metal GPU fine-tuning research dedicated workloads generative AI
GPU Hardware
GPU Models	B300 B200 H200 H100 SXM H100 PCIe H100 NVL MI300X A100 SXM A100 PCIe RTX 5090 RTX PRO 6000 L40S L40 RTX 6000 Ada RTX 5000 Ada RTX A6000 RTX A5000 RTX 4090 RTX 4080 SUPER RTX 4080 RTX 4070 Ti RTX 3090 Ti RTX 3090 RTX 3080 Ti RTX 3080 RTX 3070 A40 A30 A2 L4	A30 RTX A5000 RTX A6000 L40S A100 SXM H100 PCIe H100 SXM H100 NVL RTX PRO 6000 H200 NVL	A30 RTX A5000 RTX A6000 L40S RTX 6000 Ada A100 SXM H100 SXM GH200 RTX PRO 6000
Max VRAM (GB)	288	141	96
Max GPUs/Instance	8	8	8
Interconnect	NVLink	NVLink	NVLink
Pricing
Starting Price ($/hr)	$0.06/hr	$0.35/hr	$0.35/hr
Billing Granularity	Per-second	Per-minute	Per-hour
Spot/Preemptible	1	0	0
Reserved Discounts	15-29% (1-month to 1-year plans)	N/A	N/A
Free Credits	$5-$500 bonus after first $10 spend	None	$200 via referral program
Egress Fees	None (Free)	None	None
Storage	Container/Volume ($0.10/GB/mo), Idle Volume ($0.20/GB/mo), Network Storage ($0.07/GB/mo 1TB)	Local NVMe included with instances	Local NVMe included (up to 4x 3.8TB), Block Storage $0.10/GB/mo, Filesystem Storage $0.05/GB/mo
Infrastructure
Regions	31 global regions	United States (Tier III data centers)	23 locations: US (8 cities), LATAM (5), Europe (5), APAC (4), Mexico City. GPU in Dallas, Frankfurt, Sydney, Tokyo
Uptime SLA	99.99%	Tier III (99.98% design)	99.9%
Developer Experience
Frameworks	PyTorch TensorFlow JAX ONNX CUDA	PyTorch TensorFlow CUDA cuDNN ComfyUI pre-configured ML templates	ML-optimized images PyTorch TensorFlow (user-installed) CUDA
Docker Support	1	1	1
SSH Access	1	1	1
Jupyter Notebooks	1	0	0
API / CLI	1	1	1
Setup Time	Instant	Minutes	Seconds
Kubernetes Support	0	0	0
Business Terms
Min Commitment	None	None	None
Compliance	SOC 2 Type II	SOC 2 Type II HIPAA	Single-tenant isolation DPA available

See all RunPod comparisons