Vast.ai

GPU Marketplace Headquartered in United States Founded in 2018

Updated March 14, 2026

Vast.ai is a two-sided GPU compute marketplace that connects developers with 1,400+ independent hosts offering underutilized GPU hardware across 500+ locations worldwide. By aggregating supply from independent data centers and individual providers, Vast.ai delivers GPU compute at 40–80% less than traditional hyperscalers like AWS, Azure, and GCP.

The platform supports three deployment models: GPU Cloud (individual instances), Serverless (autoscaling inference endpoints), and Clusters (multi-node training). Users can spin up high-performance instances in seconds using Docker containers, choosing from 35+ GPU types ranging from consumer RTX cards to enterprise B200s. Pricing is set dynamically by the marketplace based on supply and demand.

Founded in 2018 by Jake Cannell, the company manages 20,000+ GPUs and achieved 310% growth in 2024. It holds SOC 2 Type 2 certification and is particularly popular among AI researchers, ML engineers, and indie developers who need affordable compute without long-term commitments.

Compare FAQs

4.4

★★★★☆

Based on 197 reviews Visit Website ↗

Starting Price $0.06/hr Per hour

Max VRAM 192 GB Per GPU

Max GPUs 8 Per instance

Billing Per-second Granularity

Hardware Pricing Infrastructure Developer Tools Business Terms Comparison Feedback

GPU Hardware

GPU Models	B200 H200 H100 SXM H100 NVL A100 SXM A100 PCIe RTX 5090 RTX 5080 RTX 5070 Ti RTX 6000 Pro RTX 6000 Ada RTX 4500 Ada RTX A6000 RTX A5000 RTX A4000 L40S L40 A40 A10 RTX 4090 RTX 4080 RTX 4070 Ti RTX 4070 RTX 4060 Ti RTX 4060 RTX 3090 Ti RTX 3090 RTX 3080 Ti RTX 3080 RTX 3070 Ti RTX 3070 Tesla V100 Tesla T4 A2 GTX 1080
Max VRAM	192 GB
Max GPUs per Instance	8
Interconnect	NVLink, InfiniBand
Multi-Node Training	Yes

Pricing

Starting Price	$0.06/hr
Billing Granularity	Per-second
Spot/Preemptible	Yes
Reserved Discounts	Up to 50% (1-6 month reserved)
Free Credits	Small test credit on signup
Egress Fees	Varies by host ($/TB)
Storage	Varies by host ($/GB/hr, charged while instance exists)

Marketplace-driven pricing: hosts set their own rates based on supply and demand. Three tiers available — On-Demand (guaranteed uptime), Interruptible (50%+ cheaper via bidding), and Reserved (1/3/6-month terms). Budget: RTX 4060 from $0.06/hr. Mid-range: RTX 4090 from $0.29/hr, A100 from $0.67/hr. High-end: H100 from $1.55/hr, H200 from $1.97/hr, B200 from $2.67/hr. Note: storage is charged even when instances are stopped, and bandwidth fees apply per TB. $5 minimum deposit to start.

Infrastructure

Regions	500+ locations, 40+ data centers
Uptime SLA	No formal SLA (host reliability scores visible)
Serverless / Autoscaling	Yes
Private Networking / VPC	Yes

Developer Experience

Pre-installed Frameworks	PyTorch TensorFlow CUDA vLLM ComfyUI
Docker Support	Yes
SSH Access	Yes
Jupyter Notebooks	Yes
API / CLI	Yes
Setup Time	Seconds
Kubernetes Support	No
Custom Images / Templates	Yes
Persistent Storage	Yes

Business Terms

Min Commitment	None
Compliance	SOC 2 Type 2 HIPAA GDPR CCPA
Best For	Training Inference Fine-tuning Batch Processing AI Research LLM Serving
Support Channels	Live Chat (24/7) Discord Email Documentation
Payment Methods	Credit Card Crypto (Coinbase Crypto.com)

How does it compare?

Compare Vast.ai against other cloud GPU providers.

Vast.ai

Latitude.sh

Massed Compute

RunPod

Google Cloud

Genesis Cloud

FluidStack

User Feedback

There are no public trader reviews for this firm yet. If you have traded with them, be the first to leave a short, honest review and help other traders.

Vast.ai

GPU Hardware

Pricing

Infrastructure

Developer Experience

Business Terms

How does it compare?

User Feedback

Share Your Experience

Cancel reply