Vast.ai

GPU Marketplace Headquartered in United States Founded in 2018
Updated March 14, 2026

Vast.ai is a two-sided GPU compute marketplace that connects developers with 1,400+ independent hosts offering underutilized GPU hardware across 500+ locations worldwide. By aggregating supply from independent data centers and individual providers, Vast.ai delivers GPU compute at 40–80% less than traditional hyperscalers like AWS, Azure, and GCP.

The platform supports three deployment models: GPU Cloud (individual instances), Serverless (autoscaling inference endpoints), and Clusters (multi-node training). Users can spin up high-performance instances in seconds using Docker containers, choosing from 35+ GPU types ranging from consumer RTX cards to enterprise B200s. Pricing is set dynamically by the marketplace based on supply and demand.

Founded in 2018 by Jake Cannell, the company manages 20,000+ GPUs and achieved 310% growth in 2024. It holds SOC 2 Type 2 certification and is particularly popular among AI researchers, ML engineers, and indie developers who need affordable compute without long-term commitments.

Starting Price $0.06/hr Per hour
Max VRAM 192 GB Per GPU
Max GPUs 8 Per instance
Billing Per-second Granularity

GPU Hardware

GPU Models B200 H200 H100 SXM H100 NVL A100 SXM A100 PCIe RTX 5090 RTX 5080 RTX 5070 Ti RTX 6000 Pro RTX 6000 Ada RTX 4500 Ada RTX A6000 RTX A5000 RTX A4000 L40S L40 A40 A10 RTX 4090 RTX 4080 RTX 4070 Ti RTX 4070 RTX 4060 Ti RTX 4060 RTX 3090 Ti RTX 3090 RTX 3080 Ti RTX 3080 RTX 3070 Ti RTX 3070 Tesla V100 Tesla T4 A2 GTX 1080
Max VRAM 192 GB
Max GPUs per Instance 8
Interconnect NVLink, InfiniBand
Multi-Node Training Yes

Pricing

Starting Price $0.06/hr
Billing Granularity Per-second
Spot/Preemptible Yes
Reserved Discounts Up to 50% (1-6 month reserved)
Free Credits Small test credit on signup
Egress Fees Varies by host ($/TB)
Storage Varies by host ($/GB/hr, charged while instance exists)

Marketplace-driven pricing: hosts set their own rates based on supply and demand. Three tiers available — On-Demand (guaranteed uptime), Interruptible (50%+ cheaper via bidding), and Reserved (1/3/6-month terms). Budget: RTX 4060 from $0.06/hr. Mid-range: RTX 4090 from $0.29/hr, A100 from $0.67/hr. High-end: H100 from $1.55/hr, H200 from $1.97/hr, B200 from $2.67/hr. Note: storage is charged even when instances are stopped, and bandwidth fees apply per TB. $5 minimum deposit to start.

Infrastructure

Regions 500+ locations, 40+ data centers
Uptime SLA No formal SLA (host reliability scores visible)
Serverless / Autoscaling Yes
Private Networking / VPC Yes

Developer Experience

Pre-installed Frameworks PyTorch TensorFlow CUDA vLLM ComfyUI
Docker Support Yes
SSH Access Yes
Jupyter Notebooks Yes
API / CLI Yes
Setup Time Seconds
Kubernetes Support No
Custom Images / Templates Yes
Persistent Storage Yes

Business Terms

Min Commitment None
Compliance SOC 2 Type 2 HIPAA GDPR CCPA
Best For Training Inference Fine-tuning Batch Processing AI Research LLM Serving
Support Channels Live Chat (24/7) Discord Email Documentation
Payment Methods Credit Card Crypto (Coinbase Crypto.com)
VS

How does it compare?

Compare Vast.ai against other cloud GPU providers.

User Feedback

There are no public trader reviews for this firm yet. If you have traded with them, be the first to leave a short, honest review and help other traders.

Share Your Experience

Short, honest feedback helps other prop traders understand what it is really like to work with this firm.

By sending feedback you agree that your comment can be published on this page. Personal details such as email are never shown publicly.

Security check