Vast.ai
Vast.ai is a two-sided GPU compute marketplace that connects developers with 1,400+ independent hosts offering underutilized GPU hardware across 500+ locations worldwide. By aggregating supply from independent data centers and individual providers, Vast.ai delivers GPU compute at 40–80% less than traditional hyperscalers like AWS, Azure, and GCP.
The platform supports three deployment models: GPU Cloud (individual instances), Serverless (autoscaling inference endpoints), and Clusters (multi-node training). Users can spin up high-performance instances in seconds using Docker containers, choosing from 35+ GPU types ranging from consumer RTX cards to enterprise B200s. Pricing is set dynamically by the marketplace based on supply and demand.
Founded in 2018 by Jake Cannell, the company manages 20,000+ GPUs and achieved 310% growth in 2024. It holds SOC 2 Type 2 certification and is particularly popular among AI researchers, ML engineers, and indie developers who need affordable compute without long-term commitments.
GPU Hardware
| GPU Models | B200 H200 H100 SXM H100 NVL A100 SXM A100 PCIe RTX 5090 RTX 5080 RTX 5070 Ti RTX 6000 Pro RTX 6000 Ada RTX 4500 Ada RTX A6000 RTX A5000 RTX A4000 L40S L40 A40 A10 RTX 4090 RTX 4080 RTX 4070 Ti RTX 4070 RTX 4060 Ti RTX 4060 RTX 3090 Ti RTX 3090 RTX 3080 Ti RTX 3080 RTX 3070 Ti RTX 3070 Tesla V100 Tesla T4 A2 GTX 1080 |
| Max VRAM | 192 GB |
| Max GPUs per Instance | 8 |
| Interconnect | NVLink, InfiniBand |
| Multi-Node Training | Yes |
Pricing
| Starting Price | $0.06/hr |
| Billing Granularity | Per-second |
| Spot/Preemptible | Yes |
| Reserved Discounts | Up to 50% (1-6 month reserved) |
| Free Credits | Small test credit on signup |
| Egress Fees | Varies by host ($/TB) |
| Storage | Varies by host ($/GB/hr, charged while instance exists) |
Marketplace-driven pricing: hosts set their own rates based on supply and demand. Three tiers available — On-Demand (guaranteed uptime), Interruptible (50%+ cheaper via bidding), and Reserved (1/3/6-month terms). Budget: RTX 4060 from $0.06/hr. Mid-range: RTX 4090 from $0.29/hr, A100 from $0.67/hr. High-end: H100 from $1.55/hr, H200 from $1.97/hr, B200 from $2.67/hr. Note: storage is charged even when instances are stopped, and bandwidth fees apply per TB. $5 minimum deposit to start.
Infrastructure
| Regions | 500+ locations, 40+ data centers |
| Uptime SLA | No formal SLA (host reliability scores visible) |
| Serverless / Autoscaling | Yes |
| Private Networking / VPC | Yes |
Developer Experience
| Pre-installed Frameworks | PyTorch TensorFlow CUDA vLLM ComfyUI |
| Docker Support | Yes |
| SSH Access | Yes |
| Jupyter Notebooks | Yes |
| API / CLI | Yes |
| Setup Time | Seconds |
| Kubernetes Support | No |
| Custom Images / Templates | Yes |
| Persistent Storage | Yes |
Business Terms
| Min Commitment | None |
| Compliance | SOC 2 Type 2 HIPAA GDPR CCPA |
| Best For | Training Inference Fine-tuning Batch Processing AI Research LLM Serving |
| Support Channels | Live Chat (24/7) Discord Email Documentation |
| Payment Methods | Credit Card Crypto (Coinbase Crypto.com) |
How does it compare?
Compare Vast.ai against other cloud GPU providers.
User Feedback
There are no public trader reviews for this firm yet. If you have traded with them, be the first to leave a short, honest review and help other traders.
Share Your Experience
Short, honest feedback helps other prop traders understand what it is really like to work with this firm.