What is the most affordable hourly rate for NVIDIA A16?
💡 Answer
The best-priced cloud NVIDIA A16 rate we track is $0.47 per hour on Vultr. That rate reflects on-demand billing; providers that support spot/preemptible instances bring the floor down to per hour — a saving of around up to 60% for workloads that can tolerate interruption.
If you can commit to reserved capacity, expect a further up to 40% off. For ad-hoc experiments we recommend on-demand; for multi-day pre-training budget a mix of spot and on-demand. Hourly quotes can shift weekly as providers compete, so check the live table before committing a long run.
Vultr offers the cheapest NVIDIA A16 rate at $0.47/hr; Cherry Servers is also available for comparison.
More FAQs about NVIDIA A16
Vultr vs Cherry Servers - GPU Provider Comparison (April 2026)
Head-to-head comparison of Vultr and Cherry Servers. Compare GPU models, hourly pricing, billing granularity, spot instances, VRAM, infrastructure, developer tools, Kubernetes support, and compliance before choosing a provider. Data refreshed April 2026.
|
Vultr
High-performance cloud GPU across 32 global regions
|
Cherry Servers
Bare metal GPU servers with 24 years of hosting experience and full hardware-level control.
|
|
|---|---|---|
| Overview | ||
| Trustpilot Rating | 1.8 | 4.6 |
| Headquarters | United States | Lithuania |
| Provider Type | Multi-Cloud | N/A |
| Best For | AI training inference video rendering HPC Stable Diffusion game development generative AI fine-tuning research | AI training inference fine-tuning rendering research HPC generative AI deep learning |
| GPU Hardware | ||
| GPU Models | A16 A40 L40S A100 PCIe GH200 A100 SXM H100 SXM B200 B300 MI300X MI325X MI355X | A100 A40 A16 A10 A2 Tesla P4 |
| Max VRAM (GB) | 288 | 80 |
| Max GPUs/Instance | 16 | 2 |
| Interconnect | NVLink | PCIe |
| Pricing | ||
| Starting Price ($/hr) | $0.47/hr | $0.16/hr |
| Billing Granularity | Per-hour | Per-hour |
| Spot/Preemptible | Yes | No |
| Reserved Discounts | N/A | N/A |
| Free Credits | Up to $300 free credit for 30 days | None |
| Egress Fees | Standard (varies by plan) | N/A |
| Storage | 350 GB - 61 TB NVMe (included), Block Storage at $0.10/GB/mo, S3-compatible Object Storage | NVMe SSD, Elastic Block Storage ($0.071/GB/mo) |
| Infrastructure | ||
| Regions | 32 regions across 6 continents (Americas, Europe, Asia, Australia, Africa) | Lithuania, Netherlands, Germany, Sweden, US, Singapore (6 locations) |
| Uptime SLA | 100% | 99.97% |
| Developer Experience | ||
| Frameworks | PyTorch TensorFlow CUDA cuDNN ROCm Hugging Face NVIDIA NGC | PyTorch TensorFlow CUDA (bare metal — full stack control) |
| Docker Support | Yes | Yes |
| SSH Access | Yes | Yes |
| Jupyter Notebooks | Yes | No |
| API / CLI | Yes | Yes |
| Setup Time | Minutes | Minutes |
| Kubernetes Support | Yes | Yes |
| Business Terms | ||
| Min Commitment | None | None |
| Compliance | SOC 2+ (HIPAA) PCI ISO 27001 ISO 27017 ISO 27018 ISO 20000-1 CSA STAR Level 1 | ISO 27001 ISO 20000-1 GDPR PCI DSS |
Vultr
Cherry Servers