NVIDIA A16 full datasheet — the specs that matter for deep learning
💡 Answer
NVIDIA A16 is a 2021-generation Ampere card with 64 GB of GDDR6 memory and 800 GB/s bandwidth. Compute peaks at 72 FP16 TFLOPS and 18 FP32 TFLOPS; TDP sits at 250W.
The VRAM/bandwidth pairing is the defining feature for machine learning work — it determines what model sizes are accessible and how hard the card can be pushed during production inference. Power draw and cooling requirements mean most NVIDIA A16 deployments live in data centres rather than workstations, which is why most NVIDIA A16 access in practice comes via the cloud.
Two tracked cloud providers currently offer NVIDIA A16: Vultr and Cherry Servers. Vultr has the cheaper rate at $0.47/hr.
More FAQs about NVIDIA A16
Vultr vs Cherry Servers - GPU Provider Comparison (April 2026)
Head-to-head comparison of Vultr and Cherry Servers. Compare GPU models, hourly pricing, billing granularity, spot instances, VRAM, infrastructure, developer tools, Kubernetes support, and compliance before choosing a provider. Data refreshed April 2026.
|
Vultr
High-performance cloud GPU across 32 global regions
|
Cherry Servers
Bare metal GPU servers with 24 years of hosting experience and full hardware-level control.
|
|
|---|---|---|
| Overview | ||
| Trustpilot Rating | 1.8 | 4.6 |
| Headquarters | United States | Lithuania |
| Provider Type | Multi-Cloud | N/A |
| Best For | AI training inference video rendering HPC Stable Diffusion game development generative AI fine-tuning research | AI training inference fine-tuning rendering research HPC generative AI deep learning |
| GPU Hardware | ||
| GPU Models | A16 A40 L40S A100 PCIe GH200 A100 SXM H100 SXM B200 B300 MI300X MI325X MI355X | A100 A40 A16 A10 A2 Tesla P4 |
| Max VRAM (GB) | 288 | 80 |
| Max GPUs/Instance | 16 | 2 |
| Interconnect | NVLink | PCIe |
| Pricing | ||
| Starting Price ($/hr) | $0.47/hr | $0.16/hr |
| Billing Granularity | Per-hour | Per-hour |
| Spot/Preemptible | Yes | No |
| Reserved Discounts | N/A | N/A |
| Free Credits | Up to $300 free credit for 30 days | None |
| Egress Fees | Standard (varies by plan) | N/A |
| Storage | 350 GB - 61 TB NVMe (included), Block Storage at $0.10/GB/mo, S3-compatible Object Storage | NVMe SSD, Elastic Block Storage ($0.071/GB/mo) |
| Infrastructure | ||
| Regions | 32 regions across 6 continents (Americas, Europe, Asia, Australia, Africa) | Lithuania, Netherlands, Germany, Sweden, US, Singapore (6 locations) |
| Uptime SLA | 100% | 99.97% |
| Developer Experience | ||
| Frameworks | PyTorch TensorFlow CUDA cuDNN ROCm Hugging Face NVIDIA NGC | PyTorch TensorFlow CUDA (bare metal — full stack control) |
| Docker Support | Yes | Yes |
| SSH Access | Yes | Yes |
| Jupyter Notebooks | Yes | No |
| API / CLI | Yes | Yes |
| Setup Time | Minutes | Minutes |
| Kubernetes Support | Yes | Yes |
| Business Terms | ||
| Min Commitment | None | None |
| Compliance | SOC 2+ (HIPAA) PCI ISO 27001 ISO 27017 ISO 27018 ISO 20000-1 CSA STAR Level 1 | ISO 27001 ISO 20000-1 GDPR PCI DSS |
Vultr
Cherry Servers