NVIDIA A16 full datasheet — the specs that matter for deep learning

💡 Answer

NVIDIA A16 is a 2021-generation Ampere card with 64 GB of GDDR6 memory and 800 GB/s bandwidth. Compute peaks at 72 FP16 TFLOPS and 18 FP32 TFLOPS; TDP sits at 250W.

The VRAM/bandwidth pairing is the defining feature for machine learning work — it determines what model sizes are accessible and how hard the card can be pushed during production inference. Power draw and cooling requirements mean most NVIDIA A16 deployments live in data centres rather than workstations, which is why most NVIDIA A16 access in practice comes via the cloud.

Two tracked cloud providers currently offer NVIDIA A16: Vultr and Cherry Servers. Vultr has the cheaper rate at $0.47/hr.

More FAQs about NVIDIA A16

Vultr vs Cherry Servers - GPU Provider Comparison (April 2026)

Head-to-head comparison of Vultr and Cherry Servers. Compare GPU models, hourly pricing, billing granularity, spot instances, VRAM, infrastructure, developer tools, Kubernetes support, and compliance before choosing a provider. Data refreshed April 2026.

Vultr vs Cherry Servers - GPU Provider Comparison (April 2026)
	Vultr High-performance cloud GPU across 32 global regions Visit Vultr	Cherry Servers Bare metal GPU servers with 24 years of hosting experience and full hardware-level control. Visit Cherry Servers
Overview
Trustpilot Rating	1.8	4.6
Headquarters	United States	Lithuania
Provider Type	Multi-Cloud	N/A
Best For	AI training inference video rendering HPC Stable Diffusion game development generative AI fine-tuning research	AI training inference fine-tuning rendering research HPC generative AI deep learning
GPU Hardware
GPU Models	A16 A40 L40S A100 PCIe GH200 A100 SXM H100 SXM B200 B300 MI300X MI325X MI355X	A100 A40 A16 A10 A2 Tesla P4
Max VRAM (GB)	288	80
Max GPUs/Instance	16	2
Interconnect	NVLink	PCIe
Pricing
Starting Price ($/hr)	$0.47/hr	$0.16/hr
Billing Granularity	Per-hour	Per-hour
Spot/Preemptible	Yes	No
Reserved Discounts	N/A	N/A
Free Credits	Up to $300 free credit for 30 days	None
Egress Fees	Standard (varies by plan)	N/A
Storage	350 GB - 61 TB NVMe (included), Block Storage at $0.10/GB/mo, S3-compatible Object Storage	NVMe SSD, Elastic Block Storage ($0.071/GB/mo)
Infrastructure
Regions	32 regions across 6 continents (Americas, Europe, Asia, Australia, Africa)	Lithuania, Netherlands, Germany, Sweden, US, Singapore (6 locations)
Uptime SLA	100%	99.97%
Developer Experience
Frameworks	PyTorch TensorFlow CUDA cuDNN ROCm Hugging Face NVIDIA NGC	PyTorch TensorFlow CUDA (bare metal — full stack control)
Docker Support	Yes	Yes
SSH Access	Yes	Yes
Jupyter Notebooks	Yes	No
API / CLI	Yes	Yes
Setup Time	Minutes	Minutes
Kubernetes Support	Yes	Yes
Business Terms
Min Commitment	None	None
Compliance	SOC 2+ (HIPAA) PCI ISO 27001 ISO 27017 ISO 27018 ISO 20000-1 CSA STAR Level 1	ISO 27001 ISO 20000-1 GDPR PCI DSS

Vultr

Cherry Servers

💡 Answer

More FAQs about NVIDIA A16

Vultr vs Cherry Servers - GPU Provider Comparison (April 2026)

Explore NVIDIA A16