What's the peak FP16 performance of NVIDIA GeForce RTX 3070?

Antwoord

Released in 2020, NVIDIA GeForce RTX 3070 is an Ampere-class accelerator with 8 GB of GDDR6, 448 GB/s of memory bandwidth, and 20.3 FP16 TFLOPS of compute. FP32 peaks at 10.2 TFLOPS and the card draws up to 220W.

In practical terms: enough VRAM to load models into the ~8B-parameter range in FP16 (larger with quantisation), enough bandwidth to avoid memory-starving attention layers, and enough compute to train transformers at batch sizes that saturate modern optimisers.

See the NVIDIA GeForce RTX 3070 page for the full spec sheet and comparisons to related GPUs.

Meer FAQs over NVIDIA GeForce RTX 3070

Verken NVIDIA GeForce RTX 3070