P100 benchmark results

GPU: Tesla P100-PCIE  |  VRAM: 16 GB  |  Architecture: Pascal (sm_60)

Prompt: "a turtle and a bird together in a forest"  |  Resolution: 512 × 512  |  Replicates: 5 timed runs

Results

Load time is from a warm cache (model already downloaded), median of 5 timed replicates. Generation time is the median of 5 timed replicates (seconds). VRAM and RAM are peak values in GB. OOM = out of memory during load or inference.

ModelModeLoad time (s)Gen time (s)Peak GPU VRAM (GB)Peak system RAM (GB)
SD 1.4GPU only5.48.63.443.2
Model offload5.410.42.645.4
Sequential offload5.437.81.436.9
SD 2.1 BaseGPU only5.27.83.287.3
Model offload5.29.52.097.5
Sequential offload4.838.80.868.8
SDXLGPU only6.314.17.4811.9
Model offload6.318.25.3413.7
Sequential offload5.474.90.7618.6
SDXL TurboGPU only4.61.37.4819.1
Model offload4.65.55.3219.1
Sequential offload5.310.50.7620.2
SD 3.5 MediumGPU onlyOOMOOMOOM
Model offload6.541.112.0634.5
Sequential offload5.066.11.2948.5
SD 3.5 Large TurboGPU onlyOOMOOMOOM
Model offload7.0OOMOOM
Sequential offload6.833.61.0472.1
Kandinsky 2.2GPU only4.68.510.0054.2
Model offload4.614.85.3729.6
Sequential offload4.644.82.7433.8
PixArt XL 512GPU only3.79.012.9132.6
Model offload3.721.210.7837.9
Sequential offload2.522.40.8762.6
FLUX.1 SchnellGPU onlyOOMOOMOOM
Model offload7.9OOMOOM
Sequential offload7.638.20.8687.6

Image gallery

Five replicates per model. Click to view full size.

SD 1.4

SD 2.1 Base

SDXL

SDXL Turbo

SD 3.5 Medium

SD 3.5 Large Turbo

Kandinsky 2.2

PixArt XL 512

FLUX.1 Schnell


Raw data: benchmark_results.json