Energy Analysis
The Hidden Cost: GPU Electricity Expense Per Training Run
An H100 draws 700W at peak. An 8-GPU cluster running 24/7 costs $4,900/year in electricity alone. But nobody tracks this cost — it doesn't appear on your cloud bill, and most on-premise teams can't attribute it to specific workloads.
GPU Power Draw: The Numbers
Every GPU has a Thermal Design Power (TDP) — the maximum sustained power draw. But real power consumption depends on workload intensity, memory access patterns, and thermal conditions. Here are the actual TDP ratings for popular AI GPUs:
| GPU | TDP (W) | Idle (~15%) | FP16 TFLOPS | Memory | Architecture |
|---|---|---|---|---|---|
| H100 SXM | 700W | 105W | 989 | 80 GB | Hopper |
| H200 SXM | 700W | 105W | 989 | 141 GB | Hopper |
| A100 SXM 80GB | 400W | 60W | 312 | 80 GB | Ampere |
| A100 SXM 40GB | 400W | 60W | 312 | 40 GB | Ampere |
| RTX 4090 | 450W | 68W | 165 | 24 GB | Ada Lovelace |
| L40S | 350W | 53W | 362 | 48 GB | Ada Lovelace |
| AMD MI300X | 750W | 113W | 1307 | 192 GB | CDNA3 |
| AMD MI250X | 500W | 75W | 383 | 128 GB | CDNA2 |
| T4 | 70W | 11W | 65 | 16 GB | Turing |
Key insight: an idle GPU still draws 15% of TDP. An idle H100 consumes ~105W — costing $0.013/hr in electricity even when doing nothing. Across an 8-GPU cluster sitting idle for a month, that's $73 in electricity for zero compute.
Electricity Cost Per Training Run
We modeled energy consumption for common ML workloads using real TDP data and typical utilization profiles. All costs use the US average electricity rate of $0.12/kWh (European rates at $0.25/kWh shown for comparison).
| Workload | GPUs | Duration | kWh | US Cost | EU Cost | CO₂ (kg) |
|---|---|---|---|---|---|---|
Fine-tune Llama 3 8B (LoRA) LoRA fine-tune on custom dataset, 3 epochs | 1x RTX 4090 | 4h | 1 | $0.16 | $0.34 | 0.5 |
Fine-tune Llama 3 70B (QLoRA) QLoRA 4-bit, distributed across 4 GPUs | 4x A100 SXM 80GB | 12h | 13 | $1.54 | $3.22 | 5.0 |
Pre-train 7B model from scratch Full pre-training, 1 week on 8-GPU node | 8x H100 SXM | 7d | 776 | $93.14 | $194.04 | 298.8 |
Pre-train 70B model Full pre-training, ~1 month, 8-node cluster | 64x H100 SXM | 30d | 27,689 | $3322.63 | $6922.14 | 10660.1 |
Inference server (vLLM, 24/7) Serving a 70B model, typical production load | 2x A100 SXM 80GB | 1 mo | 128 | $15.42 | $32.12 | 49.5 |
Idle cluster (95th percentile reality) 8 GPUs sitting idle — the industry average | 8x H100 SXM | 1 mo | 593 | $71.13 | $148.19 | 228.2 |
The Idle Cost Problem
The last row in the table above tells the real story. Industry research consistently shows average GPU utilization of just 5%. An 8-GPU H100 cluster sitting idle for a month wastes over $73 in electricity alone — and over $23,000/month in cloud compute costs for GPUs doing essentially nothing.
The electricity cost of idling is small compared to the cloud rental cost, but it's a useful signal: if you can measure power draw, you can measure utilization, and if you can measure utilization, you can eliminate waste.
The Formula
GPU energy cost per hour follows a simple formula:
Energy Cost ($/hr) = (TDP_W × Utilization × GPU_Count) / 1000 × Electricity_Rate
Example: 8x H100 at 90% utilization, $0.12/kWh
= (700 × 0.90 × 8) / 1000 × $0.12 = $0.605/hr = $441/mo
Electricity vs. Cloud Compute: The Ratio
For cloud deployments, electricity is already baked into the instance price — you pay it indirectly. But for on-premise or colocation deployments, electricity is a direct, measurable cost:
- H100 SXM: Electricity is ~2% of total cloud cost, ~20% of on-prem TCO
- RTX 4090: Electricity is ~6% of total cloud cost (lower cloud price, high TDP)
- T4: Electricity is ~2% of total cloud cost (low TDP, low price)
For cloud users, the bigger win is eliminating idle compute time. For on-prem teams, both idle compute and direct electricity savings matter.
Carbon Footprint
Every kWh of electricity produces CO₂ emissions depending on the local grid mix. Using the US average of 385g CO₂/kWh:
- Fine-tuning Llama 3 8B: ~1 kg CO₂ (equivalent to driving 2.5 miles)
- Pre-training a 7B model: ~1,700 kg CO₂ (equivalent to a cross-country flight)
- Pre-training a 70B model: ~100,000+ kg CO₂ (equivalent to 15 cars for a year)
The EU AI Act (Article 13) requires GPAI model providers to report energy consumption during training. NemulAI provides automated energy metering and compliance-ready reports. Read our EU AI Act compliance guide →
How to Measure Real Energy Costs
The estimates above use TDP as a proxy, but real power draw varies with workload. The only way to know your actual energy cost is to measure it:
# Install the agent (works on NVIDIA, AMD, Intel, Apple Silicon)
pip install nemulai
# Start measuring
nemulai
# See energy cost per job
nemulai report --format csv --with-carbon
Try the Calculator
Model your own GPU electricity costs with our free GPU Cost Calculator. Includes compute costs, electricity, and CO₂ footprint across 9 providers.
Measure real energy costs, not estimates
NemulAI measures actual GPU power draw per job with 5-second resolution across 6 hardware platforms.