πŸ‡ΊπŸ‡Έ USA Β· RunPod

Status: 🟩 COMPLETE 🟦 LIVING Last updated: 2026-06-26 Plain-English tagline: GPU cloud + serverless inference. Famous for cheap spot pods, community templates, easy deployment of any Docker container with GPU. Popular with indie AI developers + ML hobbyists.


Front-matter facts

FieldValue
VendorRunPod Inc (USA)
Country / originπŸ‡ΊπŸ‡Έ USA
Recommended for Australian users?βœ… Yes β€” accessible from AUS
Privacy summaryStandard cloud provider; your workloads on rented hardware
Free tierNone for GPUs; some free credits on sign-up
Paid tiersPer-hour rentals + serverless per-second; spot pods for cheaper variable-availability
First released2022
Last reviewed2026-06-26
Official sitehttps://runpod.io

What it is

RunPod is a GPU cloud focused on flexibility + cost + ease of use. Two main products:

Pods (rented GPUs)

  • Rent GPUs by the hour or second
  • Spot Pods β€” cheaper, can be reclaimed (~50% off On-Demand)
  • On-Demand Pods β€” guaranteed availability
  • Community templates (one-click deploy of popular ML tools: Stable Diffusion WebUI, Ollama, Whisper, etc.)
  • Multi-GPU support

Serverless

  • Pay-per-second functions on GPU
  • Cold-start delays (~5-30s)
  • Good for variable inference loads
  • Auto-scaling

Why RunPod is popular:

  • Spot pods are very cheap β€” among the cheapest H100 access
  • Easy template deployments β€” Stable Diffusion / Ollama / fine-tuning workflows pre-configured
  • Community / hobbyist friendly β€” Discord support, lots of tutorials
  • Multiple regions / hardware tiers β€” A100, H100, H200, B200 availability

What you’d use it for

  • Indie AI developer projects β€” cheap GPU access for prototypes
  • Stable Diffusion workflows at scale without local GPU
  • Fine-tuning open-weight models β€” cheap H100 spot pods
  • Hobbyist ML experimentation
  • Serverless inference for variable-load AI products
  • Quick model deployment via community templates

When NOT to use:

  • For production-grade SLAs (RunPod is reliable but not enterprise-class)
  • For AUS data residency
  • For broader cloud needs (compute-focused)
  • For tightly-coupled enterprise compliance (less mature than hyperscalers)

How to use from Australia

  1. Sign up at runpod.io
  2. AUS card accepted
  3. Pick a pod template (Stable Diffusion / Ollama / Custom Docker)
  4. Spin up β€” typically 30 seconds for warm pods
  5. SSH or web UI access
  6. Shut down when done

What it costs

On-Demand Pods (verify current)

  • H100 80GB SXM: ~US$2.99/hour
  • A100 80GB: ~US$1.74/hour
  • RTX 4090: ~US$0.34/hour
  • L40S: ~US$1.30/hour

Spot Pods

  • ~50% cheaper than On-Demand
  • Can be reclaimed; design workflows accordingly

Serverless

  • Per-second pricing
  • Cold start latency
  • Auto-scaling

How it compares to alternatives

AspectRunPodLambda LabsModalCoreWeave
Spot pricingBest (cheapest)GoodN/A (serverless)Limited
Hobbyist friendlyBest (templates + community)ResearcherDeveloperEnterprise
Serverless optionYesAPI (Inference)Yes (best)No
Multi-regionYesUSUSMulti
AUS data residencyNoNoNoNo
Best forIndie / hobbyist / spot pricingCost-conscious researchersPythonic serverlessEnterprise GPU clusters

For cheapest GPU access with community templates, RunPod is hard to beat.


Privacy / data handling

  • Standard cloud β€” your workloads on rented hardware
  • US-based; no AUS residency
  • Spot pod data persists per template settings

Recent changes

  • 2025-26: Blackwell B200 availability; serverless improvements
  • 2024: Spot pod pricing increasingly competitive

Gotchas

  • Spot pods can be reclaimed β€” design for that
  • US-only infrastructure β€” high latency for AUS interactive use
  • Cold starts on serverless β€” 5-30s typical
  • For production-grade SLAs, hyperscaler GPU (AWS / Azure / GCP) better
  • Discord support culture β€” friendly but not enterprise-style support

See also


Sources