WebGPU Local Inference
DevDiff v1.0.1 supports WebGPU-accelerated local model inference. This allows running lightweight models (e.g. Llama 3.2 3B, Gemma 2 2B) directly on your local GPU with zero installation and zero cloud costs.
Hardware Requirements
- GPU: A WebGPU-compatible GPU (e.g., Apple M-series chips, NVIDIA RTX GPUs, AMD Radeon GPUs).
- Driver support: Modern WebGPU backend support.
Fallback Chain
If WebGPU is unavailable, DevDiff automatically falls back to:
- WebAssembly (WASM)
- Native CPU
- Local Ollama instance (with download instructions)