This page runs inference in the visitor browser using transformers.js + WebGPU.
Meta-Planner currently uses OpenForecaster-8B and needs an ONNX/WebGPU-compatible repo to run fully in-browser.
Quant mapping in browser mode: IQ2_XXS→q4, IQ3_XXS→q8, IQ4_XS→q4f16, BF16→fp16.