1. Phrase fingerprinting
A small dictionary of AI hedges — moment, second, quick, instantly, let me think — plus explicit estimates like “5 minutes”. Each phrase resolves to a category with its own prior.
Sandbox-only research artifact
A calibration layer that maps LLM time-claims — “a moment,” “quick check,” “let me think” — to the wall-clock seconds they tend to consume. Trained on completed-task logs you keep locally. Gated by accuracy. Honest about its priors.
Paste real AI output. The Lens finds every time-claim, highlights it, and stacks them into one wall-clock distribution.
Annotated message will appear here.
Add one row each time an AI promised a duration and you timed the reality. The model recalibrates per category.
Each phrase family has its own prior and its own gate. Below the gate, output is marked illustrative.
Plain-English methodology — no dark patterns, no hidden state.
A small dictionary of AI hedges — moment, second, quick, instantly, let me think — plus explicit estimates like “5 minutes”. Each phrase resolves to a category with its own prior.
Default priors are log-normal, derived from public anecdotal ranges. As you log outcomes, observed samples take over via a weighted mix — the prior fades as n grows.
A category is “calibrated” only when you have at least 5 samples and the log-standard-deviation falls under 0.9. Below that, output is rendered with an illustrative badge.
The Lens treats each detected phrase as an independent latency draw, then sums their medians and propagates variance to estimate the cumulative range. Sequential, not parallel.
Everything you log stays in this browser via localStorage. No network calls, no analytics, no accounts. Hit Reset data any time.
Not a performance benchmark. Not a model evaluation. Not advice you should act on. A pedagogical instrument for thinking about AI time-talk — nothing more.