Same-prompt comparison

Pick a published prompt; see every model's score side-by-side. Append ?prompt_id=<id> to share a specific prompt.

Select a prompt above (or pass ?prompt_id=) to compare model outputs.