Same-prompt comparison
Pick a published prompt; see every model's score side-by-side. Append ?prompt_id=<id> to share a specific prompt.
Select a prompt above (or pass
?prompt_id=) to compare model outputs.