Writes software, dreams.
Run the same prompt through two or more models side by side and score the results.