Any chance to see GPT-5 gets evaluated on this beautiful benchmark? 👀
Any chance to see GPT-5 gets evaluated on this beautiful benchmark? 👀