RiddleBench: A New Generative Reasoning Benchmark for LLMs Paper โข 2510.24932 โข Published Oct 28, 2025 โข 8