ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models Paper โข 2509.24239 โข Published Sep 29 โข 1