May
03

PGN2FEN: A Benchmark for Evaluating LLM Chess Reasoning
Introducing PGN2FEN — a benchmark for evaluating language models' ability to understand and transcribe chess game move sequences.
6 min read