LLMs - Aidan Cooper

May

03

PGN2FEN: A Benchmark for Evaluating LLM Chess Reasoning

Introducing PGN2FEN — a benchmark for evaluating language models' ability to understand and transcribe chess game move sequences.

May 3, 2025

6 min read

Aug

12

Open and private models are becoming more similar than they are different

Aug 12, 2024

6 min read

Apr

26

Building your AI applications around open source models can make them better, cheaper, and faster

Apr 26, 2024

14 min read