Frontier Leaderboards
Legacy Leaderboards
2025 Scale AI. All rights reserved.
EnigmaEval
Puzzle Solving
Last updated: April 3, 2025
Performance Comparison
1
13.09±1.92
1
11.91±1.85
2
9.21±1.65
3
6.81±0.83
4
6.14±1.37
4
o1 (December 2024)
5.65±1.32
5
4.23±1.17
7
4.14±1.16
7
3.18±1.02
7
2.36±0.87
7
2.26±0.86
8
2.17±0.84
10
Gemini 2.0 Flash Thinking (January 2025)
1.10±0.60
11
Claude 3.5 Sonnet (October 2024)
0.91±0.55
12
Pixtral Large (November 2024)
0.84±0.53
13
Claude 3 Opus
0.82±0.45
13
GPT-4o (November 2024)
0.80±0.44
13
0.69±0.48
13
0.63±0.45
13
0.58±0.43
13
Llama 3.2 90B Vision Instruct
0.38±0.35