BB
BIG-bench
Evaluation·infrastructure·open·#666 of 944·-2·Surging
65.4
Low
High confidence
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Pillar Breakdown
Adoption
35%
67.3
Maintenance
30%
59.8
Friction
20%
98.3
Ecosystem
15%
44.4
Momentum
0.83Surging
7d change -0.03
High confidenceIn Evaluation
Ranked #32 of 57