BB

BIG-bench

Evaluation·infrastructure·open·#666 of 944·-2·Surging

65.4

Low

High confidence

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Pillar Breakdown

Adoption

35%

67.3

Maintenance

30%

59.8

Friction

20%

98.3

Ecosystem

15%

44.4

Momentum

0.83Surging
7d change -0.03
High confidence

In Evaluation

Ranked #32 of 57

Similar Tools