LE

lm-evaluation-harness

Evaluation·infrastructure·open·#393 of 944·+14·Surging

73.5

Moderate

High confidence

A framework for few-shot evaluation of language models.

Pillar Breakdown

Adoption

35%

82.4

Maintenance

30%

65.3

Friction

20%

99.7

Ecosystem

15%

49.7

Momentum

0.77Surging
7d change +0.42
High confidence

In Evaluation

Ranked #18 of 57

Similar Tools