PR

promptbench

Evaluation·infrastructure·open·#790 of 884·+102·Rising

58.4

Low

High confidence

A unified evaluation framework for large language models

Pillar Breakdown

Adoption

35%

44.4

Maintenance

30%

67.5

Friction

20%

97.6

Ecosystem

15%

35.1

Momentum

0.46Rising
7d change -0.26
High confidence

In Evaluation

Ranked #46 of 57

Similar Tools