languagebench / models.json

Commit History

Upload from GitHub Actions: Add auto-translated datasets
68a93b5
verified

davidpomerenke commited on

Upload from GitHub Actions: Merge pull request #18 from datenlabor-bmz/pr-17
a0d1624
verified

davidpomerenke commited on

Upload from GitHub Actions: Add auto-translated datasets
c790fdb
verified

davidpomerenke commited on

Upload from GitHub Actions: Update evaluation results
f88768f
verified

davidpomerenke commited on

Upload from GitHub Actions: Update evaluation results
95c4e14
verified

davidpomerenke commited on

Upload from GitHub Actions: ran full evaluation locally
088f96f
verified

davidpomerenke commited on

Upload from GitHub Actions: restored model.json
d380f79
verified

davidpomerenke commited on

Upload from GitHub Actions: updated and cleaned up scripts for new eval runs
963cb78
verified

davidpomerenke commited on

Upload from GitHub Actions: Update models.py, models.json, and results.json with latest evaluation data and model additions
8eebb41
verified

davidpomerenke commited on

Upload from GitHub Actions: Merge pull request #9 from datenlabor-bmz/jn-dev
7c06aef
verified

davidpomerenke commited on

Upload from GitHub Actions: Merge pull request #7 from datenlabor-bmz/jn-dev
6878a71
verified

davidpomerenke commited on

Upload from GitHub Actions: Get more results, compute average based on all tasks
98c6811
verified

davidpomerenke commited on

Upload from GitHub Actions: Correlation plot
b0aa389
verified

davidpomerenke commited on

Upload from GitHub Actions: Evaluate Google Translate
338dc9b
verified

davidpomerenke commited on

Upload from GitHub Actions: More models and languages
a73f888
verified

davidpomerenke commited on

Upload from GitHub Actions: Merge remote changes and apply terminology updates: Commercial->closed-source, Open->open-source
ebaf279
verified

davidpomerenke commited on

Upload from nightly evaluation run
c3be561
verified

davidpomerenke commited on

Upload from GitHub Actions: More results
52abc5b
verified

davidpomerenke commited on

Upload from GitHub Actions: Update model ranking fetching
f840423
verified

davidpomerenke commited on

Upload from GitHub Actions: Use FLORES+ via Huggingface
913253a
verified

davidpomerenke commited on

Upload from nightly evaluation run
7fce0be
verified

davidpomerenke commited on

Upload from nightly evaluation run
7e8d13c
verified

davidpomerenke commited on

Upload from nightly evaluation run
9ee89ef
verified

davidpomerenke commited on

Upload from nightly evaluation run
8a4050a
verified

davidpomerenke commited on

Upload from nightly evaluation run
1d4c8a4
verified

davidpomerenke commited on

Upload from GitHub Actions: New results
b311dd5
verified

davidpomerenke commited on

Upload from nightly evaluation run
47bcf10
verified

davidpomerenke commited on

Upload from nightly evaluation run
dcb356d
verified

davidpomerenke commited on

Block gemini-2.5-pro-exp-03-25
092c06a

David Pomerenke commited on

Only run tasks for which there is no result yet
2f9dee1

David Pomerenke commited on