You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Oct 27, 2023. It is now read-only.
a "number of games" parameter (needs looking into: are we randomly pitting "players" against each other? Are we rather going through all possible games?
And returns a dictionary whose keys are model names and values are ELO ratings.
This part on the wiki page also seems relevant for implementation:
An example may help to clarify: Suppose player A has a rating of 1613...
The text was updated successfully, but these errors were encountered:
Suggestion: Test using multiple small models: distilgpt2, gpt2, gpt2-medium, for example. Actually, it should be possibly to simply send in a list of e.g. three identical model names, too, right?
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
We should have a function which receives as arguments:
And returns a dictionary whose keys are model names and values are ELO ratings.
This part on the wiki page also seems relevant for implementation:
The text was updated successfully, but these errors were encountered: