paulbricman / DebateGPT

Implementation of initial ArgRank and DebateGPT prototypes.
https://paulbricman.com/defensibility/
MIT License
0 stars 0 forks source link

Implement ELO rating function #4

Open paulbricman opened 1 year ago

paulbricman commented 1 year ago

We should have a function which receives as arguments:

This part on the wiki page also seems relevant for implementation:

An example may help to clarify: Suppose player A has a rating of 1613...

paulbricman commented 1 year ago

Suggestion: Test using multiple small models: distilgpt2, gpt2, gpt2-medium, for example. Actually, it should be possibly to simply send in a list of e.g. three identical model names, too, right?