Skip to content

Releases: haesleinhuepf/human-eval-bia

2024.07.04

04 Jul 14:13
846b504
Compare
Choose a tag to compare

What's Changed

Full Changelog: 2024.04.25...2024.07.04

2024.04.25

25 Apr 12:23
b451bdd
Compare
Choose a tag to compare

What's Changed

  • Added samplers and samples from 9 recent open source models. by @jkh1 in #62
  • Figures and text modifications #63

New Contributors

  • @jkh1 made their first contribution in #62

Full Changelog: 2024.04.19...2024.04.25

2024.04.19

19 Apr 15:14
fcb37fd
Compare
Choose a tag to compare

This version of the benchmark was submitted as preprint. A link will be added to the readme once it is out.

Most important changes

Changes to list of changed models

  • adding gpt-4-turbo-2024-04-09 to tested models by @haesleinhuepf in #18
  • The mistral models tested via the blablador infrastructure was temporarily removed from the list of tested models due to technical difficulties. See #55 for details

New test-cases

Other changes

New Contributors

Full Changelog: 2024.04.07...2024.04.19

2024.04.07

07 Apr 16:31
a27a290
Compare
Choose a tag to compare
2024.04.07 Pre-release
Pre-release

What's Changed

Full Changelog: https://github.com/haesleinhuepf/human-eval-bia/commits/2024.04.07