Skip to content

2024.04.19

Compare
Choose a tag to compare
@haesleinhuepf haesleinhuepf released this 19 Apr 15:14
· 120 commits to main since this release
fcb37fd

This version of the benchmark was submitted as preprint. A link will be added to the readme once it is out.

Most important changes

Changes to list of changed models

  • adding gpt-4-turbo-2024-04-09 to tested models by @haesleinhuepf in #18
  • The mistral models tested via the blablador infrastructure was temporarily removed from the list of tested models due to technical difficulties. See #55 for details

New test-cases

Other changes

New Contributors

Full Changelog: 2024.04.07...2024.04.19