pfnet-research / japanese-lm-fin-harness

Japanese Language Model Financial Evaluation Harness
MIT License
51 stars 4 forks source link

Add claude 2.1 as new model #1

Closed tetsuoh0103 closed 2 months ago

tetsuoh0103 commented 4 months ago

Could you consider to add claude2.1 by Anthropic in this benchmark?

masanorihirano commented 4 months ago

I'll consider it. Thank you for your suggestion.

masanorihirano commented 2 months ago

claude3 models are now available on our leaderboard.