OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Support printing and saving results in Markdown table format, making it easier to view table directly in VSCode or other IDEs. Without BC-Breaking, results will be printed and saved as a Markdown table.
Support printing and saving results in Markdown table format, making it easier to view table directly in VSCode or other IDEs. Without BC-Breaking, results will be printed and saved as a Markdown table.
running log as follows: