-
Thanks for this work!
I also use the evaluation script from the https://github.com/FranxYao/chain-of-thought-hub/MMLU/run_mmlu_open_source.py
I got the same result with original repo【Falcon40b (…
-
0.0.0.0 1lesbiantube.com
0.0.0.0 3danimalsex.com
0.0.0.0 3dhentai.club
0.0.0.0 3dporn.co.uk
0.0.0.0 3xmedia.hu
0.0.0.0 3xnight.hu
0.0.0.0 5valleycatrescue.co.uk
0.0.0.0 6.xporno.online
0.0.0.0…
-
I'm opening this issue, with permission, to raise a concern about the proposed Code of Conduct update and articulate a discussion that so far has been mostly off the written record. I think the concer…
-
## What is your proposed change?
To create a new codelist for field `se_category` within the vulnerability component. This field appears in DDH_v0.2.xlsx.
[Link to spreadsheet](https://docs.google…
-
Hello,
I don't reproduce the llama7b scores https://github.com/OpenNMT/OpenNMT-py/blob/master/eval_llm/MMLU/llama7b-onmt.txt
With this config:
```yaml
# transforms
transforms: [sentencepiece]
…
-
In your MMLU evaluation, the accuracy of Vicuna is only 24.9%, which is the same as a random guess. This is obviously wrong.
Did you directly use our delta weights (https://huggingface.co/lmsys/vicun…
-
Summary of request: Add a new organization to ROR
Name of organization: Laboratoire d'études de genre et de sexualité
Website: https://legs.cnrs.fr
Link to publications: https://hal.parisnanterre…
-
(In followup to Issue #4 )
In clarification and, for the avoidance of any doubt, the read-me and associated documentation, should indicate if mature, explicit or NSFW content can or cannot be genera…
-
I came across your mod looking for glow ink related mobs, and will probably add it to my mod pack for me and my friends, since I think these mobs could do with some more color. However, I'm just a bo…
-
Back in https://github.com/freelawproject/courtlistener/issues/3536#issuecomment-1992789591, we talked about and did a lot of work on performance enhancements, and then we concluded:
> 1. Highlight…