mathqa Search Results - Githubissues

57 results
for mathqa

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

togethercomputer/OpenChatKit #128

When do model offline Training , met below issue

Error log: /mnt/tet/OpenChatKit-main/training --model-name /mnt/tet/OpenChatKit-main/training/../pretrained/Pythia-6.9B-deduped/EleutherAI_pythia-6.9b-deduped/ --tokenizer-name /mnt/tet/OpenChatKit-…

yxy123 updated 1 year ago
5
EleutherAI/lm-evaluation-harness #412

TypeError: Evaluating on squad_v2

Hi, Thanks for the great work on evaluation LMs. I am trying to evaluate GPT2 on squad_v2 dataset but got the following error. ``` Traceback (most recent call last): File "/nfs/projects…

chiyuzhang94 updated 1 year ago
9
EleutherAI/lm-evaluation-harness #443

Bad results for LLaMA

I have evaluated LLaMA (7B, 13B and 30B) in most of the tasks available in this library and the results are bad for some tasks. I will give some examples with the 7B model. I haven't checked all the r…

juletx updated 1 year ago
11
bigscience-workshop/Megatron-DeepSpeed #228

Check all EAI Eval tasks

We've observed some tasks not learning at all over the first 100B tokens during pretraining. One of the suspicion we have is some tasks are bugged. The current task list we run: ```python arc_c…

thomasw21 updated 2 years ago
5
allenai/catwalk #8

Add Tasks

- [x] arc_challenge | acc - [x] arc_challenge | acc_norm - [x] arc_easy | acc - [x] arc_easy | acc_norm - [x] boolq | acc - [x] copa | acc - [x] headqa | acc - [x] headqa | acc_norm - [x] hell…

dirkgr updated 2 years ago
1
allanj/Deductive-MWP #1

About the released checkpoints

It seems that the released checkpoints for MathQA and SVAMP have significantly worse performance than reported in the paper.

ZhihongShao updated 2 years ago
10
allenai/catwalk #16

Slight differences in scores between EAI and Catwalk

- [x] arc_challenge - [x] arc_easy - [x] boolq - [x] copa - [x] headqa (This is Spanish. WTF?) - [x] hellaswag - [ ] lambada - [x] logiqa - [x] mathqa - [x] mc_taco - [x] mrpc -…

dirkgr updated 2 years ago
2
bigscience-workshop/data_tooling #48

detect or remove eval dataset contamination

Consider removal of eval dataset contamination. We would like to make sure that any downstream eval data is not memorized by the model by being in the sampled OSCAR files. We would want to remov…

huu4ontocord updated 2 years ago
3
EleutherAI/lm-evaluation-harness #77

New Evaluation: Math

We would like to find or create an eval dataset that tests mathematical knowledge. The GRE exams may be a good source of material. - [ ] Data processing code implemented - [ ] Evaluation implemen…

StellaAthena updated 2 years ago
7
VisualJoyce/AnDuShu #1

Running errors

It seems there are a bunch of errors while running for MathQA on my side. Can you share the environment detail? I think especially the allennlp version, I use the latest allennlp version as mentio…

allanj updated 3 years ago
4

上一页 1...1 2 3 4 5 6...6 下一页

57 results for mathqa

57 results
for mathqa