issues
search
JHU-CLSP
/
turking-bench
Web-grounded natural language instructions
https://turkingbench.github.io
Apache License 2.0
11
stars
6
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Hiding error prints while oracle is running comprehensive tests and hotfix model zero outputs
#82
klxu03
closed
11 months ago
0
hotfix on comprehensive tests divide by zero
#81
klxu03
closed
11 months ago
0
Comprehensive tests for Turk-Instructions
#80
klxu03
closed
11 months ago
2
Start of comprehensive test, hotfix on clear text
#79
klxu03
closed
11 months ago
2
Passive voice Parents 1st-2nd Person Persuasiveness Comparison
#78
yeganehkordi
closed
11 months ago
0
BiSECT Multilingual Evaluation
#77
yeganehkordi
closed
11 months ago
0
Update Commonsense Misinformation Tracking Pilot [cancer data setup] 10
#76
yeganehkordi
closed
11 months ago
0
HTER - no longer sentences
#75
klxu03
closed
11 months ago
0
Update 4_run_evaluation.py
#74
yeganehkordi
closed
11 months ago
0
Change the logic of eval to skip the tasks that fail
#73
danyaljj
closed
11 months ago
0
Added Author In-Group to list of Tasks, Fixed Checkbox tasks, Updated Test Behavior
#72
klxu03
closed
12 months ago
0
in `modify_select` need to skip `nan` values
#71
danyaljj
closed
12 months ago
0
Author In-Group Analysis Phrase Classification 2 Resolved
#70
klxu03
closed
12 months ago
1
Run_single script allowing easy testing of a single task
#69
klxu03
closed
12 months ago
0
Got project working locally on my Mac
#68
klxu03
closed
12 months ago
1
`Visual Comet Multiple Choice Test Verify` images
#67
danyaljj
closed
7 months ago
0
`HTER - longer sentences -27 Sep 1129` inputs are not loaded properly
#66
danyaljj
closed
11 months ago
0
When evaluating "range" inputs, we use MSE. We need to normalize it by the maximum possible value of the range.
#65
danyaljj
closed
9 months ago
0
`Author In-Group Analysis Phrase Classification 2` batch.csv file headers say `.on` which messes up our code
#64
danyaljj
closed
12 months ago
3
Redoing the baselines and batch evaluation
#63
danyaljj
closed
1 year ago
0
[WIP] Unit testing, baseline refactoring, ...
#62
danyaljj
closed
1 year ago
0
Refactoring code and more tests
#61
danyaljj
closed
1 year ago
0
Website inspiration
#60
danyaljj
closed
10 months ago
0
`Compile list of area chairs`
#59
danyaljj
closed
10 months ago
0
`Scalar Adjectives Identification` seems like we're counting the number radio buttons in our stats.
#58
danyaljj
closed
10 months ago
3
CI: Check for missing images/files
#57
danyaljj
opened
1 year ago
0
[WIP]
#56
danyaljj
closed
1 year ago
0
Update CI so that it can run the tasks/turkle
#55
danyaljj
closed
1 year ago
0
HUSE-WebText-GPT-L-Unconditional-Pure Sampling
#54
danyaljj
closed
1 year ago
1
'ATOMIC - NL Rephrase 16'
#53
danyaljj
closed
1 year ago
1
Various improvements
#52
danyaljj
closed
1 year ago
0
Add token distance stats
#51
yeganehkordi
closed
1 year ago
0
Update encoding
#50
yeganehkordi
closed
1 year ago
0
<task_dir>
#49
danyaljj
closed
1 year ago
0
Delete redundant files
#48
yeganehkordi
closed
1 year ago
1
Transfer videos/images/URLS to our own bucket
#47
danyaljj
closed
1 year ago
0
Transfer videos/images/URLS to our own bucket [WIP]
#46
danyaljj
closed
1 year ago
1
Data review
#45
yizhongw
closed
1 year ago
0
Data review
#44
abyerly2
closed
1 year ago
1
Data review
#43
yeganehkordi
closed
1 year ago
1
`VisualCOMET Selection test` images are missing
#42
yeganehkordi
closed
10 months ago
0
`Video MC Eval lsmdc_generative_finetune_mc_neg3` videos
#41
danyaljj
closed
1 year ago
0
`MCN - Multiple Choice testset` images are missing
#40
danyaljj
closed
1 year ago
0
`advice-qual-feb-13-2020 v0` input.csv is empty?
#39
danyaljj
closed
10 months ago
0
`video sentence evaluation pos 1` requires videos
#38
danyaljj
closed
1 year ago
0
`Commonsense Morality-Text Label Validate-Collect-Extended `
#37
danyaljj
closed
10 months ago
0
`Relative CommonsenseQA Explanation Pairwise Judgements Collection 3`
#36
danyaljj
closed
10 months ago
0
Cleaned data
#35
katesanders9
closed
1 year ago
2
update data
#34
yeganehkordi
closed
1 year ago
0
"Reading comprehension" URLS
#33
danyaljj
opened
1 year ago
1
Previous
Next