issues
search
EleutherAI
/
the-pile
MIT License
1.44k
stars
122
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Set up webpage
#69
leogao2
closed
3 years ago
1
License as MIT
#68
leogao2
closed
3 years ago
0
ImportError: cannot import name 'train_chars' from 'the_pile.pile'
#67
viashimat
closed
3 years ago
5
Royal Society Publishing
#66
StellaAthena
closed
3 years ago
2
Israeli Legal Databases
#65
StellaAthena
closed
3 years ago
0
European Patent Office
#64
StellaAthena
closed
3 years ago
1
Southern African Legal Datasets
#63
StellaAthena
closed
3 years ago
1
Updating the V2 branch with recent master changes.
#62
StellaAthena
closed
3 years ago
0
Multilingual Wikipedia
#61
StellaAthena
closed
3 years ago
2
Enhance downloading
#60
researcher2
closed
3 years ago
0
Dataset download and size changes.
#59
researcher2
closed
3 years ago
0
The `--using` flag doesn't actually do anything
#58
StellaAthena
closed
3 years ago
0
case.law
#57
hendrycks
closed
3 years ago
3
Debate notes
#56
Hellisotherpeople
closed
3 years ago
5
Update requirements-dev.txt
#55
mgrankin
closed
3 years ago
0
Filter Github for copyright headers
#54
leogao2
closed
3 years ago
2
Dataset books3.tar.gz is hosted
#53
VonChair
closed
3 years ago
5
Packaging for PyPi
#52
researcher2
closed
3 years ago
0
The Eye
#51
Robbie-chew
closed
3 years ago
4
BookCorpus download not working
#50
aveni
closed
3 years ago
1
Screenplays (Subtitles don't contain info about who says & does what)
#49
christophschuhmann
closed
3 years ago
14
add YoutubeSubtitleDataset
#48
sdtblck
closed
3 years ago
0
RePEc
#47
cfoster0
closed
3 years ago
3
African Journals Online
#46
StellaAthena
closed
3 years ago
3
Paperity
#45
leogao2
closed
3 years ago
0
Coastal Zone Information Center Collection
#44
thoppe
closed
3 years ago
4
PhilPapers
#43
cfoster0
closed
3 years ago
3
Biodiversity Heritage Library
#42
cfoster0
closed
3 years ago
1
Move processing code to this repo
#41
StellaAthena
closed
3 years ago
5
Separate functions for downloading pre-processed and datasets and downloading & processing
#40
sdtblck
closed
3 years ago
1
United Nations Speeches
#39
cfoster0
closed
3 years ago
0
United Nations Publications
#38
cfoster0
closed
3 years ago
7
Revert "Add database code for pubmed"
#37
StellaAthena
closed
3 years ago
0
Add database code for pubmed
#36
thoppe
closed
3 years ago
0
Congressional Records
#35
thoppe
closed
3 years ago
1
NIH Abstract text for awarded grants
#34
thoppe
closed
3 years ago
8
FreeLaw Project
#33
thoppe
closed
3 years ago
5
Literotica (replication)
#32
leogao2
closed
3 years ago
4
PMC
#31
leogao2
closed
3 years ago
0
AO3 (replication)
#30
leogao2
closed
3 years ago
0
PUBMED (biomedical abstracts)
#29
thoppe
closed
3 years ago
14
Stackexchange dataset
#28
sdtblck
closed
3 years ago
8
Caselaw Access Project
#27
StellaAthena
closed
3 years ago
1
USPTO Patent
#26
cfoster0
closed
3 years ago
5
Europarl
#25
StellaAthena
closed
3 years ago
5
bioRxiv
#24
StellaAthena
closed
3 years ago
2
Literotica
#23
StellaAthena
closed
3 years ago
1
Bibliotik
#22
StellaAthena
closed
3 years ago
0
arXiv
#21
StellaAthena
closed
3 years ago
1
Small Flag
#20
anishthite
closed
3 years ago
8
Previous
Next