issues
search
datalad
/
datalad-crawler
DataLad extension for tracking web resources as datasets
http://datalad.org
Other
5
stars
16
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
TMP travis stall
#37
yarikoptic
closed
5 years ago
2
RF: strip "enforced" direct mode in the tests
#36
yarikoptic
closed
5 years ago
3
Rel 0.3
#35
yarikoptic
closed
5 years ago
3
using the datalad crawler for LORIS
#34
thomasbeaudry
opened
5 years ago
3
TST: Make test robust against older and newer datalad versions
#33
mih
closed
5 years ago
2
BF: CRCNS - Skip (but warn if relevant) records without xml
#32
yarikoptic
closed
5 years ago
3
RF: Adjust test for a change that is coming in -core
#31
mih
closed
5 years ago
0
RF: Import crawler-specific helper from -core
#30
mih
closed
5 years ago
2
ENH: issue warning if incoming_pipeline has Annexificator but no annex= is given
#29
yarikoptic
closed
5 years ago
5
Crawl github organizations for subdatasets
#28
yarikoptic
closed
4 years ago
2
Support pure .gz (not .tar.gz) files by exposing a new template argument archives_re
#27
yarikoptic
closed
5 years ago
1
RF: Adjust for GitRepo.get_gitattribute() API changes
#26
yarikoptic
closed
5 years ago
2
ENH drop immediately etc
#25
yarikoptic
closed
4 years ago
1
Support pure .gz (not .tar.gz) files in simple_with_archives
#24
yarikoptic
closed
5 years ago
0
TST: Drop stale known_failure_v6's
#23
kyleam
closed
5 years ago
8
Generic framework for crawling data providers with versions
#22
yarikoptic
opened
5 years ago
3
"Reproducible" crawled datasets
#21
yarikoptic
opened
5 years ago
0
figshare crawler
#20
yarikoptic
opened
5 years ago
0
RF: move crawl-init into crawl making crawl accept additional params to be passed into pipeline
#19
mih
opened
5 years ago
0
Describe basic structure of of crawler pipelines
#77
mih
opened
5 years ago
1
crawling sample stanford dataset failed - they have incomplete .tar
#18
yarikoptic
opened
5 years ago
8
ENH: crawl stanford lib initial crawler
#17
yarikoptic
closed
5 years ago
4
Physionet crawler
#16
yarikoptic
closed
3 years ago
1
CRCNS metadata fetching needs fixups
#15
yarikoptic
closed
5 years ago
1
Running datalad crawl import module error
#14
um4r12
opened
5 years ago
7
Add LORIS crawler
#13
driusan
closed
2 years ago
3
Running crawl-init in a non-dataset brings confusing and irrlevant error message
#12
yarikoptic
opened
5 years ago
2
Crawling of stanford dataspace, and simple indexes
#11
yarikoptic
closed
3 years ago
3
Small bug fixes
#10
yarikoptic
closed
5 years ago
4
XNAT and NITRC support
#9
chaselgrove
closed
4 years ago
23
Needs conda recipe/package
#8
yarikoptic
opened
6 years ago
0
Needs Debian package
#7
yarikoptic
opened
6 years ago
0
BF: use state of master as the starting point for any new brain in Annexificator
#6
yarikoptic
closed
6 years ago
2
Pipelines are broken due to merge conflict with .gitattributes
#5
mih
closed
6 years ago
2
RF: Minor cleanup while familiarizing myself with the code
#4
mih
closed
6 years ago
1
Confusing `crawl-init` error
#3
mih
opened
6 years ago
0
Provide S3 credentials
#2
mih
opened
6 years ago
0
Import docs from datalad-core
#1
mih
closed
6 years ago
1
crawl http index(es) helper
#76
yarikoptic
opened
6 years ago
0
openfmri: crawl derivatives into submodule(s)
#75
yarikoptic
opened
7 years ago
6
crawler pipeline for 'indexes' (ftp/http) with specs for where to break into submodules
#78
yarikoptic
opened
8 years ago
2
compose a helper script to touch S3 bucket files needing ETag recomputation
#97
yarikoptic
closed
3 years ago
2
Previous