datalad crawl: Changing behaviour between HCP900/1200

At the moment, I am trying the datalad-crawler for 1 subject. At first, I tried it with "HCP" as a prefix (for HCP_500), then ran "datalad crawl" and saved. After that I changed the prefix-value in crawl.cfg to HCP_900, ran datalad crawl again and it worked. But when I change the prefix now to HCP_1200 I get an error message for "datalad crawl". (Also, when I change it between 900 and 1200 and run "datalad crawl" again, the error message changes with it.)

crawl.cfg

(datalad) tkadelka@brainb02 in ~/hcp_test/123420 on git:master ❱ cat .datalad/crawl/crawl.cfg 1 ! [crawl:pipeline] template = simple_s3 _prefix = HCP_1200/123420/ _bucket = hcp-openaccess _to_http = False _skip_problematic = False

datalad --dbg crawl for HCP_900

datalad --dbg crawl for HCP_1200

''' (datalad) tkadelka@brainb02 in ~/hcp_test/123420 on git:master ❱ datalad --dbg crawl [INFO ] Loading pipeline specification from ./.datalad/crawl/crawl.cfg [INFO ] Creating a pipeline for the hcp-openaccess bucket [INFO ] Running pipeline [, switch(default=None, key='datalad_action', mapping=<<{'commit': >, re=False)] [INFO ] S3 session: Connecting to the bucket hcp-openaccess with authentication Traceback (most recent call last): File "/home/tkadelka/env/datalad/bin/datalad", line 8, in main() File "/home/tkadelka/env/datalad/datalad/datalad/cmdline/main.py", line 500, in main ret = cmdlineargs.func(cmdlineargs) File "/home/tkadelka/env/datalad/datalad/datalad/interface/base.py", line 643, in call_from_parser ret = cls.__call__(**kwargs) File "/home/tkadelka/env/datalad/datalad-crawler/datalad_crawler/crawl.py", line 130, in __call__ output = run_pipeline(pipeline, stats=stats) File "/home/tkadelka/env/datalad/datalad-crawler/datalad_crawler/pipeline.py", line 114, in run_pipeline output = list(xrun_pipeline(*args, **kwargs)) File "/home/tkadelka/env/datalad/datalad-crawler/datalad_crawler/pipeline.py", line 194, in xrun_pipeline for idata_out, data_out in enumerate(xrun_pipeline_steps(pipeline, data_in, output=output_sub)): File "/home/tkadelka/env/datalad/datalad-crawler/datalad_crawler/pipeline.py", line 270, in xrun_pipeline_steps for data_ in data_in_to_loop: File "/home/tkadelka/env/datalad/datalad-crawler/datalad_crawler/nodes/s3.py", line 187, in __call__ versions_sorted = versions_sorted[start:] UnboundLocalError: local variable 'start' referenced before assignment > /home/tkadelka/env/datalad/datalad-crawler/datalad_crawler/nodes/s3.py(187)__call__() -> versions_sorted = versions_sorted[start:] (Pdb) '''

datalad / datalad-crawler

datalad crawl: Changing behaviour between HCP900/1200 #48