aws / studio-lab-examples

Example notebooks for working with SageMaker Studio Lab. Sign up for an account at the link below!
https://studiolab.sagemaker.aws
Apache License 2.0
644 stars 188 forks source link

missing netbase prevents certain data download methods #9

Closed adswa closed 2 years ago

adswa commented 2 years ago

Apologies in advance if this repository is not the right place for a request like this, and many thanks for Sagemaker Studio Lab! I usually retrieve data via git-annex, which is a very convenient way to install datasets and retrieve portions of it on demand. It allows me to install huge datasets, often many TB large, often directly by cloning a GitHub repository, but only retrieve individual files or drop data that I have processed already. I use it as part of the datalad package, which allows me to do the data retrieval in a python session as part of my scripts.

Basic file retrieval with git-annex fails so far:

(studiolab) studio-lab-user@default:~/datatest/machinelearning-books$  git annex get B.Efron_T.Hastie-Computer_Age_Statistical_Inference.pdf 
get B.Efron_T.Hastie-Computer_Age_Statistical_Inference.pdf (from web...) 

  https://web.stanford.edu/~hastie/CASI_files/PDF/casi.pdf download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp)

  downloading from all 1 known url(s) failed

  Unable to access these remotes: web

  Maybe add some of these git remotes (git remote add ...):
        d5f231e1-6901-456a-9398-39299242baf6 -- mih@meiner:/tmp/machinelearning-books

  (Note that these git remotes have annex-ignore set: origin)
failed
get: 1 failed

The cause of the failure lies in ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp); I believe this is because netbase isn't installed and /etc/protocols thus doesn't exist.

(studiolab) studio-lab-user@default:~/datatest/machinelearning-books$ apt-cache policy netbase
netbase:
  Installed: (none)
  Candidate: (none)
  Version table:

Is there a way to have it installed, or a solution I have missed in the documentation so far? Thanks in advance!

EmilyWebber commented 2 years ago

Hi - thanks for trying out Studio Lab! We'll look into this and get back to you shortly.

EmilyWebber commented 2 years ago

Wow, I haven't worked with DataLad before, and it looks really nice. Could you send me a link of the lines you're trying to run start to finish, maybe even in a notebook, so I can replicate the issue and root cause it? Thanks.

adswa commented 2 years ago

Sure, and thanks a lot for looking into this! Here is a short script from a notebook with a Python kernel, including installation:

# installation and set up
conda install -c conda-forge datalad
!git config --global --add user.name "Adina Wagner"
!git config --global --add user.email "adina.wagner@t-online.de"
# import
import datalad.api as dl
# clone a "superdataset" with many datasets underneath
dl.clone('https://github.com/dandi/dandisets')
ds = dl.Dataset('/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets')
# install a single dataset from the collection but without getting data, just to browse its files
ds.get(path='000003', get_data=False)
!ls 'dandisets/000003'
# get a directory (3 files)
ds.get('000003/sub-YutaMouse20')
Here is the full output, including the trace back of the current failure: ```py [1] conda install -c conda-forge datalad Collecting package metadata (current_repodata.json): done Solving environment: done ==> WARNING: A newer version of conda exists. <== current version: 4.10.3 latest version: 4.11.0 Please update conda by running $ conda update -n base conda ## Package Plan ## environment location: /home/studio-lab-user/.conda/envs/default added / updated specs: - datalad The following packages will be downloaded: package | build ---------------------------|----------------- brotlipy-0.7.0 |py39h3811e60_1003 342 KB conda-forge certifi-2021.10.8 | py39hf3d152e_1 145 KB conda-forge charset-normalizer-2.0.8 | pyhd8ed1ab_0 34 KB conda-forge colorama-0.4.4 | pyh9f0ad1d_0 18 KB conda-forge idna-3.1 | pyhd3deb0d_0 52 KB conda-forge openssl-1.1.1l | h7f98852_0 2.1 MB conda-forge pyopenssl-21.0.0 | pyhd8ed1ab_0 48 KB conda-forge pysocks-1.7.1 | py39hf3d152e_4 28 KB conda-forge python-3.9.7 |hb7a2778_3_cpython 27.5 MB conda-forge requests-2.26.0 | pyhd8ed1ab_1 52 KB conda-forge urllib3-1.26.7 | pyhd8ed1ab_0 100 KB conda-forge ------------------------------------------------------------ Total: 30.4 MB The following NEW packages will be INSTALLED: annexremote conda-forge/linux-64::annexremote-1.5.0-py39hf3d152e_0 appdirs conda-forge/noarch::appdirs-1.4.4-pyh9f0ad1d_0 boto conda-forge/noarch::boto-2.49.0-py_0 brotlipy conda-forge/linux-64::brotlipy-0.7.0-py39h3811e60_1003 bzip2 conda-forge/linux-64::bzip2-1.0.8-h7f98852_4 c-ares conda-forge/linux-64::c-ares-1.18.1-h7f98852_0 certifi conda-forge/linux-64::certifi-2021.10.8-py39hf3d152e_1 chardet conda-forge/linux-64::chardet-4.0.0-py39hf3d152e_2 charset-normalizer conda-forge/noarch::charset-normalizer-2.0.8-pyhd8ed1ab_0 colorama conda-forge/noarch::colorama-0.4.4-pyh9f0ad1d_0 cryptography conda-forge/linux-64::cryptography-3.4.8-py39h95dcef6_1 curl conda-forge/linux-64::curl-7.80.0-h2574ce0_0 datalad conda-forge/linux-64::datalad-0.15.3-py39hf3d152e_0 dbus conda-forge/linux-64::dbus-1.13.6-h48d8840_2 deprecated conda-forge/noarch::deprecated-1.2.13-pyh6c4a22f_0 distro conda-forge/noarch::distro-1.6.0-pyhd8ed1ab_0 exifread conda-forge/noarch::exifread-2.3.2-pyhd8ed1ab_0 expat conda-forge/linux-64::expat-2.4.1-h9c3ff4c_0 fasteners conda-forge/noarch::fasteners-0.16-pyhd8ed1ab_0 freetype conda-forge/linux-64::freetype-2.10.4-h0708190_1 future conda-forge/linux-64::future-0.18.2-py39hf3d152e_4 gettext conda-forge/linux-64::gettext-0.19.8.1-h73d1719_1008 git conda-forge/linux-64::git-2.34.1-pl5321hc30692c_0 git-annex conda-forge/linux-64::git-annex-8.20211123-nodep_h1234567_0 glib conda-forge/linux-64::glib-2.70.1-h780b84a_0 glib-tools conda-forge/linux-64::glib-tools-2.70.1-h780b84a_0 humanize conda-forge/noarch::humanize-3.13.1-pyhd8ed1ab_0 idna conda-forge/noarch::idna-3.1-pyhd3deb0d_0 importlib_metadata conda-forge/noarch::importlib_metadata-4.8.2-hd8ed1ab_0 iso8601 conda-forge/noarch::iso8601-1.0.2-pyhd8ed1ab_0 jbig conda-forge/linux-64::jbig-2.1-h7f98852_2003 jeepney conda-forge/noarch::jeepney-0.7.1-pyhd8ed1ab_0 jpeg conda-forge/linux-64::jpeg-9d-h36c2ea0_0 keyring conda-forge/linux-64::keyring-23.4.0-py39hf3d152e_0 keyrings.alt conda-forge/noarch::keyrings.alt-4.0.2-pyhd3deb0d_0 krb5 conda-forge/linux-64::krb5-1.19.2-hcc1bbae_3 lcms2 conda-forge/linux-64::lcms2-2.12-hddcbb42_0 lerc conda-forge/linux-64::lerc-3.0-h9c3ff4c_0 libcurl conda-forge/linux-64::libcurl-7.80.0-h2574ce0_0 libdeflate conda-forge/linux-64::libdeflate-1.8-h7f98852_0 libedit conda-forge/linux-64::libedit-3.1.20191231-he28a2e2_2 libev conda-forge/linux-64::libev-4.33-h516909a_1 libglib conda-forge/linux-64::libglib-2.70.1-h174f98d_0 libiconv conda-forge/linux-64::libiconv-1.16-h516909a_0 libnghttp2 conda-forge/linux-64::libnghttp2-1.43.0-h812cca2_1 libpng conda-forge/linux-64::libpng-1.6.37-h21135ba_2 libssh2 conda-forge/linux-64::libssh2-1.10.0-ha56f1ee_2 libtiff conda-forge/linux-64::libtiff-4.3.0-h6f004c6_2 libwebp-base conda-forge/linux-64::libwebp-base-1.2.1-h7f98852_0 lz4-c conda-forge/linux-64::lz4-c-1.9.3-h9c3ff4c_1 monotonic conda-forge/noarch::monotonic-1.5-py_0 msgpack-python conda-forge/linux-64::msgpack-python-1.0.3-py39h1a9c180_0 mutagen conda-forge/noarch::mutagen-1.45.1-pyhd8ed1ab_0 olefile conda-forge/noarch::olefile-0.46-pyh9f0ad1d_1 openjpeg conda-forge/linux-64::openjpeg-2.4.0-hb52868f_1 p7zip conda-forge/linux-64::p7zip-16.02-he1b5a44_1000 patool conda-forge/linux-64::patool-1.12-py39hf3d152e_1004 pcre conda-forge/linux-64::pcre-8.45-h9c3ff4c_0 pcre2 conda-forge/linux-64::pcre2-10.37-h032f7d1_0 perl conda-forge/linux-64::perl-5.32.1-1_h7f98852_perl5 pillow conda-forge/linux-64::pillow-8.4.0-py39ha612740_0 psutil conda-forge/linux-64::psutil-5.8.0-py39h3811e60_2 pygithub conda-forge/noarch::pygithub-1.55-pyh6c4a22f_0 pyjwt conda-forge/noarch::pyjwt-2.3.0-pyhd8ed1ab_0 pynacl conda-forge/linux-64::pynacl-1.4.0-py39h3811e60_3 pyopenssl conda-forge/noarch::pyopenssl-21.0.0-pyhd8ed1ab_0 pyperclip conda-forge/noarch::pyperclip-1.8.2-pyhd8ed1ab_2 pysocks conda-forge/linux-64::pysocks-1.7.1-py39hf3d152e_4 python-gitlab conda-forge/noarch::python-gitlab-2.10.1-pyhd8ed1ab_0 requests conda-forge/noarch::requests-2.26.0-pyhd8ed1ab_1 requests-ftp conda-forge/noarch::requests-ftp-0.3.1-py_1 requests-toolbelt conda-forge/noarch::requests-toolbelt-0.9.1-py_0 secretstorage conda-forge/linux-64::secretstorage-3.3.1-py39hf3d152e_1 simplejson conda-forge/linux-64::simplejson-3.17.6-py39h3811e60_0 tqdm conda-forge/noarch::tqdm-4.62.3-pyhd8ed1ab_0 urllib3 conda-forge/noarch::urllib3-1.26.7-pyhd8ed1ab_0 whoosh conda-forge/linux-64::whoosh-2.7.4-py39hf3d152e_5 wrapt conda-forge/linux-64::wrapt-1.13.3-py39h3811e60_1 zstd conda-forge/linux-64::zstd-1.5.0-ha95c52a_0 The following packages will be DOWNGRADED: openssl 3.0.0-h7f98852_2 --> 1.1.1l-h7f98852_0 python 3.9.7-hf930737_3_cpython --> 3.9.7-hb7a2778_3_cpython Downloading and Extracting Packages requests-2.26.0 | 52 KB | ##################################### | 100% pysocks-1.7.1 | 28 KB | ##################################### | 100% urllib3-1.26.7 | 100 KB | ##################################### | 100% pyopenssl-21.0.0 | 48 KB | ##################################### | 100% charset-normalizer-2 | 34 KB | ##################################### | 100% openssl-1.1.1l | 2.1 MB | ##################################### | 100% python-3.9.7 | 27.5 MB | ##################################### | 100% idna-3.1 | 52 KB | ##################################### | 100% colorama-0.4.4 | 18 KB | ##################################### | 100% brotlipy-0.7.0 | 342 KB | ##################################### | 100% certifi-2021.10.8 | 145 KB | ##################################### | 100% Preparing transaction: done Verifying transaction: done Executing transaction: | ############################################################################## # # # Standalone distribution of git-annex was installed, instead of the # # standard distribution, likely due to package conflicts in the target # # environment. The standalone distribution may have issues (e.g. be slower, # # or not pass the expected environment to some external programs); # # the standard distribution should be used when possible. # # You can force installation of the standard version by adding =alldep* to # # the build string of the package specification, e.g. # # # # conda install -c conda-forge git-annex=*=alldep* # # # # However, this might cause an older git-annex version to be installed, # # if later versions' dependencies conflict with other packages # # in the target environment. # # # # For more info on the standalone git-annex distribution see # # https://git-annex.branchable.com/install/Linux_standalone/ # # # ############################################################################## done Note: you may need to restart the kernel to use updated packages. [2] !git config --global --add user.name "Adina Wagner" !git config --global --add user.email "adina.wagner@t-online.de" [3] import datalad.api as dl dl.clone('https://github.com/dandi/dandisets') ds = dl.Dataset('/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets') [4] ds.get(path='000003', get_data=False) [{'action': 'get', 'type': 'dataset', 'path': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003', 'status': 'ok', 'contains': [PosixPath('/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003')], 'refds': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets'}] [5] !ls 'dandisets/000003' dandiset.yaml sub-YutaMouse38 sub-YutaMouse44 sub-YutaMouse56 sub-YutaMouse20 sub-YutaMouse39 sub-YutaMouse45 sub-YutaMouse57 sub-YutaMouse23 sub-YutaMouse40 sub-YutaMouse51 sub-YutaMouse33 sub-YutaMouse41 sub-YutaMouse54 sub-YutaMouse37 sub-YutaMouse42 sub-YutaMouse55 [6] ds.get('000003/sub-YutaMouse20') --------------------------------------------------------------------------- IncompleteResultsError Traceback (most recent call last) /tmp/ipykernel_3342/1819280635.py in ----> 1 ds.get('000003/sub-YutaMouse20') ~/.conda/envs/default/lib/python3.9/site-packages/datalad/distribution/dataset.py in apply_func(wrapped, instance, args, kwargs) 482 elif i >= ds_index: 483 kwargs[orig_pos[i+1]] = args[i] --> 484 return f(**kwargs) 485 486 setattr(Dataset, name, apply_func(f)) ~/.conda/envs/default/lib/python3.9/site-packages/datalad/interface/utils.py in eval_func(wrapped, instance, args, kwargs) 482 483 lgr.log(2, "Returning return_func from eval_func for %s", wrapped_class) --> 484 return return_func(generator_func)(*args, **kwargs) 485 486 ret = eval_func(func) ~/.conda/envs/default/lib/python3.9/site-packages/datalad/interface/utils.py in return_func(wrapped_, instance_, args_, kwargs_) 474 # unwind generator if there is one, this actually runs 475 # any processing --> 476 results = list(results) 477 if return_type == 'item-or-list' and \ 478 len(results) < 2: ~/.conda/envs/default/lib/python3.9/site-packages/datalad/interface/utils.py in generator_func(*_args, **_kwargs) 459 460 if incomplete_results: --> 461 raise IncompleteResultsError( 462 failed=incomplete_results, 463 msg="Command did not complete successfully") IncompleteResultsError: Command did not complete successfully. 6 failed: [{'action': 'get', 'annexkey': 'SHA256E-s11694713998--37445940b3c895ae65cf943d7352ef73db884093f9944b17ee72e912218cde06.nwb', 'error_message': 'https://dandiarchive.s3.amazonaws.com/blobs/bb8/1f7/bb81f7b3-4cfa-40e7-aa89-95beb1954d8c?versionId=F33RzmXlfGyL4rcwBIBenrW2eqDSr4qZ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/05a80228-04a7-4c3b-88d3-44a0c6b831b1/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/bb8/1f7/bb81f7b3-4cfa-40e7-aa89-95beb1954d8c?versionId=F33RzmXlfGyL4rcwBIBenrW2eqDSr4qZ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/05a80228-04a7-4c3b-88d3-44a0c6b831b1/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/bb8/1f7/bb81f7b3-4cfa-40e7-aa89-95beb1954d8c?versionId=F33RzmXlfGyL4rcwBIBenrW2eqDSr4qZ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/05a80228-04a7-4c3b-88d3-44a0c6b831b1/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed', 'path': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140321_behavior+ecephys.nwb', 'refds': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets', 'status': 'error', 'type': 'file'}, {'action': 'get', 'annexkey': 'SHA256E-s15090018364--8aad2311db629a91f60dc5dd15bc2ddb06217e97957b5388e60485ab34b57138.nwb', 'error_message': 'https://dandiarchive.s3.amazonaws.com/blobs/b1c/be7/b1cbe7d9-d3b0-40ea-b53f-3429627e56f2?versionId=_ggbXE1e8XXRniaW4iBfEJdGBqPetDcG ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/948a507c-fd65-4e84-a7e6-a87b1e3ac706/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/b1c/be7/b1cbe7d9-d3b0-40ea-b53f-3429627e56f2?versionId=_ggbXE1e8XXRniaW4iBfEJdGBqPetDcG ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/948a507c-fd65-4e84-a7e6-a87b1e3ac706/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/b1c/be7/b1cbe7d9-d3b0-40ea-b53f-3429627e56f2?versionId=_ggbXE1e8XXRniaW4iBfEJdGBqPetDcG ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/948a507c-fd65-4e84-a7e6-a87b1e3ac706/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed', 'path': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140324_behavior+ecephys.nwb', 'refds': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets', 'status': 'error', 'type': 'file'}, {'action': 'get', 'annexkey': 'SHA256E-s17250047255--32baa37a985fd9451e6e6a8b602cfd8f866315a442ff213e952f5085a7bc1173.nwb', 'error_message': 'https://dandiarchive.s3.amazonaws.com/blobs/de0/967/de096722-f12a-460d-a572-354092a36e8f?versionId=6.XW5Gzs1cQdol3SciuHWicQLHD0Ni21 ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/006380d2-c5a6-4394-afe9-1361c91f56db/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/de0/967/de096722-f12a-460d-a572-354092a36e8f?versionId=6.XW5Gzs1cQdol3SciuHWicQLHD0Ni21 ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/006380d2-c5a6-4394-afe9-1361c91f56db/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/de0/967/de096722-f12a-460d-a572-354092a36e8f?versionId=6.XW5Gzs1cQdol3SciuHWicQLHD0Ni21 ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/006380d2-c5a6-4394-afe9-1361c91f56db/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed', 'path': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140325_behavior+ecephys.nwb', 'refds': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets', 'status': 'error', 'type': 'file'}, {'action': 'get', 'annexkey': 'SHA256E-s8417855886--918e541a92b934d7543acc92981ed0425eadc0d24dcabf9d8e5b44924dbb1046.nwb', 'error_message': 'https://dandiarchive.s3.amazonaws.com/blobs/58c/537/58c53789-eec4-4080-ad3b-207cf2a1cac9?versionId=3n1DKIXCIRuiY_nl72A.30zT7QlQGXXF ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/4112d7b0-071e-4193-bd30-73d52e7576c6/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/58c/537/58c53789-eec4-4080-ad3b-207cf2a1cac9?versionId=3n1DKIXCIRuiY_nl72A.30zT7QlQGXXF ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/4112d7b0-071e-4193-bd30-73d52e7576c6/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/58c/537/58c53789-eec4-4080-ad3b-207cf2a1cac9?versionId=3n1DKIXCIRuiY_nl72A.30zT7QlQGXXF ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/4112d7b0-071e-4193-bd30-73d52e7576c6/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed', 'path': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140327_behavior+ecephys.nwb', 'refds': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets', 'status': 'error', 'type': 'file'}, {'action': 'get', 'annexkey': 'SHA256E-s18364048064--927a3c6e19cea5fefd3f7eb73aba1ce39cf8b0446d3008f5c4cc7ec57544f129.nwb', 'error_message': 'https://dandiarchive.s3.amazonaws.com/blobs/191/a42/191a42b2-1245-45ee-b664-1ff160a01ba9?versionId=8JM0JFJvf43BjyyphYN9GZCazLRSSTyX ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/8538db7e-b726-417c-b26d-74b6d5c0b640/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/191/a42/191a42b2-1245-45ee-b664-1ff160a01ba9?versionId=8JM0JFJvf43BjyyphYN9GZCazLRSSTyX ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/8538db7e-b726-417c-b26d-74b6d5c0b640/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed\n' 'https://dandiarchive.s3.amazonaws.com/blobs/191/a42/191a42b2-1245-45ee-b664-1ff160a01ba9?versionId=8JM0JFJvf43BjyyphYN9GZCazLRSSTyX ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'https://api.dandiarchive.org/api/assets/8538db7e-b726-417c-b26d-74b6d5c0b640/download/ ' 'download failed: ConnectionFailure ' 'Network.BSD.getProtocolByName: does not exist (no such ' 'protocol name: tcp)\n' 'downloading from all 2 known url(s) failed', 'path': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140328_behavior+ecephys.nwb', 'refds': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets', 'status': 'error', 'type': 'file'}, {'action': 'get', 'message': ('could not get some content in %s %s', '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20', ['/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140321_behavior+ecephys.nwb', '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140324_behavior+ecephys.nwb', '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140325_behavior+ecephys.nwb', '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140327_behavior+ecephys.nwb', '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140328_behavior+ecephys.nwb']), 'path': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets/000003/sub-YutaMouse20', 'refds': '/home/studio-lab-user/sagemaker-studiolab-notebooks/dandisets', 'status': 'impossible', 'type': 'directory'}] ```

You could do the same thing in a terminal, and get more debug output by using the git-annex calls directly:

(installation would be the same as above if not done already)
datalad clone https://github.com/dandi/dandisets.git
cd dandisets && datalad get -n 000003
ls 000003
cd 000003
git annex -dbg get sub-YutaMouse20
Output: ```sh (studiolab) studio-lab-user@default:~$ datalad clone https://github.com/dandi/dandisets.git [INFO ] Remote origin not usable by git-annex; setting annex-ignore [INFO ] https://github.com/dandi/dandisets.git/config download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) [INFO ] access to 1 dataset sibling dandi-dandisets-dropbox not auto-enabled, enable with: | datalad siblings -d "/home/studio-lab-user/dandisets" enable -s dandi-dandisets-dropbox install(ok): /home/studio-lab-user/dandisets (dataset) (studiolab) studio-lab-user@default:~$ cd dandisets && datalad get -n 000003 [INFO ] Remote origin not usable by git-annex; setting annex-ignore [INFO ] https://github.com/dandisets/000003.git/config download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) [INFO ] access to 1 dataset sibling dandi-dandisets-dropbox not auto-enabled, enable with: | datalad siblings -d "/home/studio-lab-user/dandisets/000003" enable -s dandi-dandisets-dropbox install(ok): /home/studio-lab-user/dandisets/000003 (dataset) [Installed subdataset in order to get /home/studio-lab-user/dandisets/000003] (studiolab) studio-lab-user@default:~/dandisets$ ls 000003 dandiset.yaml sub-YutaMouse33 sub-YutaMouse39 sub-YutaMouse42 sub-YutaMouse51 sub-YutaMouse56 sub-YutaMouse20 sub-YutaMouse37 sub-YutaMouse40 sub-YutaMouse44 sub-YutaMouse54 sub-YutaMouse57 sub-YutaMouse23 sub-YutaMouse38 sub-YutaMouse41 sub-YutaMouse45 sub-YutaMouse55 (studiolab) studio-lab-user@default:~/dandisets$ cd 000003 (studiolab) studio-lab-user@default:~/dandisets/000003$ git annex -dbg get sub-YutaMouse20 [2021-12-04 08:38:14.007374579] (Utility.Process) process [6213] read: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","symbolic-ref","-q","HEAD"] [2021-12-04 08:38:14.010746629] (Utility.Process) process [6213] done ExitSuccess [2021-12-04 08:38:14.011015593] (Utility.Process) process [6214] read: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","show-ref","refs/heads/draft"] [2021-12-04 08:38:14.014677366] (Utility.Process) process [6214] done ExitSuccess [2021-12-04 08:38:14.015065041] (Utility.Process) process [6215] read: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","ls-files","--stage","-z","--","sub-YutaMouse20"] [2021-12-04 08:38:14.015965618] (Utility.Process) process [6216] chat: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","cat-file","--batch-check=%(objectname) %(objecttype) %(objectsize)","--buffer"] [2021-12-04 08:38:14.016429655] (Utility.Process) process [6217] chat: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","cat-file","--batch=%(objectname) %(objecttype) %(objectsize)","--buffer"] [2021-12-04 08:38:14.017061836] (Utility.Process) process [6218] chat: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","cat-file","--batch=%(objectname) %(objecttype) %(objectsize)","--buffer"] get sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140321_behavior+ecephys.nwb [2021-12-04 08:38:14.022783612] (Utility.Process) process [6219] read: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","show-ref","git-annex"] [2021-12-04 08:38:14.026330478] (Utility.Process) process [6219] done ExitSuccess [2021-12-04 08:38:14.0267013] (Utility.Process) process [6220] read: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","show-ref","--hash","refs/heads/git-annex"] [2021-12-04 08:38:14.030144113] (Utility.Process) process [6220] done ExitSuccess [2021-12-04 08:38:14.030513548] (Utility.Process) process [6221] read: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","log","refs/heads/git-annex..0106cc2611bf3d0419076d703f6c7788cc2cb323","--pretty=%H","-n1"] [2021-12-04 08:38:14.034127665] (Utility.Process) process [6221] done ExitSuccess [2021-12-04 08:38:14.034366555] (Utility.Process) process [6222] read: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","log","refs/heads/git-annex..14d863fc1b4856c9d2893d28cd81ff34ce1a0df8","--pretty=%H","-n1"] [2021-12-04 08:38:14.038086401] (Utility.Process) process [6222] done ExitSuccess [2021-12-04 08:38:14.04460337] (Utility.Process) process [6223] chat: git ["--git-dir=.git","--work-tree=.","--literal-pathspecs","-c","annex.debug=true","cat-file","--batch"] (from web...) [2021-12-04 08:38:14.065914718] (Utility.Url) Request { host = "api.dandiarchive.org" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/api/assets/05a80228-04a7-4c3b-88d3-44a0c6b831b1/download/" queryString = "" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } [2021-12-04 08:38:14.067077002] (Utility.Url) Request { host = "dandiarchive.s3.amazonaws.com" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/blobs/bb8/1f7/bb81f7b3-4cfa-40e7-aa89-95beb1954d8c" queryString = "?versionId=F33RzmXlfGyL4rcwBIBenrW2eqDSr4qZ" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } https://dandiarchive.s3.amazonaws.com/blobs/bb8/1f7/bb81f7b3-4cfa-40e7-aa89-95beb1954d8c?versionId=F33RzmXlfGyL4rcwBIBenrW2eqDSr4qZ download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) https://api.dandiarchive.org/api/assets/05a80228-04a7-4c3b-88d3-44a0c6b831b1/download/ download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) downloading from all 2 known url(s) failed Unable to access these remotes: web Maybe enable some of these special remotes (git annex initremote ...): 727f466f-60c3-4778-90b2-b2332856c2f8 -- dandi-dandisets-dropbox (Note that these git remotes have annex-ignore set: origin) failed get sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140324_behavior+ecephys.nwb (from web...) [2021-12-04 08:38:14.071279713] (Utility.Url) Request { host = "api.dandiarchive.org" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/api/assets/948a507c-fd65-4e84-a7e6-a87b1e3ac706/download/" queryString = "" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } [2021-12-04 08:38:14.071662554] (Utility.Url) Request { host = "dandiarchive.s3.amazonaws.com" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/blobs/b1c/be7/b1cbe7d9-d3b0-40ea-b53f-3429627e56f2" queryString = "?versionId=_ggbXE1e8XXRniaW4iBfEJdGBqPetDcG" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } https://dandiarchive.s3.amazonaws.com/blobs/b1c/be7/b1cbe7d9-d3b0-40ea-b53f-3429627e56f2?versionId=_ggbXE1e8XXRniaW4iBfEJdGBqPetDcG download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) https://api.dandiarchive.org/api/assets/948a507c-fd65-4e84-a7e6-a87b1e3ac706/download/ download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) downloading from all 2 known url(s) failed Unable to access these remotes: web Maybe enable some of these special remotes (git annex initremote ...): 727f466f-60c3-4778-90b2-b2332856c2f8 -- dandi-dandisets-dropbox (Note that these git remotes have annex-ignore set: origin) failed get sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140325_behavior+ecephys.nwb (from web...) [2021-12-04 08:38:14.073198382] (Utility.Url) Request { host = "api.dandiarchive.org" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/api/assets/006380d2-c5a6-4394-afe9-1361c91f56db/download/" queryString = "" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } [2021-12-04 08:38:14.073548637] (Utility.Url) Request { host = "dandiarchive.s3.amazonaws.com" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/blobs/de0/967/de096722-f12a-460d-a572-354092a36e8f" queryString = "?versionId=6.XW5Gzs1cQdol3SciuHWicQLHD0Ni21" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } https://dandiarchive.s3.amazonaws.com/blobs/de0/967/de096722-f12a-460d-a572-354092a36e8f?versionId=6.XW5Gzs1cQdol3SciuHWicQLHD0Ni21 download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) https://api.dandiarchive.org/api/assets/006380d2-c5a6-4394-afe9-1361c91f56db/download/ download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) downloading from all 2 known url(s) failed Unable to access these remotes: web Maybe enable some of these special remotes (git annex initremote ...): 727f466f-60c3-4778-90b2-b2332856c2f8 -- dandi-dandisets-dropbox (Note that these git remotes have annex-ignore set: origin) failed get sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140327_behavior+ecephys.nwb (from web...) [2021-12-04 08:38:14.075075466] (Utility.Url) Request { host = "api.dandiarchive.org" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/api/assets/4112d7b0-071e-4193-bd30-73d52e7576c6/download/" queryString = "" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } [2021-12-04 08:38:14.075351715] (Utility.Url) Request { host = "dandiarchive.s3.amazonaws.com" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/blobs/58c/537/58c53789-eec4-4080-ad3b-207cf2a1cac9" queryString = "?versionId=3n1DKIXCIRuiY_nl72A.30zT7QlQGXXF" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } https://dandiarchive.s3.amazonaws.com/blobs/58c/537/58c53789-eec4-4080-ad3b-207cf2a1cac9?versionId=3n1DKIXCIRuiY_nl72A.30zT7QlQGXXF download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) https://api.dandiarchive.org/api/assets/4112d7b0-071e-4193-bd30-73d52e7576c6/download/ download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) downloading from all 2 known url(s) failed Unable to access these remotes: web Maybe enable some of these special remotes (git annex initremote ...): 727f466f-60c3-4778-90b2-b2332856c2f8 -- dandi-dandisets-dropbox (Note that these git remotes have annex-ignore set: origin) failed get sub-YutaMouse20/sub-YutaMouse20_ses-YutaMouse20-140328_behavior+ecephys.nwb (from web...) [2021-12-04 08:38:14.076752813] (Utility.Url) Request { host = "api.dandiarchive.org" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/api/assets/8538db7e-b726-417c-b26d-74b6d5c0b640/download/" queryString = "" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } [2021-12-04 08:38:14.07704376] (Utility.Url) Request { host = "dandiarchive.s3.amazonaws.com" port = 443 secure = True requestHeaders = [("Accept-Encoding","identity"),("User-Agent","git-annex/8.20211118-g23ee48898")] path = "/blobs/191/a42/191a42b2-1245-45ee-b664-1ff160a01ba9" queryString = "?versionId=8JM0JFJvf43BjyyphYN9GZCazLRSSTyX" method = "GET" proxy = Nothing rawBody = False redirectCount = 10 responseTimeout = ResponseTimeoutDefault requestVersion = HTTP/1.1 } https://dandiarchive.s3.amazonaws.com/blobs/191/a42/191a42b2-1245-45ee-b664-1ff160a01ba9?versionId=8JM0JFJvf43BjyyphYN9GZCazLRSSTyX download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) https://api.dandiarchive.org/api/assets/8538db7e-b726-417c-b26d-74b6d5c0b640/download/ download failed: ConnectionFailure Network.BSD.getProtocolByName: does not exist (no such protocol name: tcp) downloading from all 2 known url(s) failed Unable to access these remotes: web Maybe enable some of these special remotes (git annex initremote ...): 727f466f-60c3-4778-90b2-b2332856c2f8 -- dandi-dandisets-dropbox (Note that these git remotes have annex-ignore set: origin) failed [2021-12-04 08:38:14.07767133] (Utility.Process) process [6218] done ExitSuccess [2021-12-04 08:38:14.077721664] (Utility.Process) process [6217] done ExitSuccess [2021-12-04 08:38:14.077757014] (Utility.Process) process [6216] done ExitSuccess [2021-12-04 08:38:14.077786862] (Utility.Process) process [6215] done ExitSuccess [2021-12-04 08:38:14.078016471] (Utility.Process) process [6223] done ExitSuccess get: 5 failed (studiolab) studio-lab-user@default:~/dandisets/000003$ ``` Hope this helps - please let me know if I forgot to include relevant information!
adswa commented 2 years ago

Sorry, I accidentally just took an example with unnecessarily big data (the three files are each 10GB in size). A smaller example dataset would be https://github.com/datalad-datasets/machinelearning-books.

datalad clone  https://github.com/datalad-datasets/machinelearning-books.git
cd machine-learning-books
datalad get A.Shashua-Introduction_to_Machine_Learning.pdf
import datalad.api as dl
dl.clone('https://github.com/datalad-datasets/machinelearning-books.git')
ds = dl.Dataset('machinelearning-books')
ds.get('A.Shashua-Introduction_to_Machine_Learning.pdf')
EmilyWebber commented 2 years ago

Confirmed! I'm getting the same error - please standby.

EmilyWebber commented 2 years ago

Closing this for now - I've created a ticket with the team. Are you still blocked on this?

adswa commented 2 years ago

Closing this for now - I've created a ticket with the team. Are you still blocked on this?

Sadly, yes, but thanks for creating a ticket.