G-Node / gogs

Fork of Gogs (https://github.com/gogs/gogs) "a painless self-hosted Git service" with added features for research data management
https://gin.g-node.org
MIT License
16 stars 15 forks source link

Cannot get annexed data over https ? #117

Open lucasgautheron opened 3 years ago

lucasgautheron commented 3 years ago

I am unable to datalad get annexed contents from a GIN repository.

See this repository for example: https://gin.g-node.org/lucasgautheron/test

(base) alejandrinas-MacBook-Air:test acristia$ datalad install https://gin.g-node.org/lucasgautheron/test.git
[INFO   ] Remote origin not usable by git-annex; setting annex-ignore                                                                                                                                       
[INFO   ] https://gin.g-node.org/lucasgautheron/test.git/config download failed: Not Found                                                                                                                  
install(ok): /Users/acristia/Documents/Lucas/data/EL1000/test/test (dataset)
(base) alejandrinas-MacBook-Air:test acristia$ cd test
(base) alejandrinas-MacBook-Air:test acristia$ datalad get annexed_content 
get(error): annexed_content (file) [not available; (Note that these git remotes have annex-ignore set: origin)]                                                                                             
(base) alejandrinas-MacBook-Air:test acristia$ 

Is this a limitation of git-annex, a limitation of GIN, or something else? It is especially important for us to download data over https, since the french supercomputer Jean Zay won't allow cloning over ssh.

Environment

$ git --version
git version 2.30.1

$ git annex version
git-annex version: 8.20210903
build flags: Assistant Webapp Pairing FsEvents TorrentParser MagicMime Feeds Testsuite S3 WebDAV
dependency versions: aws-0.22 bloomfilter-2.0.1.0 cryptonite-0.29 DAV-1.3.4 feed-1.3.2.0 ghc-8.10.7 http-client-0.7.8 persistent-sqlite-2.13.0.3 torrent-10000.1.1 uuid-1.3.15 yesod-1.6.1.2
key/value backends: SHA256E SHA256 SHA512E SHA512 SHA224E SHA224 SHA384E SHA384 SHA3_256E SHA3_256 SHA3_512E SHA3_512 SHA3_224E SHA3_224 SHA3_384E SHA3_384 SKEIN256E SKEIN256 SKEIN512E SKEIN512 BLAKE2B256E BLAKE2B256 BLAKE2B512E BLAKE2B512 BLAKE2B160E BLAKE2B160 BLAKE2B224E BLAKE2B224 BLAKE2B384E BLAKE2B384 BLAKE2BP512E BLAKE2BP512 BLAKE2S256E BLAKE2S256 BLAKE2S160E BLAKE2S160 BLAKE2S224E BLAKE2S224 BLAKE2SP256E BLAKE2SP256 BLAKE2SP224E BLAKE2SP224 SHA1E SHA1 MD5E MD5 WORM URL X*
remote types: git gcrypt p2p S3 bup directory rsync web bittorrent webdav adb tahoe glacier ddar git-lfs httpalso borg hook external
operating system: darwin x86_64
supported repository versions: 8
upgrade supported from repository versions: 0 1 2 3 4 5 6 7
local repository version: 8

$ datalad --version
datalad 0.15.2+88.gdfa956984
effigies commented 2 years ago

@lucasgautheron For some reason, the .git suffix tells datalad (or git annex) that the remote should be considered git-only. I was able to work with this as follows:

$ datalad install https://gin.g-node.org/lucasgautheron/test
install(ok): /home/chris/tmp2/test (dataset)
$ cd test
$ datalad get annexed_content
get(ok): annexed_content (file) [from origin...]
FPa-riken commented 1 year ago

for those who'll land here with the same issue, note that this behavior is referenced in DataLad manual : https://handbook.datalad.org/en/latest/basics/101-139-gin.html#sharing-and-accessing-the-dataset