opencb / cellbase

High-Performance NoSQL database and RESTful web services to access to most relevant biological data
Apache License 2.0
90 stars 53 forks source link

JSON version files duplicated #442

Open julie-sullivan opened 5 years ago

julie-sullivan commented 5 years ago

Pretty sure this is because I did something wrong but I have this:

/tmp:$ find . -name gnomadVersion.json
./gnomadVersion.json
./homo_sapiens_grch37/gnomadVersion.json

I think there should only be one?

julie-sullivan commented 5 years ago

Also my code repo looks like this:

Jenkinsfile                     cellbase-core                       disgenetReadme.txt
LICENSE                         cellbase-lib                        disgenetReadme.txt.log
README.md                       cellbase-server                     download_summary.log
all_gene_disease_associations.tsv.gz            cellbase-test                       gnomad.v2.1.1.lof_metrics.by_transcript.txt.bgz
all_gene_disease_associations.tsv.gz.log        cellbase-web                        gnomad.v2.1.1.lof_metrics.by_transcript.txt.bgz.log
build                           cellbase.iml                        pom.xml
cellbase-app                        checkstyle.xml                      target

I guess in one of my download commands, I had the wrong config? But then why didn't everything download to my current location?

imedina commented 5 years ago

Just tried branch issue-386 and all looks good:

image

julie-sullivan commented 5 years ago

Nacho, can you post which command you used? Maybe that's my problem?

This is what I am doing to get the errant JSON files:

rm -rf /tmp/homo_sapiens_grch37/; build/bin/cellbase.sh download --assembly GRCh37 
--common /tmp/ --data gene,genome --output /tmp/ --species hsapiens
julie-sullivan commented 5 years ago

I also get stray download files?

~/git/cellbase (next) $ git status
On branch next
Your branch is ahead of 'upstream/next' by 4 commits.
  (use "git push" to publish your local commits)

Untracked files:
  (use "git add <file>..." to include in what will be committed)

    all_gene_disease_associations.tsv.gz
    disgenetReadme.txt
    gnomad.v2.1.1.lof_metrics.by_transcript.txt.bgz

My config is wrong somewhere!

imedina commented 5 years ago

I'm running this CLI:

http://bioinfo.hpc.cam.ac.uk/downloads/cellbase/