Closed michael-kotliar closed 2 years ago
These things are hardcoded. What if we want to use hg19 or even something else?
# Genomic resource constants
blacklist_path = os.path.join(os.path.dirname(__file__), "../../data/hg38_maxatac_blacklist.bed")
blacklist_bigwig_path = os.path.join(os.path.dirname(__file__), "../../data/hg38_maxatac_blacklist.bw")
chrom_sizes_path = os.path.join(os.path.dirname(__file__), "../../data/hg38.chrom.sizes")
These things are hardcoded. What if we want to use hg19 or even something else?
# Genomic resource constants blacklist_path = os.path.join(os.path.dirname(__file__), "../../data/hg38_maxatac_blacklist.bed") blacklist_bigwig_path = os.path.join(os.path.dirname(__file__), "../../data/hg38_maxatac_blacklist.bw") chrom_sizes_path = os.path.join(os.path.dirname(__file__), "../../data/hg38.chrom.sizes")
These files are hardcoded as the default options that are referenced by the argparser if none are provided. It is probably best to have the user specify these with each run instead of defaulting to a specific reference.
Couple comments on original task
- [ ] Repository is too big. We will need to remove all files that were added here by mistake. Removing files would not reduce the size. Repository has to be rebased too!
- [ ] If possible add tests and set up some Continuous Integration (CI) CI means deploying it somewhere or doing something after test. So KISS limit this to tests only.
These things are hardcoded. What if we want to use hg19 or even something else?
# Genomic resource constants blacklist_path = os.path.join(os.path.dirname(__file__), "../../data/hg38_maxatac_blacklist.bed") blacklist_bigwig_path = os.path.join(os.path.dirname(__file__), "../../data/hg38_maxatac_blacklist.bw") chrom_sizes_path = os.path.join(os.path.dirname(__file__), "../../data/hg38.chrom.sizes")
These files are hardcoded as the default options that are referenced by the argparser if none are provided. It is probably best to have the user specify these with each run instead of defaulting to a specific reference.
Ok, then for now it's better to keep them in a separate repository and include it as a submodule in the main one. This will allow us to separate the code and data and do not create a new commit on each change in the data files.
We tested a few architectures earlier, but have been sticking with using a dilated convolutional neural network (/GitHub/maxATAC/maxatac/architectures). Is it ok to delete the other archs for publication? (@emiraldi, @dlab-arp)
@michael-kotliar , we have the same dir here: /GitHub/maxATAC/packaging/scripts and /Users/war9qi/Documents/GitHub/maxATAC/data/scripts, which one should we delete?
@emiraldi should we remove all quantitative options from maxATAC?
For the public codebase, I would remove the quantitative options, but we will want to build on them in the future.
On Wed, Oct 13, 2021 at 10:47 AM FaizRizvi @.***> wrote:
@emiraldi https://github.com/emiraldi should we remove all quantitative options from maxATAC?
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/MiraldiLab/maxATAC/pull/62#issuecomment-942381816, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACUJXVOGTSTUXNSGULW7DEDUGWLYHANCNFSM5FKLCVPQ . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.
-- Emily Miraldi, Ph.D. Assistant Professor Divisions of Immunobiology and Biomedical Informatics Cincinnati Children's Hospital
@emiraldi should we remove all quantitative options from maxATAC?
@michael-kotliar what is the best way to save this work that includes quantitative work and other archs for future development private?
Questions to answer
Things to consider adding to TODO list https://github.com/MiraldiLab/maxATAC/projects/1
author
andauthor_email
insetup.py
docs
with https://readthedocs.org from (need to be public repo for that)