GenevieveHaliburton / project_planning

Early stage project planning space
1 stars 0 forks source link

[WIP] Using tabula muris to understand HCA use #1

Open GenevieveHaliburton opened 6 years ago

GenevieveHaliburton commented 6 years ago

Note: this is my open planning doc for this project. Any feedback/suggestions welcome! cc @freeman-lab @mckinsel @olgabot

Overall goals

Use tabula muris data and experiences to help understand how computational biologists may be interacting with the HCA, and make sure that the current approach to HCA will meet those needs

First pass goals

Understanding

Putting tabula muris data into HCA DCP Staging

Before loading any data, give a heads up to HCA collaborators to expect some rando mouse data in there!

Got source bucket for tabula muris data from Olga and bundling/upload tips and code from Marcus.

Still thinking about:

Querying tabula muris data from HCA DCP staging

FASTQs

Still thinking about: What other analyses should I try here?

olgabot commented 6 years ago

Looks exciting! For the paper, we didn't work with anything outside of gene count matrices or csvs. I'm also doing some kmer hashing to see if we can use kmer signatures to compare single cells but that's a rabbit hole I'm willing to go down

olgabot commented 6 years ago

Sharing data was definitely a struggle as people couldn't easily find or navigate our figshare project to find what they watned

ambrosejcarr commented 6 years ago

Question: What's an mm10plus genome?

I can help here:

What pre-processing and analysis steps are being done on HCA fastqs and CSVs? Not sure where to find this

Secondary Analysis is working on docs, but, the answers are in skylab. It's probably a good idea to check out the pipelines (3', SS2) and familiarize yourself with how they're specified.

If things are confusing, you can dump all such comments into #humancellatlas/mintteam or make issues on the github repository and we'll address them.

Suggestion: linking this issue to a public google doc could be a good place to start if you expect comments -- I find git's conversations unhelpfully linear.