choderalab / assaytools

Modeling and Bayesian analysis of fluorescence and absorbance assays.
http://assaytools.readthedocs.org
GNU Lesser General Public License v2.1
18 stars 11 forks source link

Raw Data Organization #55

Open sonyahanson opened 8 years ago

sonyahanson commented 8 years ago

Probably good to change how/where we are putting the raw data. I guess I have started doing this with the assaytools/data folder, where the data is separated out by data type (singlet, spectra, etc.). Though right now there are also snippets of analysis in there, which I should move.

Also, do we want to keep the data we are analyzing for the paper and the data we have for other projects (Mehtap, Andrea, etc.) in a different place? Possibly a different repository? Or is just a separate folder within assaytools/data sufficient?

sonyahanson commented 8 years ago

Or... should we not have data in this repository at all? Just have a bare minimum for tests? We were planning on putting all the raw data for the manuscript in the manuscript repository anyway...

jchodera commented 8 years ago

Here's my thinking:

sonyahanson commented 8 years ago

I think this sounds reasonable, will shift to working in the manuscript repo for the datasets relevant to that. We can revisit if this becomes stranger than expected.

sonyahanson commented 8 years ago

I'm guessing we want to keep the files in data/full_example/, but maybe move everything to the examples directory. Maybe you can do this in your PR @jchodera ?

sonyahanson commented 8 years ago

So after discussing with @MehtapIsik, it seems like it is a good idea to make a new branch, e.g. assaytools/July2016 that could keep all the data and notebooks relatively as is, while we work on cleaning up the master branch. In the master branch we will remove/clean up any raw data and notebooks that don't serve as a minimal example.

MehtapIsik commented 8 years ago

I copied all HSA experimental data in examples/.../hsa folder to organize it under another repository (hsa-affinity).

jchodera commented 8 years ago

I'm guessing we want to keep the files in data/full_example/, but maybe move everything to the examples directory. Maybe you can do this in your PR @jchodera ?

Let's leave this for now since I use that data in my new implementation. We can clean it up next week.

So after discussing with @MehtapIsik, it seems like it is a good idea to make a new branch, e.g. assaytools/July2016 that could keep all the data and notebooks relatively as is, while we work on cleaning up the master branch. In the master branch we will remove/clean up any raw data and notebooks that don't serve as a minimal example.

Sounds great!

sonyahanson commented 8 years ago

Great Mehtap! thanks!

sonyahanson commented 7 years ago

Just had a quick meeting with @MehtapIsik where we mapped out a better organization for the examples directory (among other things):

Also note, according to the discussion above the plan is to completely delete the data directory.

Also planning to make a branch that will just be the repo as it currently is called 'Nov2016'.

jchodera commented 7 years ago

Sounds good!

sonyahanson commented 7 years ago

Branch now made: https://github.com/choderalab/assaytools/tree/Nov2016

sonyahanson commented 7 years ago

@jchodera do you have any opinion about whether this notebook stays: https://github.com/choderalab/assaytools/blob/master/examples/ipynbs/models/competition-assay-modeling/competition-assay-modeling.ipynb

jchodera commented 7 years ago

What if we move it to https://github.com/choderalab/fluorescence-assay-manuscript in case we use a derivative of it to model some fluorescence assays?

sonyahanson commented 7 years ago

if we use a derivative to model fluorescence assays we can just include that no?

sonyahanson commented 7 years ago

Do you have it somewhere else? I feel like maybe it is?

jchodera commented 7 years ago

I don't believe there is a copy somewhere else.

Since it's not an example of analyzing an experimental assay, I think we should add it to https://github.com/choderalab/fluorescence-assay-manuscript, perhaps under a notebooks/ or modeling/ or figures/ directory, and delete it from here.

sonyahanson commented 7 years ago

I think it makes less sense in fluorescence-assay-manuscript than here, I will just keep it here.

sonyahanson commented 7 years ago

I have a derivative that I have used in the passed, and will add here when we get something going for the simpler competition assay prediction. If these two notebooks are redundant, we can delete one.

jchodera commented 7 years ago

Our fluorescence assay manuscript outline contains a figure on modeling the competition assay, so I thought it was much more relevant to have this there---where we actually need to make figures depicting a modeled competition assay---than in this repo, which contains examples of real data and scripts/notebooks to analyze them. But I'm happy with whatever you think is best!

sonyahanson commented 7 years ago

We have both the modeling and analysis here right now, and I think this makes sense for testing our methods. What do you think of this: https://github.com/choderalab/assaytools/tree/data_clean/examples/probe-assay ? We can add more description in the readme about what actually happens in these notebooks:

jchodera commented 7 years ago

Looks good!

Would definitely appreciate a bit more description about the assays that are described in more detail in the notebooks. Think of yourself as a potential user looking to try to figure out if these examples are the most similar to what you are trying to do: what would you want to see?