Open alberskib opened 10 years ago
@bio4j/dynamograph Please take a look into Roadmap and express your opinion. Do you think that presented plan is reasonable I mean it contains too much aims or not enough? I added UniProtKB and UniRef as additional data types but if you think that there is some better data I will replace it.
@alberskib I just added checkboxes in the current period, so that the current progress is more visual, check please what is already more or less done.
Looks good in general :)
I'd add aws resource management in general (create and destroy tables, autoscaling groups etc). About the dataset uniprot and all that is maybe too much, and I think that refseq could be more interesting, also for seeing how a mixed dynamo/s3 solution performs (refseq includes a lot of seq data with the need for range access).
@eparejatobes By aws resource management you mean creation code that will provide such functionality or manually do such thing? Mixing dynamoDb with s3 seems extremely interesting so I definitiely will handle this dataset. I sligthly modify RoadMap.
If you know any other datasets that should be handled please let me know (generally if you suggest modification of selected datasets).
@alberskib I mean code of course, like what we talk about during our previous meeting. @evdokim can probably show you some examples
We're taking the midterm evaluation as an opportunity for refining and updating this. Some comments about it:
19.05 : 08.06 - First iteration GO:
Steps:
Artifacts:
09.06 : 29.06 - Second iteration ncbiTaxonomy:
Steps:
Artifacts:
30.06 : 13.07 - Third iteration RefSeq:
Steps:
Artifacts:
14.07 : 27.07 - Fourth iteration UniRef: // or futher work on steps/artifacts from previous iteration
Steps:
Artifacts:
28.07 : 10.08 - Fifth iteration:
Steps:
Artifacts:
11.08 : 18.08 - Final delivery/release
Steps:
Artifcats:
Each iteraton also focus on code quality(includes refactoring etc).