Closed evdokim closed 10 years ago
@evdokim
Ok, in other words:
For each dataset like GO, uniprot etc there shoul be parser that from files with data create abstract model (Map[String, AtributeValue]
).
Next there will be single Writer(the same for each dataset) that takes abstract model and save it to the database
Thanks to it: hadling additional dataset require only writing specific parser.
Correct?
yes something like this. I think that writer can even don't know about dataset, it should have only maps with items together with tables names
@evdokim Exactly it should operate on map(which is something like intermediate representation). Really similar concept could be found inside architecture of compilers.
yes quite similar to IL
I'm going to clarify things that I we were talking during the last meeting.
Map[String, AttributeValue]
.DynamoDBWriter
will take that maps and do real uploading of data. It can be implemented in a different ways: straightforward synchronous writing, local blocking queue + working threads, SQS queue + worker instances, ....