i40-Tools / CPSDocumentGenerator

A Java-based tool that generates Gold Standard for AutomationML files. It takes into account semantic heterogeneity for AutomationML files and produces a Gold standard for them. The final output can be used to measure precision and recall for AML integrated files.
3 stars 1 forks source link

Randomize splitting of documents #20

Open igrangel opened 7 years ago

igrangel commented 7 years ago

How can you ensure here that elements are randomly generated?

omarrana commented 7 years ago

Elements are Randomly Generated once at the start then they are just appended to seed.aml seed-0, and seed-1. The code above links to just randomly splitting few items into one of the files so they are bit more different.

igrangel commented 7 years ago

The idea to implement would be the following. The algorithm should randomly take elements and create conflicts, i.e., newly added elements to seed-0 or seed-1. Whenever they are created should be stated in the Gold standard that those elements are 0 since they will be in one seed but not in the other.

omarrana commented 7 years ago

yes the new elements generation is not implemented in seed-0 or seed-1 with 0 value in GoldStandard. So technically we dont need to tell GoldStandard they are 0 since GoldStandard takes values from seed.aml. We just add some new random values in seed1 and seed 0. Is it really what we are looking for

omarrana commented 7 years ago

close

igrangel commented 7 years ago

Which distribution is the basis to make this random generation? Please, point to the line of code.