Open RasmusOrsoe opened 1 year ago
If I understand correctly, do you want pattern recognition (for example regex) to analyse the string name of an i3 file (or its keys) to infer sim type, but without saving it as a column in the database? As I recall the naming convention on i3 file names can vary a lot and I am unsure if it is a well-composed problem, in that this information is always included and I am unsure whether we can include all uniqueness of a name.
Hey @MortenHolmRep! I completely agree that using file names for this is sub-optimal; we could use a more elegant approach to string parsing of file names, but it would not change that it seems like we cannot robustly rely on the file names for this.
I think the path forward would be to triple check that there indeed is no consistent frame keys we can rely on instead. If not, then we could attempt to rewrite the extractor with try/except to catch the cases that the sim_type
variables toggle between. From memory, I think only the data/mc/noise
labels are important; the rest can be removed.
Is your feature request related to a problem? Please describe. In I3TruthExtractor we rely on the variable
sim_type
to modify the extractor's behavior. The variable is inferred by the extractor in a very crude way:A more elegant solution is needed.
Describe the solution you'd like We should try to come up with a way to either remove the need for the
sim_type
variable, or a more robust way to infer it.Additional context To my knowledge, it is often that i3 files doesn't contain identifying markers that we can use for this.