Closed blcham closed 1 year ago
Yes, we discussed it, but I think this was generated before the discussion. I will regenerate it after text-analysis is finished
Moreover, I suggest to change columns in the output data: 1) rename column MultipleComponents to FoundComponentLabels (change of semantics here -- it should be filled in even there is only one component here) 2) rename column MultipleFailures --> FoundFailureLabels (see explanation above) 3) add column FoundComponentsCount (count of components above) 4) add column FoundFailuresCount (count of failures above) 5) add column SelectedComponentLabels (in case the score is same and we don't have rule to select one component, we return here multiple) 6) add column SelectedFailureLabels (see explanation above) 7) add column SelectedComponentsCount 8) add column SelectedFailuresCount 9) rename column ComponentScore --> SelectedComponentsScore 10) rename column FailureScore --> SelectedFailuresScore 11) remove column ComponentLabel 12) remove column FailureLabel
Ok, should I change it now? Or commit changes and finish the script and change it later?
Finish the script, regenerate data and put it to the google sheet file as new tab.
Done, it is in the new tab Regenerated raw data
here
I see this imported data from text analysis within line 3:
pls charge the cylinder bottle .
Those are the annotations:
I believe there is a mistake in selecting component "oxy bottle" because it has same score as "emergency cylinder". I believe we discussed that and decided that when we do not know to chose we will not pick any component or failure.