Open Yvonne-Han opened 4 years ago
I think "unidentified" cases will be impossible to handle with the current code. But I think in most of these cases, we could figure out whether the speaker is an analyst or with the company. I wouldn't change the call-parsing code to achieve this unless we can extract information that we're currently missing.
In table
speaker_data
, quite a few entries have missingrole
values. Some of them are missing because the speaker is an operator (which is fine).After excluding the "Operator" case, we still have quite a few entries (~8%) that have missing
role
values.The top 10 categories for
speaker_name
associated with missingrole
are listed below:I think we can at least fix some of the missing
role
values. For instance, we can easily label 4 and 9 asanalyst
and 3 ascompany employee
.