Our prompt & model should identify claims worth checking from a video, but should also indicate how checkworthy it is. One form of the prompt (for example) labels each claim as one of:
"not worth checking", "worth checking", "may be worth checking"
These labels should be reflected in the front end by grouping the claims by that "summary" label so that the claims "worth checking" are shown at the top, clearly demarkated (a text label and/or maybe a background colour?); then the "may be worth checking" as a separate block below; and finally the "not worth checking" at the bottom.
The on-screen labels should reflect our caveats; e.g. in order: "mostly likely to be worth checking", "may be worth checking" and "less likely to be worth checking" for these three labels.
Requirements
[ ] make this prompt the default so that newly-processed videos also get the "summary" label
[ ] group claims by this label and sort the groups from most to least checkworthy down the screen
[ ] choose and add suitable text labels for each group
Notes and additional information
Later on, we might have more than 3 groups, poss. even a continuous score. But let's see how this crude 'traffic light' sorting works first.
Overview
Our prompt & model should identify claims worth checking from a video, but should also indicate how checkworthy it is. One form of the prompt (for example) labels each claim as one of: "not worth checking", "worth checking", "may be worth checking"
These labels should be reflected in the front end by grouping the claims by that "summary" label so that the claims "worth checking" are shown at the top, clearly demarkated (a text label and/or maybe a background colour?); then the "may be worth checking" as a separate block below; and finally the "not worth checking" at the bottom.
The on-screen labels should reflect our caveats; e.g. in order: "mostly likely to be worth checking", "may be worth checking" and "less likely to be worth checking" for these three labels.
Requirements
Notes and additional information
Later on, we might have more than 3 groups, poss. even a continuous score. But let's see how this crude 'traffic light' sorting works first.