646e62 / case-brief

Generates a FIRAC-style case brief from a reported decision
GNU General Public License v3.0
2 stars 0 forks source link

Remove citations or replace with short forms #18

Closed 646e62 closed 1 year ago

646e62 commented 1 year ago

Citations are generally not semantically important for text summarization, though they may be useful for weight analysis, relational values, identifying legal tests, etc. Citations are important when they are used as labels for legal tests (eg, "Grant test", "Stinchcombe obligation", "WD analysis", and so forth).

This issue will be fully resolved once I've trained a model that can distinguish these citation references from their comparatively superfluous counterparts. An interim (and possibly good enough) solution will simply detect citations inside texts and exclude them from the text that ultimately gets submitted to GPT for summarization.

646e62 commented 1 year ago

GPT-3.5's inexpensiveness makes removing citations less a priority than it used to be. The program is currently set up to run certain legal analyses based on whether it detects certain citations.