Closed hkkenneth closed 7 years ago
list of named entity (step1)
list of string, divided by category Map<String, List
> { "category1": ["str1" , "str2"], "category2": ["str1" , "str2"] } vocabAnnotator (step2) Create a GATE Gazetteer? scrubberProcessor (step2) ?
What's the file output of step1?
1 output file per person in GATE gazateer format (*.lst) Need temporary folder for each patient
Notes: not all NERs will have two-pass
document skipping logic (i.e. error handling)
do we skip at person level or document level if the document failed in first pass at document level
More notes:
Points needed to clarify:
What's the variable type expected for:
What's the file output of step1?
document skipping logic (i.e. error handling)
How to run?
/Users/kennethlui/workspace/texscrubber
will help you to find them)java -jar build/libs/texscrubber-0.1.0.jar