tmills / ctakes-docker

Apache License 2.0
23 stars 18 forks source link

getSentenceCoverage doesn't limit the length of sentence #23

Open jamesmasanz opened 6 years ago

jamesmasanz commented 6 years ago

If the sentence detector ends up creating a very long sentence, I2b2ReadyFileWriter.getSentenceCoverage just includes the entire "sentence". This is not useful if the "sentence" is 1000s of characters long, and if try to load the data into a database, can be problematic.

Created a jdbc writer based on I2b2ReadyFileWriter.java and used with an Oracle database, and received the following because the sentence was longer than the VCHAR2 column: Caused by: java.sql.BatchUpdateException: ORA-01461: can bind a LONG value only for insert into a LONG column

Note the column was defined as VCHAR2 yet the error message refers to LONG

-- James