AmyOlex / Chrono

Parsing time normalizations from text.
GNU General Public License v3.0
15 stars 4 forks source link

i2b2 file formatting inconsistency #103

Open AmyOlex opened 4 years ago

AmyOlex commented 4 years ago

Parsing /Users/alolex/Desktop/CCTR_Git_Repos/Chrono/i2b2_train/511.xml.txt ... In get DocTime. Admit Date: Report Status :

In get DocTime. Discharge Date: 01/23/2003

Traceback (most recent call last): File "Chrono.py", line 163, in doctime = utils.getDocTime(infiles[f], i2b2=True) File "/Users/alolex/Desktop/CCTR_Git_Repos/Chrono/Chrono/utils.py", line 136, in getDocTime return(dateutil.parser.parse(lines[1])) File "/Users/alolex/anaconda3/lib/python3.7/site-packages/dateutil/parser/_parser.py", line 1374, in parse return DEFAULTPARSER.parse(timestr, **kwargs) File "/Users/alolex/anaconda3/lib/python3.7/site-packages/dateutil/parser/_parser.py", line 649, in parse raise ParserError("Unknown string format: %s", timestr) dateutil.parser._parser.ParserError: Unknown string format: Report Status :