igematberkeley / NLPChemExtractor

3 stars 0 forks source link

Training data cleanup #4

Open mrunalimanj opened 3 years ago

mrunalimanj commented 3 years ago

from files in training_data_csvs, we'd like to remove all "extraneous" sentences (data cleaning):

removing references what else? add it here. @jierui-cell I think you wanted to work on this? Add any updates/asks here :)