WolfgangFahl / ProceedingsTitleParser

Shallow Semantic Parser to extract metadata from scientific proceedings titles
Apache License 2.0
3 stars 1 forks source link

Dealing with &#x0D in Proceeding titles #38

Closed WolfgangFahl closed 4 years ago

WolfgangFahl commented 4 years ago

SQL storage for crossref records fails due to: is the culprint here - it should be ignored

SQL error unrecognized token: "'Animal Agriculture and the Environment, National Center for Manure & Animal Waste Management White Papers&#x0D" in line 20698:
    INSERT INTO "Event_crossref" VALUES(NULL,NULL,'aae2006',NULL,NULL,'Animal Agriculture and the Environment, National Center for Manure & Animal Waste Management White Papers&#x0D
SQL error near "&": syntax error in line 20699:
    &#x0D
SQL error near "&": syntax error in line 20700:
     
',NULL,'crossref','Animal Agriculture and the Environment, National Center for Manure & Animal Waste Management White Papers&#x0D
SQL error near "&": syntax error in line 20701:
    &#x0D
SQL error near "&": syntax error in line 20702:
     
','10.13031/aae2006.2013','https://api.crossref.org/v1/works/10.13031/aae2006.2013')
SQL error unrecognized token: "'A Quality Assurance Project Plan for Monitoring Gaseous and Particulate&#x0D" in line 20722:
    INSERT INTO "Event_crossref" VALUES(NULL,NULL,'aqap2008',NULL,NULL,'A Quality Assurance Project Plan for Monitoring Gaseous and Particulate&#x0D
SQL error near "&": syntax error in line 20723:
    &#x0D
SQL error near "Matter": syntax error in line 20724:
    Matter Emissions from Broiler Housing',NULL,'crossref','A Quality Assurance Project Plan for Monitoring Gaseous and Particulate&#x0D
SQL error near "&": syntax error in line 20725:
    &#x0D
SQL error near "Matter": syntax error in line 20726:
    Matter Emissions from Broiler Housing','10.13031/aqap2008.2013','https://api.crossref.org/v1/works/10.13031/aqap2008.2013')