VirtualFlyBrain / VFB_neo4j

A python package for writing schema-compliant content to VFB neo4J DBs
Apache License 2.0
0 stars 1 forks source link

Non Breaking Space unicode needs handling #279

Open Robbie1977 opened 8 months ago

Robbie1977 commented 8 months ago

unicode   a.k.a \xa0 is breaking pub load

causing error:

/opt/VFB_neo4j/src/uk/ac/ebi/vfb/neo4j/neo4j_tools.py:148: UserWarning: {'code': 'Neo.ClientError.Statement.SyntaxError', 'message': 'Invalid input \'x\': expected \'\\\', \'\'\', \'"\', \'b\', \'f\', \'n\', \'r\', \'t\', UTF16 or UTF32 (line 1, column 223 (offset: 222))\n"MERGE (n:pub:Individual { iri: \'[http://flybase.org/reports/FBrf0244422\](http://flybase.org/reports/FBrf0244422/)' }) set n.short_form = \'FBrf0244422\' set n:Entity SET n.title = [\'zfh2 controls progenitor cell activation and differentiation in the adult Drosophila\\xa0intestinal absorptive lineage.\'] SET n.miniref = [\'Rojas Villa et al., 2019, PLoS Genet. 15(12): e1008553\'] SET n.year = [\'2019\'] SET n.pages = [\'e1008553\'] SET n.volume = [\'15\'] SET n.label = "Rojas Villa et al., 2019, PLoS Genet. 15(12): e1008553" SET n.self_xref = [\'FlyBase\'] "\n                                                                                                                                                                                                                               ^'}
dosumis commented 8 months ago

Surprising. The whole stack should be unicode compliant + I'm sure we have many other examples of special characters in author name.