I get the following erro when I try to add text containing unicode characters:
Traceback (most recent call last):
File "/home/jayron/Documentos/Projeto Dengue/scholarscrap/extraction/src/dengue/pdftext.py", line 78, in <module>
Salva_PDF(lis)
File "/home/jayron/Documentos/Projeto Dengue/scholarscrap/extraction/src/dengue/pdftext.py", line 64, in Salva_PDF
s.add(**i)
File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 678, in add
return Solr.add_many(self, [fields], commit=_commit)
File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 326, in wrapper
return self._update(content, query)
File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 550, in _update
rsp = self._post(selector, request, self.xmlheaders)
File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 639, in _post
return check_response_status(self.conn.getresponse())
File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 1090, in check_response_status
raise ex
solr.core.SolrException: HTTP code=400, reason=ParseError at [row,col]:[55,1]
Message: An invalid XML character (Unicode: 0xc) was found in the element
content of the document.
Original issue reported on code.google.com by jayronso...@gmail.com on 2 Mar 2011 at 6:11
Original issue reported on code.google.com by
jayronso...@gmail.com
on 2 Mar 2011 at 6:11