leschweitzer / solrpy

Automatically exported from code.google.com/p/solrpy
0 stars 0 forks source link

No support for Unicode #25

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
I get the following erro when I try to add text containing unicode characters:

Traceback (most recent call last):
  File "/home/jayron/Documentos/Projeto Dengue/scholarscrap/extraction/src/dengue/pdftext.py", line 78, in <module>
    Salva_PDF(lis)
  File "/home/jayron/Documentos/Projeto Dengue/scholarscrap/extraction/src/dengue/pdftext.py", line 64, in Salva_PDF
    s.add(**i)
  File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 678, in add
    return Solr.add_many(self, [fields], commit=_commit)
  File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 326, in wrapper
    return self._update(content, query)
  File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 550, in _update
    rsp = self._post(selector, request, self.xmlheaders)
  File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 639, in _post
    return check_response_status(self.conn.getresponse())
  File "/usr/local/lib/python2.6/dist-packages/solrpy-0.9.3-py2.6.egg/solr/core.py", line 1090, in check_response_status
    raise ex
solr.core.SolrException: HTTP code=400, reason=ParseError at [row,col]:[55,1] 
Message: An invalid XML character (Unicode: 0xc) was found in the element 
content of the document.

Original issue reported on code.google.com by jayronso...@gmail.com on 2 Mar 2011 at 6:11