WGLab / CancerVar

Clinical interpretation of somatic mutations in cancer
42 stars 13 forks source link

Unrecognized character in orpha.txt #3

Closed ftucos closed 4 years ago

ftucos commented 4 years ago

In reference to the file orpha.txt The byte 6538 (the ü in "Strümpel disease") is causing the following error: NOTICE: Multianno output file is written to CanVar.hg19_multianno.txt Traceback (most recent call last): File "CancerVar.py", line 1885, in <module> main() File "CancerVar.py", line 1845, in main read_datasets() File "CancerVar.py", line 460, in read_datasets strs = fh.read() File "/home/tucos/miniconda3/lib/python3.7/codecs.py", line 322, in decode (result, consumed) = self._buffer_decode(data, self.errors, final) UnicodeDecodeError: 'utf-8' codec can't decode byte 0xfc in position 6538: invalid start byte

quanliustc commented 4 years ago

decoded/remove some characters,please redownload the file.