sv99 / lsdreader

decompile Lingvo LSD dictionary to DSL
MIT License
93 stars 18 forks source link

Error: 'ascii' codec can't decode byte 0xd0 in position 15: ordinal not in range(128) #3

Closed pantlmn closed 7 years ago

pantlmn commented 8 years ago

Can't run it on UniversalLaRu_abrv.lsd.

Get this kind of errors:

Unpacking dict: /some-dir/UniversalLaRu_abrv.lsd
Header:
    Magic:             LingVo
    Checksume:         0xee97b857
    Version:           0x13 (0x131001)
    Entries:           303
    AnnotationOffset:  0xcd
    DictionaryEncoderOffset: 0xd2
    ArticlesOffset:    0x4bd4
    Pages start:       0x51fe
    Source language:   1049 Russian
    Target language:   1049 Russian
Name:                  Abbrev
First heading:         *
Last heading:          юр.
Capitals:              *123ABCDEFGHIJLMNOPQRSTVАБВГДЕЗИКЛМНОПРСТУФЧШЭЮ
Pages end:             0x5bfe
Overlay data:          0x5c02
Pages count:           5
Icon enable:           False
Decoder:               SystemDictionaryDecoder13
    ArticleSymbols:    131
    HeadingSymbols:    81
LenTable:              Articles
    Count:             131
    bitsPerLen:        5
    IdxBitSize:        8
LenTable:              Headings
    Count:             81
    bitsPerLen:        4
    IdxBitSize:        7
LenTable:              PrefixLengths
    Count:             6
    bitsPerLen:        3
    IdxBitSize:        3
LenTable:              PostfixLengths
    Count:             16
    bitsPerLen:        4
    IdxBitSize:        5
Overlay:
    EntriesCount:      0
/some-dir/lsdreader/lingvoreader/lsdfile.py:335: UnicodeWarning: Unicode unequal comparison failed to convert both arguments to Unicode - interpreting them as being unequal
  if path != "":
Error: 'ascii' codec can't decode byte 0xd0 in position 15: ordinal not in range(128)
sv99 commented 8 years ago

Python version??

Python 2.7.10 (default, Oct 23 2015, 19:19:21) 
[GCC 4.2.1 Compatible Apple LLVM 7.0.0 (clang-700.0.59.5)] on darwin
Unpacking dict: testdata/x3/UniversalLaRu_abrv.lsd
reading dictionary..
decoding overlay..
decoding headings: 303
decoding articles: 303
OK
Header:
    Magic:             LingVo
    Checksume:         0xee97b857
    Version:           0x13 (0x131001)
    Entries:           303
    AnnotationOffset:  0xcd
    DictionaryEncoderOffset: 0xd2
    ArticlesOffset:    0x4bd4
    Pages start:       0x51fe
    Source language:   1049 Russian
    Target language:   1049 Russian
Name:                  Abbrev
First heading:         *
Last heading:          юр.
Capitals:              *123ABCDEFGHIJLMNOPQRSTVАБВГДЕЗИКЛМНОПРСТУФЧШЭЮ
Pages end:             0x5bfe
Overlay data:          0x5c02
Pages count:           5
Icon enable:           False
Decoder:               SystemDictionaryDecoder13
    ArticleSymbols:    131
    HeadingSymbols:    81
LenTable:              Articles
    Count:             131
    bitsPerLen:        5
    IdxBitSize:        8
LenTable:              Headings
    Count:             81
    bitsPerLen:        4
    IdxBitSize:        7
LenTable:              PrefixLengths
    Count:             6
    bitsPerLen:        3
    IdxBitSize:        3
LenTable:              PostfixLengths
    Count:             16
    bitsPerLen:        4
    IdxBitSize:        5
Overlay:
    EntriesCount:      0
Write annotation: testdata/x3/UniversalLaRu_abrv.ann
Write dsl:        testdata/x3/UniversalLaRu_abrv.dsl
Write prefix:     testdata/x3/UniversalLaRu_abrv.pref
Unpack OK (0.33 sec)
sv99 commented 8 years ago

try english only path

pantlmn commented 8 years ago
$ python
Python 2.7.11 (default, Jan 21 2016, 02:02:54)
[GCC 4.2.1 Compatible Apple LLVM 5.1 (clang-503.0.40)] on darwin
pantlmn commented 8 years ago

Oh that was the problem — Russian letters in the path. Works OK, thanks.