nviet / lingoes-converter

Convert *.LD2 dictionaries format into human-readable text files
http://windylea.com
BSD 4-Clause "Original" or "Old" License
64 stars 21 forks source link

Decompression failed at around 8000 bytes #1

Open yangchenyun opened 10 years ago

yangchenyun commented 10 years ago

Hi, Thanks for work of the converter. I encountered some issues to convert *.ld2 dictionary. Most of the attempts stopped at around 8000 bytes.

I also tried the encoding options, but the result will remain at the same offset and bytes.

You could find the dictionary I used to test here and the result is below.

Lingoes Converter v0.1
> by WindyLea

---
+ Input file: Shogakukan Chinese-Japanese Dictionary.ld2
+ Output file (Optional):
+ Entry word encoding (Optional / Default is UTF-8):
+ Entry definition encoding (Optional / Default is UTF-16LE):

Converting...
* Error: Decompression failed at offset 0x695fd0 (tried to uncompress 7749 bytes of data)

# Execution time: 5.41 (s)
# Memory usage: 1536 KB
# Peak memory usage: 237568 KB

---
+ Input file: Advanced Chinese Big Dictionary.ld2
+ Output file (Optional):
+ Entry word encoding (Optional / Default is UTF-8):
+ Entry definition encoding (Optional / Default is UTF-16LE): UTF-8

Converting...
* Error: Decompression failed at offset 0x6fb7c8 (tried to uncompress 7800 bytes of data)

# Execution time: 5.34 (s)
# Memory usage: 1792 KB
# Peak memory usage: 253696 KB

---
+ Input file: Collins COBUILD Advanced Learner's Dictionary 5th Edition.ld2
+ Output file (Optional):
+ Entry word encoding (Optional / Default is UTF-8):
+ Entry definition encoding (Optional / Default is UTF-16LE):

Converting...
* Error: Decompression failed at offset 0x695fd0 (tried to uncompress 8237 bytes of data)

# Execution time: 5.14 (s)
# Memory usage: 1536 KB
# Peak memory usage: 241920 KB
lvchakele commented 10 years ago

Hi Thank you for sharing lingoes-converter, but I have no contact PHP program, can you tell me what software you run this program, how should I do?

At 2014-07-27 03:21:49, "Steven Yang" notifications@github.com wrote:

Hi, Thanks for work of the converter. I encountered some issues to convert *.ld2 dictionary. Most of the attempts stopped at around 8000 bytes.

I also tried the encoding options, but the result will remain at the same offset and bytes.

You could find the dictionary I used to test here and the result is below.

Lingoes Converter v0.1

by WindyLea

  • Input file: Shogakukan Chinese-Japanese Dictionary.ld2
  • Output file (Optional):
  • Entry word encoding (Optional / Default is UTF-8):
  • Entry definition encoding (Optional / Default is UTF-16LE):

Converting...

Execution time: 5.41 (s)

Memory usage: 1536 KB

Peak memory usage: 237568 KB


Converting...

Execution time: 5.34 (s)

Memory usage: 1792 KB

Peak memory usage: 253696 KB


Converting...

Execution time: 5.14 (s)

Memory usage: 1536 KB

Peak memory usage: 241920 KB

— Reply to this email directly or view it on GitHub.

yangchenyun commented 10 years ago

@lvchakele I am not the author :)

I am running PHP 5.3.28 on my Mac OSX and follow the author's instruction to run the program.

simonwiles commented 9 years ago

same issue here, fwiw

nijianmo commented 7 years ago

Same issue here. Does anybody have ever resolved this issue?

DataBranner commented 5 years ago

@yangchenyun, @simonwiles, @nijianmo, @lvchakele: Have you tried GB2312 or GB18030 encodings? Not all Chinese dictionaries are encoded in Unicode.