lifthrasiir / rust-encoding

Character encoding support for Rust
MIT License
284 stars 59 forks source link

Implement missing encodings #18

Open lifthrasiir opened 10 years ago

lifthrasiir commented 10 years ago

This is a master list for important missing encodings. What is considered "important" is a delicate question, but for now I have the following list:


WHATWG multibyte encodings

Non-WHATWG multibyte encodings of the special interest

Required for completeness

lifthrasiir commented 10 years ago

As of b5bdc62d2582388f043600bb07efc75280146260 all WHATWG encodings have been implemented. Not all encodings are fully verified though.

zeld commented 8 years ago

Do you have any interest in supporting EBCDIC encodings, maybe behind a feature since they are not the most commonly used? I know that Java provides a built-in support for a limited set, iconv and icu offer more extended support. If so, I could provide help, for example by providing tables with similar syntax as https://encoding.spec.whatwg.org/index-iso-8859-15.txt

ssokolow commented 7 years ago

Would you be willing to consider supporting IBM code page 437? (The original DOS/IBM PC codepage)

So far, the only Rust support I've been able to find is a read-only crate named cp437 which doesn't inspire confidence. (The heading correctly says "cp437", but then it's typo'd as "cp537" in the very first non-heading line of the README and there's no unit test badge.)

I ask because I have a Python script for generating batch file menus for DOSBox and my retro PC (which means I need to encode box-drawing characters to cp437) and it'd be nice if I could port it to Rust to get more compile-time correctness enforcement and easier distribution to others.