cppp-reiconv
A character set conversion library based on GNU LIBICONV.
Instruction
This library provides some charset conversation function for The C++ Plus Project.
It provides support for the encodings:
- European languages
- ASCII, ISO-8859-{1,2,3,4,5,7,9,10,13,14,15,16},
- KOI8-R, KOI8-U, KOI8-RU,
- CP{1250,1251,1252,1253,1254,1257}, CP{850,866,1131},
- Mac{Roman,CentralEurope,Iceland,Croatian,Romania},
- Mac{Cyrillic,Ukraine,Greek,Turkish},
- Macintosh
- Semitic languages
- ISO-8859-{6,8}, CP{1255,1256}, CP862, Mac{Hebrew,Arabic}
- Japanese
- EUC-JP, SHIFT_JIS, CP932, ISO-2022-JP, ISO-2022-JP-2, ISO-2022-JP-1,
- ISO-2022-JP-MS
- Chinese
- EUC-CN, HZ, GBK, CP936, GB18030, GB18030:2022, EUC-TW, BIG5, CP950,
- BIG5-HKSCS, BIG5-HKSCS:2004, BIG5-HKSCS:2001, BIG5-HKSCS:1999,
- ISO-2022-CN, ISO-2022-CN-EXT
- Korean
- EUC-KR, CP949, ISO-2022-KR, JOHAB
- Armenian
- Georgian
- Georgian-Academy, Georgian-PS
- Tajik
- Kazakh
- Thai
- ISO-8859-11, TIS-620, CP874, MacThai
- Laotian
- Vietnamese
- Platform specifics
- Full Unicode
- UTF-8
- UCS-2, UCS-2BE, UCS-2LE
- UCS-4, UCS-4BE, UCS-4LE
- UTF-16, UTF-16BE, UTF-16LE
- UTF-32, UTF-32BE, UTF-32LE
- UTF-7
- C99, JAVA
- Full Unicode, in terms of 'uint16_t' or 'uint32_t'
- (with machine dependent endianness and alignment)
- UCS-2-INTERNAL, UCS-4-INTERNAL
When configured with the option -DENABLE_EXTRA, it also provides
support for a few extra encodings:
- European languages
- CP{437,737,775,852,853,855,857,858,860,861,863,865,869,1125}
- Semitic languages
- Japanese
- EUC-JISX0213, Shift_JISX0213, ISO-2022-JP-3
- Chinese
- Turkmen
- Platform specifics
- EBCDIC compatible (not ASCII compatible, very rarely used)
- European languages
-
- IBM-{037,273,277,278,280,282,284,285,297,423,500,870,871,875,880},
-
- IBM-{905,924,1025,1026,1047,1112,1122,1123,1140,1141,1142,1143},
-
- IBM-{1144,1145,1146,1147,1148,1149,1153,1154,1155,1156,1157,1158},
-
- Semitic languages
-
- IBM-{424,425,12712,16804}
- Persian
-
- Thai
-
- Laotian
-
- Vietnamese
-
- Indic languages
-
It can convert from any of these encodings to any other, through Unicode
conversion.
cppp-reiconv is for you if your application needs to support multiple character
encodings, but that support lacks from your system.
Build and installation
We use CMake for build
-
Standard build
mkdir build
cd build
cmake .. -DCMAKE_INSTALL_PREFIX=[[PREFIX]]
cmake --build . --config=Release
cmake --install .
-
Full build
mkdir build
cd build
cmake .. -DCMAKE_INSTALL_PREFIX=[[PREFIX]] -DENABLE_EXTRA=ON
cmake --build . --config=Release
cmake --install .
When use full build, extra encodings will be enabled.
Install
This library installs:
- a shared library 'libcppp-reiconv'.
- a static library 'libcppp-reiconv.static'.
- a header file '<cppp/reiconv.hpp>'.
To use it, simply #include <cppp/reiconv.hpp> and import the lib to use the functions.
Copyright
The cppp-reiconv is under LGPLv3,
see file COPYING.
Download
https://github.com/cppp-project/cppp-reiconv/releases
Homepage
https://github.com/cppp-project/cppp-reiconv
Bug reports