nijel / enca

Extremely Naive Charset Analyser
https://cihar.com/software/enca/
GNU General Public License v2.0
285 stars 47 forks source link

Enca - guess and convert encoding of text files

build status codecov.io Coverity Scan Build Status Coverage Status

Copyright

Copyright (C) 2000-2003 David Necas (Yeti) yeti@physics.muni.cz

Copyright (C) 2009-2016 Michal Cihar michal@cihar.com

Description

Enca (Extremely Naive Charset Analyser) consists of two main components:

Installation

Enca should compile and work on every POSIX.1 compliant system with ISO C compiler, and actually compiles on many noncompliant systems too (see below for list dependencies). If you have some of following additional tools, Enca can use them as external converters:

Optional features:

Don't even try to compile Enca on system not supporting following ISO C and POSIX features:

For the impatient: Run

./configure
make
make check
make install

as usual.

License

Enca can be copied and/or modified under the terms of version 2 of GNU General Public License. Please see COPYING for details.

Web resources

Enca can be found at https://github.com/nijel/enca/, you can download tarballs from https://cihar.com/software/enca/.

Bugs

Report problems at https://github.com/nijel/enca/issues. Some known bugs have been collected in BUGS section of enca manual page.

Hacking (with) Enca

Please see TODO for list of things that should be fixed and features to be implemented and their priority and also for list of things that definitely shouldn't be implemented.

The file DEVELOP.md describes what needs to be done to add a new encoding or language to Enca.

The directory devel-docs/html contains Enca library API documentation in HTML form.