TeXworks / texworks

Main codebase for TeXworks, a simple interface for working with TeX documents
https://tug.org/texworks/
GNU General Public License v2.0
699 stars 130 forks source link

feature request: auto-detect text encoding #371

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
When collaborating, we receive files from other people.  They may use other
encodings than utf-8.  I have to deal with files in utf-8 and latin-1
encodings everyday.

It is doable to an editor to auto-detect text encoding when opening files,
like Emacs do.  Encoding cookies are not new user friendly, so they are
against the principles of TeXworks.  So the TeXworks should be able to
auto-detect it.

What version of the product are you using? On what operating system?

Release 649, ubuntu linux.
Release 514, windows 7.

Please provide any additional information below.

I have found the following article on how to do the detection from mozilla
code.  I don't know it the licensing works.

"How to build standalone universal charset detector from Mozilla source"
http://www.mozilla.org/projects/intl/detectorsrc.html

Original issue reported on code.google.com by mvsfras...@gmail.com on 26 May 2010 at 5:58

GoogleCodeExporter commented 9 years ago
Worth to take a look in the ICU library
http://site.icu-project.org/

"Character Set Detection"
http://userguide.icu-project.org/conversion/detection

Original comment by mvsfras...@gmail.com on 26 May 2010 at 6:45

GoogleCodeExporter commented 9 years ago

Original comment by st.loeffler on 15 Jun 2010 at 6:00