-
another long time issue with notepad2 :)
I understand 100% accuracy may impossible even with uchardet, but notepad2 is a bit off
opening this file for example
[VBoxHardening.log](https://github.…
-
This is a common task for all crawlers, see for instance [this discussion in StormCrawler](https://github.com/DigitalPebble/storm-crawler/issues/438)
There is code for that in [Tika](https://github…
-
## Describe the bug
It seems that the charset detection implementation at [L72 of `Plugin.cs`](https://github.com/QL-Win/QuickLook/blob/master/QuickLook.Plugin/QuickLook.Plugin.MarkdownViewer/Plugi…
-
**What version of this package are you using?**
usually the latest
**What problem do you want to solve?**
https://certitude.consulting/blog/en/invisible-backdoor/
**What do you think is th…
-
Very similar issue to #269, also tested in XP and Win10. Under Japanese locale, Notepad2 thinks the attached file is Unicode-encoded. Under another locale, it correctly detects it as UTF-8. I found th…
-
https://github.com/tomas/needle/issues/170
Basically a clone of this issue
https://www.w3.org/TR/encoding/ looks like utf-8 is now default?
Are you open to pull request?
-
feedparser implements charset detection according to [RFC 3023](https://datatracker.ietf.org/doc/html/rfc3023), which has been superseded by [RFC 7303](https://datatracker.ietf.org/doc/html/rfc7303). …
-
In https://github.com/elastic/elasticsearch/pull/22691#discussion_r96935452, I added a comment which points out that our code currently ignores the `charset` parameter of the `Content-Type` header and…
-
Your idea for a subtitle correction service could be very useful, especially for those who encounter character encoding issues in subtitles (e.g., with special characters or accented letters in differ…
-
Hi!
Right now some charsets seem not to be supported when subsets are limitted. For example:
```
const langSubset = ['en', 'es'];
console.log(eld.detect('Cześć! Jak się masz?').language);
console…