-
another long time issue with notepad2 :)
I understand 100% accuracy may impossible even with uchardet, but notepad2 is a bit off
opening this file for example
[VBoxHardening.log](https://github.…
-
**Describe the bug**
The detection of encoding did change recently, and IMHO regressed (I found that in a CI failure https://dev.azure.com/nexB/commoncode/_build/results?buildId=14502&view=logs&jobId…
-
This is a common task for all crawlers, see for instance [this discussion in StormCrawler](https://github.com/DigitalPebble/storm-crawler/issues/438)
There is code for that in [Tika](https://github…
-
## Describe the bug
It seems that the charset detection implementation at [L72 of `Plugin.cs`](https://github.com/QL-Win/QuickLook/blob/master/QuickLook.Plugin/QuickLook.Plugin.MarkdownViewer/Plugi…
-
**What version of this package are you using?**
usually the latest
**What problem do you want to solve?**
https://certitude.consulting/blog/en/invisible-backdoor/
**What do you think is th…
-
### Description
I opened a file UTF-8 encoded with the char "😵💫" in it, but it looks like that "亰窶昨汳ォ" in the text. So I assume, the detection of a charset makes an Error. It seems to be a Japane…
-
**Describe the bug**
Test cases related to TestCreateNeedleFromRequest are failing
**System Setup**
> git clone https://github.com/seaweedfs/seaweedfs.git
> #install package
> cd seaweedfs
…
-
feedparser implements charset detection according to [RFC 3023](https://datatracker.ietf.org/doc/html/rfc3023), which has been superseded by [RFC 7303](https://datatracker.ietf.org/doc/html/rfc7303). …
-
Very similar issue to #269, also tested in XP and Win10. Under Japanese locale, Notepad2 thinks the attached file is Unicode-encoded. Under another locale, it correctly detects it as UTF-8. I found th…
-
https://github.com/tomas/needle/issues/170
Basically a clone of this issue
https://www.w3.org/TR/encoding/ looks like utf-8 is now default?
Are you open to pull request?