Closed wuyu8512 closed 2 years ago
Hm I may be missing something here. The \u3000
is already handled by C# - there is nothing AngleSharp can do here. It should output
. So you enter "<h4>第三章 夢與超能力</h4>"
to AngleSharp. Is this special character now removed? What does the inner text actually look like?
What does the inner text actually look like?
第三章\u0020夢與超能力
\u3000
became \u0020
Bug Report
Prerequisites
AngleSharp.Css
for CSS support)For more information, see the
CONTRIBUTING
guide.Description
[Description of the bug]
Steps to Reproduce
var doc = HtmlParser.ParseDocument("
第三章\u3000夢與超能力
"); Console.WriteLine(doc.DocumentElement.GetInnerText()); Console.WriteLine(doc.DocumentElement.GetInnerText() == "第三章\u3000夢與超能力");