Open nosgnoh opened 5 years ago
same
similar, utf-8 with or without BOM fails on some chars. allowed linebreaks throw error too. error message claim no valid utf-8 is submitted, but chars and line breaks are allowed, so this is buggy
Similar for Polish characters. Simplified test case:
<?xml version="1.0" encoding="UTF-8"?>
<czytelnicy xsi:noNamespaceSchemaLocation="ImpCz.xsd" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<czytelnicy>Ząb</czytelnicy>
</czytelnicy>
XSD:
<?xml version="1.0" encoding="UTF-8"?>
<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema" elementFormDefault="qualified" attributeFormDefault="unqualified">
<xs:element name="czytelnicy"></xs:element>
</xs:schema>
Error shown on demo page:
file.xml:3: parser error : PCDATA invalid Char value 5
<czytelnicy>Ząb</czytelnicy>
^
Hi Kripken,
I have used your library in my project and see some issue but didn't know this issue belong to your lib or mine. So I log this issue there:
When I validate my xml file using xsd schema with format (utf-8). In xml file I have use some CJK characters and then the result was failed. I research some way to resolve but have no ideas. This is my schema and xml file:
` <?xml version="1.0" encoding="utf-8"?> <xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:usdm="http://usdm.asia/usdm">
</xs:schema> `
xml : `<?xml version="1.0" encoding="utf-8"?> <usdm version="0.0.0" xmlns:usdm="http://usdm.asia/usdm">
`
I realize from this page https://www.utf8-chartable.de/unicode-utf8-table.pl?start=12288&number=512&names=- that the characters begin U+3081 | め | e3 82 81 to the end is failed with utf-8
Thank for your attention!