[JAXP-76] StaX: data corruption when reading Unicode SMP characters in UTF-8 XML Created: 17/Jan/13 Updated: 09/Apr/15 Resolved: 09/Apr/15
|Remaining Estimate:||Not Specified|
|Time Spent:||Not Specified|
|Original Estimate:||Not Specified|
The attached small XML file contains a chinese character and the first gothic character (U+10330 : http://www.unicode.org/charts/PDF/U10330.pdf)
When parsing this file using StaX, the attribute value containing the gothic character is corrupted: it contains also the chinese character from the previous attribute.
See the console output:
From XML chinese:[-16, -92, -83, -94]
This issue comes from JOSM bug tracker: http://josm.openstreetmap.de/ticket/3290
|Comment by donvip [ 17/Jan/13 ]|
Sorry, how do we attach files ? If I am not allowed, they can be found here:
|Comment by donvip [ 16/Sep/13 ]|
Does anyone care about this public JIRA ?
|Comment by donvip [ 29/Nov/14 ]|
This bug has finally been addressed through https://bugs.openjdk.java.net/browse/JDK-8058175. Any chance to see it backported to JDK7 and JDK8?
|Comment by Joe Wang [ 09/Apr/15 ]|
Please note that the JAXP standalone (https://jaxp.java.net/) was retired. Please report issues to <a href="https://bugs.openjdk.java.net">the OpenJDK Bug System</a>.