jaxp
  1. jaxp
  2. JAXP-76

StaX: data corruption when reading Unicode SMP characters in UTF-8 XML

    Details

    • Type: Bug Bug
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: current
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None
    • Environment:

      JRE 7u11

      Description

      The attached small XML file contains a chinese character and the first gothic character (U+10330 : http://www.unicode.org/charts/PDF/U10330.pdf)

      When parsing this file using StaX, the attribute value containing the gothic character is corrupted: it contains also the chinese character from the previous attribute.

      See the console output:

      From XML chinese:[-16, -92, -83, -94]
      Expected chinese:[-16, -92, -83, -94]
      From XML gothic:[-16, -92, -83, -94, -16, -112, -116, -80]
      Expected gothic:[-16, -112, -116, -80]

      This issue comes from JOSM bug tracker: http://josm.openstreetmap.de/ticket/3290

        Activity

        donvip created issue -
        Hide
        donvip added a comment -
        Show
        donvip added a comment - Sorry, how do we attach files ? If I am not allowed, they can be found here: http://josm.openstreetmap.de/attachment/ticket/3290/gottic.osm http://josm.openstreetmap.de/attachment/ticket/3290/Test.java
        Hide
        donvip added a comment -

        Does anyone care about this public JIRA ?

        Show
        donvip added a comment - Does anyone care about this public JIRA ?

          People

          • Assignee:
            Unassigned
            Reporter:
            donvip
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated: