Incorrectly parsing XML with duplicated tag names #11

nurkiewicz · 2011-06-08T12:09:53Z

Trying to parse the following XML document:

<data>
  <r><a>A</a></r>
  <r><b>B</b><c>C</c></r>
</data>

with:

new XmlMapper().readValue(xml, Map.class)

ignores the first "r" (r -> {a -> A}) node, overriding it with a second one (r -> {b -> B, c -> C}). It should generate a map with a single key and array value instead: r -> [{a -> A}, {b -> B, c -> C}]. The problem is here (last line of org.codehaus.jackson.map.deser.MapDeserializer#_readAndBind):

            /* !!! 23-Dec-2008, tatu: should there be an option to verify
             *   that there are no duplicate field names? (and/or what
             *   to do, keep-first or keep-last)
             */
            result.put(key, value);

Although this can be worked around by using special map implementation instead of Map.class, but if the duplicated tags appear deeper in XML document (not at top level), there is no easy workaround, see org.codehaus.jackson.map.deser.UntypedObjectDeserializer#mapObject class (LinkedHashMap creation).

Of course the root cause of this problem is the assumption that there are no duplicate properties in JSON. In XML such nodes should be treated as arrays.

The text was updated successfully, but these errors were encountered:

cowtowncoder · 2011-06-08T16:33:46Z

Correct, this problem does result from impedance between XML and JSON.
Another question is whether wrappers for elements were supported: I think structure that works is one without elements; and there is a known issue wrt handling of wrapper vs unwrapped lists.

But Map is pretty specific type, so I wonder if it might be possible to add bit more interaction to make it work.
The way Lists are handled does in fact rely on a somewhat specific low-level method, so it might be possible to add something similar for cases where Map-type content is expected.

cowtowncoder · 2012-03-07T23:35:50Z

I am not sure there is generic solution to this problem: your solution assumes that we can use heuristic to combine sub-trees, but this would not be guaranteed for all kinds of structures.

But it could work for some subset of cases; so question then is whether to try to work on something that would work with the standard Map deserializer (which is not format specific), or to add XML-specific Map deserializer.
Latter might make more sense, given that this is "impossible" case for JSON (and in fact I would argue should probably throw an exception so that users do not rely on being able to handle duplicates).

As to XML: there is the immediate problem wherein value of duplicate property may well be something other than another Map; so it is not clear what would be the proper way to merge things. For example:

<data>
  <r>A</r>
  <r><b>B</b><c>C</c></r>
</data>

would not quite work, as value for entry "r" would be String "A". So what should be done for the following entry?

cowtowncoder · 2012-04-05T21:40:38Z

I think the answer here is "works as designed" -- 'untyped' binding to Maps and Lists will not be working correctly without assuming more advanced rules, and I don't want to move to that direction.
Will close the issue as 'wont fix'.

arakelian · 2017-09-13T15:02:11Z

For people who stumble upon this issue, see the Gist provided in #205 for a work around.

cowtowncoder closed this as completed Apr 5, 2012

calebkiage mentioned this issue Feb 14, 2017

Issue with RemoteTokenServices decoding the /oauth/check_token result spring-attic/spring-security-oauth#976

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Incorrectly parsing XML with duplicated tag names #11

Incorrectly parsing XML with duplicated tag names #11

nurkiewicz commented Jun 8, 2011

cowtowncoder commented Jun 8, 2011

Uh oh!

cowtowncoder commented Mar 7, 2012

Uh oh!

cowtowncoder commented Apr 5, 2012

Uh oh!

arakelian commented Sep 13, 2017

Uh oh!

Uh oh!

Incorrectly parsing XML with duplicated tag names #11

Incorrectly parsing XML with duplicated tag names #11

Comments

nurkiewicz commented Jun 8, 2011

cowtowncoder commented Jun 8, 2011

Uh oh!

cowtowncoder commented Mar 7, 2012

Uh oh!

cowtowncoder commented Apr 5, 2012

Uh oh!

arakelian commented Sep 13, 2017

Uh oh!