libervia-backend: sat/tools/xml

comparison sat/tools/xml_tools.py @ 3393:2b6f69f6df8c

tools(xml_tools): fixed `<div>` unwrapping + added `parse` instance: `<div>` unwrapping could fail when a text node was a sibling of the top element (could easily happen ith a `\n` line feed added by an editor). This is fixed by filtering on IElement with `elements()`. A `parse` instance has been added as it is not necessary to create a new `ElementParser` each time that we want to parse something.

author	Goffi <goffi@goffi.org>
date	Thu, 12 Nov 2020 14:53:15 +0100
parents	8770397f8f82
children	d6a482a78bda

comparison

equal deleted inserted replaced

-:0957ea9137b8
+:2b6f69f6df8c
 """Check if a data_form.Field is an XHTML one"""
 return (field.fieldType is None and field.ext_type == "xml" and
 field.value.uri == C.NS_XHTML)
-class ElementParser(object):
+class ElementParser:
-"""callable class to parse XML string into Element"""
+"""Callable class to parse XML string into Element"""
 # XXX: Found at http://stackoverflow.com/questions/2093400/how-to-create-twisted-words-xish-domish-element-entirely-from-raw-xml/2095942#2095942
 def _escapeHTML(self, matchobj):
 entity = matchobj.group(1)
 raw_xml = raw_xml.replace("\n", " ").replace("\t", " ")
 tmp.addRawXml(raw_xml)
 parser.parse(tmp.toXml().encode("utf-8"))
 top_elt = self.result.firstChildElement()
 # we now can check if there was a unique element on the top
-# and remove our wrapping <div/> is this was the case
+# and remove our wrapping <div/> is this is the case
-if len(top_elt.children) == 1 and domish.IElement.providedBy(top_elt.children[0]):
+top_elt_children = list(top_elt.elements())
-top_elt = top_elt.firstChildElement()
+if len(top_elt_children) == 1:
+top_elt = top_elt_children[0]
 return top_elt
+parse = ElementParser()
 # FIXME: this method is duplicated from frontends.tools.xmlui.getText
 def getText(node):
 """Get child text nodes of a domish.Element.

Mercurial > libervia-backend

comparison sat/tools/xml_tools.py @ 3393:2b6f69f6df8c