What is the best way to deal with the lack of a namespace on some nodes in an XML document using lxml? Should I first change all the No nodes to add the name gmd and then change the attributes of the tree to name http://www.isotc211.org/2005/gmd as gmd? If so, is there a clean way to do this with lxml or something else that would be relatively clean / safe?
from lxml import etree
nsmap = charts_tree.nsmap
nsmap.pop(None)
len (charts_tree.xpath('//*/gml:Polygon',namespaces=nsmap))
len (charts_tree.xpath('//*/DS_DataSet',namespaces=nsmap))
len (charts_tree.xpath('//*/DS_DataSet'))
eg. http://www.charts.noaa.gov/ENCs/ENCProdCat_19115.xml
<DS_Series xmlns="http://www.isotc211.org/2005/gmd" xmlns:gco="http://www.isotc211.org/2005/gco" xmlns:gml="http://www.opengis.net/gml/3.2" xmlns:gsr="http://www.isotc211.org/2005/gsr" xmlns:gss="http://www.isotc211.org/2005/gss" xmlns:gts="http://www.isotc211.org/2005/gts" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.isotc211.org/2005/gmd http://schemas.opengis.net/iso/19139/20070417/gmd/gmd.xsd">
<composedOf>
<DS_DataSet>
<has>
<MD_Metadata>
<parentIdentifier>
<gco:CharacterString>NOAA ENC Product Catalog</gco:CharacterString>
</parentIdentifier>
...
<EX_BoundingPolygon>
<polygon>
<gml:Polygon gml:id="US1AK90M_P1">
<gml:exterior>
<gml:LinearRing>
<gml:pos>67.61505 -178.99979</gml:pos>
<gml:pos>73.99999 -178.99979</gml:pos>
...
<gml:pos>64.99997 -178.99979</gml:pos>
<gml:pos>67.61505 -178.99979</gml:pos>
</gml:LinearRing>
source
share