OSM数据解析以获取具有子节点的节点

时间:2013-04-21 08:09:36

标签: php xml simplexml openstreetmap

我为一个小区域下载Open Street Map数据,我想过滤数据以获得具有特殊类别的节点。

以下是OSM数据的示例

 <node id="505126369" lat="31.2933856" lon="34.2687443" user="JumpStart International" uid="125156" visible="true" version="1" changeset="2568758" timestamp="2009-09-22T13:05:10Z"/>
 <node id="505126372" lat="31.2682934" lon="34.2745680" user="JumpStart International" uid="125156" visible="true" version="1" changeset="2568758" timestamp="2009-09-22T13:05:10Z"/>
 <node id="505126375" lat="31.2953082" lon="34.3471630" user="JumpStart International" uid="125156" visible="true" version="1" changeset="2568758" timestamp="2009-09-22T13:05:10Z"/>
 <node id="505126378" lat="31.2807872" lon="34.2757999" user="JumpStart International" uid="125156" visible="true" version="1" changeset="2568758" timestamp="2009-09-22T13:05:11Z">
   <tag k="amenity" v="school"/>
   <tag k="name" v="Al Aqqad Basic &amp; Secondary Female School"/>
   <tag k="name:ar" v="مدرسة العقاد الأساسية والثانوية للبنات"/>
  </node>

我希望得到整个学校,医院的数据。

如果有人使用PHP或Java进行XML解析,我将非常感谢与我和所有人分享它。

修改 这是一个简单的开始,我只是

$dataFile = base_url() . 'media/files/osmdata/map_3.xml';
    //echo ($dataFile);

    $xml = simplexml_load_file($dataFile);

    //    $countTotal = count($xml->node);
    //   echo 'here'.$countTotal;
    foreach ($xml as $key => $val) {
        var_dump($val);
               // can't manage things overs here

    }

1 个答案:

答案 0 :(得分:8)

以下是PHP SimpleXML的一个小OSM Overpass API示例我已编译,因为我们这里没有PHP,我喜欢OSM,所以让我们展示一些有用的例子。

第一部分展示了如何使用标准PHP查询Overpass Endpoint。您不需要该部分,因为您已将数据保存在硬盘上:

<?php
/**
 * OSM Overpass API with PHP SimpleXML / XPath
 *
 * PHP Version: 5.4 - Can be back-ported to 5.3 by using 5.3 Array-Syntax (not PHP 5.4's square brackets)
 */


//
// 1.) Query an OSM Overpass API Endpoint
//

$query = 'node
  ["amenity"~".*"]
  (38.415938460513274,16.06338500976562,39.52205163048525,17.51220703125);
out;';

$context = stream_context_create(['http' => [
    'method'  => 'POST',
    'header' => ['Content-Type: application/x-www-form-urlencoded'],
    'content' => 'data=' . urlencode($query),
]]);

# please do not stress this service, this example is for demonstration purposes only.
$endpoint = 'http://overpass-api.de/api/interpreter';
libxml_set_streams_context($context);
$start = microtime(true);

$result = simplexml_load_file($endpoint);
printf("Query returned %2\$d node(s) and took %1\$.5f seconds.\n\n", microtime(true) - $start, count($result->node));

对你而言,第二部分更有趣。那就是查询你已经拥有的XML数据。使用xpath最容易做到这一点,使用的PHP XML库基于libxml,它支持XPath 1.0,可以很好地满足各种查询需求。

以下示例列出了所有学校并尝试获取其名称。我还没有涵盖翻译,因为我的样本数据没有这些,但您也可以查找所有类型的名称,包括翻译,只是喜欢特定的名称):

//
// 2.) Work with the XML Result
//

# get all school nodes with xpath
$xpath = '//node[tag[@k = "amenity" and @v = "school"]]';
$schools = $result->xpath($xpath);
printf("%d School(s) found:\n", count($schools));
foreach ($schools as $index => $school)
{
    # Get the name of the school (if any), again with xpath
    list($name) = $school->xpath('tag[@k = "name"]/@v') + ['(unnamed)'];
    printf("#%02d: ID:%' -10s  [%s,%s]  %s\n", $index, $school['id'], $school['lat'], $school['lon'], $name);
}

这里的关键点是xpath查询。使用了两个,第一个用于获取具有特定标记的节点。我认为这对你来说是最有趣的一个:

//node[tag[@k = "amenity" and @v = "school"]]

这一行说明:给我所有 node 元素,其中包含标记元素,其中 k 属性值“amenity “ v 属性值”school“。这是您必须过滤掉那些用舒适学校标记的节点的条件。

此外,再次使用xpath,现在相对于那些学校节点,看看是否有名称,如果是这样,就可以获取它:

tag[@k = "name"]/@v'

这一行说明:相对于当前节点,从标记元素中提取 v 属性作为 k 属性值< EM> “名称”。如您所见,某些部分再次类似于之前的行。我认为你们都可以根据自己的需要采用它们。

因为并非所有 school 节点都有一个名称,所以通过将其添加到(然后为空)结果数组中,提供了一个默认字符串用于显示目的:

list($name) = $school->xpath('tag[@k = "name"]/@v') + ['(unnamed)'];
                                                    ^^^^^^^^^^^^^^^
                                                Provide Default Value

所以这里是我的代码示例的结果:

Query returned 907 node(s) and took 1.10735 seconds.
10 School(s) found:
#00: ID:332534486   [39.5017565,16.2721899]  Scuola Primaria
#01: ID:1428094278  [39.3320912,16.1862820]  (unnamed)
#02: ID:1822746784  [38.9075566,16.5776597]  (unnamed)
#03: ID:1822755951  [38.9120272,16.5713431]  (unnamed)
#04: ID:1903859699  [38.6830409,16.5522243]  Liceo Scientifico Statale A. Guarasci
#05: ID:2002566438  [39.1347698,16.0736924]  (unnamed)
#06: ID:2056891127  [39.4106679,16.8254844]  (unnamed)
#07: ID:2056892999  [39.4124687,16.8286119]  (unnamed)
#08: ID:2272010226  [39.4481717,16.2894353]  SCUOLA DELL'INFANZIA SAN FRANCESCO
#09: ID:2272017152  [39.4502366,16.2807664]  SCUOLA MEDIA 

我希望这有用,如果你有更多的澄清问题,请告诉我。


  

(by rbwilkinson):这是您可以添加其他参数以查找其他值的方法。以下示例查找一公里内的其他属性:

$query = 'node
  ["addr:postcode"~"RM12"]
  (51.5557914,0.2118915,51.5673083,0.2369398);
   node
  (around:1000)
  ["amenity"~"fast_food"];
           out;';

$context = stream_context_create(['http' => [
    'method'  => 'POST',
    'header' => ['Content-Type: application/x-www-form-urlencoded'],
    'content' => 'data=' . urlencode($query),
]]);

$endpoint = 'http://overpass-api.de/api/interpreter';
libxml_set_streams_context($context);

$result = simplexml_load_file($endpoint);
printf("Query returned %2\$d node(s) and took %1\$.5f seconds.\n\n", microtime(true) - $start, count($result->node));
}