匹配URL与规范化/规范化

时间:2011-12-28 03:26:58

标签: php url canonicalization

我发现这个解决方案规范化并匹配相同的URL,但我想知道是否有更优雅的东西,PHP是否有URL规范化功能?

function urlMatch($url1, $url2)
{
    // parse the urls
    $r1 = parse_url($url1);
    $r2 = parse_url($url2);

    // get the variables out of the queries
    parse_str($r1['query'], $v1);
    parse_str($r2['query'], $v2);

    // match the domains and paths
    if ($r1['host'] != $r2['host'] || $r1['path'] != $r2['path'])
        return false;

    // match the arrays
    foreach ($v1 as $key => $value)
        if (array_key_exists($key, $v2) && $value != $v2[$key])
            return false;

    // if we haven't returned already, then the queries match
    return true;
}

1 个答案:

答案 0 :(得分:1)

这样的事情可能更适合您的需求:

function urlMatch($url1, $url2){
  // parse the urls
  $r1 = parse_url($url1);
  $r2 = parse_url($url2);

  if (isset($r1['query'])){
    // get the variables out of the query
    parse_str($r1['query'], $v1);
    // sort arguments so they be in exactly same order
    asort($v1);
    // place sorted arguments back
    $r1['query'] = http_build_query($v1);
  }

  if (isset($r2['query'])){
    parse_str($r2['query'], $v2);
    asort($v2);
    $r2['query'] = http_build_query($v2);
  }

  // Match re-constructed urls (you'll need pecl_http extension for this)
  $matched = http_build_url($r1) === http_build_url($r2);

  return $matched;
}

更新:我已经更改了一些代码来处理空查询......