如何使用PHP最好地解码SEO目的字符串中的特殊字符

时间:2012-03-23 12:27:54

标签: php seo html-entities

我正在运行CMS。我看到我的CMS用户正在输入来自copy& amp ;;在Word等中粘贴动作

因此,在元描述内容属性中,我看到一颗钻石代替倾斜的单一右引号。

我进入数据库并将单引号更改为’

因此,我的CMS现在呈现it’s,例如。

我正在尝试使用PHP的htmlentities(),htmlspecialchars_decode()和html_entity_decode()在渲染期间对字符串进行转换。

每条清理指令只需呈现it’s

我应该使用PHP函数进行翻译吗?它甚至重要吗?也就是说,我可以在元标记中使用对于SEO至关重要的HTML代码吗?

希望这很清楚......感谢任何反馈。

1 个答案:

答案 0 :(得分:1)

这是我为了同样的理由而整理的那个。如果您乐意将HTML保留在帖子中,则可以删除strip_tag()行。

function convert_to_plaintext($message)
  {
    $message = strip_tags($message);

    //Quotes: Replace smart double quotes with straight double quotes.
    //ANSI version for use with 8-bit regex engines and the Windows code page 1252.
    preg_replace('[\x84\x93\x94]', '"', $message);

    //Quotes: Replace smart single quotes and apostrophes with straight single quotes.
    //ANSI version for use with 8-bit regex engines and the Windows code page 1252.
    preg_replace("[\x82\x91\x92]", "'", $message);

    //Quotes: Replace straight apostrophes with smart apostrophes
    preg_replace("/(\b'\b)/", "?", $message);

    //Quotes: Replace straight double quotes with smart double quotes.
    //ANSI version for use with 8-bit regex engines and the Windows code page 1252.
    preg_replace('/(\B"\b([^"\x84\x93\x94\r\n]+)\b"\B)/', '?\1?', $message);

    //Quotes: Replace straight double quotes with smart double quotes.
    //Unicode version for use with Unicode regex engines.
    //preg_replace('/(\B"\b([^"\u201C\u201D\u201E\u201F\u2033\u2036\r\n]+)\b"\B)/', '?\1?', $message);
    $message = str_replace(array('%u201C','%u201D','%u201E','%u201F','%u2033','%u2036'),'"',$message);

    //Quotes: Replace straight single quotes with smart single quotes.
    //Unicode version for use with Unicode regex engines.
    //preg_replace("/(\B'\b([^'\u2018\u2019\u201A\u201B\u2032\u2035\r\n]+)\b'\B)/", "?\1?", $message);
    $message = str_replace(array('%u2018','%u2019','%u201A','%u201B','%u2032','%u2035'),"'",$message);

    //Quotes: Replace straight single quotes with smart single quotes.
    //ANSI version for use with 8-bit regex engines and the Windows code page 1252.
    preg_replace("/(\B'\b([^'\x82\x91\x92\r\n]+)\b'\B)/", "?\1?", $message);

    $message = str_replace("\n\n\n","\n\n",$message);
    $message = str_replace("\n\n\n","\n\n",$message);
    $message = str_replace("\n\n\n","\n\n",$message);
    $message = str_replace("\n\n\n","\n\n",$message);
    $message = str_replace("\n\n\n","\n\n",$message);
    $message = str_replace("\n",'<br/>',$message);

    return $message;
  }
相关问题