读取csv文件时的字符编码问题

时间:2013-04-26 08:09:28

标签: php csv utf-8 character-encoding

我的csv文件包含“æ”,“å”等特殊字符。当我读取并打印文件时,文件中的特殊字符会转换为“ ”。 我尝试将页面编码设置为UTF-8和ISO 8859-1。但这些都没有帮助。

smb可以建议解决方案吗?

1 个答案:

答案 0 :(得分:2)

我认为您必须检测并更改原始编码,如下所示(如果您使用的是php):

  <?php
        header( "Content-Type: text/html; charset=utf-8");
        $csvContent = file_get_contents( $fileName );
        $encoding = mb_detect_encoding( $csvContent, 
                                        array("UTF-8","UTF-32","UTF-32BE","UTF-32LE","UTF-16","UTF-16BE","UTF-16LE"), 
                                        TRUE );

        if( $fileEncoding !== "UTF-8" ) {
             $csvContent = mb_convert_encoding($csvContent, "UTF-8", $fileEncoding );
        }

        foreach( explode( PHP_EOL, $csvContent ) as $item ) {
           var_dump($item );
        }
 ?>