Wget日本人物

时间:2014-11-21 05:35:36

标签: utf-8 wget

我想下载包含日文字符的网址

http://ja.wikipedia.org/wiki/%E3%82%A2%E3%82%A4%E3%82%B6%E3%83%83%E3%82%AF%E3%83%BB%E3%82%A2%E3%82%B7%E3%83%A2%E3%83%95

使用wget下载文件时,文件名变得乱码。

在wget的手册中,有一些关于这个问题:

If you specify ‘nocontrol’, then the escaping of the control characters is also switched off. This option may make sense when you are downloading URLs whose names contain UTF-8 characters, on a system which can save and display filenames in UTF-8 (some possible byte values used in UTF-8 byte sequences fall in the range of values designated by Wget as “controls”).

所以我试着写这个。但是,它没有用。有什么问题?

wget --restrict-file-names=nocontrol http://ja.wikipedia.org/wiki/%E3%82%A2%E3%82%A4%E3%82%B6%E3%83%83%E3%82%AF%E3%83%BB%E3%82%A2%E3%82%B7%E3%83%A2%E3%83%95

0 个答案:

没有答案