我要下载数据集(SemEval 2016/2017),如
所述http://alt.qcri.org/semeval2017/task4/index.php?id=data-and-tools
这反过来提醒
https://github.com/seirasto/twitter_download
最后一个链接提供了有关如何运行脚本以下载数据集的说明。
我使用Ubuntu,然后在命令提示符下运行
twitter
,如自述文件中所述。它会打开Web浏览器,并使用我的Twitter用户名和密码从中登录。我获得了PIN码,然后将其输入到脚本生成的提示中。
然后,我尝试使用命令下载数据集
python3 download_tweets_api.py --dist=tweeti-a.dist.tsv --output=downloaded.tsv
但我知道
fp@fp-X58LE:~/twitter_download-master$ python3 download_tweets_api.py --dist=tweeti-a.dist.tsv --output=downloaded.tsv
Hi there! We're gonna get you all set up to use Semeval sentiment analysis.
Traceback (most recent call last):
File "/usr/local/lib/python3.4/dist-packages/twitter/api.py", line 341, in _handle_response
handle = urllib_request.urlopen(req, **kwargs)
File "/usr/lib/python3.4/urllib/request.py", line 161, in urlopen
return opener.open(url, data, timeout)
File "/usr/lib/python3.4/urllib/request.py", line 469, in open
response = meth(req, response)
File "/usr/lib/python3.4/urllib/request.py", line 579, in http_response
'http', request, response, code, msg, hdrs)
File "/usr/lib/python3.4/urllib/request.py", line 507, in error
return self._call_chain(*args)
File "/usr/lib/python3.4/urllib/request.py", line 441, in _call_chain
result = func(*args)
File "/usr/lib/python3.4/urllib/request.py", line 587, in http_error_default
raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 401: Authorization Required
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "download_tweets_api.py", line 21, in <module>
oauth_dance("Semeval sentiment analysis", CONSUMER_KEY, CONSUMER_SECRET, MY_TWITTER_CREDS)
File "/usr/local/lib/python3.4/dist-packages/twitter/oauth_dance.py", line 93, in oauth_dance
twitter.oauth.request_token(oauth_callback="oob"))
File "/usr/local/lib/python3.4/dist-packages/twitter/api.py", line 334, in __call__
return self._handle_response(req, uri, arg_data, _timeout)
File "/usr/local/lib/python3.4/dist-packages/twitter/api.py", line 367, in _handle_response
raise TwitterHTTPError(e, uri, self.format, arg_data)
twitter.api.TwitterHTTPError: Twitter sent status 401 for URL: oauth/request_token using parameters: (oauth_callback=oob&oauth_consumer_key=JEdRRoDsfwzCtupkir4ivQ&oauth_nonce=6666274229238613502&oauth_signature_method=HMAC-SHA1&oauth_timestamp=1592954865&oauth_version=1.0&oauth_signature=T8xiOX4HMOLQcertNeMC1vlnoHM%3D)
details: {"errors":[{"code":32,"message":"Could not authenticate you."}]}
我已卸载并重新安装了python和twitter,目前我不知道如何进行操作。
在运行脚本之前,在groups.google.com/forum/#!topic/semevaltweet/4e45k1QkHk4之后,我首先复制了训练数据ID文件 twitter-2016train-A.txt (从数据和工具页面alt.qcri.org/semeval2017/task4/index.php?id=data-and-tools下载)在脚本的同一目录中,我将其重命名为 tweeti-a.dist.tsv 。