按位置过滤推文

时间:2015-05-27 23:43:23

标签: python twitter geolocation

我正在尝试修改此脚本以仅保存附加了位置的推文的JSON,并且遇到Python的问题,其中检查某些内容不为null似乎不起作用。 Key没有正常工作,因为它们都有密钥,大多数都只是'null'。是不是没有工作,因为Python认为null和None是不同的,并将其作为文本检查不是“null”也不起作用。有没有人对如何解决这个问题有一个聪明的想法?

from tweepy.streaming import StreamListener
from tweepy import OAuthHandler
from tweepy import Stream
import pymongo
import tweepy
import json


#Variables that contains the user credentials to access Twitter API 
access_key = '' #redacted for privacy and such
access_secret = ''
consumer_key = ''
consumer_secret = ''

#Runs auth to Twitter API
auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
auth.set_access_token(access_key, access_secret)
api = tweepy.API(auth)


#This is a basic listener that will print incoming data to stdout
class StdOutListener(StreamListener):

    def on_data(self, data):
        print data
        return True

    def on_error(self, status):
        print status


#Customizes the stream and saves text and lang to databases 
class CustomStreamListener(tweepy.StreamListener):
    def __init__(self, api):
        self.api = api
        super(tweepy.StreamListener, self).__init__()
        self.db = pymongo.MongoClient('localhost', 27017).crime



    def on_data(self, data):
        jd = json.loads(data)
        if jd.has_key('coordinates') :
            self.db.tweets.insert( { 'text' : jd['text'], 'coordinates' : jd['coordinates'], 'lang' : jd['lang'] } )




    def on_error(self, status_code):
        return True # Don't kill the stream

    def on_timeout(self):
        return True # Don't kill the stream

#Calls on StreamListerner and provides specifications of tracking
l = tweepy.streaming.Stream(auth, CustomStreamListener(api))
l.filter(track=['guns'])

1 个答案:

答案 0 :(得分:0)

您可以尝试检查字符串的长度:

if len( jd['coordinates'] ) > 1:
    self.db.tweets.insert( { 'text' : jd['text'], 'coordinates' : jd['coordinates'], 'lang' : jd['lang'] } )