【发布时间】:2017-04-25 12:05:59
【问题描述】:
我使用的文件是从 twitter 流中导出的。但是当运行代码时 然后错误: KeyError: 'text' on tweets['python'] = tweets['text'].apply(lambda tweet: word_in_text('python', tweet))。 如何解决?谢谢大家...!
import re
import json
import string
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import matplotlib
tweets_data_path = '...\\..\\log1000.txt'
tweets_data = []
tweets_file = open(tweets_data_path, "r")
for line in tweets_file:
try:
tweet = json.loads(line)
tweets_data.append(tweet)
except:
continue
def word_in_text(word, text):
word = word.lower()
text = text.lower()
match = re.search(word, text)
if match:
return True
return False
#------------------------DataFrame--------------------------
tweets = pd.DataFrame()
#------------------------------------------------------------------------
tweets['python'] = tweets['text'].apply(lambda tweet: word_in_text('python', tweet))
#----------------------------------------------------------------
print (tweets['python'].value_counts()[True])
【问题讨论】: