【发布时间】:2020-05-28 12:22:34
【问题描述】:
这里我正在尝试合并文本文件并仅将文件的消息部分提取到单独的文件中
import os
import re
message_data=[]
path=r'C:\Users\Multiple Text files/'
filenames=['2019-01-01.text','2019-01-02.text','2019-01-03.text','2019-01-04.text','2019-01-
05.text','2019-01-06.text','2019-01-07.text']
#inside each file there is a message and I'm trying to extract that particular message only
with open(os.path.join(path,filenames),encoding='utf8') as f:
for line in f.readlines():
m=re.findall('.*?Message:.*',line)
for line in m:
message_data.append(line)
【问题讨论】:
-
问题与
machine-learning、sentiment-analysis或nlp无关——请不要向无关标签发送垃圾邮件(已删除)。
标签: python file text text-processing