【发布时间】:2020-07-24 02:33:08
【问题描述】:
我有一个包含电子邮件的 TXT 文件,例如:
From r Wed Oct 30 21:41:56 2002
Return ...
...
From r Thu Oct 31 08:11:39 2002
Return ...
...
我想将每封电子邮件提取到一个数组中,例如:
["From r Wed Oct 30 21:41:56 2002 Return ...", "From r Thu Oct 31 08:11:39 2002 Return ...", ..., "From r ..."]
我正在使用 python
with open(self.file, encoding="utf8", errors='ignore') as data_file:
lines = ''
first_line = True
for line in data_file:
if line.startswith("From r") and not first_line:
emails.append(lines)
lines = ''
else:
first_line = False
lines = lines + line
【问题讨论】:
-
您希望根据什么拆分电子邮件?每封电子邮件只有 2 行 - From 和 Return?
-
每封邮件有n行,但都以'From r'开头
-
@vitorcarvalho 提取数组的具体原因?