【发布时间】:2016-06-01 16:08:44
【问题描述】:
我正在尝试将标签的所有命中写入 Python 中的 csv 文件。 我的字符串是:
<pre class="CodeRay highlight"><code data-lang="java"><span class="annotation">@CDIUI</span>(<span class="string"><span class="delimiter">"</span><span class="content">cdievents</span><span class="delimiter">"</span></span>)
<span class="annotation">@Theme</span>(<span class="string"><span class="delimiter">"</span><span class="content">valo</span><span class="delimiter">"</span></span>)
<span class="directive">public</span> <span class="type">class</span> <span class="class">CDIEventUI</span> <span class="directive">extends</span> UI {
<span class="annotation">@Inject</span>
InputPanel inputPanel;
<span class="annotation">@Inject</span>
DisplayPanel displayPanel;
<span class="annotation">@Override</span>
<span class="directive">protected</span> <span class="type">void</span> init(VaadinRequest request) {
Layout content =
<span class="keyword">new</span> HorizontalLayout(inputPanel, displayPanel);
setContent(content);
}
}</code></pre>
我用于将命中写入 csv 文件的 python 代码是:
hits = soup.find_all("pre", "CodeRay highlight")# "programlisting")
f = open('extractedsuorceTEST2.csv','ab')
writer = csv.writer(f)
writer.writerow(('page', hits[0].text.encode('UTF-8').replace('Â',' ')))
通过这段代码, hits[0] 是:
'@CDIUI("cdievents")\n@Theme("valo")\npublic class CDIEventUI extends UI {\n @Inject\n InputPanel inputPanel;\n\n @Inject\n DisplayPanel displayPanel;\n\n @Override\n protected void init(VaadinRequest request) {\n Layout content =\n new HorizontalLayout(inputPanel, displayPanel);\n setContent(content);\n }\n}'
但是写入csv文件的结果是:
@CDIUI(""cdievents"")
@Theme(""valo"")
public class CDIEventUI extends UI {
@Inject
InputPanel inputPanel;
@Inject
DisplayPanel displayPanel;
@Override
protected void init(VaadinRequest request) {
Layout content =
什么时候应该:
@CDIUI("cdievents")
@Theme("valo")
public class CDIEventUI extends UI {
@Inject
InputPanel inputPanel;
@Inject
DisplayPanel displayPanel;
@Override
protected void init(VaadinRequest request) {
Layout content =
new HorizontalLayout(inputPanel, displayPanel);
setContent(content);
}
}
有人可以提出解决方案吗? 谢谢
【问题讨论】:
-
hits[0].text.encode('UTF-8').replace('Â',' ')包含什么内容? -
hits[0].text.encode('UTF-8').replace('Â',' ') : '@CDIUI("cdievents")\n@Theme("valo" )\npublic class CDIEventUI extends UI {\n @Inject\n InputPanel inputPanel;\n\n @Inject\n DisplayPanel displayPanel;\n\n @Override\n protected void init(VaadinRequest request) {\n 布局内容 =\ n new HorizontalLayout(inputPanel, displayPanel);\n setContent(content);\n }\n}'
-
这里的字符串是完整的,但是当我使用“writer.writerow”将它写入CSV文件时它被截断了!!!
标签: python csv beautifulsoup