【发布时间】:2019-09-30 11:40:35
【问题描述】:
我有一个类似的 XML 文件,
<?xml version="1.0" encoding="utf-8"?>
<source>
<publisher>Job App</publisher>
<publisherurl>https://jldfsfsd.jlfdfs.com/Jobs/</publisherurl>
<lastBuildDate>10-19-2015 00:00:00</lastBuildDate>
<job>
<title><![CDATA[Barista/Sandwich Prep]]></title>
<date><![CDATA[10-19-2015]]></date>
<referencenumber><![CDATA[83]]></referencenumber>
<url><![CDATA[https://test/Jobs/Job.aspx?JobPostingId=83&SourceId=3]]></url>
<company><![CDATA[Another Cafe]]></company>
<city><![CDATA[San Francisco]]></city>
<state><![CDATA[California]]></state>
<country><![CDATA[United States of America]]></country>
<postalcode><![CDATA[94123]]></postalcode>
<description><![CDATA[ TESTTESTTESTTESTTESTTESTTESTTEST <br> STEdsasjflsdf<p> dfjhdjlas </p>]]></description>
</job>
<job>
<title><![CDATA[MV Drivers]]></title>
<date><![CDATA[01-01-1900]]></date>
<referencenumber><![CDATA[147]]></referencenumber>
<url><![CDATA[https://sdf.dsfs.com/Jobs/Job.aspx?JobPostingId=147&SourceId=3]]></url>
<company><![CDATA[Papa Johns Pizza]]></company>
<city><![CDATA[Mountain View]]></city>
<state><![CDATA[California]]></state>
<country><![CDATA[United States of America]]></country>
<book><![CDATA[BOOKTEST]]></book>
<postalcode><![CDATA[94404]]></postalcode>
<description><![CDATA[Fun sfsf job while makingfsfup to $20/hour!]]></description>
</job>
在 lxml 解析器中,如何仅获取第二个作业标记及其子节点意味着我只想获取以下数据作为输出。请注意,这不是固定格式,它取决于 XML 文件结构。
<job>
<title><![CDATA[MV Drivers]]></title>
<date><![CDATA[01-01-1900]]></date>
<referencenumber><![CDATA[147]]></referencenumber>
<url><![CDATA[https://sdf.dsfs.com/Jobs/Job.aspx?JobPostingId=147&SourceId=3]]></url>
<company><![CDATA[Papa Johns Pizza]]></company>
<city><![CDATA[Mountain View]]></city>
<state><![CDATA[California]]></state>
<country><![CDATA[United States of America]]></country>
<postalcode><![CDATA[94404]]></postalcode>
<description><![CDATA[Fun sfsf job while makingfsfup to $20/hour!]]></description>
</job>
【问题讨论】:
-
如果您尝试了任何东西,最好展示您的尝试。