【发布时间】:2018-02-17 23:51:05
【问题描述】:
我必须像这样解析 HTML 结构:
<div class='container>
<div class='inner-div'>
<span class='text'>...</span>
<div class='author'>
<span data-author='Alpha'>...</span>
</div>
<div class='summary'>
<span data-summary='Exclusive'>Text 1</span>
</div>
</div>
<div class='inner-div'>
<span class='text'>...</span>
<div class='author'>
<span data-author='Beta'>...</span>
</div>
<div class='summary'>
<span data-summary='Non-Exclusive'>Text 2</span>
</div>
</div>
<div class='inner-div'>
<span class='text'>...</span>
<div class='author'>
<span data-author='Gamma'>...</span>
</div>
<div class='summary'>
<span data-summary='Exclusive'>Text 3</span>
</div>
</div>
<div class='inner-div'>
<span class='text'>...</span>
<div class='author'>
<span data-author='Delta'>...</span>
</div>
<div class='summary'>
<span data-summary='Non-Exclusive'>Text 4</span>
</div>
</div>
...
<div class='inner-div'>
<span class='text'>...</span>
<div class='author'>
<span data-author='Zeta'>...</span>
</div>
<div class='summary'>
<span data-summary='Exclusive'>Text 5</span>
</div>
</div>
</div>
我希望获得作者不是“Alpha”的第一个“独家”摘要。在上面的示例中,它将是“文本 3”。如何使用 Simple HTML DOM 甚至 XML DOM 解析它?
附录:我正在寻找使用 PHP Simple HTML Dom 库解析 HTML。我知道如何在 jQuery 中解析它,但是 Simple HTML Dom 库似乎不支持 (:has) 的任何等效项。
【问题讨论】:
标签: php html dom xml-parsing simple-html-dom