【发布时间】:2014-01-30 08:45:08
【问题描述】:
我们有以下 RSS 源
<title>THIS IS THE TITLE</title>
<link>http://www.website.com/....</link>
<description>
<div class="primary-image">
<img typeof="foaf:Image" src="http://website.com/" alt="Drink driving" title="Drink driving" />
</div>
<div class="field-group-format group_meta field-group-div group-meta speed-fast effect-none">
<span class="field field-name-field-published-date field-type-datetime field-label-hidden">
<span class="field-item even">
<span class="date-display-single" property="dc:date" datatype="xsd:dateTime" content="2014-01-29T17:43:00+00:00">29 Jan, 2014 5:43pm</span>
</span>
</span>
<span class="field field-name-field-author field-type-node-reference field-label-hidden">
<span class="field-item even"><a href="/authors/joe-finnerty">Joe Finnerty</a></span>
</span>
</div>
<p class="short-desc">TEXT THAT I WANT TO EXTRACT FROM HERE</p>
</description>
我正在尝试使用以下this script 提取<p class="short-desc">TEXT THAT I WANT TO EXTRACT FROM HERE</p>,并在此处检查了一些问题,但没有找到实际的答复。
我尝试添加
$htmlStr = $node->getElementsByTagName('description')->item(0)->nodeValue;
$html = new DOMDocument();
$html->loadHTML($htmlStr);
$xpath = new DOMXPath($html);
$desc = $xpath->query("//*[contains(concat(' ', normalize-space(@class), ' '), ' short-desc')]");
在 $item = array ( 之前,在 foreach 循环内但不起作用。
但没有完成这项工作。也代替
&lt; 正在替换 < AND
&quot; 正在替换 " AND
&gt; 正在替换 >
请帮助我几天来一直在寻找答案,但没有找到。
【问题讨论】:
-
您只需要文本?或与
<p>标签一起使用? -
仅来自该规范 p 类标签的
TEXT THAT I WANT TO EXTRACT FROM HERE。