【发布时间】:2020-12-18 22:27:08
【问题描述】:
我无法构造 XPath 表达式来抓取 <br> 标记之间的文本。任何帮助将不胜感激:
网页:
<hr>
<center>
<a href="http://www.somewebsite.html" target="_blank">Description</a>
</center>
<br>
SEC 10 TWP 20 RGE 30
<br>
PLAT BOOK A PAGE 200
<br>
BLK 10 LOTS 1.4.5.6.7.8. EX
<br>
POSSIBLY MORE TEXT...
<br>
<hr>
复制 <href> 的 XPath 结果:
//*[@id="prc"]/table/tbody/tr/td/center[6]/a
Javascript:
var Xpath = 'substring-after(//*[@id="prc"]/table/tbody/tr/td/center[6]/a, "Description")';
var parser = new DOMParser();
var doc = parser.parseFromString(html, 'text/html');
//Property Description
var result = doc.evaluate(Xpath, doc, null, XPathResult.STRING_TYPE, null);
Description = result.stringValue;
//Display Message
alert("Description Search Results: " + Description);
显示消息的期望结果:
SEC 10 TWP 20 RGE 30
PLAT BOOK A PAGE 200
BLK 10 LOTS 1.4.5.6.7.8. EX
POSSIBLY MORE TEXT...
【问题讨论】:
-
可能类似于
//*[@id="prc"]/table/tbody/tr/td/br/following::text()。您可能需要一些后处理来清理生成的字符串。
标签: javascript html dom xpath