【发布时间】:2016-08-14 12:27:52
【问题描述】:
我正在尝试抓取以下站点中的权限表:https://register.fca.org.uk/ShPo_FirmDetailsPage?id=001b000000MfaDiAAJ
我正在尝试找出 xpath 是否能够使用诸如此类的文本来定位特定类(请注意 ID 是随机的,因此无法使用它们进行定位,并且每个表的类也是相同的)
就购房计划提供建议
<div id="a2Nb000000035ohEAA" class="collapse DisciplineDetails PassportDetails PermDesc">
<h3 class="PermissionsListHeader">Advising on a home purchase plan</h3>
<br>
<br>
</div>
<ul class="PermissionConditionsList">
<li class="PermissionsConditionsItem">
Customer Type
<ul class="PermCondsLimitationsList">
<li style="list-style: none"><span id="j_id0:j_id1:j_id110:regActTable:0:j_id531:0:j_id533:0:j_id535:0:j_id538"></span></li>
<li class="PermCondsLimitationsItem Popover">Customer</li>
</ul>
</li>
</ul>
<ul class="PermissionConditionsList">
<li class="PermissionsConditionsItem">
Investment Type
<ul class="PermCondsLimitationsList">
<li style="list-style: none"><span id="j_id0:j_id1:j_id110:regActTable:0:j_id531:1:j_id533:0:j_id535:0:j_id538"></span></li>
<li class="PermCondsLimitationsItem Popover">Home purchase plans</li>
</ul>
</li>
</ul>
</div>
【问题讨论】:
-
您能详细解释一下您的要求吗?是要获取匹配文本的类名还是要获取匹配文本的div?
-
您好 Maheeka,感谢您的帮助。我试图提取表格,以便 XPath 找到与“就购房计划提供建议”的文本匹配的“PermissionsListHeader”类的表格,并提取客户类型(在本例中为“客户”,但可能有一些其中)
标签: xpath web-scraping import.io