【发布时间】:2016-03-14 11:46:54
【问题描述】:
我正在使用 Wordpress 插件“WP Web Scraper”来轻松地从网站收集一些数据。函数 wpws_get_content 返回结果“Raised(number%)”我想把它清理成数字。我设法用以下代码返回(90%)
<p id="number1"><?php echo wpws_get_content('http://my.sportrelief.com/sponsor/sachazarb', '#totalizer_percent', array( 'replace_query' => '/Raised/', 'replace_query_type' => 'regex', 'replace_with' => ' ', ) ); ?></p>
我无法进一步解决。奇怪的是,它还返回重复数据,即
<!--
Start of web scrap (created by wp-web-scraper)
Source URL: http://my.sportrelief.com/sponsor/sachazarb
Query: #totalizer_percent (cssselector)
Other options: Array
(
[headers] =>
[cache] => 60
[useragent] => WPWS bot (http://windreeladprint.com)
[timeout] => 2
[on_error] => error_show
[output] => html
[glue] =>
[eq] =>
[gt] =>
[lt] =>
[query_type] => cssselector
[remove_query] =>
[remove_query_type] => cssselector
[replace_query] => /Raised/
[replace_query_type] => regex
[replace_with] =>
[basehref] => 1
[a_target] =>
[callback_raw] =>
[callback] =>
[debug] => 1
[charset] => UTF-8
)
--><span id="totalizer_percent" class="percent"> (90%)</span><span id="totalizer_percent" class="percent"> (90%)</span><!--
End of web scrap
WPWS Cache Control: Remote-fetch via WP_Http
Computing time: 1.306 seconds
-->
【问题讨论】:
标签: php regex wordpress web-scraping