【发布时间】:2020-10-11 03:55:17
【问题描述】:
我是从网页上抓取数据,一个div里有li元素,网页界面是这样的
Job Description:
• Developing application programming interfaces (APIs) to support mobile functionality
• Keeping up to date with the terminology, concepts and best practices for coding mobile apps
• Using and adapting existing web applications for apps
• working closely with colleagues to constantly innovate app functionality and design
这是我对这些部分的抓取代码的一部分,如下所示(job 和 jobTtle 是 JSON 数组)
Elements ele3=doc.select("div.job-sections div[itemprop=description] section#st-jobDescription");
for (Element element3 : ele3.select("div[itemprop=responsibilities] ul")) {
String job_description=element3.select("li").text();
job.put(jobTitle.put(new JSONObject().put("description",job_description)));
}
这样的输出
{"description" : "Developing application programming interfaces (APIs) to support mobile functionality Keeping up to date with the terminology, concepts and best practices for coding mobile apps Using and adapting existing web applications for apps Working closely with colleagues to constantly innovate app functionality and design"}
但我想用逗号分隔每个 li 元素,所以输出应该是这样的
{"description" : ["Developing application programming interfaces (APIs) to support mobile functionality", "Keeping up to date with the terminology, concepts and best practices for coding mobile apps", "Using and adapting existing web applications for apps", "Working closely with colleagues to constantly innovate app functionality and design"]}
我该如何解决这个问题?有人可以帮忙吗? 谢谢
【问题讨论】:
标签: java json web-scraping jsoup org.json