【发布时间】:2011-04-22 05:35:36
【问题描述】:
我怎么打开这个网页,我得到的是 HTML 文本:
http://itunes.apple.com/us/app/mobile/id381057839
但是当我点击这个网页时,我得到了乱码垃圾?
http://itunes.apple.com/us/app/mobile/id375562663
我在python中使用了相同的download()函数,在这里:
def download(source_url):
try:
socket.setdefaulttimeout(10)
agent = "Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.2.10) Gecko/20100914 AlexaToolbar/alxf-1.54 Firefox/3.6.10 GTB7.1"
ree = urllib2.Request(source_url)
ree.add_header('User-Agent',agent)
ree.add_header("Accept","text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8")
ree.add_header("Accept-Language","en-us,en;q=0.5")
ree.add_header("Accept-Charset","ISO-8859-1,utf-8;q=0.7,*;q=0.7")
ree.add_header("Accept-Encoding","gzip,deflate")
ree.add_header("Host","itunes.apple.com")
resp = urllib2.urlopen(ree)
htmlSource = resp.read()
return htmlSource
except Exception, e:
print e
【问题讨论】:
标签: python http api rest urllib2