【发布时间】:2017-09-12 07:02:21
【问题描述】:
我试过这段代码:
src1 = "https://hms.harvard.edu/"<br/>
src = response.css('div.person-line > div >
img::attr("src")').extract_first()<br/>
src = sites/default/files/hms-faculty-emails/BX0UVXkP.jpg <br/>
import urlparse <br/>
urlparse.urljoin(src1, src)<br/>
https://hms.harvard.edu/sites/default/files/hms-faculty-emails/BX0UVXkP.jpg<br/>
src2 = urlparse.urljoin(src1,src)<br/>
email = pytesseract.image_to_string(Image.open(src2))<br/>
我收到了这个错误
ioerror errno 22 invalid mode ('rb') or filename
如何从文本图像中获取电子邮件文本..有人可以帮忙吗?
【问题讨论】: