17-19,结论:对于unicode字符串,要用unicode正则字符串查找,unicode正则字符串的构成有两个要件,一是字符串本身由unicode字符构成,二是用'u'引导(python2);
20-23,结论:对于str字符串,要用二进制正则字符串查找,用'r'引导,如果用'u'引导,因为并不包含unicode字符,其实际上不起作用,相当于'r';

In [17]: %paste
import re
html = u"abcd 32人喜欢 efgh"
html = re.sub(ur'(\d+)\u4eba\u559c\u6b22', r'<div class="like">\1</div>', html)
print html

## -- End pasted text --
abcd <div class="like">32</div> efgh

In [18]: %paste
import re
html = u"abcd 32人喜欢 efgh"
html = re.sub(r'(\d+)\u4eba\u559c\u6b22', r'<div class="like">\1</div>', html)
print html

## -- End pasted text --
abcd 32人喜欢 efgh

In [19]: %paste
import re
html = u"abcd 32人喜欢 efgh"
html = re.sub(r'(\d+)\xe4\xba\xba\xe5\x96\x9c\xe6\xac\xa2', r'<div class="like">\1</div>', html)
print html

## -- End pasted text --
abcd 32人喜欢 efgh

In [20]: %paste
import re
html = "abcd 32人喜欢 efgh"
html = re.sub(r'(\d+)\u4eba\u559c\u6b22', r'<div class="like">\1</div>', html)
print html

## -- End pasted text --
abcd 32浜哄枩娆?efgh

In [21]: %paste
import re
html = "abcd 32人喜欢 efgh"
html = re.sub(ur'(\d+)\u4eba\u559c\u6b22', r'<div class="like">\1</div>', html)
print html

## -- End pasted text --
abcd 32浜哄枩娆?efgh

In [22]: %paste
import re
html = "abcd 32人喜欢 efgh"
html = re.sub(r'(\d+)\xe4\xba\xba\xe5\x96\x9c\xe6\xac\xa2', r'<div class="like">\1</div>', html)
print html

## -- End pasted text --
abcd <div class="like">32</div> efgh

In [23]: %paste
import re
html = "abcd 32人喜欢 efgh"
html = re.sub(ur'(\d+)\xe4\xba\xba\xe5\x96\x9c\xe6\xac\xa2', r'<div class="like">\1</div>', html)
print html

## -- End pasted text --
abcd <div class="like">32</div> efgh

相关文章:

  • 2021-12-02
  • 2021-12-24
  • 2022-12-23
  • 2022-02-09
  • 2022-12-23
  • 2021-12-24
  • 2021-05-23
  • 2022-02-08
猜你喜欢
  • 2021-10-26
  • 2021-08-04
  • 2022-12-23
  • 2021-07-03
  • 2021-05-02
相关资源
相似解决方案