Bai, IEEE 2019

Mask TextSpotter 结构

  • a feature pyramid network (FPN) as backbone.
  • a region proposal network (RPN) for generating text proposals.
  • a Fast R-CNN for bounding boxes regression.
  • a mask branch for text instance segmentation, character segmentation, and text sequence recognition.

数据流动

训练阶段, 由 RPN 产生文本建议(text proposals), 然后文本建议的 RoI 特征喂给 Fast R-CNN, 并且 mask branch 生成精准的文本候选框(candidate boxes), 文本实例分割图(text instance segmentation maps), 字符分割图(character segmentation maps), 和文本序列(text sequence)

相关文章:

  • 2021-04-06
  • 2022-02-13
  • 2021-10-02
  • 2022-12-23
  • 2022-12-23
  • 2021-06-01
  • 2021-11-21
  • 2021-05-18
猜你喜欢
  • 2021-12-01
  • 2021-12-17
  • 2021-11-28
  • 2022-12-23
  • 2021-09-30
  • 2021-12-23
  • 2021-05-07
相关资源
相似解决方案