之前的目标检测算法均需要多个步骤实现目标的分类和定位。如RCNN系列,首先需要进行region proposal,RCNN到Faster RCNN模块逐步将其他任务整合到网络,最终将region proposal也用网络来实现,但是仍然是分步骤实现的。分步骤实现的缺点是实现复杂,运行速度慢。
-
INPUT: image=448×448×3。
-
C1: input=448×448×3,filters=64@7×7×3,stride=2,pad=3→Leaky ReLU,output=224×224×64
-
MAX_POOl1: input=224×224×64,window=2×2,stride=2,output=112×112×64
-
C2: input=112×112×64,filters=92@3×3×64→Leaky ReLU,output=112×112×192
-
MAX_POOL2: input=112×112×192,window=2×2,stride=2,output=56×56×192
-
C3: input=56×56×192,filters=128@1×1×192→Leaky ReLU,output=56×56×128
-
C4: input=56×56×128,filters=256@3×3×128→Leaky ReLU,output=56×56×256
-
C5: input=56×56×256,filters=256@1×1×256→Leaky ReLU,output=56×56×256
-
C6: input=56×56×512,filters=512@3×3×256→Leaky ReLU,output=56×56×512
-
MAX_POOL3: input=56×56×512,window=2×2,stride=2,output=28×28×512
-
C7:input=28×28×512,filters=256@1×1×512→Leaky ReLU,output=28×28×256
-
C8:input=28×28×256,filters=512@3×3×256→Leaky ReLU,output=28×28×512
-
C9:input=28×28×512,filters=256@1×1×512→Leaky ReLU,output=28×28×256
-
C10:input=28×28×256,filters=512@3×3×256→Leaky ReLU,output=28×28×512
-
C11: input=28×28×512,filters=256@1×1×512→Leaky ReLU,output=28×28×256
-
C12: input=28×28×256,filters=512@3×3×256→Leaky ReLU,output=28×28×512
-
C13: input=28×28×512,filters=256@1×1×512→Leaky ReLU,output=28×28×256
-
C14: input=28×28×256,filters=512@3×3×256→Leaky ReLU,output=28×28×512
-
C15: input=28×28×512,filters=256@1×1×512→Leaky ReLU,output=28×28×256
-
C16: input=28×28×256,filters=1024@3×3×256→Leaky ReLU,output=28×28×1024
-
MAX_POOL4: input=28×28×1024,window=2×2,stride=2,output=14×14×1024
-
C17: input=14×14×1024,filters=512@1×1×1024→Leaky ReLU,output=14×14×512
-
C18: input=14×14×512,filters=1024@3×3×512→Leaky ReLU,output=14×14×1024
-
C19: input=14×14×1024,filters=512@1×1×1024→Leaky ReLU,output=14×14×512
-
C20: input=14×14×512,filters=1024@3×3×512→Leaky ReLU,output=14×14×1024
-
C21: input=14×14×1024,filters=1024@3×3×1024→Leaky ReLU,output=14×14×1024
-
C22: input=14×14×1024,filters=1024@3×3×1024,stride=2→Leaky ReLU,output=7×7×1024
-
C23: input=7×7×1024,filters=1024@3×3×1024→Leaky ReLU,output=7×7×1024
-
C24: input=7×7×1024,filters=1024@3×3×1024→Leaky ReLU,output=7×7×1024
-
FC1: input=7×7×1024=50176,weight=512×50176→Leaky ReLU,output=512
-
FC2: input=512,weight=4096×512,drop_prob=0.5→Leaky ReLU,output=4096
-
OUTPUT: input=?drop_dims,output=7×7×30