深度学习笔记

首页 > 代码库 > 深度学习笔记

2024-08-17 22:34:51 223人阅读

Assume the output from a layer in CNN is N × N × d dimension, which is the output of d filters for N × N spatial cells. Each spatial cell is computed from a receptive field in the input image.

The receptive fields of all the spatial cells in the input image can highly overlap with each other. The size of one receptive field can be computed layer by layer in CNN. In a convolution (pooling) layer, if the filter (pooling) size is a×a and the stride is s, then T ×T cells in the output of this layer corresponds to [s*(T ? 1) + a] × [s*(T ? 1) + a] cells in the input of this layer. For example, one cell in the CONV5 (the 5th convolutional)layer of CNN model (imagenet-vgg-m) [40] corresponds to a 139 × 139 receptive field in the 224 × 224 input image (cf. Fig. 4).

深度学习笔记

声明：以上内容来自用户投稿及互联网公开渠道收集整理发布，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任，若内容有误或涉及侵权可进行投诉：投诉/举报工作人员会在5个工作日内联系你，一经查实，本站将立刻删除涉嫌侵权内容。

联系
我们

首页 > 代码库 > 深度学习笔记

深度学习笔记

看完仍有疑问？有类似问题直接问程序猿