【问题标题】:PyTorch - How to derive manually ResNet?PyTorch - 如何手动派生 ResNet?
【发布时间】:2020-04-14 13:49:28
【问题描述】:

我设法找到了 ResNet-18 的这个实现:

def __init__(self): 
    super(ResNet18,self).__init__()
    self.block1 = nn.Sequential(
        nn.Conv2d(1,64,kernel_size=2,stride=2,padding=3,bias=False),
        nn.BatchNorm2d(64),
        nn.ReLU(True)
    )
    self.block2 = nn.Sequential(
       nn.MaxPool2d(1,1),
       ResidualBlock(64,64),
       ResidualBlock(64,64,2)
    )
    self.block3 = nn.Sequential(
        ResidualBlock(64,128),
        ResidualBlock(128,128,2)
    )
    self.block4 = nn.Sequential(
        ResidualBlock(128,256),
        ResidualBlock(256,256,2)
    )
    self.block5 = nn.Sequential(
        ResidualBlock(256,512),
        ResidualBlock(512,512,2)
    )
    self.avgpool = nn.AvgPool2d(2) 
    # vowel_diacritic 
    self.fc1 = nn.Linear(512,11)
    # grapheme_root
    self.fc2 = nn.Linear(512,168)
    # consonant_diacritic
    self.fc3 = nn.Linear(512,7)

def forward(self,x):
    x = self.block1(x)
    x = self.block2(x)
    x = self.block3(x)
    x = self.block4(x)
    x = self.block5(x)
    x = self.avgpool(x)
    x = x.view(x.size(0),-1)
    x1 = self.fc1(x)
    x2 = self.fc2(x)
    x3 = self.fc3(x)
    return x1,x2,x3

class ResidualBlock(nn.Module):

    def __init__(self,in_channels,out_channels,stride=1,kernel_size=3,padding=1,bias=False):
        super(ResidualBlock,self).__init__()
        self.cnn1 = nn.Sequential(nn.Conv2d(in_channels,out_channels,kernel_size,stride,padding,bias=False), nn.BatchNorm2d(out_channels), nn.ReLU(True) )

        self.cnn2 = nn.Sequential( nn.Conv2d(out_channels,out_channels,kernel_size,1,padding,bias=False), nn.BatchNorm2d(out_channels) ) 

        if stride != 1 or in_channels != out_channels: 
            self.shortcut = nn.Sequential( nn.Conv2d(in_channels,out_channels,kernel_size=1,stride=stride,bias=False), nn.BatchNorm2d(out_channels) ) 
        else: 
            self.shortcut = nn.Sequential() 

    def forward(self,x): 
        residual = x 
        x = self.cnn1(x) 
        x = self.cnn2(x) 
        x += self.shortcut(residual) 
        x = nn.ReLU(True)(x) 
        return x

我还是 PyTorch 的新手,我正在尝试以与上述 ResNet18 类似的格式获取 ResNets 34、50 和 101 的架构。最初,我认为我必须做的唯一更改是根据https://github.com/pytorch/vision/blob/master/torchvision/models/resnet.py 的源 GitHub 代码,每个块的最后一个参数,其中对于 ResNet34,我们有 return _resnet('resnet34', BasicBlock, [3, 4, 6, 3], pretrained, progress, **kwargs)

我做错了吗?我仍在阅读文档并尝试理解这些方法,因此我们将不胜感激一些指导。 :')

【问题讨论】:

    标签: python deep-learning classification pytorch resnet


    【解决方案1】:

    来自具有 FastAi 风格的 TensorFlow 背景:如果您直接从 torchvision 模块导入这些 ResNet 变体,则可以使用它们:

    #Download the pretrained model
    import torch
    import torchvision.models as models
    model = models.resnet18(pretrained = True) #change here to whatever model you want
    
    #Switch device to gpu if available
    device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
    

    【讨论】:

    • 有趣,澄清一下 - 一个愚蠢的问题:导入未经训练的 ResNet 模型,仅仅是models.resnet18(pretrained=False)
    • 是的,完全正确!就这么简单:D
    • 谢谢伙计!好吧,我试过了,但得到了错误:RuntimeError: Given groups=1, weight of size 64 3 7 7, expected input[32, 1, 64, 64] to have 3 channels, but got 1 channels instead 。之前运行上述 ResNet18 模型在训练期间运行良好。你对此有什么看法吗?
    • 是的。 3 vs 1 通道问题通常来自您使用灰度图像而不是 RGB 图像这一事实。所有预训练模型都在 RGB 图像上进行了预训练。确保向模型提供正确的数据。
    • 没问题的朋友,很好,你明白了,这是最重要的。
    猜你喜欢
    • 2018-12-19
    • 2021-10-23
    • 2018-12-22
    • 2021-09-08
    • 2019-09-05
    • 1970-01-01
    • 2023-03-14
    • 2018-08-15
    • 1970-01-01
    相关资源
    最近更新 更多