色图网App下载,97草草在线

本博文主要介紹了VGGNET的網(wǎng)絡(luò)結(jié)構(gòu)，并在cifar10數(shù)據(jù)集上實(shí)現(xiàn)了

VGGNET詳解

??VGG Net由牛津大學(xué)的視覺幾何組（Visual Geometry Group）和 Google DeepMind公司的研究員一起研發(fā)的的深度卷積神經(jīng)網(wǎng)絡(luò)，在 ILSVRC 2014 上取得了第二名的成績，將 Top-5錯誤率降到7.3%。它主要的貢獻(xiàn)是展示出網(wǎng)絡(luò)的深度（depth）是算法優(yōu)良性能的關(guān)鍵部分。
??VGGNET的網(wǎng)絡(luò)結(jié)構(gòu)如下圖所示，VGGNET包含多層網(wǎng)絡(luò)，深度從11層到19層不等，較為常用的是VGG16和VGG19，接下來我們以VGG16為例，即下圖中的D，介紹VGGNET。

VGGNET網(wǎng)絡(luò)結(jié)構(gòu)

輸入尺寸為 $224\times224\times3$ 的圖片，用64個 $3\times3$ 的卷積核作兩次卷積和ReLU,卷積后的尺寸變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=224%5Ctimes224%5Ctimes64" alt="224\times224\times64" mathimg="1">。
池化層，使用 $max pooling$ ，池化單元大小為 $2\times2$ ，池化后尺寸變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=112%5Ctimes112%5Ctimes64" alt="112\times112\times64" mathimg="1">。
輸入尺寸為 $112\times112\times64$ ，使用128個 $3\times3$ 的卷積核作兩次卷積和ReLU，尺寸改變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=112%5Ctimes112%5Ctimes128" alt="112\times112\times128" mathimg="1">。
池化層，使用 $max pooling$ ，池化單元大小為 $2\times2$ ，池化后尺寸變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=56%5Ctimes56%5Ctimes128" alt="56\times56\times128" mathimg="1">。
輸入尺寸為 $56\times56\times128$ ，使用256個 $3\times3$ 的卷積核作三次卷積和ReLU，尺寸改變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=56%5Ctimes56%5Ctimes256" alt="56\times56\times256" mathimg="1">。
池化層，使用 $max pooling$ ，池化單元大小為 $2\times2$ ，池化后尺寸變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=28%5Ctimes28%5Ctimes256" alt="28\times28\times256" mathimg="1">。
輸入尺寸為 $28\times28\times256$ ，使用512個 $3\times3$ 的卷積核作三次卷積和ReLU，尺寸改變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=28%5Ctimes28%5Ctimes512" alt="28\times28\times512" mathimg="1">。
池化層，使用 $max pooling$ ，池化單元大小為 $2\times2$ ，池化后尺寸變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=14%5Ctimes14%5Ctimes512" alt="14\times14\times512" mathimg="1">。
輸入尺寸為 $14\times14\times512$ ，使用512個 $3\times3$ 的卷積核作三次卷積和ReLU，尺寸改變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=14%5Ctimes14%5Ctimes512" alt="14\times14\times512" mathimg="1">。
池化層，使用 $max pooling$ ，池化單元大小為 $2\times2$ ，池化后尺寸變?yōu)?img class="math-inline" src="https://math.jianshu.com/math?formula=7%5Ctimes7%5Ctimes512" alt="7\times7\times512" mathimg="1">。
與兩層1x1x4096，一層1x1x1000進(jìn)行全連接+ReLU（共三層）。
通過softmax輸出1000個預(yù)測結(jié)果。

VGGNET的特點(diǎn)

VGGNET全部使用 $3\times3$ 的卷積核和 $2\times2$ 的池化核，通過不斷加深網(wǎng)絡(luò)深度來提升性能。作者認(rèn)為，兩個 $3\times3$ 卷積層的串聯(lián)相當(dāng)于1個 $5\times5$ 的卷積層，3個 $3\times3$ 的卷積層串聯(lián)相當(dāng)于1個7*7的卷積層，即3個 $3\times3$ 卷積層的感受野大小相當(dāng)于1個 $7\times7$ 的卷積層。但是3個 $3\times3$ 的卷積層參數(shù)量只有 $7\times7$ 的一半左右，同時前者可以有3個非線性操作，而后者只有1個非線性操作，這樣使得前者對于特征的學(xué)習(xí)能力更強(qiáng)。
VGGNet的卷積層有一個顯著的特點(diǎn)：特征圖的空間分辨率單調(diào)遞減，特征圖的通道數(shù)單調(diào)遞增。

代碼實(shí)現(xiàn)

import torch.nn as nn


cfg = {
    'VGG11': [64, 'M', 128, 'M', 256, 256, 'M', 512, 512, 'M', 512, 512, 'M'],
    'VGG13': [64, 64, 'M', 128, 128, 'M', 256, 256, 'M', 512, 512, 'M', 512, 512, 'M'],
    'VGG16': [64, 64, 'M', 128, 128, 'M', 256, 256, 256, 'M', 512, 512, 512, 'M', 512, 512, 512, 'M'],
    'VGG19': [64, 64, 'M', 128, 128, 'M', 256, 256, 256, 256, 'M', 512, 512, 512, 512, 'M', 512, 512, 512, 512, 'M'],
}


class VGG(nn.Module):
    def __init__(self, vgg_name):
        super(VGG, self).__init__()
        self.features = self._make_layers(cfg[vgg_name])
        self.classifier = nn.Linear(512, 10)

    def forward(self, x):
        out = self.features(x)
        out = out.view(out.size(0), -1)
        out = self.classifier(out)
        return out

    def _make_layers(self, cfg):
        layers = []
        in_channels = 3
        for x in cfg:
            if x == 'M':
                layers += [nn.MaxPool2d(kernel_size=2, stride=2)]
            else:
                layers += [nn.Conv2d(in_channels, x, kernel_size=3, padding=1),
                           nn.BatchNorm2d(x),
                           nn.ReLU(inplace=True)]
                in_channels = x
        layers += [nn.AvgPool2d(kernel_size=1, stride=1)]
        return nn.Sequential(*layers)


def VGG11():
    return VGG('VGG11')


def VGG13():
    return VGG('VGG13')


def VGG16():
    return VGG('VGG16')


def VGG19():
    return VGG('VGG19')

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九欧美,1769亚洲,黄色成人av

VGGNet詳解

VGGNet詳解

VGGNET詳解

VGGNET的特點(diǎn)

代碼實(shí)現(xiàn)

相關(guān)閱讀更多精彩內(nèi)容

友情鏈接更多精彩內(nèi)容

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九 欧美,1769亚洲,黄色成人av

VGGNet詳解

VGGNET詳解

VGGNET的特點(diǎn)

代碼實(shí)現(xiàn)

相關(guān)閱讀更多精彩內(nèi)容

友情鏈接更多精彩內(nèi)容

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九欧美,1769亚洲,黄色成人av