squeeze-and-excitation networks · 2018-06-27 · squeeze-and-excitation networks (senets) formed...

19
Squeeze-and-Excitation Networks Jie Hu 1,* Li Shen 2,* Gang Sun 1 1 Momenta 2 Department of Engineering Science, University of Oxford

Upload: others

Post on 16-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Squeeze-and-Excitation Networks

Jie Hu1,* Li Shen2,* Gang Sun1

1 Momenta2 Department of Engineering Science,

University of Oxford

Page 2: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Large Scale Visual Recognition Challenge

Convolutional Neural NetworksFeature Engineering

Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification

[Statistics provided by ILSVRC]

SENets

Page 3: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Convolution

A convolutional filter is expected to be an informative combination• Fusing channel-wise and spatial information

• Within local receptive fields

Page 4: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

A Simple CNN

Page 5: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

A Simple CNN

Channel dependencies are:• Implicit: Entangled with the spatial correlation

captured by the filters• Local: Unable to exploit contextual information

outside this region

Page 6: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Exploiting Channel Relationships

Can the representational power of a network be enhanced by channel relationships?

Design a new architectural unit• Explicitly model interdependencies between the

channels of convolutional features

• Feature recalibrationq Selectively emphasise informative features and inhibit less

useful ones q Use global information

Page 7: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Squeeze-and-Excitation Blocks

Given transformation F"#: input X → featuremaps U• Squeeze• Excitation

Page 8: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

• Aggregate feature maps through spatial dimensions using global average pooling

• Generate channel-wise statistics

U can be interpreted as a collection of local descriptors whose statistics are expressive for the whole image.

Squeeze: Global Information Embedding

Page 9: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

• Learn a nonlinear and non-mutually-exclusive relationship between channels

• Employ a self-gating mechanism with sigmoid functionq Input: channel-wise statisticsq Bottleneck configuration with two FC layers around non-linearityq Output: channel-wise activations

Excitation: Adaptive Recalibration

Page 10: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

• Rescale the feature maps U with the channel activationsq Act on the channels of U

q Channel-wise multiplication

SE blocks intrinsically introduce dynamics conditioned on the input.

Excitation: Adaptive Recalibration

Page 11: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Example Models

SE-ResNet Module

+

Global pooling

FC

ReLU

+

ResNet Module

X

X

X

X

Sigmoid

1 × 1 × C

1 × 1 ×C𝑟

1 × 1 × C

1 × 1 × C

Scale

𝐻 ×W× C

𝐻 ×W× C

𝐻 ×W× C

Residual Residual

FC

1 × 1 ×C𝑟

Inception

Global pooling

FC

SE-Inception Module

FC

X

Inception

X

Inception Module

X

X

Sigmoid

Scale

ReLU

𝐻 ×W× C

1 × 1 × C

1 × 1 × C

1 × 1 × C

1 × 1 ×C𝑟

1 × 1 ×C𝑟

𝐻 ×W× C

Page 12: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Object ClassificationExperiments on ImageNet-1k dataset• Benefits at different depths• Incorporation with modern architectures

Page 13: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Benefits at Different DepthsSE blocks consistently improve performance across different depths at minimal additional computational complexity (no more than 0.26%).

ü SE-ResNet-50 exceeds ResNet-50 by 0.86% and approaches the result of ResNet-101.

ü SE-ResNet-101 outperforms ResNet-152.

Page 14: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Incorporation with Modern Architectures

SE blocks can boost the performance of a variety of network architectures on both residual and non-residual settings.

Page 15: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Beyond Object Classification

• Places365-Challenge Scene Classification

• Object Detection on COCO

SE blocks can generalise well on different datasets and tasks.

Page 16: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Role of Excitation

The role at different depths adapts to the needs of the network• Early layers: Excite informative features in a class agnostic manner

SE_2_3 SE_3_4

Page 17: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Role of Excitation

The role at different depths adapts to the needs of the network• Later layers: Respond to different inputs in a highly class-specific manner

SE_4_6 SE_5_1

Page 18: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Code and Models: https://github.com/hujie-frank/SENet

Conclusion

• Designed a novel architectural unit to improve the representational capacity of networks by dynamic channel-wise feature recalibration.

• Provided insights into the limitations of previous CNN architectures in modelling channel dependencies.

• Induced feature importance may be helpful to related fields, e.g. network compression.

Page 19: Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Thank you!