[OpenMMLab AI Practical Camp Phase II Notes] Day 8 Semantic Segmentation and MMSegmentation

1. What is semantic segmentation?

1.1 Task:

Segmenting the image into different areas according to the category of the object is equivalent to classifying each pixel .
Insert image description here

1.2 Application:

Self-driving car
Insert image description here
portrait segmentation
, real-time replacement of video background,
Insert image description here
intelligent remote
sensing, and identification of the categories of surface objects. As you can see from the segmented image on the right, the red parts belong to lakes and rivers. Intelligent remote sensing can monitor changes in surface waters in different seasons, thereby assisting agricultural production, predicting droughts and floods, etc.
Insert image description here
Medical image analysis
Insert image description here

1.3 Classification

Semantic segmentation vs instance segmentation vs panoramic segmentation
Insert image description here

2. Basic ideas of semantic segmentation

2.1 Split by color

Prior knowledge: The colors inside objects are similar, and the colors at the interfaces of objects change.
Insert image description here
Problems:
(1) Prior knowledge is not completely accurate;
(2) Different objects may have similar colors, and objects may also contain multiple colors.

2.2 Pixel-by-pixel classification

Insert image description here
Existing problems:
low efficiency, repeated convolution calculations in overlapping areas
Solution:
reuse convolution calculations
Insert image description here
Insert image description here

2.3 Convolution of fully connected layers

Insert image description here

2.4 Upsampling of prediction maps

Insert image description here
Bilinear Interpolation Bilinear Interpolation
Insert image description here
Convolution realizes bilinear interpolation
Insert image description here
Transposed Convolution Transposed Convolution
Insert image description here
Insert image description here

2.5 Prediction and training of fully convolutional network

Insert image description here

2.6 Upsampling based on multi-level features

Insert image description here
Various solutions:
Insert image description here
Insert image description here

2.5 Context information

The content surrounding an image (also called context) can help us make more accurate judgments.
Insert image description here
Insert image description hereHow to use contextual information during segmentation?
Solution: Add a network branch with a larger receptive field and introduce contextual information into local predictions
Insert image description here

2.6 Atrous convolution and Deeplab series

Insert image description here
Dilated convolution solves downsampling problem
Insert image description here
Insert image description here
Insert image description here
DeepLab model
Insert image description here

2.6 Conditional Random Field CRF

Insert image description here
Conditional random field
Insert image description here
energy function meaning
![Insert image description here](https://img-blog.csdnimg.cn/f908254ecc704ba595f171f22a56acea.p

2.6 Spatial Pyramid Pooling ASPP

Insert image description here
Insert image description here
Insert image description here

3. Summary of semantic segmentation algorithms

Insert image description here

4. Semantic segmentation preface algorithm

4.1 SegFormer

Insert image description here

4.2 K-Net

Insert image description here

4.3 Mask Forms

Insert image description here

4.4 Mask2Former

Insert image description here

4.5 SAM

Insert image description here

5. Evaluation of semantic segmentation models

Insert image description here
Insert image description here

Guess you like

Origin blog.csdn.net/qq_41776136/article/details/131192328