🔲 -> ✂️
Traditional approach. First detect bboxes, then segment each box.
|
|
Date |
Mask R-CNN |
|
Mar 2017 |
PANet |
Path Aggregation Network |
Mar 2018 |
HTC |
Hybrid Task Cascade |
Jan 2019 |
Single-Shot
|
|
Date |
YOLACT |
Real-time Instance Segmentation |
Apr 2019 |
SSAP |
Single-Shot Instance Segmentation With Affinity Pyramid |
Sep 2019 |
PolarMask |
Single Shot Instance Segmentation with Polar Representation |
Sep 2019 |
SOLO |
Segmenting Objects by Locations |
Dec 2019 |
YOLACT++ |
Better real-time instance segmentation |
Dec 2019 |
BlendMask |
Top-Down Meets Bottom-Up for Instance Segmentation |
Jan 2020 |
SOLOv2 |
Dynamic and fast instance segmentation |
Mar 2020 |
CenterMask |
Single shot instance segmentation with point representation |
Apr 2020 |
Unet predicting boundaries
- This method works very well for round obects (microscopic cells, cars, …)
- This method works very bad for complex shapes (persons, occussions, etc.)
- Paper Deep Watershed Transform for Instance Segmentation Nov 2016
- https://on-demand.gputechconf.com/gtc/2017/presentation/s7588-min-bai-deep-watershed-transform-for-instance-segmentation.pdf
- https://www.kaggle.com/c/data-science-bowl-2018/discussion/55118
Unet predicting embedding per piexel
Resorces