Analysis of Improving Instance Segmentation Network Based on DCTMask
This paper expounds that Mask R-CNN is a popular network in the task of instance segmentation,but in the past,using high-resolution grid prediction masks has greatly increased the complexity of training Mask R-CNN,and low-resolution grid prediction masks have inadequate detail capture.The introduction of DCT methods allows for compact encoding of high-resolution binary binary grid masks as vectors,which reduces complexity and improves efficiency.When dealing with deep networks in which several commonly used activation functions are frequently applied,it often presents insufficient information preservation problems.Therefore,this chapter introduces Swish activation functions to improve the performance of deeper networks.