<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Deep Learning Digest</title>
    <description>The latest articles on Forem by Deep Learning Digest (@dldigest).</description>
    <link>https://forem.com/dldigest</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Forganization%2Fprofile_image%2F3051%2F7a41eff5-eac1-4346-8f0c-7a784899fa42.png</url>
      <title>Forem: Deep Learning Digest</title>
      <link>https://forem.com/dldigest</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/dldigest"/>
    <language>en</language>
    <item>
      <title>SipMask — New SOTA in Instance Segmentation</title>
      <dc:creator>Mikhail Raevskiy</dc:creator>
      <pubDate>Mon, 09 Nov 2020 19:36:17 +0000</pubDate>
      <link>https://forem.com/dldigest/sipmask-new-sota-in-instance-segmentation-70</link>
      <guid>https://forem.com/dldigest/sipmask-new-sota-in-instance-segmentation-70</guid>
      <description>&lt;p&gt;&lt;strong&gt;SipMask&lt;/strong&gt; is a one-stage neural network for instance segmentation of objects in an image. The model bypasses the previous one-stage state-of-the-art approaches on the &lt;em&gt;COCO test-dev dataset&lt;/em&gt;. Compared to TensorMask, SipMask gives a &lt;strong&gt;1% AP gain&lt;/strong&gt;. Moreover, the model produces predictions &lt;strong&gt;4 times faster&lt;/strong&gt;. The model bypasses &lt;em&gt;YOLACT&lt;/em&gt; by &lt;strong&gt;3% in AP&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--_r9YKueP--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/fgc98xqh2sbo37t94dqd.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--_r9YKueP--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/fgc98xqh2sbo37t94dqd.png" alt="SipMask — New SOTA in Instance Segmentation"&gt;&lt;/a&gt;&lt;/p&gt;
Instance Segmentation with SipMask. Source: https://arxiv.org/pdf/2007.14772v1.pdf



&lt;h1&gt;
  
  
  More about the model
&lt;/h1&gt;

&lt;p&gt;A feature of the neural network architecture is the new spatial preservation (SP) module. The SP module is a feature pooling mechanism in a one-stage segmentation model. The idea of ​​the module is to store spatial information about an object.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--8cQmAH4z--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/mc7e1bghd9r6568gp9fo.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--8cQmAH4z--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/mc7e1bghd9r6568gp9fo.png" alt="SipMask — New SOTA in Instance Segmentation"&gt;&lt;/a&gt;&lt;/p&gt;
The overall architecture of our SipMask comprising fully convolutional mask specialized classification and regression branches. Source: https://arxiv.org/pdf/2007.14772v1.pdf



&lt;p&gt;The model is based on the FCOS architecture. However, the two standard branches of classification and regression have been replaced with mask-specific classification and regression in order to adapt the model for instance segmentation. The classification unit predicts the rates of the classes and assigns spatial coefficients for the regions of the boundaries of objects. These coefficients are then used by the SP to predict the individual masks.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--hjK4JajC--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/so3250kumu1d8u7svjh8.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--hjK4JajC--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/so3250kumu1d8u7svjh8.png" alt="Comparison SipMask model with other competing architectures"&gt;&lt;/a&gt;&lt;/p&gt;
Comparison of competing architectures. Source: https://arxiv.org/pdf/2007.14772v1.pdf



&lt;h1&gt;
  
  
  Testing the model
&lt;/h1&gt;

&lt;p&gt;The researchers validated the model on the COCO test dataset. Compared to state-of-the-art one-step approaches for instance segmentation, SipMask produces more accurate predictions.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--PbiqQVYO--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/lvt0fp8sz8v3e77qtxq1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--PbiqQVYO--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/lvt0fp8sz8v3e77qtxq1.png" alt="SipMask - Benchmarking results for semantic segmantation"&gt;&lt;/a&gt;&lt;/p&gt;
Benchmarking results. Source: https://arxiv.org/pdf/2007.14772v1.pdf



&lt;p&gt;The source code of the project is available &lt;a href="https://github.com/JialeCao001/SipMask"&gt;in the repository on GitHub&lt;/a&gt;.&lt;/p&gt;

&lt;h1&gt;
  
  
  Read More
&lt;/h1&gt;

&lt;p&gt;&lt;em&gt;If you found this article helpful, click the💚 or 👏 button below or share the article on Facebook so your friends can benefit from it too.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://slidetosubscribe.com/raevskymichail/"&gt;https://slidetosubscribe.com/raevskymichail/&lt;/a&gt;&lt;/p&gt;

</description>
      <category>datascience</category>
      <category>machinelearning</category>
      <category>deeplearning</category>
      <category>computerscience</category>
    </item>
  </channel>
</rss>
