Tutorial 5: Clarifying notation with illustration

phlippe · phlippe · commit b18d9d392308 · 2020-11-03T09:58:38.000+01:00
diff --git a/docs/tutorial_notebooks/tutorial5/Inception_ResNet_DenseNet.ipynb b/docs/tutorial_notebooks/tutorial5/Inception_ResNet_DenseNet.ipynb
@@ -1525,7 +1525,11 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "The overall ResNet architecture consists of stacking multiple ResNet blocks, of which some are downsampling the input. When talking about ResNet blocks in the whole network, we usually group them by the same output shape. Hence, if we say the ResNet has `[3,3,3]` blocks, it means that we have 3 times a group of 3 ResNet blocks, where a subsampling is taking place in the fourth and seventh block. The same notation is used by many other implementations such as in the [torchvision library](https://pytorch.org/docs/stable/_modules/torchvision/models/resnet.html#resnet18) from PyTorch. Our code looks as follows:"
+    "The overall ResNet architecture consists of stacking multiple ResNet blocks, of which some are downsampling the input. When talking about ResNet blocks in the whole network, we usually group them by the same output shape. Hence, if we say the ResNet has `[3,3,3]` blocks, it means that we have 3 times a group of 3 ResNet blocks, where a subsampling is taking place in the fourth and seventh block. The ResNet with `[3,3,3]` blocks on CIFAR10 is visualized below.\n",
+    "\n",
+    "<center width=\"100%\"><img src=\"resnet_notation.svg\" width=\"500px\"></center>\n",
+    "\n",
+    "The three groups operate on the resolutions $32\\times32$, $16\\times16$ and $8\\times8$ respectively. The blocks in orange denote ResNet blocks with downsampling. The same notation is used by many other implementations such as in the [torchvision library](https://pytorch.org/docs/stable/_modules/torchvision/models/resnet.html#resnet18) from PyTorch. Thus, our code looks as follows:"
    ]
   },
   {
@@ -1540,7 +1544,7 @@
     "        \"\"\"\n",
     "        Inputs: \n",
     "            num_classes - Number of classification outputs (10 for CIFAR10)\n",
-    "            num_blocks - List with the number of ResNet blocks to use. The first block of each group uses downsampling, expect the first.\n",
+    "            num_blocks - List with the number of ResNet blocks to use. The first block of each group uses downsampling, except the first.\n",
     "            c_hidden - List with the hidden dimensionalities in the different blocks. Usually multiplied by 2 the deeper we go.\n",
     "            act_fn_name - Name of the activation function to use, looked up in \"act_fn_by_name\"\n",
     "            block_name - Name of the ResNet block, looked up in \"resnet_blocks_by_name\"\n",
@@ -1576,7 +1580,7 @@
     "        blocks = []\n",
     "        for block_idx, block_count in enumerate(self.hparams.num_blocks):\n",
     "            for bc in range(block_count):\n",
-    "                subsample = (bc == 0 and block_idx > 0) # Subsample the first block of each \"super-block\", except the very first one.\n",
+    "                subsample = (bc == 0 and block_idx > 0) # Subsample the first block of each group, except the very first one.\n",
     "                blocks.append(\n",
     "                    self.hparams.block_class(c_in=c_hidden[block_idx if not subsample else (block_idx-1)],\n",
     "                                             act_fn=self.hparams.act_fn,\n",
@@ -2273,7 +2277,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.7.3"
+   "version": "3.7.4"
   }
  },
  "nbformat": 4,
diff --git a/docs/tutorial_notebooks/tutorial5/resnet_notation.svg b/docs/tutorial_notebooks/tutorial5/resnet_notation.svg
@@ -0,0 +1,3 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN" "http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
+<svg xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" version="1.1" width="522px" height="162px" viewBox="-0.5 -0.5 522 162"><defs><style type="text/css">@import url(https://fonts.googleapis.com/css?family=Roboto);&#xa;</style></defs><g><path d="M 36 60 L 60.88 60" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 66.13 60 L 59.13 63.5 L 60.88 60 L 59.13 56.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="-39" y="45" width="120" height="30" rx="4.5" ry="4.5" fill="#f5f5f5" stroke="#666666" transform="rotate(-90,21,60)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 20.999999999999773 60)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 118px; height: 1px; padding-top: 60px; margin-left: -38px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #333333; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">ResNet Block 1</div></div></div></foreignObject><text x="21" y="64" fill="#333333" font-family="Helvetica" font-size="12px" text-anchor="middle">ResNet Block 1</text></switch></g><path d="M 97.25 60 L 117.14 60 L 106 60 L 119.63 60" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 124.88 60 L 117.88 63.5 L 119.63 60 L 117.88 56.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="22.25" y="45" width="120" height="30" rx="4.5" ry="4.5" fill="#f5f5f5" stroke="#666666" transform="rotate(-90,82.25,60)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 82.24999999999977 60)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 118px; height: 1px; padding-top: 60px; margin-left: 23px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #333333; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">ResNet Block 2</div></div></div></foreignObject><text x="82" y="64" fill="#333333" font-family="Helvetica" font-size="12px" text-anchor="middle">ResNet Block 2</text></switch></g><path d="M 156 60 L 180.88 60" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 186.13 60 L 179.13 63.5 L 180.88 60 L 179.13 56.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="81" y="45" width="120" height="30" rx="4.5" ry="4.5" fill="#f5f5f5" stroke="#666666" transform="rotate(-90,141,60)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 140.99999999999977 60)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 118px; height: 1px; padding-top: 60px; margin-left: 82px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #333333; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">ResNet Block 3</div></div></div></foreignObject><text x="141" y="64" fill="#333333" font-family="Helvetica" font-size="12px" text-anchor="middle">ResNet Block 3</text></switch></g><path d="M 217.25 60 L 239.63 60" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 244.88 60 L 237.88 63.5 L 239.63 60 L 237.88 56.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="142.25" y="45" width="120" height="30" rx="4.5" ry="4.5" fill="#ffe6cc" stroke="#000000" transform="rotate(-90,202.25,60)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 202.24999999999977 60)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 118px; height: 1px; padding-top: 60px; margin-left: 143px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">ResNet Block 4</div></div></div></foreignObject><text x="202" y="64" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">ResNet Block 4</text></switch></g><path d="M 276 60 L 300.88 60" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 306.13 60 L 299.13 63.5 L 300.88 60 L 299.13 56.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="201" y="45" width="120" height="30" rx="4.5" ry="4.5" fill="#f5f5f5" stroke="#666666" transform="rotate(-90,261,60)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 260.9999999999998 60)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 118px; height: 1px; padding-top: 60px; margin-left: 202px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #333333; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">ResNet Block 5</div></div></div></foreignObject><text x="261" y="64" fill="#333333" font-family="Helvetica" font-size="12px" text-anchor="middle">ResNet Block 5</text></switch></g><path d="M 337.25 60 L 359.63 60" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 364.88 60 L 357.88 63.5 L 359.63 60 L 357.88 56.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="262.25" y="45" width="120" height="30" rx="4.5" ry="4.5" fill="#f5f5f5" stroke="#666666" transform="rotate(-90,322.25,60)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 322.2499999999998 60)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 118px; height: 1px; padding-top: 60px; margin-left: 263px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #333333; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">ResNet Block 6</div></div></div></foreignObject><text x="322" y="64" fill="#333333" font-family="Helvetica" font-size="12px" text-anchor="middle">ResNet Block 6</text></switch></g><path d="M 396 60 L 420.88 60" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 426.13 60 L 419.13 63.5 L 420.88 60 L 419.13 56.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="321" y="45" width="120" height="30" rx="4.5" ry="4.5" fill="#ffe6cc" stroke="#000000" transform="rotate(-90,381,60)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 380.9999999999998 60)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 118px; height: 1px; padding-top: 60px; margin-left: 322px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">ResNet Block 7</div></div></div></foreignObject><text x="381" y="64" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">ResNet Block 7</text></switch></g><path d="M 457.25 60 L 477.14 60 L 466 60 L 479.63 60" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 484.88 60 L 477.88 63.5 L 479.63 60 L 477.88 56.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="382.25" y="45" width="120" height="30" rx="4.5" ry="4.5" fill="#f5f5f5" stroke="#666666" transform="rotate(-90,442.25,60)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 442.2499999999998 60)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 118px; height: 1px; padding-top: 60px; margin-left: 383px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #333333; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">ResNet Block 8</div></div></div></foreignObject><text x="442" y="64" fill="#333333" font-family="Helvetica" font-size="12px" text-anchor="middle">ResNet Block 8</text></switch></g><rect x="441" y="45" width="120" height="30" rx="4.5" ry="4.5" fill="#f5f5f5" stroke="#666666" transform="rotate(-90,501,60)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 501 60)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 118px; height: 1px; padding-top: 60px; margin-left: 442px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #333333; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">ResNet Block 9</div></div></div></foreignObject><text x="501" y="64" fill="#333333" font-family="Helvetica" font-size="12px" text-anchor="middle">ResNet Block 9</text></switch></g><path d="M 89.75 50 L 84.75 50 Q 79.75 50 79.75 60 L 79.75 120 Q 79.75 130 74.75 130 L 72.25 130 Q 69.75 130 74.75 130 L 77.25 130 Q 79.75 130 79.75 140 L 79.75 200 Q 79.75 210 84.75 210 L 89.75 210" fill="none" stroke="#000000" stroke-miterlimit="10" transform="rotate(-90,79.75,130)" pointer-events="all"/><path d="M 270.5 50 L 265.5 50 Q 260.5 50 260.5 60 L 260.5 120 Q 260.5 130 255.5 130 L 253 130 Q 250.5 130 255.5 130 L 258 130 Q 260.5 130 260.5 140 L 260.5 200 Q 260.5 210 265.5 210 L 270.5 210" fill="none" stroke="#000000" stroke-miterlimit="10" transform="rotate(-90,260.5,130)" pointer-events="all"/><path d="M 450.25 50 L 445.25 50 Q 440.25 50 440.25 60 L 440.25 120 Q 440.25 130 435.25 130 L 432.75 130 Q 430.25 130 435.25 130 L 437.75 130 Q 440.25 130 440.25 140 L 440.25 200 Q 440.25 210 445.25 210 L 450.25 210" fill="none" stroke="#000000" stroke-miterlimit="10" transform="rotate(-90,440.25,130)" pointer-events="all"/><rect x="26" y="140" width="100" height="20" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 150px; margin-left: 76px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: nowrap; ">Group 1 (32x32)</div></div></div></foreignObject><text x="76" y="154" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">Group 1 (32x32)</text></switch></g><rect x="210.5" y="140" width="100" height="20" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 150px; margin-left: 261px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: nowrap; ">Group 2 (16x16)</div></div></div></foreignObject><text x="261" y="154" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">Group 2 (16x16)</text></switch></g><rect x="395.25" y="140" width="90" height="20" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 150px; margin-left: 440px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: nowrap; ">Group 3 (8x8)</div></div></div></foreignObject><text x="440" y="154" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">Group 3 (8x8)</text></switch></g></g><switch><g requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"/><a transform="translate(0,-5)" xlink:href="https://desk.draw.io/support/solutions/articles/16000042487" target="_blank"><text text-anchor="middle" font-size="10px" x="50%" y="100%">Viewer does not support full SVG 1.1</text></a></switch></svg>