pytorch
diff --git a/‎README.md
Lines changed: 26 additions & 27 deletions b/‎README.md
Lines changed: 26 additions & 27 deletions
diff --git a/‎tutorials/CIFAR_TorchVision_Interpret.ipynb
Lines changed: 73 additions & 35 deletions b/‎tutorials/CIFAR_TorchVision_Interpret.ipynb
Lines changed: 73 additions & 35 deletions
@@ -120,16 +120,15 @@ class ToyModel(nn.Module):
         self.lin1 = nn.Linear(3, 3)
         self.relu = nn.ReLU()
         self.lin2 = nn.Linear(3, 2)
-        self.sigmoid = nn.Sigmoid()
 
         # initialize weights and biases
-        self.lin1.weight = nn.Parameter(torch.arange(0.0, 9.0).view(3, 3))
+        self.lin1.weight = nn.Parameter(torch.arange(-4.0, 5.0).view(3, 3))
         self.lin1.bias = nn.Parameter(torch.zeros(1,3))
-        self.lin2.weight = nn.Parameter(torch.arange(0.0, 6.0).view(2, 3))
+        self.lin2.weight = nn.Parameter(torch.arange(-3.0, 3.0).view(2, 3))
         self.lin2.bias = nn.Parameter(torch.ones(1,2))
 
     def forward(self, input):
-        return self.sigmoid(self.lin2(self.relu(self.lin1(input))))
+        return self.lin2(self.relu(self.lin1(input)))
 ```
 
 Let's create an instance of our model and set it to eval mode.
@@ -176,9 +175,9 @@ print('IG Attributions: ', attributions, ' Convergence Delta: ', delta)
 ```
 Output:
 ```
-IG Attributions:  tensor([[0.0628, 0.1314, 0.0747],
-                          [0.0930, 0.0120, 0.1639]])
-Convergence Delta: tensor([0., 0.])
+IG Attributions:  tensor([[-0.5922, -1.5497, -1.0067],
+                          [ 0.0000, -0.2219, -5.1991]])
+Convergence Delta: tensor([2.3842e-07, -4.7684e-07])
 ```
 The algorithm outputs an attribution score for each input element and a
 convergence delta. The lower the absolute value of the convergence delta the better
@@ -217,9 +216,9 @@ print('GradientShap Attributions: ', attributions, ' Convergence Delta: ', delta
 ```
 Output
 ```
-GradientShap Attributions:  tensor([[ 0.0008,  0.0019,  0.0009],
-                                    [ 0.1892, -0.0045,  0.2445]])
-Convergence Delta: tensor([-0.2681, -0.2633, -0.2607, -0.2655, -0.2689, -0.2689,  1.4493, -0.2688])
+GradientShap Attributions:  tensor([[-0.1542, -1.6229, -1.5835],
+                                    [-0.3916, -0.2836, -4.6851]])
+Convergence Delta: tensor([ 0.0000, -0.0005, -0.0029, -0.0084, -0.0087, -0.0405,  0.0000, -0.0084])
 
 ```
 Deltas are computed for each `n_samples * input.shape[0]` example. The user can,
@@ -243,8 +242,8 @@ print('DeepLift Attributions: ', attributions, ' Convergence Delta: ', delta)
 ```
 Output
 ```
-DeepLift Attributions:  tensor([[0.0628, 0.1314, 0.0747],
-                                [0.0930, 0.0120, 0.1639]])
+DeepLift Attributions:  tensor([[-0.5922, -1.5497, -1.0067],
+                                [ 0.0000, -0.2219, -5.1991])
 Convergence Delta: tensor([0., 0.])
 ```
 DeepLift assigns similar attribution scores as Integrated Gradients to inputs,
@@ -269,12 +268,12 @@ print('DeepLiftSHAP Attributions: ', attributions, ' Convergence Delta: ', delta
 ```
 Output
 ```
-DeepLiftShap Attributions: tensor([0.0627, 0.1313, 0.0747],
-                                  [0.0929, 0.0120, 0.1637], grad_fn=<MeanBackward1>)
-Convergence Delta:  tensor([-2.9802e-08,  0.0000e+00,  0.0000e+00,  0.0000e+00,  0.0000e+00,
-         0.0000e+00,  0.0000e+00,  0.0000e+00,  0.0000e+00,  2.9802e-08,
-         0.0000e+00,  0.0000e+00,  0.0000e+00,  0.0000e+00,  0.0000e+00,
-         0.0000e+00,  0.0000e+00,  2.9802e-08,  0.0000e+00,  2.9802e-08])
+DeepLiftShap Attributions: tensor([[-5.9169e-01, -1.5491e+00, -1.0076e+00],
+        [-4.7101e-03, -2.2300e-01, -5.1926e+00]], grad_fn=<MeanBackward1>)
+Convergence Delta:  tensor([-4.6120e-03, -1.6267e-03, -5.1045e-04, -1.4184e-03, -6.8886e-03,
+        -2.2224e-02,  0.0000e+00, -2.8790e-02, -4.1285e-03, -2.7295e-02,
+        -3.2349e-03, -1.6265e-03, -4.7684e-07, -1.4191e-03, -6.8889e-03,
+        -2.2224e-02,  0.0000e+00, -2.4792e-02, -4.1289e-03, -2.7296e-02])
 ```
 `DeepLiftShap` uses `DeepLift` to compute attribution score for each
 input-baseline pair and averages it for each input across all baselines.
@@ -303,10 +302,10 @@ print('IG + SmoothGrad Attributions: ', attributions, ' Convergence Delta: ', de
 ```
 Output
 ```
-IG + SmoothGrad Attributions:  tensor([[0.0631, 0.1335, 0.0723],
-                                       [0.0911, 0.0142, 0.1636]])
-Convergence Delta:  tensor([ 1.4901e-07, -8.9407e-08,  1.1921e-07,
-        1.4901e-07,  1.1921e-07, -1.7881e-07, -5.9605e-08,  5.9605e-08])
+IG + SmoothGrad Attributions:  tensor([[-0.4574, -1.5493, -1.0893],
+                                       [ 0.0000, -0.2647, -5.1619]])
+Convergence Delta:  tensor([ 0.0000e+00,  2.3842e-07,  0.0000e+00, -2.3842e-07,  0.0000e+00,
+        -4.7684e-07,  0.0000e+00, -4.7684e-07])
 
 ```
 The number of elements in the `delta` tensor is equal to: `n_samples * input.shape[0]`
@@ -334,8 +333,8 @@ print('Neuron Attributions: ', attributions)
 ```
 Output
 ```
-Neuron Attributions:  tensor([[0.0106, 0.0247, 0.0150],
-                              [0.0144, 0.0021, 0.0301]])
+Neuron Attributions:  tensor([[ 0.0000,  0.0000,  0.0000],
+                              [ 1.3358,  0.0000, -1.6811]])
 ```
 
 Layer conductance shows the importance of neurons for a layer and given input.
@@ -351,9 +350,9 @@ print('Layer Attributions: ', attributions, ' Convergence Delta: ', delta)
 ```
 Outputs
 ```
-Layer Attributions: tensor([[0.0000, 0.0515, 0.1811],
-                            [0.0000, 0.0477, 0.1652]], grad_fn=<SumBackward1>)
-Convergence Delta:  tensor([-0.0363, -0.0560])
+Layer Attributions: tensor([[ 0.0000,  0.0000, -3.0856],
+                            [ 0.0000, -0.3488, -4.9638]], grad_fn=<SumBackward1>)
+Convergence Delta:  tensor([0.0630, 0.1084])
 ```
 
 Similar to other attribution algorithms that return convergence delta, LayerConductance