Skip to content

Commit 40d7dcb

Browse files
committed
README
1 parent feefb65 commit 40d7dcb

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

src/models/sequence/ss/s4_simple/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ python -m train experiment=s4-simple-cifar wandb=null
1313
This code should reach 83-84% val accuracy on CIFAR10.
1414

1515
By default, the kernel ignores the initial state (fusing `b` and `c`), and only trains the `a` parameters (leaving `theta` fixed to the initialization).
16-
You can play with those parameters in the training run:
16+
You can play with these parameters in the training run:
1717
* Adding `use_initial=true` will add a learnable initial state, and learn the `b` and `c` parameters separately.
1818
* Setting `learn_theta=true` will make the `theta` parameters learnable (we usually see a decrease in performance of about 3 points from this).
1919
* Setting `leran_a=false` will make the `a` parameters not learnable. We don't see much of a performance degradation on CIFAR in this case, which speaks to the utility of the Chebyshev initialization!

0 commit comments

Comments
 (0)