Skip to content

Commit 0ce65ad

Browse files
authored
RoPER (labmlai#126)
1 parent c5d9235 commit 0ce65ad

File tree

17 files changed

+3204
-102
lines changed

17 files changed

+3204
-102
lines changed

.gitignore

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,4 +15,5 @@ logs
1515
html/
1616
diagrams/
1717
.comet.config
18-
settings.md
18+
settings.md
19+
labml_app.log

.labml.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,3 +19,4 @@ indicators:
1919
name: optim.*
2020
options:
2121
comet: false
22+
web_api: http://localhost:5005/api/v1/track?

docs/experiments/arithmetic_dataset.html

Lines changed: 900 additions & 0 deletions
Large diffs are not rendered by default.

docs/normalization/deep_norm/experiment.html

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -70,7 +70,8 @@
7070
<a href='#section-0'>#</a>
7171
</div>
7272
<h1><a href="index.html">DeepNorm</a> Experiment</h1>
73-
<p><a href="https://colab.research.google.com/github/labmlai/annotated_deep_learning_paper_implementations/blob/master/labml_nn/normalization/deep_norm/experiment.ipynb"><img alt="Open In Colab" src="https://colab.research.google.com/assets/colab-badge.svg"></a> <a href="https://app.labml.ai/run/ec8e4dacb7f311ec8d1cd37d50b05c3d"><img alt="View Run" src="https://img.shields.io/badge/labml-experiment-brightgreen"></a></p>
73+
<p><a href="https://colab.research.google.com/github/labmlai/annotated_deep_learning_paper_implementations/blob/master/labml_nn/normalization/deep_norm/experiment.ipynb"><img alt="Open In Colab" src="https://colab.research.google.com/assets/colab-badge.svg"></a> <a href="https://app.labml.ai/run/ec8e4dacb7f311ec8d1cd37d50b05c3d"><img alt="View Run" src="https://img.shields.io/badge/labml-experiment-brightgreen"></a> <a href="https://www.comet.ml/labml/deep-norm/61d817f80ff143c8825fba4aacd431d4?experiment-tab=chart&showOutliers=true&smoothing=0&transformY=smoothing&xAxis=step"><img alt="Open In Comet" src="https://images.labml.ai/images/comet.svg?experiment=deep_norm&file=experiment"></a></p>
74+
7475
</div>
7576
<div class='code'>
7677
<div class="highlight"><pre><span class="lineno">15</span><span></span><span class="kn">import</span> <span class="nn">copy</span>

docs/sitemap.xml

Lines changed: 31 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -204,7 +204,7 @@
204204

205205
<url>
206206
<loc>https://nn.labml.ai/normalization/deep_norm/index.html</loc>
207-
<lastmod>2022-04-23T16:30:00+00:00</lastmod>
207+
<lastmod>2022-05-18T16:30:00+00:00</lastmod>
208208
<priority>1.00</priority>
209209
</url>
210210

@@ -244,6 +244,13 @@
244244
</url>
245245

246246

247+
<url>
248+
<loc>https://nn.labml.ai/experiments/arithmetic_dataset.html</loc>
249+
<lastmod>2022-06-02T16:30:00+00:00</lastmod>
250+
<priority>1.00</priority>
251+
</url>
252+
253+
247254
<url>
248255
<loc>https://nn.labml.ai/experiments/index.html</loc>
249256
<lastmod>2020-12-26T16:30:00+00:00</lastmod>
@@ -603,14 +610,35 @@
603610

604611
<url>
605612
<loc>https://nn.labml.ai/transformers/rope/index.html</loc>
606-
<lastmod>2022-04-05T16:30:00+00:00</lastmod>
613+
<lastmod>2022-05-31T16:30:00+00:00</lastmod>
614+
<priority>1.00</priority>
615+
</url>
616+
617+
618+
<url>
619+
<loc>https://nn.labml.ai/transformers/rope/value_pe/arithmetic_experiment.html</loc>
620+
<lastmod>2022-06-02T16:30:00+00:00</lastmod>
621+
<priority>1.00</priority>
622+
</url>
623+
624+
625+
<url>
626+
<loc>https://nn.labml.ai/transformers/rope/value_pe/index.html</loc>
627+
<lastmod>2022-06-02T16:30:00+00:00</lastmod>
628+
<priority>1.00</priority>
629+
</url>
630+
631+
632+
<url>
633+
<loc>https://nn.labml.ai/transformers/rope/value_pe/experiment.html</loc>
634+
<lastmod>2022-05-31T16:30:00+00:00</lastmod>
607635
<priority>1.00</priority>
608636
</url>
609637

610638

611639
<url>
612640
<loc>https://nn.labml.ai/transformers/rope/experiment.html</loc>
613-
<lastmod>2022-03-12T16:30:00+00:00</lastmod>
641+
<lastmod>2022-05-31T16:30:00+00:00</lastmod>
614642
<priority>1.00</priority>
615643
</url>
616644

docs/transformers/rope/experiment.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -92,7 +92,7 @@ <h3>Rotary PE attention</h3>
9292
<div class='code'>
9393
<div class="highlight"><pre><span class="lineno">21</span><span class="k">def</span> <span class="nf">_rotary_pe_mha</span><span class="p">(</span><span class="n">c</span><span class="p">:</span> <span class="n">TransformerConfigs</span><span class="p">):</span>
9494
<span class="lineno">22</span> <span class="kn">from</span> <span class="nn">labml_nn.transformers.rope</span> <span class="kn">import</span> <span class="n">RotaryPEMultiHeadAttention</span>
95-
<span class="lineno">23</span> <span class="k">return</span> <span class="n">RotaryPEMultiHeadAttention</span><span class="p">(</span><span class="n">c</span><span class="o">.</span><span class="n">n_heads</span><span class="p">,</span> <span class="n">c</span><span class="o">.</span><span class="n">d_model</span><span class="p">)</span></pre></div>
95+
<span class="lineno">23</span> <span class="k">return</span> <span class="n">RotaryPEMultiHeadAttention</span><span class="p">(</span><span class="n">c</span><span class="o">.</span><span class="n">n_heads</span><span class="p">,</span> <span class="n">c</span><span class="o">.</span><span class="n">d_model</span><span class="p">,</span> <span class="mf">1.</span><span class="p">)</span></pre></div>
9696
</div>
9797
</div>
9898
<div class='section' id='section-2'>
@@ -157,7 +157,7 @@ <h3>Rotary PE attention</h3>
157157

158158
</div>
159159
<div class='code'>
160-
<div class="highlight"><pre><span class="lineno">46</span> <span class="n">experiment</span><span class="o">.</span><span class="n">create</span><span class="p">(</span><span class="n">name</span><span class="o">=</span><span class="s2">&quot;rotary_pe_transformer&quot;</span><span class="p">)</span></pre></div>
160+
<div class="highlight"><pre><span class="lineno">46</span> <span class="n">experiment</span><span class="o">.</span><span class="n">create</span><span class="p">(</span><span class="n">name</span><span class="o">=</span><span class="s2">&quot;rotary_pe_transformer&quot;</span><span class="p">,</span> <span class="n">writers</span><span class="o">=</span><span class="p">{</span><span class="s1">&#39;screen&#39;</span><span class="p">})</span></pre></div>
161161
</div>
162162
</div>
163163
<div class='section' id='section-7'>

docs/transformers/rope/index.html

Lines changed: 148 additions & 69 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)