💡 [REQUEST] - <GETTING STARTED WITH FULLY SHARDED DATA PARALLEL(FSDP)> #2613

Wesleystormrage · 2023-10-20T09:27:55Z

🚀 Descirbe the improvement or the new tutorial

After I read the "How FSDP works" in https://pytorch.org/tutorials/intermediate/FSDP_tutorial.html?highlight=fsdp, I still couldn't figure out what FSDP is due to the lack of explaination of ALL_GATHER and REDUCE-SCATTER which I believe are the key concepts in FSDP.
And this article helped me. https://engineering.fb.com/2021/07/15/open-source/fsdp/

"I believe The key insight to unlock full parameter sharding is that we can decompose the all-reduce operations in DDP into separate reduce-scatter and all-gather operations:"

I think adding this part can greatly help readers to better understand FSDP.

Existing tutorials on this topic

GETTING STARTED WITH FULLY SHARDED DATA PARALLEL(FSDP)
https://pytorch.org/tutorials/intermediate/FSDP_tutorial.html?highlight=fsdp

Additional context

No response

cc @wconstab @osalpekar @H-Huang @kwen2501 @sekyondaMeta @svekars @carljparker @NicolasHug @kit1980 @subramen

svekars · 2023-10-25T16:22:36Z

@osalpekar @H-Huang @kwen2501 - any thoughts on the above?

ChanBong · 2023-11-01T17:31:36Z

/assigntome

H-Huang · 2023-11-02T20:34:48Z

I think adding that diagram in the introduction of the tutorial (or a link to the article) is fine to me. cc @awgu

awgu · 2023-11-02T21:37:21Z

I think adding that diagram in the introduction of the tutorial (or a link to the article) is fine to me. cc @awgu

Adding this figure sounds good to me!

svekars added the distributed label Oct 20, 2023

svekars added medium docathon-h2-2023 labels Nov 1, 2023

github-actions bot assigned ChanBong Nov 1, 2023

ChanBong mentioned this issue Nov 4, 2023

Add image for better explanation to FSDP tutorial #2644

Merged

4 tasks

svekars closed this as completed in #2644 Nov 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

💡 [REQUEST] - <GETTING STARTED WITH FULLY SHARDED DATA PARALLEL(FSDP)> #2613

💡 [REQUEST] - <GETTING STARTED WITH FULLY SHARDED DATA PARALLEL(FSDP)> #2613

Wesleystormrage commented Oct 20, 2023 •

edited by pytorch-bot bot

Loading

svekars commented Oct 25, 2023

ChanBong commented Nov 1, 2023

H-Huang commented Nov 2, 2023

awgu commented Nov 2, 2023

💡 [REQUEST] - <GETTING STARTED WITH FULLY SHARDED DATA PARALLEL(FSDP)> #2613

💡 [REQUEST] - <GETTING STARTED WITH FULLY SHARDED DATA PARALLEL(FSDP)> #2613

Comments

Wesleystormrage commented Oct 20, 2023 • edited by pytorch-bot bot Loading

🚀 Descirbe the improvement or the new tutorial

Existing tutorials on this topic

Additional context

svekars commented Oct 25, 2023

ChanBong commented Nov 1, 2023

H-Huang commented Nov 2, 2023

awgu commented Nov 2, 2023

Wesleystormrage commented Oct 20, 2023 •

edited by pytorch-bot bot

Loading