Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Summary: Design doc: https://fb.quip.com/6lgwApu6q46w For row-wise partition, the sequence embeddings pipeline is ``` (T_g, W, B_local, L_bucket x D) same bucket of local batches → permute → (W, T_g, B_local, L_bucket x D) → a2a on bucketized lengths → (W, T_g, B_local, L_bucket x D) all buckets of local batches → debucketize → (T_g, B_local, L_batch x D) ``` To enable this pipeline, we - use `torch.index_select` to permute the embeddings. - produced the permute mapping for bucketize and unbucketize embeddings and its gradients. - added these metadata in the rw partition workflow. Differential Revision: D27570196 fbshipit-source-id: b061c0c92b65f710c598d28d441bc287a74a217c
- Loading branch information