dgl/tutorials/models/4_old_wines at 714f2e1ced1b9fdbfcfe397e6ae4103a92b11fba · ChenAris/dgl

History

Name		Name	Last commit message	Last commit date
parent directory ..
2_capsule.py		2_capsule.py
7_transformer.py		7_transformer.py
README.txt		README.txt

README.txt

.. _tutorials4-index:


Old (new) wines in new bottle
=============================

* **Capsule** `[paper] <https://arxiv.org/abs/1710.09829>`__ `[tutorial]
  <4_old_wines/2_capsule.html>`__ `[code]
  <https://github.com/dmlc/dgl/tree/master/examples/pytorch/capsule>`__:
  this new computer vision model has two key ideas -- enhancing the feature
  representation in a vector form (instead of a scalar) called *capsule*, and
  replacing max-pooling with dynamic routing. The idea of dynamic routing is to
  integrate a lower level capsule to one (or several) of a higher level one
  with non-parametric message-passing. We show how the later can be nicely
  implemented with DGL APIs.


* **Transformer** `[paper] <https://arxiv.org/abs/1706.03762>`__ `[tutorial] <4_old_wines/7_transformer.html>`__ 
  `[code] <https://github.com/dmlc/dgl/tree/master/examples/pytorch/transformer>`__ and **Universal Transformer** 
  `[paper] <https://arxiv.org/abs/1807.03819>`__ `[tutorial] <4_old_wines/7_transformer.html>`__
  `[code] <https://github.com/dmlc/dgl/tree/master/examples/pytorch/transformer/modules/act.py>`__:
  these two models replace RNN with several layers of multi-head attention to
  encode and discover structures among tokens of a sentence. These attention
  mechanisms can similarly formulated as graph operations with
  message-passing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

4_old_wines

4_old_wines

README.txt

Files

4_old_wines

Directory actions

More options

Directory actions

More options

Latest commit

History

4_old_wines

Folders and files

parent directory

README.txt