@@ -9,8 +9,8 @@ This allows one to build very rich architectures:
9
9
* Table Conversion Modules convert between tables and Tensors:
10
10
* [ SplitTable] ( #nn.SplitTable ) : splits a Tensor into a table of Tensors;
11
11
* [ JoinTable] ( #nn.JoinTable ) : joins a table of Tensors into a Tensor;
12
+ * [ MixtureTable] ( #nn.MixtureTable ) : mixture of experts weighted by a gater;
12
13
* [ SelectTable] ( #nn.SelectTable ) : select one element from a table;
13
- * [ MixtureTable] ( #nn.MixtureTable ) : mixture of experts using a gater;
14
14
* Pair Modules compute a measure like distance or similarity from a pair (table) of input Tensors :
15
15
* [ PairwiseDistance] ( #nn.PairwiseDistance ) : outputs the ` p ` -norm. distance between inputs;
16
16
* [ DotProduct] ( #nn.DotProduct ) : outputs the dot product (similarity) between inputs;
@@ -386,8 +386,9 @@ Creates a module that takes a Table `{gater, experts}` as input and outputs
386
386
the mixture of ` experts ` (a Tensor or Table of Tensors) using a
387
387
` gater ` Tensor. When ` dim ` is provided, it specifies the dimension of
388
388
the ` experts ` Tensor that will be interpolated (or mixed). Otherwise,
389
- the ` experts ` are expected to be provided as a Table of Tensors. This
390
- Module is currently only implemented for mini-batches.
389
+ the ` experts ` should take the form of a Table of Tensors. This
390
+ Module works for ` experts ` of dimension 1D or more, and for a
391
+ 1D or 2D ` gater ` , i.e. for single examples or mini-batches.
391
392
392
393
Considering an ` input = {G,E} ` with a single example, then
393
394
the mixture of experts Tensor ` E ` with
@@ -474,7 +475,6 @@ Forwarding a batch of 2 examples gives us something like this:
474
475
475
476
```
476
477
477
-
478
478
<a name =" nn.SelectTable " />
479
479
## SelectTable ##
480
480
0 commit comments