-
Notifications
You must be signed in to change notification settings - Fork 347
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature request: add layer for darts lookup table? #1225
Comments
Is this a tf.text-specific request, or should it be filed against tensorflow? Do you have a link for "darts lookup table"? |
Ah, sorry, I didn't specify the details. It's tensorflow-text specific request. Darts is double array trie and we can use it like lookup table. You can check the basic interface here: https://github.com/s-yata/darts-clone/blob/master/doc/en/Interface.md#dictionary-class. Additionally, tensorflow text already has a dependency of darts-clone (used in wordpiece tokenizer, darts-clone is cloned repository of darts) Lines 37 to 45 in b32645f
Double array trie is performant and efficient data structure to store lots of strings and paired values, so it can be useful to train/serve with lots of vocabs. (like tens of milliions vocabs in the single model. it can be hard to use hash table because of the memory burden) So I'm suggesting implementing the basic methods of the darts-clone's interface. |
In some cases, it can be more efficient and memory-efficient than hashtable in tensorflow.
It should be great if darts lookup table has following methods
The text was updated successfully, but these errors were encountered: