Stars
The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"
Unsupervised text tokenizer for Neural Network-based text generation.
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
A PyTorch library for differentiable submodular minimization.
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"
Multi-layer Recurrent Neural Networks (LSTM, RNN) for character-level language models in Python using Tensorflow
The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference…
A Kitti Road Segmentation model implemented in tensorflow.
Named Entity Recognition using multilayered bidirectional LSTM
helpfulness prediction online product review
C-based/Cached/Core Computer Vision Library, A Modern Computer Vision Library
gnebehay / OpenTLD
Forked from zk00006/OpenTLDA C++ implementation of OpenTLD
brat rapid annotation tool (brat) - for all your textual annotation needs
💫 Industrial-strength Natural Language Processing (NLP) in Python
IXA pipes sentence segmenter and tokenizer (http://ixa2.si.ehu.es/ixa-pipes).
IXA pipes Named Entity Tagger (http://ixa2.si.ehu.es/ixa-pipes).
C++11 implementation of the supervised descent optimisation method
Original Caffe Version for LightCNN-9. Highly recommend to use PyTorch Version (https://github.com/AlfredXiangWu/LightCNN)
dgk_lost_conv 中文对白语料 chinese conversation corpus