Stars
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
PyTorch Tutorial for Deep Learning Researchers
all kinds of text classification models and more with deep learning
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Code to obtain the CNN / Daily Mail dataset (non-anonymized) for summarization
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
中文文本生成(NLG)之文本摘要(text summarization)工具包, 语料数据(corpus data), 抽取式摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(graph,feature,topic model,summarize to…
Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"
ACL 2022: BRIO: Bringing Order to Abstractive Summarization
Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Micr…
Code for ACL2020 paper "Heterogeneous Graph Neural Networks for Extractive Document Summarization"
Implementation of different summarization algorithms applied to legal case judgements.
Code for paper "Discourse-Aware Neural Extractive Text Summarization" (ACL20)
Code for "Autoencoder as Assistant Supervisor: Improving Text Representation for Chinese Social Media Text Summarization"
Long-context pretrained encoder-decoder models
Compositional generalization through meta sequence-to-sequence learning