xvshiting

Follow

🎯

Focusing

will xvshiting

🎯

Focusing

Follow

9 followers · 13 following

Achievements

Achievements

Stars

ThuCCSLab / Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,012 68 Updated Dec 16, 2024

mattmezza / vue-beautiful-chat

A simple and beautiful Vue chat component backend agnostic, fully customisable and extendable.

Vue 1,513 441 Updated Apr 25, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 72,861 8,688 Updated Dec 1, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,659 4,055 Updated Jul 17, 2024

cyruliu / darksea

LTL verification on lifted binaries.

LLVM 6 Updated Jun 11, 2024

xvshiting / SpellCor

spell correction fully python tools

Python 3 Updated Dec 3, 2019

aied2021TRMRC / AIED_2021_TRMRC_code

Python 5 2 Updated Jul 7, 2021

nlp-uoregon / trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Python 738 102 Updated Oct 13, 2024

huawei-noah / Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Python 3,038 628 Updated Jan 22, 2024

xvshiting / Greedy-snake

greedy snake In shell. Writing for fun.

Shell 3 1 Updated Aug 30, 2021

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,673 6,425 Updated Oct 18, 2024

clinc / oos-eval

Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)

204 42 Updated Jun 1, 2021

yuchenlin / rebiber

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,680 160 Updated Aug 18, 2024

CharizardAcademy / hanzi_chaizi

Forked from howl-anderson/hanzi_chaizi

汉字拆字库，可以将汉字拆解成偏旁部首，在机器学习中作为汉字的字形特征

Python 4 1 Updated Nov 29, 2018

RasaHQ / rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 19,069 4,650 Updated Dec 12, 2024

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 9,155 815 Updated Nov 27, 2024

tal-tech / edu-bert

好未来开源教育领域首个在线教学中文预训练模型TAL-EduBERT

Python 186 41 Updated Jan 26, 2021

grammarly / gector

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

Python 908 216 Updated May 21, 2024

sebastianruder / NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,739 3,622 Updated Jul 28, 2024

brightmart / nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,529 1,548 Updated May 23, 2024

flairNLP / flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 13,989 2,105 Updated Dec 13, 2024

purcell / emacs.d

An Emacs configuration bundle with batteries included

Emacs Lisp 6,882 2,061 Updated Nov 21, 2024

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,368 1,179 Updated Dec 1, 2024

wolfgarbe / SymSpell

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

C# 3,167 298 Updated Oct 27, 2024

bakwc / JamSpell

Modern spell checking library - accurate, fast, multi-language

C++ 618 103 Updated Aug 29, 2024

kakaobrain / helo-word

Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Task

Python 89 22 Updated Sep 19, 2019

google-research / bert

TensorFlow code and pre-trained models for BERT

Python 38,369 9,624 Updated Jul 23, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,338 27,301 Updated Dec 15, 2024

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,646 3,513 Updated Jun 2, 2023

xvshiting / ebm_code_release

Forked from openai/ebm_code_release

Python 1 Updated Mar 22, 2019