Skip to content
View xvshiting's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report xvshiting

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,012 68 Updated Dec 16, 2024

A simple and beautiful Vue chat component backend agnostic, fully customisable and extendable.

Vue 1,513 441 Updated Apr 25, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 72,861 8,688 Updated Dec 1, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,659 4,055 Updated Jul 17, 2024

LTL verification on lifted binaries.

LLVM 6 Updated Jun 11, 2024

spell correction fully python tools

Python 3 Updated Dec 3, 2019

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Python 738 102 Updated Oct 13, 2024

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Python 3,038 628 Updated Jan 22, 2024

greedy snake In shell. Writing for fun.

Shell 3 1 Updated Aug 30, 2021

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,673 6,425 Updated Oct 18, 2024

Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)

204 42 Updated Jun 1, 2021

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,680 160 Updated Aug 18, 2024

汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征

Python 4 1 Updated Nov 29, 2018

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 19,069 4,650 Updated Dec 12, 2024

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 9,155 815 Updated Nov 27, 2024

好未来开源教育领域首个在线教学中文预训练模型TAL-EduBERT

Python 186 41 Updated Jan 26, 2021

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

Python 908 216 Updated May 21, 2024

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,739 3,622 Updated Jul 28, 2024

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,529 1,548 Updated May 23, 2024

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 13,989 2,105 Updated Dec 13, 2024

An Emacs configuration bundle with batteries included

Emacs Lisp 6,882 2,061 Updated Nov 21, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,368 1,179 Updated Dec 1, 2024

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

C# 3,167 298 Updated Oct 27, 2024

Modern spell checking library - accurate, fast, multi-language

C++ 618 103 Updated Aug 29, 2024

Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Task

Python 89 22 Updated Sep 19, 2019

TensorFlow code and pre-trained models for BERT

Python 38,369 9,624 Updated Jul 23, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,338 27,301 Updated Dec 15, 2024

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,646 3,513 Updated Jun 2, 2023
Python 1 Updated Mar 22, 2019
Next