Skip to content
View YanChuan1's full-sized avatar

Block or report YanChuan1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

ICU based universal language tokenizer

Python 30 2 Updated Jan 19, 2022

开源SFT数据集整理,随时补充

475 39 Updated Jun 2, 2023

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,718 4,751 Updated Jan 21, 2025

A PyTorch-based knowledge distillation toolkit for natural language processing

Python 1,618 239 Updated May 8, 2023

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 13,328 1,497 Updated Jan 15, 2025

📖 A curated list of resources dedicated to Urdu language.

63 13 Updated May 11, 2021

A Python Implementation of Simhash Algorithm

Python 995 224 Updated Mar 24, 2022

An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.

Python 287 42 Updated Jan 4, 2024

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 34,339 10,325 Updated Jan 15, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,497 1,188 Updated Dec 1, 2024
Jupyter Notebook 226 31 Updated Sep 9, 2021

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Jupyter Notebook 1,316 99 Updated Aug 30, 2023

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 25,289 3,226 Updated Sep 24, 2024

🚀 Stealth - Secure, Peer-to-Peer, Private and Automateable Web Browser/Scraper/Proxy

JavaScript 1,058 316 Updated Oct 5, 2023

Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)

Python 10,490 1,187 Updated Jun 25, 2024

Demo showing how to bypass Cloudflare Challenge page with Turnstile CAPTCHA with puppeteer and 2Captcha

JavaScript 23 5 Updated Oct 23, 2024

Proxy server to bypass Cloudflare protection

Python 8,465 734 Updated Jan 21, 2025

A Python module to bypass Cloudflare's anti-bot page.

Python 4,625 494 Updated Feb 23, 2024

A browser automation framework and ecosystem.

Java 31,369 8,298 Updated Jan 25, 2025

中国大模型

5,814 491 Updated Nov 30, 2024

Library for fast text representation and classification.

HTML 26,024 4,730 Updated Mar 22, 2024

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,927 1,263 Updated Apr 7, 2024

鸣潮后台自动刷BOSS声骸

Python 700 68 Updated Aug 18, 2024

PyTorch implementation of adversarial attacks [torchattacks]

Python 1,954 357 Updated Jun 29, 2024

Code for IJCAI 2019 paper "Real-time Adversarial Attack".

Python 20 3 Updated Jul 4, 2020

Analysis of the ISCX VPN-nonVPN Dataset 2016 for Encrypted Network Traffic Classification

Python 82 14 Updated Jan 2, 2024

using deep learning to classify the encrypted network traffic

Python 148 26 Updated Dec 16, 2020

Statsmodels: statistical modeling and econometrics in Python

Python 10,384 3,218 Updated Jan 20, 2025

Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.

Python 900 138 Updated Aug 2, 2023
Next