Starred repositories
ICU based universal language tokenizer
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A PyTorch-based knowledge distillation toolkit for natural language processing
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
📖 A curated list of resources dedicated to Urdu language.
An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Unsupervised text tokenizer for Neural Network-based text generation.
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
🚀 Stealth - Secure, Peer-to-Peer, Private and Automateable Web Browser/Scraper/Proxy
Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
Demo showing how to bypass Cloudflare Challenge page with Turnstile CAPTCHA with puppeteer and 2Captcha
Proxy server to bypass Cloudflare protection
A Python module to bypass Cloudflare's anti-bot page.
A browser automation framework and ecosystem.
Library for fast text representation and classification.
An annotated implementation of the Transformer paper.
PyTorch implementation of adversarial attacks [torchattacks]
Code for IJCAI 2019 paper "Real-time Adversarial Attack".
Analysis of the ISCX VPN-nonVPN Dataset 2016 for Encrypted Network Traffic Classification
using deep learning to classify the encrypted network traffic
Statsmodels: statistical modeling and econometrics in Python
Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.