Skip to content
View Sewens's full-sized avatar
🏳️‍⚧️
Back to be a programmer
🏳️‍⚧️
Back to be a programmer

Block or report Sewens

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

🍭Data

Data and dataset
34 repositories

Tools to download and cleanup Common Crawl data

Python 975 143 Updated Apr 25, 2023

Site code

JavaScript 237 65 Updated Nov 12, 2024

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 440 27 Updated Sep 22, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 69,887 14,583 Updated May 10, 2024