Skip to content
View mag1s10n's full-sized avatar

Block or report mag1s10n

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,644 410 Updated Jul 22, 2024

这是一个seq2seq模型,编码器是bert,解码器是transformer的解码器,可用于自然语言处理中文本生成领域的任务

Python 72 10 Updated Aug 3, 2019

在信息化时代,互联网中存储着海量多模态、多类型的资源。这些资源中包含了大量网络公害数据,例如黄赌毒、电信诈骗、网络敲诈、网络谣言、虚假新闻等。其中,数据的载体包括文本、图像、音频、视频,并且以非结构化或者半结构化形式存在。构建面向网络公害治理的知识图谱的任务是对这些海量多模态、多类型的网络公害数据进行知识抽取,提取相关的实体和关系,并且以结构化方式组织、存储。所构建的知识图谱提供接口用于对网…

Jupyter Notebook 8 Updated Jul 13, 2022

诈骗脚本语料库数据集

10 3 Updated Apr 20, 2022

This is a project with Machine Learning Algorithm to detect fraud from Live Chat.

1 Updated Oct 15, 2022

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 70,067 14,597 Updated May 10, 2024

本项目旨在识别长短文本中的敏感词,并对整段/句文本进行语义分类,从而达到文本审核的目的

Python 63 23 Updated Feb 21, 2020