Skip to content
View kaynezhang's full-sized avatar

Block or report kaynezhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,118 27,438 Updated Jan 2, 2025

Fess is very powerful and easily deployable Enterprise Search Server.

Java 1,011 167 Updated Dec 29, 2024

ACHE is a web crawler for domain-specific search.

Java 460 134 Updated Aug 24, 2023
Java 2 1 Updated Apr 17, 2017

专注于解决自然语言处理领域的几个核心问题:词法分析,句法分析,语义分析,语种检测,信息抽取,文本聚类和文本分类. 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了多种自然语言处理算法,适配了多个自然语言处理框架. 兼容Lucene/Solr/ElasticSearch插件.

Java 112 29 Updated Apr 12, 2023

thulac analysis plugin for elasticsearch

Java 190 27 Updated Sep 18, 2020

BosonNLP Analysis for ElasticSearch

Java 102 23 Updated Apr 17, 2017

Elasticsearch with T5/Bert/Other models provided by huggingface Transfomers.

Python 14 3 Updated Jun 12, 2023

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,771 1,390 Updated Jul 31, 2023

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Python 6,576 988 Updated Nov 5, 2022

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 70,280 14,612 Updated May 10, 2024

100+ Chinese Word Vectors 上百种预训练中文词向量

Python 11,906 2,322 Updated Oct 30, 2023

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

4,028 999 Updated Mar 27, 2024

It's an image similarity search Engine built on top of Lire. The images can be filtered using a query by keywords [support Chinese]and are afterwards optically ranked. This engine provides an easy …

Java 7 3 Updated Oct 17, 2023
Java 3 4 Updated Jun 4, 2021

ChainSQL: the collaboration of blockchain and database

C++ 216 76 Updated Jan 12, 2023

WebViewer UI built in React

JavaScript 420 355 Updated Dec 13, 2024

Small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.

Python 3,696 215 Updated Dec 31, 2024

PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages

Java 3,503 347 Updated Dec 16, 2024

公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。

1,250 373 Updated Mar 27, 2024

An open source engine for license management on the Java Virtual Machine.

Java 345 71 Updated Nov 16, 2022

A reverse image search engine powered by elastic search and tensorflow

Python 322 50 Updated Apr 3, 2021

Simple image search engine

Python 756 241 Updated Nov 14, 2021

🎇 Quickly search over billions of images

Python 2,948 404 Updated Dec 6, 2022

Face search engine

Python 198 43 Updated Sep 16, 2016

Convert Word documents to simple and clean HTML

Java 257 48 Updated Dec 30, 2024

Instant Message

Java 2 2 Updated Jun 23, 2021

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

Java 6,496 2,320 Updated Nov 19, 2023

IK中文分词,兼容solr/lucene6.6.0,优化数字和英文搜索

Java 37 19 Updated Nov 1, 2019

Addon to provide a set of common content store implementations and easy-to-use configuration (no Spring config)

Java 44 19 Updated Jun 24, 2024
Next