Skip to content
View faneshion's full-sized avatar

Organizations

@NTMC-Community

Block or report faneshion

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
25 stars written in Python
Clear filter

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 31,104 4,464 Updated Feb 3, 2025

🔒 Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories.

Python 27,672 2,282 Updated Mar 9, 2025

s1: Simple test-time scaling

Python 5,923 685 Updated Mar 6, 2025

Facilitating the design, comparison and sharing of deep text matching models.

Python 3,852 898 Updated Aug 2, 2024

Implementation of BERT that could load official pre-trained models for feature extraction and prediction

Python 2,423 511 Updated Jan 22, 2022

Fully open data curation for reasoning models

Python 1,494 129 Updated Feb 23, 2025

🎯 Task-oriented embedding tuning for BERT, CLIP, etc.

Python 1,490 69 Updated Mar 11, 2024

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

Python 1,455 195 Updated Jul 22, 2023

Full text geoparsing as a Python library

Python 745 96 Updated Sep 17, 2021

TrustRAG:The RAG Framework within Reliable input,Trusted output

Python 744 81 Updated Mar 11, 2025

Kaggle:Quora Question Pairs, 4th/3396 (https://www.kaggle.com/c/quora-question-pairs)

Python 731 261 Updated Dec 20, 2017

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

Python 689 71 Updated Aug 25, 2024

Time-NLP的python3版本 中文时间表达词转换

Python 517 123 Updated Dec 8, 2022

Facilitating the design, comparison and sharing of deep text matching models.

Python 496 106 Updated May 3, 2024

BERT for Coreference Resolution

Python 447 93 Updated Dec 8, 2022

A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and Transferability of Contextual Representations" (NAACL 2019).

Python 210 30 Updated Oct 20, 2021

Utilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET

Python 188 41 Updated Oct 24, 2019

question answering, reading comprehension toolkit

Python 166 43 Updated Oct 16, 2022

Evaluation tools for Retrieval-augmented Generation (RAG) methods.

Python 149 11 Updated Nov 18, 2024

A simple version of MatchPyramid implement in TensorFlow. Paper https://arxiv.org/abs/1602.06359.

Python 133 23 Updated Jan 27, 2019

A curated list of resources dedicated to retrieval-augmented generation (RAG).

Python 97 7 Updated Feb 11, 2025

Official Repo of paper "QUITO: Accelerating Long-Context Reasoning through Query-Guided Context Compression".

Python 9 1 Updated Aug 12, 2024

The IR papers rocked the world, including best papers, test-of-time papers, and highly cited papers, published in IR conferences.

Python 4 1 Updated Dec 13, 2024

The matchzoo-doc-template contains the code to generate the API document for the matchzoo project.

Python 2 Updated Jan 12, 2019

When to Retrieve? Teaching LLMs to Utilize Information Retrieval Effectively

Python 1 Updated Aug 16, 2024
25 stars written in Python