Stars
Java implementation of the Aho-Corasick algorithm for efficient string matching
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
A flexible, high-performance framework for large-scale retrieval problems based on TensorFlow.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Google Research
Header-only C++/python library for fast approximate nearest neighbors
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
Advanced Retrieval Algorithms for Decomposing Large-Scale Candidate Set into Pieces.
An industrial deep learning framework for high-dimension sparse data
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
most comfortable and dynamic way to process binary data in Java and Android
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
An Open-Source Framework for Prompt-Learning.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)