Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 936 173 Updated Mar 14, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 44,188 5,405 Updated Mar 14, 2025

km1994 / LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

1,819 125 Updated Dec 26, 2024

Intelligent-Driving-Laboratory / GOPS_DOC

13 Updated Mar 15, 2025

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,267 413 Updated Nov 18, 2024

raydac / java-binary-block-parser

most comfortable and dynamic way to process binary data in Java and Android

Java 246 34 Updated Feb 8, 2025

CyC2018 / CS-Notes

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

179,335 51,208 Updated Aug 21, 2024

castorini / docTTTTTquery

docTTTTTquery document expansion model

Python 361 35 Updated Mar 25, 2023

naver / splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 831 90 Updated May 3, 2024

liucongg / ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

Python 2,724 308 Updated Dec 12, 2023

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,420 4,300 Updated Mar 14, 2025

THUDM / GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,680 603 Updated Jul 25, 2023

THUDM / GLM

GLM (General Language Model)

Python 3,229 326 Updated Nov 3, 2023

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,019 5,227 Updated Jun 27, 2024

thunlp / OpenPrompt

An Open-Source Framework for Prompt-Learning.

Python 4,500 461 Updated Jul 16, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,904 2,609 Updated Mar 4, 2025

Doragd / Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集（来源：知乎、Datafuntalk、技术公众号）

Python 3,115 372 Updated Mar 15, 2025

sfzhou5678 / PolyEncoder

An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)

Python 249 36 Updated Jun 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nankaiming Nankaiming

Block or report Nankaiming

Stars

deepseek-ai / DeepSeek-V3

robert-bor / aho-corasick

fla-org / flash-linear-attention

alibaba / nann

milvus-io / milvus

google-research / google-research

nmslib / hnswlib

state-spaces / mamba

Blealtan / efficient-kan

KindXiaoming / pykan

massquantity / dismember

alibaba / x-deeplearning

facebookresearch / generative-recommenders