Skip to content
View waiteryee127's full-sized avatar
  • NetEase
  • hangzhou,china

Block or report waiteryee127

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,752 480 Updated Aug 6, 2024

大模型基础: 一文了解大模型基础知识

3,365 301 Updated Dec 25, 2024

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,571 405 Updated Jan 2, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,893 113 Updated Jul 29, 2024

✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解

Go 33,237 5,737 Updated Dec 11, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,289 1,277 Updated Sep 5, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

17,223 1,640 Updated Sep 19, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,085 5,037 Updated Jan 3, 2025

Paper List of Pre-trained Foundation Recommender Models

322 27 Updated Aug 12, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,524 239 Updated May 1, 2024

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Python 28,328 2,826 Updated Dec 22, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,151 1,223 Updated Dec 12, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 12,429 1,329 Updated Dec 17, 2024

PyTorch tutorials, examples and some books I found 【不定期更新】整理的PyTorch 最新版教程、例子和书籍

Jupyter Notebook 1,230 294 Updated Nov 23, 2020

Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.

Jupyter Notebook 11,598 3,369 Updated Sep 12, 2024

Grok open release

Python 49,753 8,343 Updated Aug 30, 2024

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 10,813 1,565 Updated Aug 8, 2024

中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)

Python 591 43 Updated Apr 30, 2024

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,211 464 Updated Aug 7, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,467 518 Updated Oct 22, 2024

开源社区第一个能下载、能运行的中文 LLaMA2 模型!

Python 2,236 201 Updated Oct 26, 2023

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,301 877 Updated Jul 1, 2024

An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)

Python 253 36 Updated Jun 12, 2023

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,161 396 Updated Nov 18, 2024

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Python 1,455 212 Updated Dec 24, 2024

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 18,284 2,213 Updated Nov 13, 2024

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 52,786 6,871 Updated Nov 17, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,937 5,236 Updated Jun 27, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,707 4,055 Updated Jul 17, 2024
Next