Skip to content
View JessicaGao8's full-sized avatar

Highlights

  • Pro

Block or report JessicaGao8

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。

Python 263 39 Updated Feb 18, 2025

My learning notes/codes for ML SYS.

Python 1,305 69 Updated Mar 7, 2025

[NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Python 50 4 Updated Nov 25, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,088 623 Updated Feb 10, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 54,482 11,862 Updated Mar 5, 2025
Python 55 16 Updated Jul 26, 2024

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,080 47 Updated Jul 31, 2024

Awesome-Paper-list: Visualization meets LLM

32 1 Updated Feb 14, 2025

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 910 67 Updated Jan 16, 2025

pdf-translator translates English PDF files into Japanese, preserving the original layout.

Python 310 43 Updated May 7, 2024

translate PDF files using GPT

Python 116 32 Updated Sep 16, 2024

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 6,645 574 Updated Mar 6, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 27,598 2,125 Updated Mar 7, 2025

WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge

Python 114 14 Updated Nov 11, 2024

Awesome papers about unifying LLMs and KGs

2,244 159 Updated Feb 6, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,864 1,355 Updated Mar 3, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,358 5,310 Updated Mar 7, 2025

🧑‍🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

3,940 417 Updated Mar 6, 2025

Utilities intended for use with Llama models.

Python 5,886 1,001 Updated Mar 1, 2025

Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents

Python 23 2 Updated May 24, 2022

An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).

Python 48 1 Updated Jun 11, 2024

WavJourney: Compositional Audio Creation with LLMs

Python 531 43 Updated Sep 28, 2023

[T-PAMI] A curated list of self-supervised multimodal learning resources.

246 7 Updated Aug 16, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,638 672 Updated Mar 3, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,719 661 Updated Aug 13, 2024

Yummly Similar Recipe And Similar Ingredient Recommender

Jupyter Notebook 20 6 Updated Mar 15, 2018

🏦 银行笔试面试经验分享及资料分享(help you pass the bank interview, and get a amazing bank offer!)

5,284 627 Updated Oct 18, 2024

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 16,453 4,594 Updated Jun 21, 2022