Skip to content
View thinklis's full-sized avatar

Block or report thinklis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 20 1 Updated Feb 7, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,249 5,294 Updated Mar 6, 2025

🧑‍🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

3,928 417 Updated Mar 6, 2025

Official electron build of draw.io

JavaScript 52,892 5,164 Updated Mar 5, 2025

Hosts a number of bilingual Mayan-Spanish corpora

JavaScript 6 1 Updated Jun 16, 2024

ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages

Python 7 1 Updated Jan 4, 2025

PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.

Python 27 2 Updated Sep 26, 2024

[ACL 2024] code and data for the paper: LogogramNLP

Jupyter Notebook 5 Updated Sep 25, 2024

The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.

293 14 Updated Dec 20, 2024

Collection of training data management explorations for large language models

312 30 Updated Aug 2, 2024

[Survey] Awesome List of Mixup Augmentation and Beyond (https://arxiv.org/abs/2409.05202)

142 12 Updated Oct 14, 2024
Python 7 Updated Feb 26, 2025

A zero-shot faithfulness evaluation metric for text summarization

Python 11 3 Updated Oct 17, 2023

Code and Dataset for EMNLP 2024 Findings Paper

HTML 3 Updated Dec 9, 2024

https://arxiv.org/pdf/2402.18025

Python 30 2 Updated Jan 26, 2025

Tesseract Open Source OCR Engine (main repository)

C++ 65,095 9,720 Updated Feb 12, 2025

[NeurIPS 2024] Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

Python 68 4 Updated Feb 11, 2025

[ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training

Python 42 4 Updated Mar 14, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 725 42 Updated Aug 5, 2024

Codes for Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM

50 Updated Oct 8, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 36,609 2,767 Updated Mar 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,512 6,096 Updated Mar 7, 2025

Practice to LLM.

Jupyter Notebook 1,039 153 Updated Mar 6, 2025

本文原文由知名 Hacker Eric S. Raymond 所撰寫,教你如何正確的提出技術問題並獲得你滿意的答案。

JavaScript 32,061 5,695 Updated Jan 1, 2025

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

1,628 206 Updated Aug 14, 2024
Next