Skip to content
View jlin816's full-sized avatar

Highlights

  • Pro

Block or report jlin816

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Use the reMarkable2 as an interface to vision-LLMs (ChatGPT, Claude, Gemini). Ghost in the machine!

Rust 352 10 Updated Dec 30, 2024

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 3,442 407 Updated Feb 8, 2025

EXPERIMENTAL – A library for language models to respond with GUI.

JavaScript 63 4 Updated Oct 7, 2023

s1: Simple test-time scaling

Python 4,535 498 Updated Feb 8, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 6,890 496 Updated Feb 7, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 9,288 1,212 Updated Feb 1, 2025

Make websites accessible for AI agents

Python 25,877 2,606 Updated Feb 9, 2025

run paligemma in real time

Python 129 14 Updated May 18, 2024

Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0.

TypeScript 4,627 523 Updated Feb 9, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 704 53 Updated Jan 24, 2025

Automagically reverse-engineer REST APIs via capturing traffic

HTML 8,600 303 Updated Feb 3, 2025

Run arbitrary commands when files change

C 4,830 109 Updated Feb 6, 2025

A paper list of some recent works about Token Compress for Vit and VLM

302 16 Updated Feb 9, 2025

A fast multimodal LLM for real-time voice

Python 3,442 233 Updated Feb 8, 2025

SOTA Open Source TTS

Python 18,960 1,434 Updated Feb 3, 2025

Local realtime voice AI

Python 2,212 120 Updated Feb 9, 2025

πŸ” An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 5,911 599 Updated Jan 8, 2025

A toolkit for describing model features and intervening on those features to steer behavior.

Python 155 13 Updated Nov 10, 2024

NanoGPT (124M) in 3 minutes

Python 2,232 230 Updated Feb 10, 2025

OpenAI's Realtime API minus the enterprise bloat

TypeScript 43 5 Updated Nov 21, 2024

A system for agentic LLM-powered data processing and ETL

Python 1,651 150 Updated Feb 9, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,424 239 Updated Jan 27, 2025

Best practices & guides on how to write distributed pytorch training code

Python 347 25 Updated Jan 23, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,072 199 Updated Feb 9, 2025

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 996 61 Updated Jan 7, 2025

πŸš€πŸ€– Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 29,456 2,369 Updated Feb 9, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,848 4,885 Updated Feb 9, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,426 593 Updated Feb 9, 2025
Next