Skip to content
View dalong0514's full-sized avatar

Block or report dalong0514

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Explore the Multimodal “Aha Moment” on 2B Model

Python 362 11 Updated Mar 7, 2025

A comprehensive video analysis tool that combines computer vision, audio transcription, and natural language processing to generate detailed descriptions of video content. This tool extracts key fr…

Python 616 71 Updated Mar 2, 2025

Public facing notes page

Jupyter Notebook 10,362 4,098 Updated Aug 1, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 14,711 1,634 Updated Feb 23, 2025

100 % FREE, Private (No Internet) DeepSeek’s Advanced RAG: Boost Your RAG Chatbot: Hybrid Retrieval (BM25 + FAISS) + Neural Reranking + HyDe🚀

Python 1,284 147 Updated Feb 9, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,710 6,130 Updated Mar 8, 2025

Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.

Jupyter Notebook 106 7 Updated Feb 28, 2025

This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!

Jupyter Notebook 710 143 Updated Mar 7, 2025

Make websites accessible for AI agents

Python 36,645 3,793 Updated Mar 3, 2025

Let AI be your browser operator.

HTML 6,826 365 Updated Mar 7, 2025

Sample apps to help developers get started with Structured Outputs

TypeScript 613 57 Updated Jan 10, 2025

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

1,045 70 Updated Feb 19, 2025

⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.

Go 2,683 138 Updated Mar 4, 2025

A python module to repair invalid JSON from LLMs

Python 1,552 79 Updated Feb 23, 2025

Curated list of datasets and tools for post-training.

2,795 241 Updated Jan 29, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 19,529 1,576 Updated Feb 23, 2025

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 32,969 3,131 Updated Mar 4, 2025

Development repository for the Triton language and compiler

MLIR 14,763 1,845 Updated Mar 8, 2025

Automatic evals for LLMs

HTML 313 33 Updated Mar 8, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,485 821 Updated Mar 7, 2025

Parse PDFs into markdown using Vision LLMs

Python 300 43 Updated Feb 8, 2025

OCR & Document Extraction using vision models

TypeScript 10,190 666 Updated Mar 5, 2025

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,966 474 Updated Jan 3, 2025

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 6,331 513 Updated Nov 3, 2024

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python 19,866 2,545 Updated Mar 7, 2025

Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.

TypeScript 1,516 139 Updated Mar 5, 2025

Fully local web research and report writing assistant

Python 2,650 355 Updated Feb 24, 2025

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 4,806 576 Updated Feb 23, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 3,241 299 Updated Mar 7, 2025

A simple open-source chat app that uses Exa's API for web search and Deepseek R1 for reasoning

TypeScript 665 67 Updated Jan 30, 2025
Next