Skip to content
View kachiO's full-sized avatar

Block or report kachiO

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)

Python 456 32 Updated Jan 16, 2025

🦙 Integrating LLMs into structured NLP pipelines

Python 1,207 94 Updated Jan 8, 2025

📚 Process PDFs, Word documents and more with spaCy

Python 457 24 Updated Mar 8, 2025

🍬 Confection: the sweetest config system for Python

Python 183 12 Updated May 31, 2024

📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) o…

TypeScript 12,571 533 Updated Mar 8, 2025

A list of useful Open Source tools and scrapers to gather data for LLMs

214 22 Updated Feb 24, 2025

This is the repository content that contains all of the course code

1 Updated Mar 5, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 3,265 301 Updated Mar 7, 2025

A python module that wraps the pdftoppm utility to convert PDF to PIL Image object

Python 1,734 199 Updated Jul 23, 2024

📝 python package to calculate readability statistics of a text object - paragraphs, sentences, articles.

Python 1,241 168 Updated Mar 8, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 33,533 3,308 Updated Mar 8, 2025

A Python library for dewarping/straightening/reformatting document images and PDFs

Python 9 Updated Feb 27, 2025

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

Python 192 25 Updated Mar 7, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 8,878 575 Updated Mar 7, 2025

Examples and guides for using the Gemini API

Jupyter Notebook 10,879 1,336 Updated Mar 7, 2025

CCCS security control profiles expressed using OSCAL

Python 9 1 Updated Jan 24, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 291,251 48,399 Updated Dec 2, 2024

📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)

581 101 Updated Mar 16, 2023

TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.

Python 20 2 Updated Feb 21, 2025

Bridging LLM and Recommender System.

Jupyter Notebook 725 67 Updated Feb 11, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 705 88 Updated Mar 7, 2025

SmolLM2-135M-Instruct.Q4_1 for LLM

Python 22 2 Updated Feb 7, 2025

RAGChecker: A Fine-grained Framework For Diagnosing RAG

Python 783 68 Updated Dec 13, 2024

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

Jupyter Notebook 5,815 638 Updated Mar 7, 2025
Jupyter Notebook 3 Updated Feb 3, 2025

🛒 Simple recommender with matrix factorization, graph, and NLP. Beating the regular collaborative filtering baseline.

Python 141 29 Updated Jul 7, 2024

Embedding Vector Oriented Clustering

Python 132 6 Updated Feb 28, 2025

Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embeddings recursively. This helps us understand user behaviour on…

TypeScript 1 Updated Jan 20, 2025

Fully open reproduction of DeepSeek-R1

Python 22,366 2,004 Updated Mar 8, 2025
Next