![django logo](https://raw.githubusercontent.com/github/explore/7456fdff59816d37ef383a6c8f32a26ff7332db2/topics/django/django.png)
Starred repositories
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
A high-throughput and memory-efficient inference and serving engine for LLMs
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Retrieval and Retrieval-augmented LLMs
Empowering RAG with a memory-based data interface for all-purpose applications!
Assessing the Utility of Large Language Models for Phenotype-Driven Gene Prioritization in Rare Genetic Disorder Diagnosis
Bi-Directional Equivariant Long-Range DNA Sequence Modeling
Robust recipes to align language models with human and AI preferences
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Extensible, parallel implementations of t-SNE
Integrated Image-based Deep Learning and Language Models for Primary Diabetes Care
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…
A LLM project as a part of CS-5660
Deep Generative Modelling of Patient Timelines using Electronic Health Records
An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.