Skip to content
View DTennant's full-sized avatar
🚫
Undefined
🚫
Undefined

Highlights

  • Pro

Block or report DTennant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

FlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs

C++ 10,420 657 Updated Feb 27, 2025

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Python 1,153 92 Updated Feb 26, 2025

[ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis

Python 28 Updated Feb 17, 2025

A sparse attention kernel supporting mix sparse patterns

C++ 148 4 Updated Feb 13, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 533 10 Updated Feb 26, 2025

SWE Arena

Python 26 2 Updated Feb 23, 2025
JavaScript 5 1 Updated Sep 9, 2023

Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.

Python 14 1 Updated Feb 11, 2025
Jupyter Notebook 8 Updated Feb 11, 2025

A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from vari…

Python 2,689 267 Updated Dec 14, 2024

HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes function calls with approval workflows across slack, email and mo…

Python 608 51 Updated Feb 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,880 343 Updated Feb 27, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 927 67 Updated Feb 26, 2025

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 94 4 Updated Dec 10, 2024

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,748 1,380 Updated Feb 1, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 913 77 Updated Jan 24, 2025

The Campsite monorepo

TypeScript 4,424 664 Updated Feb 24, 2025

PyTorch implementation of PerCo (Towards Image Compression with Perfect Realism at Ultra-Low Bitrates, ICLR 2024)

Python 69 2 Updated Jan 13, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,135 93 Updated Feb 12, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 3,687 517 Updated Jan 26, 2025

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,247 49 Updated Jan 12, 2025

Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers

Python 83 3 Updated Jul 15, 2024

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 12,441 1,152 Updated Feb 26, 2025

A Telegram bot to recommend arXiv papers

Python 247 20 Updated Feb 9, 2025

a family of versatile and state-of-the-art video tokenizers.

Python 345 21 Updated Jan 15, 2025

An Open Source Toolkit For LLM Distillation

Python 514 53 Updated Jan 7, 2025
Python 115 18 Updated Jun 18, 2024

Source code for <Large language models surpass human experts in predicting neuroscience results>

70 10 Updated Nov 29, 2024
Next