-
Tsinghua University
- Beijing
- earthring.github.io
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Stars
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
Official implementation for "TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables" (NeurIPS 2024)
Open-Source Implementations of Large Time-Series Models
[CIKM'23] The official implementation code of DiffuASR.
Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."
This repository demonstrates how to build a Decoder-Only Transformer with Multi-Query Attention in JAX.
Collect some World Models for Autonomous Driving papers.
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
Causal Discovery in Python. It also includes (conditional) independence tests and score functions.
A curated list of awesome model based RL resources (continually updated)
[COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"
World Model based Autonomous Driving Platform in CARLA 🚗
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Official code for: "SHINE: Shielding Backdoors in Deep Reinforcement Learning"
Implementation for "RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content"
Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344
About code release of "HelmFluid: Learning Helmholtz Dynamics for Interpretable Fluid Prediction", ICML 2024. https://arxiv.org/pdf/2310.10565
About code release of "Transolver: A Fast Transformer Solver for PDEs on General Geometries", ICML 2024 Spotlight. https://arxiv.org/abs/2402.02366
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)