-
Phd Student @ SCST, Soochow University
-
18:45
(UTC +08:00) - https://zetangforward.github.io/
- https://www.zhihu.com/people/zctang2000
Highlights
- Pro
-
MyRLHF Public
Forked from OpenRLHF/OpenRLHFCopy from OpenRLHF
Jupyter Notebook Apache License 2.0 UpdatedJan 8, 2025 -
-
LCM_Stack Public
Code for paper: Long cOntext aliGnment via efficient preference Optimization
-
Global_Mamba Public
Code for paper: Revealing and Mitigating the Local Pattern Shortcuts of Mamba
-
-
Long-form-reasoning-data Public
Forked from booydar/babilongBABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
Jupyter Notebook Apache License 2.0 UpdatedNov 20, 2024 -
zetangforward.github.io Public
Forked from Xnhyacinth/Xnhyacinth.github.ioI'm here! 😎 Personal Home Page 👋🏠
JavaScript UpdatedNov 17, 2024 -
-
Awesome-LLM-Long-Context-Modeling Public
Forked from Xnhyacinth/Awesome-LLM-Long-Context-Modeling📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
MIT License UpdatedOct 25, 2024 -
L-CITEEVAL Public
L-CITEEVAL: DO LONG-CONTEXT MODELS TRULY LEVERAGE CONTEXT FOR RESPONDING?
-
CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Main)
-
-
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attentionCustom Ring attention implementation with flash attention
Python MIT License UpdatedSep 20, 2024 -
Long-Context-Data-Engineering Public
Forked from FranxYao/Long-Context-Data-EngineeringImplementation of paper Data Engineering for Scaling Language Models to 128K Context
-
EasyContext Public
Forked from jzhang38/EasyContextMemory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
-
-
zoology Public
Forked from HazyResearch/zoologyUnderstand and test language model architectures on synthetic tasks.
-
mamba Public
Forked from state-spaces/mambaself-implemented mamba
-
mamba-chat Public
Forked from redotvideo/mamba-chatself-implemented mamba training
-
gfn-lm-tuning Public
Forked from GFNOrg/gfn-lm-tuning -
HaluEval Public
Forked from RUCAIBox/HaluEvalThis is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
-
meshgpt-pytorch Public
Forked from lucidrains/meshgpt-pytorchImplementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
Python MIT License UpdatedDec 18, 2023 -
encodec-pytorch Public
Forked from ZhikangNiu/encodec-pytorchunofficial implementation of the High Fidelity Neural Audio Compression
Python MIT License UpdatedDec 13, 2023 -
rtdl-num-embeddings Public
Forked from yandex-research/rtdl-num-embeddings(NeurIPS 2022) On Embeddings for Numerical Features in Tabular Deep Learning
Python MIT License UpdatedDec 8, 2023 -
encodec Public
Forked from facebookresearch/encodecState-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Python MIT License UpdatedNov 20, 2023 -
LLaVA Public
Forked from haotian-liu/LLaVA[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
Python Apache License 2.0 UpdatedNov 18, 2023 -
VectorFusion-pytorch Public
Forked from ximinng/VectorFusion-pytorch[CVPR 2023] Unofficial implementation for "VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models"
Python MIT License UpdatedOct 17, 2023 -
CSA-GEC Public
This is the official code for ``Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Simple Cycle Self-Augmenting``
-
Causality4NLP_Papers Public
Forked from zhijing-jin/CausalNLP_PapersA reading list for papers on causality for natural language processing (NLP)
1 UpdatedOct 7, 2023 -
ChatSydney-react-img Public
Forked from renqabs/ChatSydney-react-imgJavaScript The Unlicense UpdatedOct 3, 2023