-
Nanjing University of Aeronautics and Astronautics
- ntdxyg.github.io
Stars
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
[COLING25] CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?
[ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
[UNMAINTAINED] A Python script to obfuscate and protect your code by renaming classes, functions, variables and remove comments and docstrings.
A curated list of awesome Python code formatters
Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with 🤗 transformers.
Hugging Face RoBERTa with Flash Attention 2
Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking
The supplementary material for the paper "Code Comment Inconsistency Detection and Rectification Using a Large Language Model".
Official repo for "HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task"
[LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems
This repository explores the effectiveness of curriculum learning (CL) in improving small code language models.
Direct Preference Optimization from scratch in PyTorch
Efficient Triton Kernels for LLM Training
[ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.
[NAACL 2025 Main] Repository for the paper: Prompt Compression for Large Language Models: A Survey
Pytorch Implementation of "Sinkhorn Distance Minimization for Knowledge Distillation", COLING 2024 and TNNLS 2024
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Official Repo of paper "QUITO: Accelerating Long-Context Reasoning through Query-Guided Context Compression".