Starred repositories
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
[ASE2024] Mutual Learning-Based Framework for Enhancing Robustness of Code Models via Adversarial Training
VulTrigger is a tool to for identifying vulnerability-triggering statements across functions and investigating the effectiveness of function-level vulnerability detectors in detecting inter-procedu…
Sample data and sources of mysql-course.
MegaVul - The largest, high-quality, extensible, continuously updated, C/C++/Java vulnerability dataset
Software Vulnerabilities to Weakness Mapping
A transformer-based VS Code extension that enables one to discover vulnerabilities in Java files.
A tutorial for Automatic Text Summarization using TextRank algorithm.
Learning a Unified Classifier Incrementally via Rebalancing
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
A Transformer-based Line-Level Vulnerability Prediction
System to detect the buffer overflow in the source code. It takes source code in C/C++ language and check for 8 type of buffer overflow vulnerability.
Code for the paper - Source Code Vulnerability Detection: Combining Code Language Models and Code Property Graph
Vision Transformer-Inspired Automated Vulnerability Repair
CodeSage: Code Representation Learning At Scale (ICLR 2024)
CfExplainer, a rule-based method for explainable defect prediction
Source Code for ICML 2022 paper "Boosting Graph Structure Learning with Dummy Nodes"
DeepBugs is a framework for learning bug detectors from an existing code corpus.