- Core codes when studying in Guangdong Provincial People's Hospital, Guangzhou.
- Personal use & ready to share.
- Most codes are adjusted from tutorials of related packages/softwares/articles.
- Reference of each package/software/method is omitted.
- Notes during studying and code writing.
- R
- ICD10 & OPSC4
- Disease contains IHD, HF...
- R
- 5 times of imputation
- Rubin rules
- R/Python
- In most cases, plasma proteins are risk factors
- Covariates adjustment
- Python
- kmeans, LGBM, RF, SVM, XGBoost, DT, LR...
- SMOTE/Downsampling to deal with imbalance problems
- Bayesian-optimization
- Accuracy, precision, recall...
- ROC AUC curve & PR curve
- Cross-validation (k- fold, External validation...)
- Python
- Youden Index to determine cutoffs for each protein (biomarker effect)
- Numbers at risk & numbers at event
- Python/R
- Gwaslab to wash data into vcf framat in GR37/hg19
- Series of IEU packages to conduct Two-sample MR analysis locally
- R
- Gwasglue to match vcf files in given chrompos
- Conc for colocalization analysis
- sh/R
- Plink2 to perform GWAS analysis
- PRSice-2, LDpred2, and lassosum for PRS calculation
- R
- Pleiotropic effect of a genetic variant on two traits
- Reference: https://github.com/RayDebashree/PLACO