Author | Pengchuan Sun (sunpengchuan) |
[email protected] | |
License | BSD |
WGDI (Whole-Genome Duplication Integrated analysis) is a Python-based command-line tool designed to simplify the analysis of whole-genome duplications (WGD) and cross-species genome alignments. It offers three main workflows that enhance the detection and study of WGD events:
- Identifies and confirms polyploid events with high accuracy.
- Traces the evolutionary history of duplicated regions across species, with a focus on distinguishing subgenomes.
- Reconstructs protochromosomes and traces common chromosomal rearrangements to understand chromosome evolution.
Python package and command line interface (IDLE) for the analysis of whole genome duplications (WGDI). WGDI can be deployed in Windows, Linux, and Mac OS operating systems and can be installed via pip and conda.
conda install -c bioconda wgdi
pip3 install wgdi
Documentation for installation along with a user tutorial, a default parameter file, and test data are provided. please consult the docs at http://wgdi.readthedocs.io/en/latest/.
Here are some videos with simple examples of WGDI.
chatting group QQ : 966612552
If you use wgdi in your work, please cite:
Sun P., Jiao B., Yang Y., Shan L., Li T., Li X., Xi Z., Wang X., and Liu J. (2022). WGDI: A user-friendly toolkit for evolutionary analyses of whole-genome duplications and ancestral karyotypes. Mol. Plant. doi: https://doi.org/10.1016/j.molp.2022.10.018.
- Added extract the fusion positions dataset (-fpd).
- Added determine whether these fusion events occur in other genomes (-fd).
- Improved the karyotype_mapping (-km) effect.
- Fixed the problem caused by the Python version, now it is compatible with version 3.12.
- Fixed some issues (-sf).
- Added new tips to avoid some errors.
- Fixed the problem caused by the Python version, now it is compatible with version 3.11.3.
- Fixed some issues (-ks, -sf).
- Added find shared fusions between species (-sf).
- Fixed issue with alignment (-a). Only version 0.6.0 has this bug.
- Fixed issue with improved collinearity (-icl).
- Added a parameter 'tandem_ratio' to blockinfo (-bi).
- Update the improved collinearity (-icl). Faster than before, but lower than MCscanX, JCVI.
- Fixed issue with ancestral karyotype repertoire (-akr).
- Fixed issue with gene names (-ks).
- Fixed issue with chromosome order (-ak).
- Fixed issue with gene names (-ks). This version is not fixed, please install the latest version.
- Add ancestral karyotype (-ak)
- Add ancestral karyotype repertoire (-akr)
- Improved the karyotype_mapping (-km) effect.
- little change (-at).
- Fixed legend issue with (-kf).
- Fixed calculate Ks issue with (-ks).
- Improved the karyotype_mapping (-km) effect.
- Improved the alignmenttrees (-at) effect.
- Fixed some bugs.
- Fixed the error of the command (-conf).
- Improved the karyotype_mapping (-km) effect.
- Added the available data set of alignmenttree (-at). Low copy data set (for example, single-copy_groups.tsv of sonicparanoid2 software).
- The latest version adds karyotype_mapping (-km) and karyotype (-k) display.
- The latest version changes the calculation of extracting pvalue from collinearity (-icl), making this parameter more sensitive. Therefore, it is recommended to set to 0.2 instead of 0.05.
- The latest version has also changed the drawing display of ksfigure (-kf) to make it more beautiful.