Skip to content

Determine linked (and single) mutation counts/frequencies from individual MinION/Illumina reads (currently just for HIV-1)

License

Notifications You must be signed in to change notification settings

owenlancaster/HIV-1_linked_virus_mutations

Repository files navigation

Determine HIV-1 mutation frequency (overall and linked mutations) from raw single reads from MinION


Need to convert MinION reads from fast5 to fasta format for the sample (this example uses poretools):

poretools fasta DIRECTORY_FOR_FAST5 > SAMPLE_NAME.fasta

Run linked mutation python script to count/calculate the total number/frequency of mutations for each virus particle for the sample:

Example to run linked mutation script on fasta file (works with MinION and Illumina data but paired reads for Illumina currently not ready). In this example BLASTx will be initially run and will calculate single and linked mutation counts for each read. The -c option specifies the number of linked mutations to report (1 means that all linked mutations, even those with just one a strand will be outputted)
python HIV-1_linked_mutations_blastx.py -f reads.fasta -c 1
N.B. this step requires NCBI BLAST+ to be installed, and uses the "all" database (included in this respository) which is a HIV-1 RT/PR/INT pre-formatted protein BLAST database created from the Stanford HIV-1 subtype B protein sequences. The output format from the search is in BLAST xml format (outfmt 5) which is then used in the script to work out the mutation counts.
N.B. if you already have the BLAST xml output file and do not want BLAST to be re-run on the fasta file then use the -x option instead of -f and specify the BLAST xml file instead of the fasta file

To get help run:

python HIV-1_linked_mutations_blastx.py -h

TODO: example other options for the script e.g. specifying dictionaries of mutations

About

Determine linked (and single) mutation counts/frequencies from individual MinION/Illumina reads (currently just for HIV-1)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages