-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Models for newer versions of Guppy with sup basecalls #198
Comments
The existing models in NanoSim are very old.
|
Can we use NA12878 data to train the human genome and then use the trained model as a generic human model? |
I am not sure if I got your question correctly. But if you are interested in simulating nanopore reads having the characteristics of the latest basecaller and chemistry, I would suggest you train NanoSim using such reads and then use the simulator to generate reads given a reference genome/transcriptome. Btw, if you are interested in newer basecaller, dorado, we provide a trained model for H. sapiens NA24385 - AshkenazimTrio - Son (hg002) which is sequenced using If you have any issues using the pre-trained models, check the dependencies section for some information and tips. |
I see that you recently added a new model for genomic DNA, trained with a recent version of nanosim in #224. Thanks already for this. Would it be possible to also get newly trained models for transcriptomic and metagenomic DNA? The background of my request is for wrapping nanosim to be easily usable in snakemake workflows. I have recently created such a wrapper, and for testing it relies on the pre-trained models. In combination with the issue regarding |
Hi @dlaehnemann, Sure, we can look at adding those updated pre-trained models! We'll likely use the same data as was used for the older models, but with the most recent NanoSim version. Will keep you updated - but hopefully can have that for you in the next couple of weeks! |
Thanks, that would be great. One TODO to remove from the wrapper... 😅 |
Hi @dlaehnemann, |
HI NanoSim developers,
I am wondering if you have a pre-trained model for simulating genomic reads from the human genomes close to ones basecalled from newer versions of Guppy (e.g. Guppy 6) under the super accuracy mode (R9.4.1 chemistry). Or will the default -c human option already suffice to emulate this kind of data?
The text was updated successfully, but these errors were encountered: