-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DIA-NN Phospho search issues #1032
Comments
What's the amount of RAM in the system? Not enough RAM could be the only reason why it's slow in this case. |
Hi Vadim, we have about 120gb RAM. I am in the process of getting the PC upgraded to more processing power. But is the workflow above the correct way to go with phospho DIA? |
We describe exact recommendations for phospho here now: |
Hi Vadim, we upgraded our PC with higher processing power and also used the latest version of DIA-NN with the settings used as per the recommendations provided (using 'ultrafast' as well). However, the search seems to always be stuck at the step "[16:54] Predicting spectra and IMs". Is it common for this step of the workflow to take long in the generation of the spectral library with 3 variable modifcations? |
On a Ryzen 7950X (16 cores) or 10980XE (18 cores) it takes about 3 minutes per million precursors. That is, a rough time to generate a 50-million precursor library on such a PC would be ~150 minutes - with DIA-NN 1.9. So no, it should not take much longer. Can you please share the full log generated so far using the 'Save log' button? |
heres the log saved so far: Skyline not found diann.exe --lib "" --threads 32 --verbose 1 --out "C:\Users\mproteomics\Desktop\Analysis Data\Sequences Databases\SpecLib_FREE\Mouse_Phos.tsv" --qvalue 0.01 --matrices --out-lib "C:\Users\mproteomics\Desktop\Analysis Data\Sequences Databases\SpecLib_FREE\Mouse_Phos-lib.tsv" --gen-spec-lib --predictor --fasta "C:\Users\mproteomics\Desktop\Analysis Data\Sequences Databases\uniprot-proteome_UP000000589_MOUSE__55,086 (Dec 2023).fasta" --fasta-search --min-fr-mz 200 --max-fr-mz 1800 --met-excision --min-pep-len 7 --max-pep-len 30 --min-pr-mz 300 --max-pr-mz 1800 --min-pr-charge 2 --max-pr-charge 4 --cut K*,R* --missed-cleavages 1 --unimod4 --var-mods 3 --var-mod UniMod:21,79.966331,STY --peptidoforms --reanalyse --relaxed-prot-inf --rt-profiling 0 files will be processed |
Thanks! Still, could it be that the RAM is all full? What is the physical RAM occupied amount shown by the Task Manager? This would be an explanation for it taking a very long time. |
it says its only using 5-10% of memory |
50 million database will be tens of gigabytes, strange. I would try to restart DIA-NN and if takes long see what's the reported RAM consumption. |
i restarted DIANN and changed a few settings:
there has been further progress and is now saying: 0 files will be processed this has been running for about 15 hours now so i am not sure if this is still taking too long. the memory usage is now at around 30% |
This is indeed quite strange. The CPU load in Task Manager does correspond to what you'd expect, based on the number of threads set, and no other high-CPU tasks are run on the machine at the same time? |
Not that i was able to see. we managed to get through the data but it looks to take a lot longer than what others have experienced |
Hi there,
I'm trying to run 4x phospho enriched and 4x global samples in DIA-NN (1.8.2 beta 39) but i am not sure if DIA-NN is bugged or if its still searching and taking a long time.
From what i have read in the forums, the workflow for phospho dia in DIA-NN is briefly as follows:
input phospho and global raw files
add fasta (in this case, mouse)
check both boxes under "precursor ion generation" ('FASTA digest for lib free' and 'deep learning based spectra')
missed cleavages 1
max no. variable mods 3
phospho checked
precursor charge range 2-4
library generation as "IDs, RT and IM profiling"
MBR checked
other settings as default
when this has finished, perform a second DIA-NN search with the spectral library generated with similar settings
However, i am running the first search and the current log for the search is below:
[0:00] Loading FASTA C:\Users\mproteomics\Desktop\Analysis Data\Sequences Databases\uniprot-proteome_UP000000589_MOUSE__55,086 (Dec 2023).fasta
[0:27] Processing FASTA
[4:13] Assembling elution groups
[7:39] 50147387 precursors generated
[7:39] Gene names missing for some isoforms
[7:39] Library contains 54858 proteins, and 22143 genes
[8:56] Encoding peptides for spectra and RTs prediction
[13:04] Predicting spectra and IMs
I have been running this for about 2 days now and i am now sure if this is a DIA-NN bug or if i am doing something wrong
is there a workflow that explains how to perform phospho samples?
thanks in advanced
The text was updated successfully, but these errors were encountered: