-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FASTA format #1029
Comments
Hi TK, Protein sequence IDs should be read correctly from any FASTA. All other information you can always pull out of the FASTA using some FASTA-reading R package, to annotate DIA-NN's output report.
How did it manifest? Best, |
Hi, I'm having the same issue in the library free search. The FASTA header for example looks like this: >P62874,Q3TQ70|TX=10090 OS=Mouse GN=ENSMUSG00000029064.16,Gnb1 TA=NM_001160016.1,ENSMUST00000105616.10,XM_017319977.2,NM_001160017.1,ENSMUST00000030940.14,ENSMUST00000176637.2,ENSMUST00000165335.8,NM_008142.4 PA=ENSMUSP00000030940.8,NP_032168.1,ENSMUSP00000135091.2,XP_017175466.1,ENSMUSP00000101241.4,NP_001153488.1,ENSMUSP00000130123.2,NP_001153489.1,P62874,Q3TQ70 [0:48] Processing FASTA Any idea how to fix this issue? Thanks ! |
Hi Sara, DIA-NN will not correctly extract protein names from this. It should get the IDs OK though, i.e. you can annotate DIA-NN output using some FASTA-reading R package. Best, |
Hi Vadim, I had the same thing than Sara (Library contains 1 proteins, and 1 genes). 10 files will be processed First pass: generating a spectral library from DIA data [418:51] File #1/10 Since it is very long to process .... we will run it on a more powerfull server, it works with linux. Is it the smae command lin ethan with DIANN 1.8 ? Thanks, TK |
Hi TK, I would suggest to try the recommended settings first, which should result in much smaller predicted library & search space. No, I don't recommend using 1.8.1. If you do, please make sure to use the predicted library generated by 1.9. Best, |
Hi Sara, We seems to have both a large amount of precusors, can you tell me what kind of computer or server do you use for your analysis once the library is generated ? Best, |
Hi Tania, What is the amount of RAM? If you wish, I can take a look at the log. Best, |
This was the log of the first step, the library generation: |
Metaproteomics I guess? Yes, can take a very long time. I would also suggest using Peptidoform scoring in this case. |
Seems fine. I would run first with the settings I suggested and ultra-fast mode. In fact, would also regenerate the library with precursors charges restricted to 2-3, and then run ultra-fast mode. After this works, you can explore slower (and potentially more thorough) analysis methods. |
Thank you Vadim, I will try definitely. Best, |
Hi, Does it speed up the process also if I convert all the wiif file from Sciex to .dia prior the analysis ? Thank you again for your help Tania |
Hi Tania, Will save ~2min/file, based on the screenshot, i.e. not worth it in this case. Best, |
Hello,
I have an issue with the FASTA format. It is a FASTA format which was made from the Illumina Sequencing and annotated with KREGG. We have tried a first time wihtout Uniprot annotation and it did not.
Will it work if the FASTA is composed of different annotation uncluded the Uniprot one ? it seems that we can't just have the Uniprot FASTA format.
Thanks in advance
TK
The text was updated successfully, but these errors were encountered: