-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about the filter steps between the main report and the matrix in DIANN 1.9 #1056
Comments
Hi, Please try: Best, |
Thank you for your advice。 I have tried this, but it does not work.It affected the number of precursors but had no effect on the entries in Protein.Group.
Thank you again! |
Is this MBR output? |
Yes, it is MBR output. |
Can you please share both the .parquet and pg_matrix? Best, |
Thank you! |
Works if filter for non-zero quantities too :) |
Thank you very much for your great help. Best wishes! |
Hi, Vadim Thanks for your help yesterday. I have encountered a new question. When I utilized ‘diann_maxlfq’ to estimate protein group quantities, the results appear to differ significantly from those obtained from 'pg_matrix' as well as the 'PG.MaxLFQ' column. Below is the code I employed, which functioned correctly in DIANN 1.8 but has raised some concerns in DIANN 1.9. Do you have any suggestions or advice on this issue? Thank you in advance! |
diann_maxlfq implements a simple MaxLFQ algorithm, different from what DIA-NN uses internally. The results will therefore always differ. |
Thank you. I understand. Another question is about species-specifc precursors. Our samples contain a mixture of human and mouse proteins. When running DIANN 1.9, we used both human and mouse FASTA files and add additional options including '--species-genes' and '--species-ids'. We would like to exclude precursors specific to mouse or shared between both species, and instead focus only on human-specific precursors to quantify their associated proteins. Under these parameter settings, we would like to know if the 'PG.MaxLFQ' value is calculated from human-specific and mouse-specific precursors? Best wishes! |
It's calculated using all precursors matched to the protein group (Protein.Group column). So in this case you'd want to just discard all entries in the .parquet report with Protein.Ids column string containing 'MOUSE'. |
Hi Vadim
Thanks for your work in DiaNN 1.9.
When analyzing the results from version 1.9, I've observed discrepancies between the number of Protein.Group entries filtered by R and those reported in report.pg_matrix. Are there additional filtering steps being applied? I suspect that the "Additional 5% run-specific protein-level FDR filter applied to the protein matrices, use --matrix-spec-q to adjust it" might be impacting the results. However, I'm unsure how to address this issue.
Thank you in advance
The text was updated successfully, but these errors were encountered: