Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A new odgi function wish, odgi synteny, or a syntenywish...? #451

Open
diaspj opened this issue Oct 4, 2022 · 0 comments
Open

A new odgi function wish, odgi synteny, or a syntenywish...? #451

diaspj opened this issue Oct 4, 2022 · 0 comments

Comments

@diaspj
Copy link

diaspj commented Oct 4, 2022

Hi odgi team,

First, I would like to congratulate the odgi team and other related teams (vg, pggb, etc) for the wonderful work you are doing in developing the field of Pangenome Graphs.

I came here to the issues, not to report an error but to report a wish, the development of an extra function to odgi that can handle the identification and report of synteny between organisms.

In short, my wish is a function equivalent to halSynteny (hal toolkit) to be made available in the odgi toolkit!

https://github.com/ComparativeGenomicsToolkit/hal/tree/master/synteny

halSynteny: a fast, easy-to-use conserved synteny block construction method for multiple whole-genome alignments
Ksenia Krasheninnikova, Mark Diekhans, Joel Armstrong, Aleksei Dievskii, Benedict Paten, Stephen O’Brien
GigaScience, Volume 9, Issue 6, June 2020, giaa047, https://doi.org/10.1093/gigascience/giaa047

The ability of exporting a file representing genome collinearity would be a great addition to the set of operations that odgi toolkit have made available to researchers, and since the odgi toolkit was developed as a post-processing tool of Pangenome Graphs, odgi seems to be the more adequate platform to include this functionality.

For instance, if we do not construct a pangenome using pggb, but we use only wfmash and seqwish and exclude the smoothxg step, and then use odgi untangle to export a paf file representing the alignments between two genome sequences, a function odgi synteny could allow to recover the list of alignment blocks similar to the one comprised in the original paf file used to construct the gfa file, and not the long list of fragmented alignments that are reported in the paf file created by the function odgi untangle (check the two paf files in attachment).

The use of the hal toolkit is not a possibility since there has not been developed a script allowing the conversion of gfa -> hal (nor vg -> hal, og -> hal, etc), only a script in the opposite direction, hal2vg, is available...

One way around would be the development of a script that would allow to convert gfa -> hal, but because the gfa format (or og or vg) seems to be adopted by the generality of the research community as the preferred exchange format between toolkits/frameworks dedicated to the study of Pangenome Graphs, I think that going in the downstream direction makes more sense...

Best regards,

Paulo Dias

two_paf_files_the_original_and_the_exported_by_odgi_untangle.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant