-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
odgi extract error #516
Comments
Hi @George-du, can you share the input graph and BED files to reproduce the issue? Which kind of information do you want to keep and where? |
The operational process is as follows: Step1: The first question, the program produces the above error, how should I solve it? The second question, how can I keep the fourth column of annotation information(Bolded parts) in the BED file into the Reads ID of the finally extracted FASTA sequence? The BED file format is as follows: The finally extracted FASTA sequence: I'm looking forward for your reply. Thanks. |
Hi, I'm curious if I described the problem clearly and if there are some suggestions about the solution to this problem? |
Hi, I'm concerned about a question and would like to know if there are any suggestions? Is this error being reported because of a haplotype breakpoint region? Thank you in advance. |
Hi @George-du, you have a mc graph as input, I am not sure it will work with our tools, but I am investigating. |
Hi, I'm glad to hear from you. Thank you very much for your efforts in trying to solve this problem. |
hi @subwaystation @AndreaGuarracino , Dear Developer, |
Hi @George-du, sorry for the long wait.
However, I can't download your BED file, since the font is an Asian language and don't know where to click. Regarding your 2nd question: I still can't follow. Could you please prove a tiny step-by-step example of what you want to do and what your expectations are? Thanks! |
Hi @subwaystation @AndreaGuarracino, I'm glad to hear from you. As for the first question, the BED file can be downloaded via this link (https://github.com/George-du/Human_PAN/blob/main/Tissue_All_Non-Coding-region.bed).It is worth noting that some changes were made to the chromosome names in the BED file for matching with the GRAPH file. As for the second question, my expected results are shown in the following steps. Step1: head -n 1 Tissue_All_Non-Coding-region.bed(https://github.com/George-du/Human_PAN/blob/main/Tissue_All_Non-Coding-region.bed)
Step2:
Thank you in advance. Best,Du |
Hi, developers! @subwaystation Has there been any progress on this issue so far? Or some suggestions? I'm looking forward for your reply. Thanks! |
Hi @George-du, |
Hi, developers! @subwaystation, |
Dear @George-du, First question. When running
I am re-running with GDB now. I finally understood your 2nd question. You need this because you have two BED entries which target the same region in the reference, but you annotated them from different samples.
However, |
The full stack trace.
|
Duplicated path ranges are not supported. Just pushed a fix to check them #525 |
Indeed, after removing the duplicated BED entries it works.
|
Dear @subwaystation @AndreaGuarracino, Wow, that sounds good! I'll try it as soon as possible. I will also try the odgi explode method. Thanks! Best ,Du |
Dear @subwaystation @AndreaGuarracino, I would like to ask if extracting the sequence based on intervals (-b BED file) will extract the excess sequence?(#526) When there are more intervals, a large number of sequences will be extracted, how can I de-redundancy these sequences? Is it possible to divide the data into several parts and redundant them separately or is there any other suggestion? Or do you recommend using VCF files directly? |
Dear @George-du, as far as I understood only with I think there are several options for you:
How would like the VCF file to interact with the graph? Or would you like to project the graph into a VCF and then work with it? If you want to take a look at the variants within your ranges, I would rather first project into a VCF using |
Dear @subwaystation @AndreaGuarracino, I will test it as you suggest. |
There is odgi procbed to project path annotations into the final path names
after extract.
…On Fri, Sep 1, 2023, 05:52 George-du ***@***.***> wrote:
Dear @subwaystation <https://github.com/subwaystation> @AndreaGuarracino
<https://github.com/AndreaGuarracino>,
I will test it as you suggest.
Best, Du
—
Reply to this email directly, view it on GitHub
<#516 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AABDQELHRTSHLQ4YKYIE6VDXYG473ANCNFSM6AAAAAAZ4ZCSVU>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Dear @subwaystation @AndreaGuarracino @ekg I tried it as your suggested and found that this is much slower, which seems to extract one region at a time. Best, Du |
@AndreaGuarracino Any ideas how to speed this up? |
Hi,I got the following error with the command(odgi extract -i GRAPH -b BED -o - -c 0 -d 10 --full-range -t 30 -P ), how should I fix it?
Besides, how can I keep more additional information (not just location information) in the BED file to the sequence ID in using the B command?
The text was updated successfully, but these errors were encountered: