-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update final-workflow.cwl to add functional annotations to gff file #141
Comments
@amcooksey - could you run the functional annotation pipeline for Saccharomyces eubayanus GCF_001298625.1, so we have some smaller test data for Tina to work with? |
@ZhiXuanLai here are paths to an example GO and KEGG file on Ceres: |
functional annotation for Saccharomyces eubayanus on CERES: |
Update writeLastLine-genePred.cwl:
to (replace the file names in brackets with the original and processed gff file name inputs) |
@ZhiXuanLai can we include both the original gff file and the processed gff file in the dispatch output? |
@ZhiXuanLai when I run the workflow using
|
Hi @mpoelchau (2) Sure! I added the two gff files to dispatch output. (3) My bad! I fixed the error now. |
@ZhiXuanLai thanks for the updates! I get the following error when I try to run the pipeline with
|
@ZhiXuanLai when I run the program with the table file URL, it completes successfully. The readme file looks good. However, I don't see the unprocessed gff in the analyses directory:
|
For the readme update, we won't need to change |
@mpoelchau Sorry for not fixing the error. I must run the pipeline without saving the change in yaml file. |
Good question! Is it possible to leave that line unchanged? |
Update on how to handle writeLastLine-genePred.cwl.
|
We need to add another process that I forgot about when I described this issue. The functional annotation directory (name is now in the tree variable array) needs to be moved into the analyses directory during the dispatch workflow. We could add a sub-workflow similar to https://github.com/NAL-i5K/Organism_Onboarding/blob/master/flow_dispatch/2other_species/cp_dir.cwl. |
@childers could you take a look at the last comment/update? |
We need to begin adding functional annotation information to the genome browsers. The most straightforward way to do this is via the annotation gff3 file, prior to creating the apollo/jbrowse files. That means we will change some of the first steps of final-workflow.cwl.
perl add_GO-KEGG_to_RefSeq-gff.pl GO-file Kegg-file GFF table-file > output.gff
GFF.annotated.gff
(e.g.GCF_001298625.1_SEUB3.0_genomic.annotated.gff
).The text was updated successfully, but these errors were encountered: