Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
tomasbruna authored Jan 20, 2023
1 parent cd05e91 commit 5b4ab0f
Showing 1 changed file with 11 additions and 0 deletions.
11 changes: 11 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -1 +1,12 @@
# orthodb-clades

Workflow for generating OrthoDB v11 protein sets.

All files are automatically downloaded from OrthoDB and parsed using a [Snakemake](https://snakemake.readthedocs.io/en/stable/index.html) workflow with the following command:

snakemake --cores 1

The resulting protein sets are saved into two different folders:

* `clades` contains clade-specific (e.g., `Arthropoda.fa` or `Viridiplantae.fa`) OrthoDB sets.
* `species` contains species-specific protein sets from which the proteins of the same species or proteins of all species in the same taxonomic order were removed. This is intended for gene prediction experiments as described in, e.g, [BRAKER2 paper](https://academic.oup.com/nargab/article/3/1/lqaa108/6066535).

0 comments on commit 5b4ab0f

Please sign in to comment.