-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CLI: Adding subcommand structure and unifying file suffixes #87
Comments
I'm unsure if the library for input parsing I'm using supports double dashed full name arguments. Would need to check. Alternatively we can have it as |
|
On second thought, using 'streaming' instead of 'online' would look better to people working on applications. Actually, it is a streaming (sublinear-space) algorithm in the sense that we use space proportional to # distinct k-mers, not the input length (if implemented in one pass). |
I've incorporated this into the proposal. |
I've incorporated it into the proposal with the following additional change: the user may actually require exporting both basic types of masks - min-one or max-one - right during the MS computation (it will be actually extremely helpful even for us as in many situations we want to have both masks explicitly stored and this will quite significantly reduce the number of commands to execute). |
Thanks, that's actually a pretty good idea. Does it make sense to have a parameter for minimizing ones in mask during the construction, when the default mask already minimizes the number of ones? |
This is an updated version of the ticket #78, incorporating all the associated discussion into the proposal. We need to restructure the CLI to well separate subcommands, as well as make some associated format-related changes. This will be a major update of KmerCamel.
File formats
Consistent suffixes:
.msfa
.s
.m
CLI improvement
The use should be more simple, eg:
Notes
-c
should always be on by default (canonical)-k
should be parsed automatically from the superstring (by default) and this extraction should appear in a well visible message (e.g.,"KMER SIZE EXTRACTED: 31")-k
is provided and is not consistent with the suffix-encoded k, the program should fail (optional checking if user wants)The text was updated successfully, but these errors were encountered: