Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow users to modify the state of error datasets #4021

Open
natefoo opened this issue May 4, 2017 · 6 comments
Open

Allow users to modify the state of error datasets #4021

natefoo opened this issue May 4, 2017 · 6 comments

Comments

@natefoo
Copy link
Member

natefoo commented May 4, 2017

Sometimes it'd be useful to be able to operate on a dataset that Galaxy considers to be in error - especially if the user is confident that the error state was only due to an errant tool message or framework bug that can be worked around. This capability should probably be accompanied with a warning.

@nsoranzo
Copy link
Member

nsoranzo commented May 4, 2017

💯

@hexylena
Copy link
Member

xref #3779

@hexylena
Copy link
Member

image

5/6 worked fine. The 6th invalidated all other uploads.

@yvanlebras
Copy link
Contributor

Hi Nate, Nicola, Helena, is there some manner to fix this kind of thing now ? With @ColineRoyaux we have some datasets with an error state but the content is ok and we search a way to use it...

@hexylena
Copy link
Member

You can always modify the database records, but, outside of that, no way for people to do it.

@mvdbeek
Copy link
Member

mvdbeek commented Jan 21, 2022

due to an errant tool message or framework bug that can be worked around

I think the ambition is always to fix those and make the framework better. It's very complex to assert that a dataset is erroneously in error (is it truncated ? is there an unexpected message in the stderr ? did the tool swallow an error but set the exit code to non-zero ? did the scheduler oom message mean the output is incomplete ? other weird stuff that lead to a dataset that looks almost correct but isn't), and that overwriting the state is correct and won't lead to wrong results. I'd rather have someone waste time and compute on re-doing what they were doing than publish a wrong result. If re-doing what you were doing works, great! If not, there's a bug that needs to be fixed. @hexylena's example for instance lead to re-working the upload mechanism so that now you can have just 1 dataset fail instead of all.

My 2c is that this should be a wontfix and we should close the issue, if you are confident the data is actually fine you can change the state in the database.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants