Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

All MarkDuplicates tasks run out of memory after addition of Xmx flags #512

Closed
michaelgatzen opened this issue Nov 18, 2021 · 6 comments
Closed

Comments

@michaelgatzen
Copy link
Contributor

Probably since the addition of the Xmx flags for Java tasks, all MarkDuplicates tasks run out of memory for WGS samples for me. Unfortunately, the way that they are specified, Cromwell's retry with more memory feature does not work because the value of the Xmx argument is only dependent on an input, not the actual (possibly increased) memory. This relates to #481.

@michaelgatzen
Copy link
Contributor Author

Note that in addition to the failing workflows, this can lead to very high costs for users. If they have the retry with more memory feature enabled, Cromwell will detect that the task runs out of memory and will increase it, but the task will keep failing because the maximum memory that Java can consume won't change. Since this increase is automatic and exponential, Cromwell will end up requesting insane amounts of memory

@jessicaway
Copy link
Member

@michaelgatzen I'll take a look at this. Thanks for pointing it out!

@droazen
Copy link

droazen commented Nov 18, 2021

@michaelgatzen @jessicaway Isn't the fix just to make the -Xmx value relative to the total memory on the instance (eg., instance_memory - 1 GB), like we did recently for the HaplotypeCaller task in the DRAGEN-GATK pipeline?

@michaelgatzen
Copy link
Contributor Author

@jessicaway Just letting you know, CollectRawWgsMetrics also runs out of memory for about 50% of samples for me

@jessicaway
Copy link
Member

jessicaway commented Nov 29, 2021

@michaelgatzen I have changed this back for now. We will try to reintroduce the Xmx for CollectRawWgsMetrics and MarkDuplicates later (with more testing!)

@gbggrant
Copy link
Contributor

We think this is all set, with appropriate follow-up tickets created.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants