-
Notifications
You must be signed in to change notification settings - Fork 156
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG]: abp_pcap_detection
pipeline running slowly on AArch64
#2120
Comments
The This might be an issue with a specific stage. |
abp_pcap_detection
pipeline running slowly on AArch64
On the x86_64 side the slowest stage is the preprocessing stage. |
On ARM the preprocessing stage is also the slowest but takes 03m:38s vs 00m:13s. |
The issue appears to be the number of threads being used. The ARM system I've been testing on has 80 cores, resulting in a default thread count of 80. This pipeline contains 12 stages, setting |
Running with |
In testing we found that the performance problems observed happened under Ubuntu 22.04, but did not exist on Ubuntu 24.04. |
…es for DFP (#2162) * Document Arm64 performance issues on older Linux Kernels * Manually install PyTorch in the DFP container allowing Arm64 users to run the pipeline work-around issue #2095 (fix slated for PyTorch 2.6 / Morpheus 25.06). Closes #2120 Closes #2124 ## By Submitting this PR I confirm: - I am familiar with the [Contributing Guidelines](https://github.com/nv-morpheus/Morpheus/blob/main/docs/source/developer_guide/contributing.md). - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. Authors: - David Gardner (https://github.com/dagardner-nv) Approvers: - Eli Fajardo (https://github.com/efajardo-nv) - Michael Demoret (https://github.com/mdemoret-nv) URL: #2162
Version
25.02
Which installation method(s) does this occur on?
Source
Describe the bug.
This pipeline typically executes in under a minute (20s on my system), however on an Arm64 system this takes between 5-9 minutes.
On a first run I noticed it took 5m38s, initially suspecting the ONNX conversion was the culprit, however on a second run it took 9m2s.
Minimum reproducible example
Run the `examples/abp_pcap_detection` example workflow as documented on an Arm64 system
Relevant log output
Click here to see error details
Full env printout
Click here to see environment details
Other/Misc.
No response
Code of Conduct
The text was updated successfully, but these errors were encountered: