Skip to content

[ILUVATAR_GPU] Add logic to apply patches to python files in install script && Fix the segment fault that occurred after linking with the NCCL library. #1762

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: develop
Choose a base branch
from

Conversation

tianyuzhou668
Copy link
Contributor

增加了python文件的patch;
解决了链接nccl.so后出现的segment fault问题;

@tianyuzhou668
Copy link
Contributor Author

已将patch中spawn.py给去掉,flash_attention.py暂时保留在patch文件中,先保证CI使能,后续再完善这段关于python文件的patch逻辑。

Copy link
Collaborator

@YqGe585 YqGe585 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Flash attention is a temporary modification. This can be merged for now.

YqGe585

This comment was marked as outdated.

@@ -17,6 +17,7 @@
SCRIPT_DIR=$(dirname "$0")
LEGACY_TEST_PATH="${SCRIPT_DIR}/../../../Paddle/test/legacy_test"
export PYTHONPATH="${LEGACY_TEST_PATH}:${PYTHONPATH}"
export LD_PRELOAD=/usr/local/corex-4.3.0/lib/libcuda.so.1
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

是否可以不使用绝对路径?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已修改,使用环境变量替代绝对路径

@YqGe585
Copy link
Collaborator

YqGe585 commented Jun 25, 2025

CI显示通过,但实际有报错,需解决此问题后合入

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants