Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: get_amd_offload_arch_flag in llamafile/cuda.c hardcodes "hipInfo" in some log messages #672

Open
tetron opened this issue Jan 9, 2025 · 0 comments

Comments

@tetron
Copy link

tetron commented Jan 9, 2025

Contact Details

[email protected]

What happened?

In get_amd_offload_arch_flag it searches for both hipInfo and rocminfo, and uses the first one it finds. However, the log messages in this function only mention hipInfo which is misleading when one is trying to debug why GPU support isn't being built.

Version

llamafile 0.9.0

What operating system are you seeing the problem on?

Linux

Relevant log output

rocminfo is the one that returned non-zero exit, but the log messages make it seem like the problem is that hipInfo is the program that's missing or broken. This sent me down a debugging dead end.

2025-01-09T21:12:42.496150707Z get_rocm_bin_path: note: hipInfo not found on $PATH
2025-01-09T21:12:42.496152791Z get_rocm_bin_path: note: $HIP_PATH/bin/hipInfo does not exist
2025-01-09T21:12:42.496154254Z get_rocm_bin_path: note: /opt/rocm/bin/hipInfo does not exist
2025-01-09T21:12:42.496172849Z llamafile_log_command: /opt/rocm/bin/rocminfo
2025-01-09T21:12:42.523078724Z get_amd_offload_arch_flag: error: hipInfo returned non-zero exit status
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant