[BUG]: NemoLLM error with NGC_API_KEY #2153

aadesoba-nv · 2025-01-29T17:24:57Z

Version

25.02

Which installation method(s) does this occur on?

Docker

Describe the bug.

When you set NGC_API_KEY=XYZ, you get a reasonable error message, but if this is changed to the real key by one digit then we get that unhelpful error, not sure if its a Morpheus or a NemoLLM bug.

Minimum reproducible example

the LLM returns a response in an unexpected format, in this case "\n(current age)**0.43\n" rather than "26 ^ 0.43"

Relevant log output

Similar log output:

Exception occurred in pipeline. Rethrowing
Traceback (most recent call last):
File "/opt/conda/envs/morpheus/lib/python3.10/site-packages/morpheus/pipeline/pipeline.py", line 408, in post_start
await executor.join_async()
File "/opt/conda/envs/morpheus/lib/python3.10/asyncio/base_events.py", line 649, in run_until_complete
return future.result()
File "/opt/conda/envs/morpheus/lib/python3.10/asyncio/tasks.py", line 650, in _wrap_awaitable
return (yield from awaitable.await())
File "/opt/conda/envs/morpheus/lib/python3.10/site-packages/morpheus_llm/llm/nodes/llm_generate_node.py", line 55, in execute
results = await self._llm_client.generate_batch_async(inputs, return_exceptions=self._return_exceptions)
File "/opt/conda/envs/morpheus/lib/python3.10/site-packages/morpheus_llm/llm/services/nemo_llm_service.py", line 190, in generate_batch_async
results = await asyncio.gather(*futures, return_exceptions=return_exceptions)
File "/opt/conda/envs/morpheus/lib/python3.10/site-packages/morpheus_llm/llm/services/nemo_llm_service.py", line 153, in _process_one_async
raise RuntimeError(
RuntimeError: Failed to generate response for prompt 'What is the capital of France?' after 5 attempts. Errors: ['\n401\n', '\n401\n', '\n401\n', '\n401\n', '\n401\n']

Full env printout

Click here to see environment details

[Paste the results of print_env.sh here, it will be hidden by default]

Other/Misc.

No response

Code of Conduct

I agree to follow Morpheus' Code of Conduct
I have searched the open bugs and have found no duplicates for this bug report

The text was updated successfully, but these errors were encountered:

morpheus-bot-test · 2025-01-29T17:25:14Z

Hi @aadesoba-nv!

Thanks for submitting this issue - our team has been notified and we'll get back to you as soon as we can!
In the meantime, feel free to add any relevant information to this issue.

dagardner-nv · 2025-01-30T00:17:45Z

We need to determine if this is a bug in NemoLLM by reproducing this outside of Morpheus, and thus reporting it to the NemoLLM team, or if it is a Morpheus issue.

dagardner-nv · 2025-01-31T18:16:41Z

This appears to be an issue with running nemollm.NemoLLM.generate in async mode, in blocking mode we always get a good response. The response errors are multi-line errors, in async we are only getting the last line.

import asyncio
import os

import nemollm

def test(nemo_key: str, model: str, prompt: str):
    try:
        con = nemollm.NemoLLM(api_key=nemo_key)
        response = con.generate(model, prompt)
        print(f"Success: {response['text']}")
    except Exception as e:
        print(f"Failure: {e}")

async def test_async(nemo_key: str, model: str, prompt: str):
    con = nemollm.NemoLLM(api_key=nemo_key)
    fut = await asyncio.wrap_future(con.generate(model, prompt, return_type="async"))
    response = nemollm.NemoLLM.post_process_generate_response(fut, return_text_completion_only=False)
    print(response)

async def main():
    model = "gpt-43b-002"
    prompt="What is the minimum nvidia driver version needed for CUDA 12.5?"

    print("Test with real key")
    nemo_key = os.environ['NGC_API_KEY']
    await test_async(nemo_key, model, prompt)

    print("\n----------\n")
    print("Test with a bad key")
    await test_async("bad_key", model, prompt)

    print("\n----------\n")
    print("Test with a bad key one character off from a real key")
    await test_async(nemo_key[0:-1] + '5', model, prompt)

if __name__ == '__main__':
    asyncio.run(main())

Output:

Test with real key
{'text': ' The minimum NVIDIA driver version needed for CUDA 12.5 depends on the specific GPU architecture being used. For example', 'cumlogprobs': -7.698154, 'prompt_labels': [{'class_name': 'nontoxic', 'score': 0.98646855}], 'completion_labels': [{'class_name': 'nontoxic', 'score': 0.98950094}]}

----------

Test with a bad key
{'status': 'fail', 'msg': 'http: named cookie not present\n'}

----------

Test with a bad key one character off from a real key
{'status': 'fail', 'msg': '\n401\n'}

aadesoba-nv added the bug Something isn't working label Jan 29, 2025

github-project-automation bot added this to Morpheus Boards Jan 29, 2025

github-project-automation bot moved this to Todo in Morpheus Boards Jan 29, 2025

morpheus-bot-test bot added Needs Triage Need team to review and classify external This issue was filed by someone outside of the Morpheus team labels Jan 29, 2025

dagardner-nv mentioned this issue Jan 30, 2025

[BUG]: error in completion pipeline - LLM example #2147

Closed

2 tasks

dagardner-nv removed the Needs Triage Need team to review and classify label Jan 30, 2025

dagardner-nv assigned dagardner-nv and unassigned dagardner-nv Jan 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: NemoLLM error with NGC_API_KEY #2153

[BUG]: NemoLLM error with NGC_API_KEY #2153

aadesoba-nv commented Jan 29, 2025 •

edited by dagardner-nv

Loading

morpheus-bot-test bot commented Jan 29, 2025

dagardner-nv commented Jan 30, 2025

dagardner-nv commented Jan 31, 2025 •

edited

Loading

[BUG]: NemoLLM error with NGC_API_KEY #2153

[BUG]: NemoLLM error with NGC_API_KEY #2153

Comments

aadesoba-nv commented Jan 29, 2025 • edited by dagardner-nv Loading

Version

Which installation method(s) does this occur on?

Describe the bug.

Minimum reproducible example

Relevant log output

Full env printout

Other/Misc.

Code of Conduct

morpheus-bot-test bot commented Jan 29, 2025

dagardner-nv commented Jan 30, 2025

dagardner-nv commented Jan 31, 2025 • edited Loading

aadesoba-nv commented Jan 29, 2025 •

edited by dagardner-nv

Loading

dagardner-nv commented Jan 31, 2025 •

edited

Loading