fix(ollama): handle incomplete JSON chunks in stream #995

berkcaputcu · 2025-05-17T17:37:20Z

Addresses one of the issues raised in #686

Problem

Ollama LLM returns the final chunk in parts when the chunk is too long. I only noticed this behaviour on the final chunk, I'm not sure if it happens on other chunks as well.

Solution

Improve the json_responses_chunk_handler to gracefully handle cases where a JSON chunk is split across buffer boundaries. If a chunk does not end with '}', it is considered incomplete and buffered until the next chunk arrives. This prevents JSON parsing errors and ensures all responses are processed correctly.

I took part of the solution from this diff: https://github.com/patterns-ai-core/langchainrb/pull/644/files#diff-746ba2cd57580e32b0f013cbe3c8eaf8f1621e112c89f3af07983321dd6846dbL143-L148

Improve the `json_responses_chunk_handler` to gracefully handle cases where a JSON chunk is split across buffer boundaries. If a chunk does not end with '}', it is considered incomplete and buffered until the next chunk arrives. This prevents JSON parsing errors and ensures all responses are processed correctly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(ollama): handle incomplete JSON chunks in stream #995

fix(ollama): handle incomplete JSON chunks in stream #995

Uh oh!

berkcaputcu commented May 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

fix(ollama): handle incomplete JSON chunks in stream #995

Are you sure you want to change the base?

fix(ollama): handle incomplete JSON chunks in stream #995

Uh oh!

Conversation

berkcaputcu commented May 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Uh oh!

Uh oh!

berkcaputcu commented May 17, 2025 •

edited

Loading