Skip to content

Out of memory when working with Qwen Code in a session with a local Qwen 3.6 model running with llama.cpp under Linux #4351

@sergehuber

Description

@sergehuber

What happened?

Working with Qwen code with a local qwen3.6 model using llama.cpp when I got outofmemory. I resumed my session and tried to continue working and this happened a second time hence my report.

What did you expect to happen?

To not do an OOM and continue working normally :)

Client information

Qwen Code: 0.15.11 (782403d)
Runtime: Node.js v24.14.0 / npm 11.9.0
OS: linux x64 (6.17.0-23-generic)
Auth: API Key - openai
Base URL: http://localhost:8080/v1
Model: qwen-3.5-64k
Fast Model: qwen-3.5-64k
Session ID: d67c09cf-263f-4995-9873-ab5cabc22b2a
Sandbox: no sandbox
Proxy: no proxy
Memory Usage: 315.0 MB

Login information

No response

Anything else we need to know?

I didn't have this issue with the exact same configuration on 0.15.10

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions