Out of memory when working with Qwen Code in a session with a local Qwen 3.6 model running with llama.cpp under Linux

### What happened?

Working with Qwen code with a local qwen3.6 model using llama.cpp when I got outofmemory. I resumed my session and tried to continue working and this happened a second time hence my report.

### What did you expect to happen?

To not do an OOM and continue working normally :) 

### Client information


Qwen Code: 0.15.11 (782403d71)
Runtime: Node.js v24.14.0 / npm 11.9.0
OS: linux x64 (6.17.0-23-generic)
Auth: API Key - openai
Base URL: http://localhost:8080/v1
Model: qwen-3.5-64k
Fast Model: qwen-3.5-64k
Session ID: d67c09cf-263f-4995-9873-ab5cabc22b2a
Sandbox: no sandbox
Proxy: no proxy
Memory Usage: 315.0 MB


### Login information

_No response_

### Anything else we need to know?

I didn't have this issue with the exact same configuration on 0.15.10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Out of memory when working with Qwen Code in a session with a local Qwen 3.6 model running with llama.cpp under Linux #4351

What happened?

What did you expect to happen?

Client information

Login information

Anything else we need to know?

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Out of memory when working with Qwen Code in a session with a local Qwen 3.6 model running with llama.cpp under Linux #4351

Description

What happened?

What did you expect to happen?

Client information

Login information

Anything else we need to know?

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions