sampling : add support for saving/loading backend sampling state #18862

danbev · 2026-01-15T12:26:53Z

This commit adds write/read support for backend sampling state similar
to how the logits and embedding buffers are handled.

The motivation for this is that it adds the backend sampling state to
be saved/restored along with the rest of the llama_context state.

This commit build upon #18811 which is included as the first commit in this PR. I'll rebase and remove it once it has been reviewed and merged.

This commit updates output_reserve in llama-context.cpp to always allocate sampling buffers regardless of whether sampling is needed for the current batch. The motivation for this is to avoid reallocations and branching based on the sampling requirements of the batch.

This commit adds write/read support for backend sampling state similar to how the logits and embedding buffers are handled. The motivation for this is that it adds the backend sampling state to be saved/restored along with the rest of the llama_context state.

ggerganov · 2026-01-15T18:39:53Z

Initially, I was thinking that since the samplers can now be part of the context state, we should also store this information. But it should include also the sampler states. And it gets very complicated.

But now I am wondering if we should instead remove the output ids, the logits and the embeddings from the state and only store the model info and the memory. I can't think of a meaningful use case for storing this information. And even if its needed, one can simply run the last token through llama_decode to obtain the necessary logits/embeddings. So maybe this is the better option as it will simplify the read/write logic.

danbev added 2 commits January 15, 2026 13:21

loci-dev mentioned this pull request Jan 15, 2026

UPSTREAM PR #18862: sampling : add support for saving/loading backend sampling state auroralabs-loci/llama.cpp#933

Open

remove unnecessary capture in lambda

5af1201

github-actions bot added the testing Everything test related label Jan 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sampling : add support for saving/loading backend sampling state #18862

sampling : add support for saving/loading backend sampling state #18862

danbev commented Jan 15, 2026

Uh oh!

ggerganov commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sampling : add support for saving/loading backend sampling state #18862

Are you sure you want to change the base?

sampling : add support for saving/loading backend sampling state #18862

Conversation

danbev commented Jan 15, 2026

Uh oh!

ggerganov commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants