Conversation

Namespace: LLama.Batched

A single conversation thread that can be prompted (adding tokens from the user) or inferred (extracting a token from the LLM)

public sealed class Conversation : System.IDisposable

Inheritance Object → Conversation
Implements IDisposable
Attributes NullableContextAttribute, NullableAttribute

Properties

Executor

The executor which this conversation belongs to

public BatchedExecutor Executor { get; }

Property Value

BatchedExecutor

ConversationId

Unique ID for this conversation

public LLamaSeqId ConversationId { get; }

Property Value

LLamaSeqId

TokenCount

Total number of tokens in this conversation, cannot exceed the context length.

public int TokenCount { get; }

Property Value

Int32

IsDisposed

Indicates if this conversation has been disposed, nothing can be done with a disposed conversation

public bool IsDisposed { get; }

Property Value

Boolean

RequiresInference

Indicates if this conversation is waiting for inference to be run on the executor. "Prompt" and "Sample" cannot be called when this is true.

public bool RequiresInference { get; }

Property Value

Boolean

RequiresSampling

Indicates that this conversation should be sampled.

public bool RequiresSampling { get; }

Property Value

Boolean

Methods

Finalize()

Finalizer for Conversation

protected void Finalize()

Dispose()

End this conversation, freeing all resources used by it

public void Dispose()

Exceptions

ObjectDisposedException

Fork()

Create a copy of the current conversation

public Conversation Fork()

Returns

Conversation

Exceptions

ObjectDisposedException

Remarks:

The copy shares internal state, so consumes very little extra memory.

GetSampleIndex(Int32)

Get the index in the context which each token can be sampled from, the return value of this function get be used to retrieve logits (SafeLLamaContextHandle.GetLogitsIth(Int32)) or to sample a token (SafeLLamaSamplerChainHandle.Sample(SafeLLamaContextHandle, Int32).

public int GetSampleIndex(int offset)

Parameters

offset Int32
How far from the end of the previous prompt should logits be sampled. Any value other than 0 requires allLogits to have been set during prompting.
For example if 5 tokens were supplied in the last prompt call:

-

Returns

Int32

Exceptions

ObjectDisposedException

CannotSampleRequiresPromptException
Thrown if this conversation was not prompted before the previous call to infer

CannotSampleRequiresInferenceException
Thrown if Infer() must be called on the executor

Sample(Int32)

Get the logits from this conversation, ready for sampling

public Span<float> Sample(int offset)

Parameters

offset Int32
How far from the end of the previous prompt should logits be sampled. Any value other than 0 requires allLogits to have been set during prompting

Returns

Span<Single>

Exceptions

ObjectDisposedException

CannotSampleRequiresPromptException
Thrown if this conversation was not prompted before the previous call to infer

CannotSampleRequiresInferenceException
Thrown if Infer() must be called on the executor

Prompt(List<LLamaToken>, Boolean)

Add tokens to this conversation

public void Prompt(List<LLamaToken> tokens, bool allLogits)

Parameters

tokens List<LLamaToken>

allLogits Boolean
If true, generate logits for all tokens. If false, only generate logits for the last token.

Exceptions

ObjectDisposedException

AlreadyPromptedConversationException

Prompt(ReadOnlySpan<LLamaToken>, Boolean)

Add tokens to this conversation

public void Prompt(ReadOnlySpan<LLamaToken> tokens, bool allLogits)

Parameters

tokens ReadOnlySpan<LLamaToken>

allLogits Boolean
If true, generate logits for all tokens. If false, only generate logits for the last token.

Exceptions

ObjectDisposedException

AlreadyPromptedConversationException

Prompt(LLamaToken)

Add a single token to this conversation

public void Prompt(LLamaToken token)

Parameters

token LLamaToken

Exceptions

ObjectDisposedException

AlreadyPromptedConversationException

Prompt(SafeLlavaImageEmbedHandle)

Prompt this conversation with an image embedding

public void Prompt(SafeLlavaImageEmbedHandle embedding)

Parameters

embedding SafeLlavaImageEmbedHandle

Prompt(ReadOnlySpan<Single>)

Prompt this conversation with embeddings

public void Prompt(ReadOnlySpan<float> embeddings)

Parameters

embeddings ReadOnlySpan<Single>
The raw values of the embeddings. This span must divide equally by the embedding size of this model.

Modify(ModifyKvCache)

Directly modify the KV cache of this conversation

public void Modify(ModifyKvCache modifier)

Parameters

modifier ModifyKvCache

Exceptions

CannotModifyWhileRequiresInferenceException
Thrown if this method is called while Conversation.RequiresInference == true

Save(String)

Save the complete state of this conversation to a file. if the file already exists it will be overwritten.

public void Save(string filepath)

Parameters

filepath String

Exceptions

CannotSaveWhileRequiresInferenceException

Save()

Save the complete state of this conversation in system memory.

public State Save()

Returns

State

< Back