LLMSessionManager

File: main_logic/core.py (~2300 lines)

The LLMSessionManager is the heart of N.E.K.O. — one instance per character, managing the entire conversation lifecycle.

Responsibilities

WebSocket connection management
LLM session creation and hot-swapping
TTS pipeline coordination
Audio resampling (24kHz → 48kHz)
Agent callback injection
Translation support

Key methods

`start_session(websocket, new, input_mode)`

Initializes a new LLM session:

Creates an OmniRealtimeClient with the character's configuration
Connects to the Realtime API via WebSocket
Starts the TTS worker thread (if voice output is enabled)
Begins background preparation of the next session for hot-swap

`stream_data(message)`

Processes incoming user input:

Audio: Sends PCM audio chunks to the Realtime API client
Text: Sends text messages to the LLM
Screen: Sends screenshots for multi-modal understanding

`handle_new_message()`

Called when the LLM produces output:

Routes text output to the TTS queue (or directly to WebSocket)
Sends emotion labels for expression mapping
Handles agent notifications

`end_session(by_server)`

Closes the current session and triggers hot-swap:

Closes the Realtime API WebSocket
Calls _perform_final_swap_sequence() for seamless transition
Flushes cached audio from the swap period

`cleanup(expected_websocket)`

Releases all resources when the WebSocket disconnects.

`trigger_agent_callbacks()`

Injects pending agent results into the next LLM conversation turn, allowing the character to reference agent findings.

`translate_if_needed(text)`

Translates text when user language differs from character language.

Thread model

Main async loop (FastAPI)
  ├── WebSocket recv loop
  ├── LLM event handlers (on_text_delta, on_audio_delta)
  │
  ├── TTS worker thread (queue consumer)
  │
  └── Background session preparation (hot-swap)

Integration points

WebSocket Router → calls start_session, stream_data, end_session
Agent Event Bridge → delivers results via pending_agent_callbacks
Config Manager → provides character data and API configuration
TTS Client → get_tts_worker() factory creates TTS workers

LLMSessionManager ​

Responsibilities ​

Key methods ​

start_session(websocket, new, input_mode) ​

stream_data(message) ​

handle_new_message() ​

end_session(by_server) ​

cleanup(expected_websocket) ​

trigger_agent_callbacks() ​

translate_if_needed(text) ​

Thread model ​

Integration points ​