PS_AI_Agent

Author	SHA1	Message	Date
j.foucher	4d662ebada	debug Network	2026-03-02 16:55:11 +01:00
j.foucher	8a3304fea9	Add relay RPC pattern for client→NPC communication over network UE5 clients cannot call Server RPCs on actors they don't own. NPC actors are server-owned, causing "No owning connection" errors when remote clients try to start conversations, send mic audio, or interrupt agents. Solution: relay all client→NPC RPCs through the InteractionComponent on the player's pawn (which IS owned by the client). The relay forwards commands to the NPC's ElevenLabsComponent on the server side. Changes: - InteractionComponent: add Server relay RPCs (Start/End conversation, mic audio, text message, interrupt) and Client relay RPCs (ConversationStarted/Failed) with GetLifetimeReplicatedProps - ElevenLabsComponent: implement FindLocalRelayComponent(), route all client-side calls through relay (StartConversation, EndConversation, SendTextMessage, InterruptAgent, FeedExternalAudio, mic capture) - Fix HandleConnected/ServerRequestConversation to route Client RPCs through the player pawn's relay instead of the NPC (no owning connection) - Fix StartListening/FeedExternalAudio/StopListening to accept bNetIsConversing on clients (WebSocket only exists on server) - Fix EvaluateBestAgent to use NetConversatingPawn instead of NetConversatingPlayer (NULL on remote clients due to bOnlyRelevantToOwner) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 16:40:27 +01:00
j.foucher	ce84a5dc58	Fix head tracking on remote clients: activate PostureComponent in OnRep PostureComponent starts with bActive=false and waits for OnConversationConnected, which only fires on the server (WebSocket). Remote clients never got bActive=true, so CurrentActiveAlpha stayed at 0 and all head/eye rotation was zeroed out. Now OnRep_ConversationState also sets Posture->bActive alongside TargetActor, matching what was already done for FacialExpression and LipSync components. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 15:33:07 +01:00
j.foucher	daf79d0d89	Add diagnostic logs to OnRep_ConversationState for head tracking debug Logs NetConversatingPawn, PostureComponent availability, and TargetActor assignment to diagnose why head tracking doesn't work on remote clients. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 15:26:28 +01:00
j.foucher	0124de8b53	Fix audio pre-buffer bypass on clients due to network sub-chunking Raw PCM is split into 32KB sub-chunks for network transmission. On the client, these sub-chunks arrive nearly simultaneously, triggering the "second chunk arrived" fast-path which cancelled the pre-buffer after ~10ms instead of the intended 2000ms. Now the fast-path only applies on Authority (server) where chunks represent genuine separate TTS batches. Clients always wait the full pre-buffer duration via TickComponent timer. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 15:19:05 +01:00
j.foucher	c0c1b2cea4	Fix head tracking on remote clients: replicate Pawn instead of PlayerController PlayerControllers have bOnlyRelevantToOwner=true, so NetConversatingPlayer was always nullptr on remote clients. Added NetConversatingPawn (APawn*) which IS replicated to all clients. OnRep_ConversationState now uses NetConversatingPawn as the posture TargetActor, enabling head/eye tracking on all clients. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 15:16:48 +01:00
j.foucher	fc728454d0	Fix raw PCM multicast exceeding UE5 replicated array limit UE5 limits replicated TArrays to 65535 elements. ElevenLabs sends audio chunks up to ~72K bytes, exceeding this limit when sent as raw PCM. Split into 32000-byte sub-chunks (1s of 16kHz 16-bit mono) before calling MulticastReceiveAgentAudio. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 15:04:33 +01:00
j.foucher	6cac56fa06	Fix network audio: raw PCM fallback when Opus codecs unavailable UE5 5.5's FVoiceModule returns NULL encoder/decoder, breaking Opus-based audio replication. This adds a transparent fallback: - Server sends raw PCM when OpusEncoder is null (~32KB/s, fine for LAN) - Client accepts raw PCM when OpusDecoder is null - Changed MulticastReceiveAgentAudio from Unreliable to Reliable to handle larger uncompressed payloads without packet loss - Added OnlineSubsystemNull config for future Opus compatibility - Removed premature bAgentSpeaking=true from MulticastAgentStartedSpeaking to fix race condition with audio initialization Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 14:58:21 +01:00
j.foucher	5e1c50edf8	Add diagnostic logs for Opus codec initialization and audio multicast - Log at InitOpusCodec: FVoiceModule availability, encoder/decoder creation, net role - Log at HandleAudioReceived: warn once if encoder is null or role is not Authority - These fire unconditionally (not behind bDebug) to catch the root cause Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 14:38:49 +01:00
j.foucher	1990b828a2	Move posture per-frame debug log to verbosity level 2 Reduces log spam when bDebug is on with DebugVerbosity=1. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 14:31:18 +01:00
j.foucher	913be3c631	Add server-side Opus encoding diagnostic logs Logs compressed audio size in HandleAudioReceived to diagnose why MulticastReceiveAgentAudio (Unreliable) never reaches clients. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 14:28:18 +01:00
j.foucher	84a4fd7e0e	Add Debug to component to debug replication	2026-03-02 14:20:16 +01:00
j.foucher	11255db576	Fix network replication: head/eyes tracking, facial expressions, lip sync and audio on remote clients - OnRep_ConversationState now sets PostureComponent TargetActor from replicated NetConversatingPlayer so remote clients see head/eyes tracking - Activate FacialExpressionComponent and LipSyncComponent on remote clients (OnAgentConnected never fires on clients since WebSocket is server-only) - Fix audio race condition: MulticastAgentStartedSpeaking no longer sets bAgentSpeaking prematurely, letting EnqueueAgentAudio handle the full first-chunk initialization (pre-buffer, Play(), state reset) - Add diagnostic logging to MulticastReceiveAgentAudio for silent failures (OpusDecoder invalid, LOD culling, decode failure) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 14:17:49 +01:00
j.foucher	9ee7960855	Merge branch 'main' of https://git.polymorph.fr/j.foucher/PS_AI_Agent	2026-03-02 13:47:11 +01:00
j.foucher	e4cf657077	prepare test replication	2026-03-02 13:46:00 +01:00
j.foucher	11c473f3ab	Merge branch 'main' of https://git.polymorph.fr/j.foucher/PS_AI_Agent	2026-03-02 12:39:23 +01:00
j.foucher	0b07a0493f	ini	2026-03-02 12:39:04 +01:00
j.foucher	82b134bcc3	Resolve some Bugs	2026-03-02 12:37:28 +01:00
j.foucher	259a77f9f6	Add Agent Config data asset system with ElevenLabs editor integration Introduces UPS_AI_ConvAgent_AgentConfig_ElevenLabs data asset to encapsulate full agent configuration (voice, LLM, prompt, language, emotions) with a custom Detail Customization providing: - Voice/TTS Model/LLM/Language pickers with Fetch buttons (ElevenLabs API) - LLM latency hints in dropdown (~250ms, ~700ms, etc.) - Create/Update/Fetch Agent buttons for REST API CRUD - Auto-fetch on editor open, auto-select first voice for new assets - Prompt fragment management (language, multilingual, emotion tool) - Smart defaults: gemini-2.5-flash LLM, eleven_turbo_v2_5 TTS, English - Speed range expanded to 0.7-1.95 (was 0.7-1.2) - bAutoStartConversation + StartConversationWithSelectedAgent() on InteractionComponent Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 20:51:05 +01:00
j.foucher	8175375c28	remove unwanted old plugins	2026-03-01 17:18:30 +01:00
j.foucher	275065f5aa	Revert SSL cert path to Content/Certificates for packaged build staging Saved/ is not staged in packaged builds, so Content/Certificates/ is the only reliable location. Simplified code by removing Android-specific writable fallback (Content/ works on all platforms with NonUFS staging). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 16:42:16 +01:00
j.foucher	33ec54150f	Add bActive with smooth blend and auto-activation to all 3 AnimNode components - Posture, FacialExpression, LipSync: bActive + ActivationBlendDuration for smooth alpha blend in/out (linear interp via FInterpConstantTo). - Auto-activation: components bind to OnAgentConnected/OnAgentDisconnected, starting inactive and blending in when conversation begins. - Without an agent component, bActive defaults to true (backward compatible). - Add BlueprintFunctionLibrary with SetPostProcessAnimBlueprint helper (wraps UE5.5 SetOverridePostProcessAnimBP for per-instance BP setup). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 15:24:57 +01:00
j.foucher	5fcd98ba73	Add Android platform support to plugin Whitelist Android in .uplugin Runtime module and handle read-only APK paths for SSL certificate copy on Android (ProjectSavedDir fallback). No change to Win64 behavior. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 14:14:30 +01:00
j.foucher	8fcb8b6f30	Gate operational logs behind bDebug/bVerboseLogging Only init, deinit, errors and critical warnings remain unconditional. All turn-by-turn logs (mic open/close, agent generating/speaking/stopped, emotion changes, audio chunks, text/viseme processing, latency, bone resolution, selection, registration) are now gated behind component bDebug or Settings->bVerboseLogging. 10 files, ~125 lines removed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 13:28:59 +01:00
j.foucher	09ca2215a0	Remove dead code across plugin: unused members, constants, includes - EL_LOG() function + JsonUtilities.h include (WebSocketProxy) - SilenceBuffer, bSpeculativeTurn (ElevenLabsComponent + proxy) - bSignedURLMode, SignedURLEndpoint (plugin settings) - bHasPendingAnalysis (LipSyncComponent) - bForceSelectionActive (InteractionComponent) - UserActivity, InternalTentativeAgent constants (Definitions) - Engine/AssetManager.h includes (EmotionPoseMap, LipSyncPoseMap) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 13:03:20 +01:00
j.foucher	5323da6908	no message	2026-03-01 12:46:26 +01:00
j.foucher	350558ba50	v3.0.1: Packaged build fixes, conversation-triggered body tracking, diagnostic cleanup Packaged build fixes: - Use EvaluateCurveData() for emotion and lip sync curves (works with compressed/cooked data instead of raw FloatCurves) - Add FMemMark wrapper for game-thread curve evaluation (FBlendedCurve uses TMemStackAllocator) - Lazy binding in AnimNodes and LipSyncComponent for packaged build component initialization order - SetIsReplicatedByDefault(true) instead of SetIsReplicated(true) - Load AudioCapture module explicitly in plugin StartupModule - Bundle cacert.pem for SSL in packaged builds - Add DirectoriesToAlwaysStageAsNonUFS for certificates - Add EmotionPoseMap and LipSyncPoseMap .cpp implementations Body tracking: - Body tracking now activates on conversation start (HandleAgentResponseStarted) instead of on selection, creating a natural notice→engage two-step: eyes+head track on selection, body turns when agent starts responding - SendTextMessage also enables body tracking for text input Cleanup: - Remove all temporary [DIAG] and [BODY] debug logs - Gate PostureComponent periodic debug log behind bDebug flag - Remove obsolete editor-time curve caches (CachedCurveData, RebuildCurveCache, FPS_AI_ConvAgent_CachedEmotionCurves, FPS_AI_ConvAgent_CachedPoseCurves) from EmotionPoseMap and LipSyncPoseMap — no longer needed since EvaluateCurveData() reads compressed curves directly at runtime Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 12:45:11 +01:00
j.foucher	21298e01b0	Add LAN test scripts: Package, Host, Join batch files Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 11:41:58 +01:00
j.foucher	765da966a2	Add Listen Server networking: exclusive NPC lock, Opus audio broadcast, LOD - Replicate conversation state (bNetIsConversing, NetConversatingPlayer) for exclusive NPC locking - Opus encode TTS audio on server, multicast to all clients for shared playback - Replicate emotion state (OnRep) so clients compute facial expressions locally - Multicast speaking/interrupted/text events so lip sync and posture run locally - Route mic audio via Server RPC (client→server→ElevenLabs WebSocket) - LOD: cull audio beyond 30m, skip lip sync beyond 15m for non-speaker clients - Auto-detect player disconnection and release NPC on authority - InteractionComponent: skip occupied NPCs, auto-start conversation on selection - No changes to LipSync, Posture, FacialExpression, MicCapture or AnimNodes Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 11:04:07 +01:00
j.foucher	453450d7eb	Commit Scene + BP	2026-02-27 18:38:16 +01:00
j.foucher	301efee982	Enable body tracking on both voice and text input Body tracking is now activated by ElevenLabsComponent directly in StartListening() and SendTextMessage(), instead of being managed by InteractionComponent. This ensures the agent turns its body toward the player on any form of conversation input. InteractionComponent still disables body tracking on deselection. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 18:33:08 +01:00
j.foucher	92a8b70a7f	Split posture: eyes+head on selection, body on conversation start PostureComponent gains bEnableBodyTracking flag. When false, only head and eyes track the target — body stays frozen. InteractionComponent now: - On posture attach: sets TargetActor but disables body tracking (agent notices player with eyes+head only) - On StartListening: enables body tracking (agent fully engages) - On StopListening: disables body tracking Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 18:26:27 +01:00
j.foucher	23e216b211	Add bAutoManageListening to InteractionComponent Automatically calls StartListening/StopListening on the agent's ElevenLabsComponent on selection/deselection. Enabled by default. Disable for manual control (e.g. push-to-talk). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 18:08:16 +01:00
j.foucher	6c081e1207	Fix: body keeps orientation on target loss, fix close-range selection PostureComponent: body no longer returns to original yaw when TargetActor is cleared — only head and eyes return to neutral. InteractionComponent: add AgentEyeLevelOffset (default 150cm) so the view cone check targets chest height instead of feet, preventing selection loss at close range. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 17:49:03 +01:00
j.foucher	c6aed472f9	Add auto posture management to InteractionComponent InteractionComponent now automatically sets/clears the agent's PostureComponent TargetActor on selection/deselection, with configurable attach/detach delays and a master toggle for manual control. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 17:39:05 +01:00
j.foucher	b1bac66256	move some file	2026-02-27 16:42:09 +01:00
j.foucher	88a824897e	WIP multi agent	2026-02-27 09:38:39 +01:00
j.foucher	f24c9355eb	Fix: remove embedded worktree from index, add to .gitignore Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 18:38:41 +01:00
j.foucher	a3adf10e6f	Add per-component debug system, fix BP categories, enable plugin Content - Add bDebug + DebugVerbosity (0-3) to all 5 components - Gate verbose/diagnostic logs behind debug checks; keep key events always visible - Unify BP categories to "PS AI ConvAgent\|..." (was "PS_AI_Agent\|...") - Enable CanContainContent in .uplugin for plugin-specific assets - Remove unused template/animation content assets Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 18:38:12 +01:00
j.foucher	b3bfabc52d	Rename: unify prefix PS_AI_Agent_ → PS_AI_ConvAgent_ Aligns all class/struct/enum/delegate prefixes with the module name PS_AI_ConvAgent. Removes redundant Conv_ from ElevenLabsComponent (PS_AI_Agent_Conv_ElevenLabsComponent → PS_AI_ConvAgent_ElevenLabsComponent). UI strings now use "PS AI ConvAgent". CoreRedirects updated for both ElevenLabs* and PS_AI_Agent_* legacy references. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 17:33:47 +01:00
j.foucher	eeb0b1172b	Rename plugin ElevenLabs → PS_AI_ConvAgent for multi-backend support Separates generic systems (lip sync, posture, facial expression, mic) from ElevenLabs-specific ones (conv agent, websocket, settings). Generic classes now use PS_AI_Agent_ prefix; ElevenLabs-specific keep _ElevenLabs suffix. CoreRedirects in DefaultEngine.ini ensure existing .uasset references load correctly. Binaries/Intermediate deleted for full rebuild. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 17:19:35 +01:00
j.foucher	8a4622f348	Posture/LipSync: update default values from production tuning - HeadAnimationCompensation: 1.0 → 0.9 - EyeAnimationCompensation: 1.0 → 0.6 - BodyDriftCompensation: 0.0 → 0.8 - AmplitudeScale: 0.75 → 1.0 - EnvelopeAttackMs: 15 → 10 (range recentered to 1–50) - EnvelopeReleaseMs: 100 → 50 (range recentered to 10–200) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 16:42:09 +01:00
j.foucher	fd0e2a2d58	Posture: body drift compensation + debug gaze lines + ARKit eye range calibration - Add BodyDriftCompensation parameter (0→1) to counter-rotate head when body animation moves the torso (bow, lean). Drift correction applied AFTER swing-twist on the first chain bone only, preventing the correction from being stripped by tilt decomposition or compounded through the chain. - Add bDrawDebugGaze toggle: draws per-eye debug lines from Face mesh FACIAL_L/R_Eye bones (cyan = desired direction, green = actual bone Z-axis gaze) to visually verify eye contact accuracy. - Cache Face mesh separately from Body mesh for correct eye bone transforms. - Use eye bone midpoint (Face mesh) as EyeOrigin for pitch calculation instead of head bone, fixing vertical offset. - Calibrate ARKit eye ranges: horizontal 30→40, vertical 20→35 to match MetaHuman actual eye deflection per curve unit. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 16:37:52 +01:00
j.foucher	9eca5a3cfe	Posture: smooth eye compensation blend + separate head/eye AnimationCompensation - Fix eye compensation to use smooth Lerp blend instead of binary switch. Uses Curve.Get() to read animation's existing CTRL values, then Lerp(animCTRL, postureCTRL, Comp) for proportional blending. - Split AnimationCompensation into HeadAnimationCompensation and EyeAnimationCompensation for independent control per layer. - Fix CTRL curve naming: use correct MetaHuman format (CTRL_expressions_eyeLook{Dir}{L/R}) with proper In/Out→Left/Right mapping. - Add eye diagnostic modes (disabled) for pipeline debugging. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 15:00:12 +01:00
j.foucher	9d281c2e03	Posture: animation compensation, emotion lip sync, head jump fixes Feature 1 — AnimationCompensation: posture overrides head/neck orientation while body animations still play. Extracts animation contribution vs reference pose, Slerps toward identity proportionally to compensation. Feature 2 — Emotion-aware lip sync: expression curves (smile, frown, sneer) blend additively with facial emotion during speech. Shape curves (jaw, tongue) remain 100% lip sync. EmotionExpressionBlend property controls blend ratio (default 0.5). Bug fixes — Head rotation 1-2 frame jump prevention: - Keep last cached rotation when PostureComponent is momentarily invalid - Normalize composed quaternion (TurnQuat * NodQuat) - Clamp DeltaTime to 50ms to prevent FInterpTo spikes during GC - Guard against double cascade overflow (body + eye) in same frame - EnforceShortestPath dot-product check before all Slerp operations Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:43:18 +01:00
j.foucher	d2e904144a	begin posture with animation	2026-02-26 08:57:37 +01:00
j.foucher	cdb118a83e	Add some debug ressource	2026-02-25 15:07:59 +01:00
j.foucher	8df6967ee5	Posture: neck bone chain distribution + swing-twist axis fix + log cleanup - Add NeckBoneChain config (FElevenLabsNeckBoneEntry USTRUCT) to distribute head rotation across multiple bones (e.g. neck_01=0.25, neck_02=0.35, head=0.40) for more natural neck arc movement. - Swing-twist now uses bone's actual tilt axis (BoneRot.RotateVector) instead of hardcoded FVector::RightVector — accounts for MetaHuman ~11.5° reference pose rotation on head bone. - Clean up diagnostic logging: reduced to Log level, removed verbose per-frame quaternion dumps. - Backward compatible: empty NeckBoneChain falls back to single HeadBoneName. KNOWN ISSUE: diagonal tilt is more visible with neck chain — needs investigation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 15:05:21 +01:00
j.foucher	c3abc420cf	map	2026-02-25 14:45:33 +01:00
j.foucher	81bcd67428	Merge posture fixes: cascade two-step, ARKit normalization, diagonal tilt Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 14:44:56 +01:00

1 2 3

101 Commits