Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.medera.info/llms.txt

Use this file to discover all available pages before exploring further.

The Co-Therapy Agent is a silent multimodal partner — it never speaks to the patient. It captures audio, video, and language during the session and fuses the signal into 15 RDoC construct activations on a private clinician channel.

Architecture

   AUDIO + VIDEO  →  Multimodal Engine  →  RDoC Construct Computer


                                          Clinician private channel


                                          Session note + outcomes

Multimodal channels

ChannelEngineOutputs
AudioVocal Acoustic EngineF0, jitter, shimmer, HNR, MFCC, prosodic flatness, depression/anxiety/distress indices
VideoFacial Physiological EngineHR, BP, HRV (SDNN/RMSSD/LF-HF), respiration rate, stress index, affect
LanguageLinguistic Content ExpertTopic, valence, certainty, pronoun shift
These fuse into 15 RDoC construct activations. See RDoC Overview.

Steps

1

Capture consent

Recording requires explicit patient consent per session. Configure your consent flow in the Console.
2

Start the session

POST /api/therapy-sessions/start with the participants and modality. Returns a session_id and a multimodal WebSocket URL.
3

Stream audio + video

Stream PCM audio at 16 kHz and video frames (MediaPipe 468-point landmarks) over the multimodal WebSocket.
4

Receive construct activations

The clinician’s private channel emits rdoc.activation events with feature contributions and confidence.
5

Close the session

POST /api/therapy-sessions/stop finalizes the session. The Co-Therapy Agent drafts the session note with construct-anchored claims.

Co-Therapy Agent

Agent-level documentation.

Facial Engine

Physiological signals from video.

Vocal Engine

Acoustic features from audio.

RDoC Constructs

15 documented constructs.