Requirements
- Target platform
- OpenClaw
- Install method
- Manual import
- Extraction
- Extract archive
- Prerequisites
- OpenClaw
- Primary doc
- SKILL.md
Real-time AI video chat that routes through your OpenClaw agent. Uses Groq Whisper (cloud STT), edge-tts (cloud TTS via Microsoft), and OpenClaw chatCompletions API for conversation. Your agent sees your camera, hears your voice, and responds with its own personality and memory. Requires: GROQ_API_KEY for speech recognition. Reads ~/.openclaw/openclaw.json for gateway port and auth token. Data flows: audio → Groq cloud (STT), TTS text → Microsoft cloud (edge-tts), camera frames (base64) + text → OpenClaw gateway → your configured LLM provider (may be cloud — frames leave the machine if using a cloud LLM). Installs a persistent launchd service (optional). Trigger phrases: "video chat", "voice call", "call me", "视频一下", "语音", "打电话给我", "我要和你视频", "videochat-withme".
Real-time AI video chat that routes through your OpenClaw agent. Uses Groq Whisper (cloud STT), edge-tts (cloud TTS via Microsoft), and OpenClaw chatCompletions API for conversation. Your agent sees your camera, hears your voice, and responds with its own personality and memory. Requires: GROQ_API_KEY for speech recognition. Reads ~/.openclaw/openclaw.json for gateway port and auth token. Data flows: audio → Groq cloud (STT), TTS text → Microsoft cloud (edge-tts), camera frames (base64) + text → OpenClaw gateway → your configured LLM provider (may be cloud — frames leave the machine if using a cloud LLM). Installs a persistent launchd service (optional). Trigger phrases: "video chat", "voice call", "call me", "视频一下", "语音", "打电话给我", "我要和你视频", "videochat-withme".
Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.
I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Then review README.md for any prerequisites, environment setup, or post-install checks. Tell me what you changed and call out any manual steps you could not complete.
I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Then review README.md for any prerequisites, environment setup, or post-install checks. Summarize what changed and any follow-up checks I should run.
Real-time video call with your OpenClaw agent — full personality, memory, and vision.
New users run once after installing the skill: bash skills/videochat-withme/scripts/setup.sh This handles everything: dependencies, Groq API key, SSL certs, launchd service.
macOS (launchd required) Python 3.10+, ffmpeg OpenClaw gateway running with chatCompletions enabled
Get a free key at: https://console.groq.com/keys Save it: mkdir -p ~/.openclaw/secrets echo "your-key-here" > ~/.openclaw/secrets/groq_api_key.txt Or set env var: export GROQ_API_KEY="your-key-here"
Add to ~/.openclaw/openclaw.json: { "gateway": { "http": { "endpoints": { "chatCompletions": { "enabled": true } } } } } Then restart OpenClaw.
When the user requests a video/voice call: Step 1: Check if service is running: curl -sk https://localhost:8766/api/config 2>/dev/null || curl -s http://localhost:8766/api/config 2>/dev/null Step 2: If no response, setup needed: Check Groq key: cat ~/.openclaw/secrets/groq_api_key.txt 2>/dev/null If missing, ask user to get one at https://console.groq.com/keys Save it: echo "key" > ~/.openclaw/secrets/groq_api_key.txt Ask user: "What name should I display for you in the video call?" Run setup: bash skills/videochat-withme/scripts/setup.sh --auto --agent-name "YourName" --user-name "TheirName" Step 3: Initiate the call based on context: Determine how the user is connecting and pick the best method: User is at the computer (message from webchat/desktop): bash skills/videochat-withme/scripts/call.sh This pops up a macOS incoming call notification → user clicks Accept → browser opens. User is on mobile/remote (message from Telegram/phone): Pick the right URL automatically: # Prefer Tailscale IP (works from any network) TS_IP=$(tailscale ip -4 2>/dev/null) # Fallback to local IP (same WiFi only) LOCAL_IP=$(python3 -c "import socket; s=socket.socket(socket.AF_INET,socket.SOCK_DGRAM); s.connect(('8.8.8.8',80)); print(s.getsockname()[0]); s.close()" 2>/dev/null) If Tailscale is available → send https://<tailscale-ip>:8766 (works everywhere) Otherwise → send https://<local-ip>:8766 (same WiFi only) Note: first visit requires tapping "Advanced → Continue" (self-signed cert)
🎤 Voice → Groq Whisper (STT) 📷 Camera → base64 frame ↓ OpenClaw /v1/chat/completions → Your Agent ↓ edge-tts (TTS) → 🔊 Audio playback
Agent runs these automatically: ScriptWhensetup.sh --autoFirst use (service not running)call.shEvery call request User can run manually if needed: ScriptPurposesetup.shInteractive setup (without --auto)start.shStart servicestop.shStop service
VariableDefaultDescriptionGROQ_API_KEY(secrets file)Groq API key for Whisper STTPORT8766Server portAGENT_NAMEAI AssistantDisplay name for the agentUSER_NAMEUserDisplay name for the userSSL_CERT(auto-detect)Path to SSL certificateSSL_KEY(auto-detect)Path to SSL private key
Code helpers, APIs, CLIs, browser automation, testing, and developer operations.
Largest current source with strong distribution and engagement signals.