{
  "schemaVersion": "1.0",
  "item": {
    "slug": "elevenlabs-voice",
    "name": "it will help you to send voice messages to your AI Assistant and also can make it talk",
    "source": "tencent",
    "type": "skill",
    "category": "通讯协作",
    "sourceUrl": "https://clawhub.ai/amreahmed/elevenlabs-voice",
    "canonicalUrl": "https://clawhub.ai/amreahmed/elevenlabs-voice",
    "targetPlatform": "OpenClaw"
  },
  "install": {
    "downloadMode": "redirect",
    "downloadUrl": "/downloads/elevenlabs-voice",
    "sourceDownloadUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=elevenlabs-voice",
    "sourcePlatform": "tencent",
    "targetPlatform": "OpenClaw",
    "installMethod": "Manual import",
    "extraction": "Extract archive",
    "prerequisites": [
      "OpenClaw"
    ],
    "packageFormat": "ZIP package",
    "includedAssets": [
      "SKILL.md",
      "scripts/elevenlabs_scribe.py",
      "scripts/elevenlabs_speech.py"
    ],
    "primaryDoc": "SKILL.md",
    "quickSetup": [
      "Download the package from Yavira.",
      "Extract the archive and review SKILL.md first.",
      "Import or place the package into your OpenClaw setup."
    ],
    "agentAssist": {
      "summary": "Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.",
      "steps": [
        "Download the package from Yavira.",
        "Extract it into a folder your agent can access.",
        "Paste one of the prompts below and point your agent at the extracted folder."
      ],
      "prompts": [
        {
          "label": "New install",
          "body": "I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete."
        },
        {
          "label": "Upgrade existing",
          "body": "I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run."
        }
      ]
    },
    "sourceHealth": {
      "source": "tencent",
      "status": "healthy",
      "reason": "direct_download_ok",
      "recommendedAction": "download",
      "checkedAt": "2026-04-30T16:55:25.780Z",
      "expiresAt": "2026-05-07T16:55:25.780Z",
      "httpStatus": 200,
      "finalUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=network",
      "contentType": "application/zip",
      "probeMethod": "head",
      "details": {
        "probeUrl": "https://wry-manatee-359.convex.site/api/v1/download?slug=network",
        "contentDisposition": "attachment; filename=\"network-1.0.0.zip\"",
        "redirectLocation": null,
        "bodySnippet": null
      },
      "scope": "source",
      "summary": "Source download looks usable.",
      "detail": "Yavira can redirect you to the upstream package for this source.",
      "primaryActionLabel": "Download for OpenClaw",
      "primaryActionHref": "/downloads/elevenlabs-voice"
    },
    "validation": {
      "installChecklist": [
        "Use the Yavira download entry.",
        "Review SKILL.md after the package is downloaded.",
        "Confirm the extracted package contains the expected setup assets."
      ],
      "postInstallChecks": [
        "Confirm the extracted package includes the expected docs or setup files.",
        "Validate the skill or prompts are available in your target agent workspace.",
        "Capture any manual follow-up steps the agent could not complete."
      ]
    },
    "downloadPageUrl": "https://openagent3.xyz/downloads/elevenlabs-voice",
    "agentPageUrl": "https://openagent3.xyz/skills/elevenlabs-voice/agent",
    "manifestUrl": "https://openagent3.xyz/skills/elevenlabs-voice/agent.json",
    "briefUrl": "https://openagent3.xyz/skills/elevenlabs-voice/agent.md"
  },
  "agentAssist": {
    "summary": "Hand the extracted package to your coding agent with a concrete install brief instead of figuring it out manually.",
    "steps": [
      "Download the package from Yavira.",
      "Extract it into a folder your agent can access.",
      "Paste one of the prompts below and point your agent at the extracted folder."
    ],
    "prompts": [
      {
        "label": "New install",
        "body": "I downloaded a skill package from Yavira. Read SKILL.md from the extracted folder and install it by following the included instructions. Tell me what you changed and call out any manual steps you could not complete."
      },
      {
        "label": "Upgrade existing",
        "body": "I downloaded an updated skill package from Yavira. Read SKILL.md from the extracted folder, compare it with my current installation, and upgrade it while preserving any custom configuration unless the package docs explicitly say otherwise. Summarize what changed and any follow-up checks I should run."
      }
    ]
  },
  "documentation": {
    "source": "clawhub",
    "primaryDoc": "SKILL.md",
    "sections": [
      {
        "title": "ElevenLabs Speech",
        "body": "Complete voice solution — both TTS and STT using one API:\n\nTTS: Text-to-Speech (high-quality voices)\nSTT: Speech-to-Text via Scribe (accurate transcription)"
      },
      {
        "title": "Environment Setup",
        "body": "Set your API key:\n\nexport ELEVENLABS_API_KEY=\"sk_...\"\n\nOr create .env file in workspace root."
      },
      {
        "title": "Text-to-Speech (TTS)",
        "body": "Convert text to natural-sounding speech:\n\npython scripts/elevenlabs_speech.py tts -t \"Hello world\" -o greeting.mp3\n\nWith custom voice:\n\npython scripts/elevenlabs_speech.py tts -t \"Hello\" -v \"voice_id_here\" -o output.mp3"
      },
      {
        "title": "List Available Voices",
        "body": "python scripts/elevenlabs_speech.py voices"
      },
      {
        "title": "Using in Code",
        "body": "from scripts.elevenlabs_speech import ElevenLabsClient\n\nclient = ElevenLabsClient(api_key=\"sk_...\")\n\n# Basic TTS\nresult = client.text_to_speech(\n    text=\"Hello from zerox\",\n    output_path=\"greeting.mp3\"\n)\n\n# With custom settings\nresult = client.text_to_speech(\n    text=\"Your text here\",\n    voice_id=\"21m00Tcm4TlvDq8ikWAM\",  # Rachel\n    stability=0.5,\n    similarity_boost=0.75,\n    output_path=\"output.mp3\"\n)\n\n# Get available voices\nvoices = client.get_voices()\nfor voice in voices['voices']:\n    print(f\"{voice['name']}: {voice['voice_id']}\")"
      },
      {
        "title": "Popular Voices",
        "body": "Voice IDNameDescription21m00Tcm4TlvDq8ikWAMRachelNatural, versatile (default)AZnzlk1XvdvUeBnXmlldDomiStrong, energeticEXAVITQu4vr4xnSDxMaLBellaSoft, soothingErXwobaYiN019PkySvjVAntoniWell-roundedMF3mGyEYCl7XYWbV9V6OElliWarm, friendlyTxGEqnHWrfWFTfGW9XjXJoshDeep, calmVR6AewLTigWG4xSOukaGArnoldAuthoritative"
      },
      {
        "title": "Voice Settings",
        "body": "stability (0-1): Lower = more emotional, Higher = more stable\nsimilarity_boost (0-1): Higher = closer to original voice\n\nDefault: stability=0.5, similarity_boost=0.75"
      },
      {
        "title": "Models",
        "body": "eleven_turbo_v2_5 - Fast, high quality (default)\neleven_multilingual_v2 - Best for non-English\neleven_monolingual_v1 - English only"
      },
      {
        "title": "Integration with Telegram",
        "body": "When user sends text and wants voice reply:\n\n# Generate speech\nresult = client.text_to_speech(text=user_text, output_path=\"reply.mp3\")\n\n# Send via Telegram message tool with media path\nmessage(action=\"send\", media=\"path/to/reply.mp3\", as_voice=True)"
      },
      {
        "title": "Pricing",
        "body": "Check https://elevenlabs.io/pricing for current rates. Free tier available!"
      },
      {
        "title": "Speech-to-Text (STT) with ElevenLabs Scribe",
        "body": "Transcribe voice messages using ElevenLabs Scribe:"
      },
      {
        "title": "Transcribe Audio",
        "body": "python scripts/elevenlabs_scribe.py voice_message.ogg\n\nWith specific language:\n\npython scripts/elevenlabs_scribe.py voice_message.ogg --language ara\n\nWith speaker diarization (multiple speakers):\n\npython scripts/elevenlabs_scribe.py voice_message.ogg --speakers 2"
      },
      {
        "title": "Using in Code",
        "body": "from scripts.elevenlabs_scribe import ElevenLabsScribe\n\nclient = ElevenLabsScribe(api_key=\"sk-...\")\n\n# Basic transcription\nresult = client.transcribe(\"voice_message.ogg\")\nprint(result['text'])\n\n# With language hint (improves accuracy)\nresult = client.transcribe(\"voice_message.ogg\", language_code=\"ara\")\n\n# With speaker detection\nresult = client.transcribe(\"voice_message.ogg\", num_speakers=2)"
      },
      {
        "title": "Supported Formats",
        "body": "mp3, mp4, mpeg, mpga, m4a, wav, webm\nMax file size: 100 MB\nWorks great with Telegram voice messages (.ogg)"
      },
      {
        "title": "Language Support",
        "body": "Scribe supports 99 languages including:\n\nArabic (ara)\nEnglish (eng)\nSpanish (spa)\nFrench (fra)\nAnd many more...\n\nWithout language hint, it auto-detects."
      },
      {
        "title": "Complete Workflow Example",
        "body": "User sends voice message → You reply with voice:\n\nfrom scripts.elevenlabs_scribe import ElevenLabsScribe\nfrom scripts.elevenlabs_speech import ElevenLabsClient\n\n# 1. Transcribe user's voice message\nstt = ElevenLabsScribe()\ntranscription = stt.transcribe(\"user_voice.ogg\")\nuser_text = transcription['text']\n\n# 2. Process/understand the text\n# ... your logic here ...\n\n# 3. Generate response text\nresponse_text = \"Your response here\"\n\n# 4. Convert to speech\ntts = ElevenLabsClient()\ntts.text_to_speech(response_text, output_path=\"reply.mp3\")\n\n# 5. Send voice reply\nmessage(action=\"send\", media=\"reply.mp3\", as_voice=True)"
      },
      {
        "title": "Pricing",
        "body": "Check https://elevenlabs.io/pricing for current rates:\n\nTTS (Text-to-Speech):\n\nFree tier: 10,000 characters/month\nPaid plans available\n\nSTT (Speech-to-Text) - Scribe:\n\nFree tier available\nCheck website for current pricing"
      }
    ],
    "body": "ElevenLabs Speech\n\nComplete voice solution — both TTS and STT using one API:\n\nTTS: Text-to-Speech (high-quality voices)\nSTT: Speech-to-Text via Scribe (accurate transcription)\nQuick Start\nEnvironment Setup\n\nSet your API key:\n\nexport ELEVENLABS_API_KEY=\"sk_...\"\n\n\nOr create .env file in workspace root.\n\nText-to-Speech (TTS)\n\nConvert text to natural-sounding speech:\n\npython scripts/elevenlabs_speech.py tts -t \"Hello world\" -o greeting.mp3\n\n\nWith custom voice:\n\npython scripts/elevenlabs_speech.py tts -t \"Hello\" -v \"voice_id_here\" -o output.mp3\n\nList Available Voices\npython scripts/elevenlabs_speech.py voices\n\nUsing in Code\nfrom scripts.elevenlabs_speech import ElevenLabsClient\n\nclient = ElevenLabsClient(api_key=\"sk_...\")\n\n# Basic TTS\nresult = client.text_to_speech(\n    text=\"Hello from zerox\",\n    output_path=\"greeting.mp3\"\n)\n\n# With custom settings\nresult = client.text_to_speech(\n    text=\"Your text here\",\n    voice_id=\"21m00Tcm4TlvDq8ikWAM\",  # Rachel\n    stability=0.5,\n    similarity_boost=0.75,\n    output_path=\"output.mp3\"\n)\n\n# Get available voices\nvoices = client.get_voices()\nfor voice in voices['voices']:\n    print(f\"{voice['name']}: {voice['voice_id']}\")\n\nPopular Voices\nVoice ID\tName\tDescription\n21m00Tcm4TlvDq8ikWAM\tRachel\tNatural, versatile (default)\nAZnzlk1XvdvUeBnXmlld\tDomi\tStrong, energetic\nEXAVITQu4vr4xnSDxMaL\tBella\tSoft, soothing\nErXwobaYiN019PkySvjV\tAntoni\tWell-rounded\nMF3mGyEYCl7XYWbV9V6O\tElli\tWarm, friendly\nTxGEqnHWrfWFTfGW9XjX\tJosh\tDeep, calm\nVR6AewLTigWG4xSOukaG\tArnold\tAuthoritative\nVoice Settings\nstability (0-1): Lower = more emotional, Higher = more stable\nsimilarity_boost (0-1): Higher = closer to original voice\n\nDefault: stability=0.5, similarity_boost=0.75\n\nModels\neleven_turbo_v2_5 - Fast, high quality (default)\neleven_multilingual_v2 - Best for non-English\neleven_monolingual_v1 - English only\nIntegration with Telegram\n\nWhen user sends text and wants voice reply:\n\n# Generate speech\nresult = client.text_to_speech(text=user_text, output_path=\"reply.mp3\")\n\n# Send via Telegram message tool with media path\nmessage(action=\"send\", media=\"path/to/reply.mp3\", as_voice=True)\n\nPricing\n\nCheck https://elevenlabs.io/pricing for current rates. Free tier available!\n\nSpeech-to-Text (STT) with ElevenLabs Scribe\n\nTranscribe voice messages using ElevenLabs Scribe:\n\nTranscribe Audio\npython scripts/elevenlabs_scribe.py voice_message.ogg\n\n\nWith specific language:\n\npython scripts/elevenlabs_scribe.py voice_message.ogg --language ara\n\n\nWith speaker diarization (multiple speakers):\n\npython scripts/elevenlabs_scribe.py voice_message.ogg --speakers 2\n\nUsing in Code\nfrom scripts.elevenlabs_scribe import ElevenLabsScribe\n\nclient = ElevenLabsScribe(api_key=\"sk-...\")\n\n# Basic transcription\nresult = client.transcribe(\"voice_message.ogg\")\nprint(result['text'])\n\n# With language hint (improves accuracy)\nresult = client.transcribe(\"voice_message.ogg\", language_code=\"ara\")\n\n# With speaker detection\nresult = client.transcribe(\"voice_message.ogg\", num_speakers=2)\n\nSupported Formats\nmp3, mp4, mpeg, mpga, m4a, wav, webm\nMax file size: 100 MB\nWorks great with Telegram voice messages (.ogg)\nLanguage Support\n\nScribe supports 99 languages including:\n\nArabic (ara)\nEnglish (eng)\nSpanish (spa)\nFrench (fra)\nAnd many more...\n\nWithout language hint, it auto-detects.\n\nComplete Workflow Example\n\nUser sends voice message → You reply with voice:\n\nfrom scripts.elevenlabs_scribe import ElevenLabsScribe\nfrom scripts.elevenlabs_speech import ElevenLabsClient\n\n# 1. Transcribe user's voice message\nstt = ElevenLabsScribe()\ntranscription = stt.transcribe(\"user_voice.ogg\")\nuser_text = transcription['text']\n\n# 2. Process/understand the text\n# ... your logic here ...\n\n# 3. Generate response text\nresponse_text = \"Your response here\"\n\n# 4. Convert to speech\ntts = ElevenLabsClient()\ntts.text_to_speech(response_text, output_path=\"reply.mp3\")\n\n# 5. Send voice reply\nmessage(action=\"send\", media=\"reply.mp3\", as_voice=True)\n\nPricing\n\nCheck https://elevenlabs.io/pricing for current rates:\n\nTTS (Text-to-Speech):\n\nFree tier: 10,000 characters/month\nPaid plans available\n\nSTT (Speech-to-Text) - Scribe:\n\nFree tier available\nCheck website for current pricing"
  },
  "trust": {
    "sourceLabel": "tencent",
    "provenanceUrl": "https://clawhub.ai/amreahmed/elevenlabs-voice",
    "publisherUrl": "https://clawhub.ai/amreahmed/elevenlabs-voice",
    "owner": "amreahmed",
    "version": "1.0.0",
    "license": null,
    "verificationStatus": "Indexed source record"
  },
  "links": {
    "detailUrl": "https://openagent3.xyz/skills/elevenlabs-voice",
    "downloadUrl": "https://openagent3.xyz/downloads/elevenlabs-voice",
    "agentUrl": "https://openagent3.xyz/skills/elevenlabs-voice/agent",
    "manifestUrl": "https://openagent3.xyz/skills/elevenlabs-voice/agent.json",
    "briefUrl": "https://openagent3.xyz/skills/elevenlabs-voice/agent.md"
  }
}