Skip to content

{{ community_contribution_banner }}

strands-deepgram is a production-ready speech and audio processing tool powered by Deepgram’s AI platform with 30+ language support.

Terminal window
pip install strands-deepgram
from strands import Agent
from strands_deepgram import deepgram
agent = Agent(tools=[deepgram])
# Transcribe with speaker identification
agent("transcribe this audio: recording.mp3 with speaker diarization")
# Text-to-speech
agent("convert this text to speech: Hello world")
# Audio intelligence
agent("analyze sentiment in call.wav")
  • Speech-to-Text: 30+ language support and speaker diarization
  • Text-to-Speech: Natural-sounding voices (Aura series)
  • Audio Intelligence: Sentiment analysis, topic detection, and intent recognition
  • Speaker Diarization: Identify and separate different speakers
  • Multi-format Support: WAV, MP3, M4A, FLAC, and more
  • Real-time Processing: Streaming capabilities for live audio
Terminal window
DEEPGRAM_API_KEY=your_deepgram_api_key # Required
DEEPGRAM_DEFAULT_MODEL=nova-3 # Optional
DEEPGRAM_DEFAULT_LANGUAGE=en # Optional

Get your API key at: console.deepgram.com