Skip to content

community
- integrations
- model-providers
- session-managers
- tools
examples
- cdk
  - deploy_to_ec2
  - deploy_to_fargate
  - deploy_to_lambda
- deploy_to_eks
- python
  - multi_agent_example
- typescript
  - deploy_to_bedrock_agentcore
user-guide
- concepts
  - agents
  - bidirectional-streaming
    
    models
  - experimental
  - model-providers
  - multi-agent
  - streaming
  - tools
- deploy
  - deploy_to_bedrock_agentcore
  - deploy_to_docker
- evals-sdk
  - evaluators
  - how-to
  - simulators
- observability-evaluation
- quickstart
- safety-security

strands-deepgram

{{ community_contribution_banner }}

strands-deepgram is a production-ready speech and audio processing tool powered by Deepgram’s AI platform with 30+ language support.

Installation

pip install strands-deepgram

Usage

from strands import Agent
from strands_deepgram import deepgram

agent = Agent(tools=[deepgram])

# Transcribe with speaker identification
agent("transcribe this audio: recording.mp3 with speaker diarization")

# Text-to-speech
agent("convert this text to speech: Hello world")

# Audio intelligence
agent("analyze sentiment in call.wav")

Key Features

Speech-to-Text: 30+ language support and speaker diarization
Text-to-Speech: Natural-sounding voices (Aura series)
Audio Intelligence: Sentiment analysis, topic detection, and intent recognition
Speaker Diarization: Identify and separate different speakers
Multi-format Support: WAV, MP3, M4A, FLAC, and more
Real-time Processing: Streaming capabilities for live audio

Configuration

DEEPGRAM_API_KEY=your_deepgram_api_key    # Required
DEEPGRAM_DEFAULT_MODEL=nova-3             # Optional
DEEPGRAM_DEFAULT_LANGUAGE=en              # Optional

Get your API key at: console.deepgram.com

Resources