Deepgram releases Deepgram CLI (`dg`) an agent screenshot

What is Deepgram releases Deepgram CLI (`dg`) an agent?

Deepgram CLI is a command-line tool that brings Deepgram's speech AI capabilities directly to your terminal. It allows you to transcribe audio files, stream live audio for real-time transcription, convert text to speech, and interact with the Deepgram API without leaving your command line. The tool works on macOS, Linux, and Windows with a straightforward installation process. It's designed for developers, engineers, and technical users who prefer working in terminal environments or need to integrate speech processing into scripts and automation workflows.

Key Features

Audio file transcription

convert recorded audio files to text using Deepgram's speech recognition models

Live streaming transcription

process real-time audio input and receive transcriptions as speech is captured

Text-to-speech synthesis

generate spoken audio from text strings via the command line

Direct API access

call Deepgram API endpoints directly from the terminal for customised requests

Cross-platform support

runs on macOS, Linux, and Windows with minimal setup

Pros & Cons

Advantages

  • Quick installation with one-line setup commands
  • Integrated into terminal workflows, useful for scripting and automation
  • Access to multiple Deepgram capabilities from a single tool
  • Freemium pricing model allows testing before committing to paid usage

Limitations

  • Limited to command-line interface; not suitable for users who prefer graphical applications
  • Requires some familiarity with terminal commands and API concepts

Use Cases

Batch processing audio files for transcription in automated workflows

Integrating speech recognition into shell scripts and development pipelines

Testing Deepgram's speech models before building full applications

Real-time transcription of live audio streams for monitoring or logging purposes

Quick text-to-speech generation for notifications or accessibility features