Agents

Models

Resources

Pricing

Contact Sales

Integrations

Pipecat

Pipecat x Smallest AI Voice

Voice-enable your pipelines with Pulse speech-to-text and Lightning text-to-speech

Documentation

Get your API key

OVERVIEW

Real-time speech, native to Pipecat.

Pipecat is the open-source Python framework for building real-time voice and multimodal AI agents. Smallest AI as first-class STT and TTS services, so your agent hears and speaks through models built for low-latency production conversation — no custom wrappers, no glue code. Pulse handles what the caller says: real-time speech-to-text transcription over a WebSocket integration with the Waves API, streaming audio continuously and returning interim and final results with low latency. Lightning handles what the agent says back: real-time synthesis over WebSocket streaming, with configurable voice parameters and multiple languages, reconnecting cleanly to handle interruptions.

HOW IT WORKS

Up and running in five steps.

RESOURCES

Everything else lives here.

Pipecat TTS documentation

SmallestTTSService — installation, parameters, and usage on docs.pipecat.ai

Pipecat STT documentation

SmallestSTTService — Pulse streaming transcription on docs.pipecat.ai

Smallest AI docs

API reference, guides, and the full integrations catalog

Code example

Runnable Pipecat voice-agent example with Smallest AI STT + TTS

Related integrations

View all integrations

Building voice agents elsewhere?

LiveKit

Voice-Agent Framework

Real-time Pulse STT and Lightning TTS inside the LiveKit Agents pipeline.

Vapi

Voice-Agent Platform

Lightning is a built-in voice option in Vapi's assistant configuration.

n8n

Automation

TTS, transcription, and voice cloning in no-code workflows — verified by n8n.

Build the future of voice agent orchestration

Contact sales

311 California Street, Suite 320
San Francisco, CA 94104

Models

Text to Speech

Speech to Text

Speech to Speech

Voice cloning

Agents

Overview

On Prem

Industries

Debt Collection

Healthcare

Real Estate

Small business

E-commerce

Documentation

For Agents

For Models

Resources

Pricing

Blogs

Research

Careers

Voice AI apps

Integrations

Dictionary

Initiatives

Startup Grants

Legals

Privacy notice

Terms and conditions

Data processing

User Policy

TCPA compliance

Twitter

Instagram

Youtube

Discord

Substack

Medium

System status operational

We are

SOC 2,

GDPR, and

HIPAA, Compliant

Build the future of voice agent orchestration

Contact sales

311 California Street, Suite 320
San Francisco, CA 94104

Models

Text to Speech

Speech to Text

Speech to Speech

Voice cloning

Agents

Overview

On Prem

Industries

Debt Collection

Healthcare

Real Estate

Small business

E-commerce

Legals

Privacy notice

Terms and conditions

Data processing

User Policy

TCPA compliance

Documentation

For Agents

For Models

Resources

Pricing

Blogs

Research

Careers

Voice AI apps

Integrations

Dictionary

Initiatives

Startup Grants

Twitter

Instagram

Youtube

Discord

Substack

Medium

System status operational

We are

SOC 2,

GDPR, and

HIPAA, Compliant

Build the future of voice agent orchestration

Contact sales

311 California Street, Suite 320
San Francisco, CA 94104

Models

Text to Speech

Speech to Text

Speech to Speech

Voice cloning

Agents

Overview

On Prem

Industries

Debt Collection

Healthcare

Real Estate

Small business

E-commerce

Documentation

For Agents

For Models

Resources

Pricing

Blogs

Research

Careers

Voice AI apps

Integrations

Dictionary

Initiatives

Startup Grants

Legals

Privacy notice

Terms and conditions

Data processing

User Policy

TCPA compliance

Twitter

Instagram

Youtube

Discord

Substack

Medium

System status operational

We are

SOC 2,

GDPR, and

HIPAA, Compliant