Agents

Models

Resources

Pricing

Contact Sales

July 20, 2026

Introducing Smallest AI Agent Skills: Give Your Coding Agent Voice API Expertise

Harshita Jain

Book a demo

Start building

TABLE OF CONTENT

Agent Workflows

AI-Powered Solutions

Revolutionizing Industries

Summarize with AI

Automate your Contact Centers with Us

Experience fast latency, strong security, and unlimited speech generation.

Contact sales

How agencies can sell AI receptionist services to local businesses

Most of the time, an AI coding agent feels like a superpower. You describe what you want, and code appears in seconds. So when you ask one to add text-to-speech to your app, you expect the same. It hands back a clean function, you run it, and you get a 404. You spend the next two hours assuming you misconfigured something, before realizing the agent confidently called an API route that no longer exists. The code was never going to work, and the agent had no way of knowing that. Understanding why means understanding something about how these agents learn.

The problem your coding agent has with voice APIs

When you ask Claude Code or Cursor to add text-to-speech to your application, it usually writes something that looks correct.

Then you run it and get a 404.

Why?

Because the endpoint it used was deprecated six months ago.

AI coding agents are trained on snapshots of the internet, and APIs evolve much faster than model training cycles. Smallest AI's unified /waves/v1/tts endpoint replaced older per-model routes. Model names changed. Recommended sample rates changed. Best practices changed.

An agent relying on stale training data often generates code that was never going to work. You end up spending your afternoon debugging generated code instead of building your product.

There's a better way.

Introducing Smallest AI Agent Skills

Today, we're open-sourcing the Smallest AI Agent Skills collection:

GitHub Repository: https://github.com/smallest-inc/skills

Agent Skills is an open standard originally developed by Anthropic and now supported by Claude Code, Cursor, GitHub Copilot, Gemini CLI, and dozens of other AI coding tools.

Skills provide coding agents with specialized, up-to-date knowledge for a specific domain. Each skill is packaged as a SKILL.md file. When your agent detects a relevant task, it automatically loads the corresponding skill into context.

The result:

No hallucinated endpoints
No deprecated model names
No outdated examples
No API guesswork

Just working code.

Install in one line

Install the complete skill collection:

Or install only the skills you need:

Works with Claude Code, Cursor, GitHub Copilot (VS Code), Gemini CLI, Kiro, and any agent that supports the Agent Skills standard.

Six skills, everything covered

setup-api-key

Get your API key configured and verified. Start here.

Use this skill to:

Configure authentication
Verify API connectivity
Validate credentials
Troubleshoot setup issues

Example prompt:

"Set up my Smallest AI API key and verify that I can successfully make a request."

The agent loads the setup skill and walks through authentication using the latest onboarding flow.

text-to-speech

Everything you need for Lightning v3.1 and Lightning v3.1 Pro (Text-to-Speech models).

Includes:

HTTP generation
SSE streaming
WebSocket streaming
Voice cloning
Voice/model compatibility rules
Supported output formats
Sample rate recommendations

Example prompt:

"Generate speech from 'Your order has been confirmed.' using Smallest AI. Use the Magnus voice, stream it as WAV at 24000 Hz, and save it to output.wav."

The agent loads the Text-to-Speech skill and generates code using the correct endpoint, parameters, authentication headers, and streaming configuration.

speech-to-text

Everything you need for Pulse and Pulse Pro (Speech-to-Text models).

Includes:

Pre-recorded audio transcription
HTTP uploads
Real-time WebSocket streaming
Word-level timestamps
Speaker diarization
Automatic language detection

Example prompt:

"Transcribe recording.mp3 using Smallest AI. Return word-level timestamps and identify speaker changes."

The agent loads the Speech-to-Text skill and generates the correct transcription workflow using the latest API patterns.

speech-to-speech

Build real-time voice experiences with Hydra (Speech-to-Speech model).

Includes:

WebSocket session management
Turn detection
Barge-in handling
Tool calling
End-to-end voice pipeline examples

Example prompt:

"Create a real-time customer support voice agent using Hydra that can interrupt itself when the user starts speaking and call external tools when needed."

The agent loads the Speech-to-Speech skill and generates a complete real-time voice pipeline with the correct session and turn-handling logic.

llm-electron

Build with Electron using an OpenAI-compatible interface.

Includes:

Chat completions
Streaming responses
Function calling
Migration guidance from OpenAI models

Example prompt:

"Replace my OpenAI GPT-4o call with Smallest AI's Electron model. Keep the same messages array and streaming setup, just swap the base URL and model name."

The agent loads the Electron skill and performs the migration using the latest model names and API configuration.

voice-agents

Build production voice agents on Atoms.

Includes:

Agent creation
Agent crews
Outbound calling
Campaign management
Knowledge base integration
Call analytics

Example prompt:

"Create an outbound calling campaign for appointment reminders and give me code to monitor call outcomes and analytics."

The agent loads the Voice Agents skill and generates the correct Atoms API integration.

Getting Started

Detailed installation instructions, skill documentation, and example prompts are available in the documentation.

The skills repository is fully open source. If you find an issue, want additional examples, or would like to contribute improvements, pull requests are welcome.

Build faster. Ship working integrations. Let your coding agent stay up to date.

Resources:

Frequently asked questions

How do I add or install skills in Claude Code?

How do I use skills in Claude Code?

Can I create my own custom skills?

Why do AI coding agents generate broken or deprecated API code?

Summarize with AI

Related Blogposts

View all

How To Build An AI Voice Agent Using Atoms API

June 25, 2026

Smallest AI Now Available in Dograh: Build Open-Source Voice Agents with Lightning TTS and Pulse STT

June 23, 2026

Build the future of voice agent orchestration

Contact sales

311 California Street, Suite 320
San Francisco, CA 94104

Models

Text to Speech

Speech to Text

Speech to Speech

Voice cloning

Agents

Overview

On Prem

Industries

Debt Collection

Healthcare

Real Estate

Small business

E-commerce

Documentation

For Agents

For Models

Resources

Pricing

Blogs

Research

Careers

Voice AI apps

Integrations

Dictionary

Press kit

Initiatives

Startup Grants

Legals

Privacy notice

Terms and conditions

Data processing

User Policy

TCPA compliance

Twitter

Instagram

Youtube

Discord

Substack

Medium

System status operational

We are

SOC 2,

GDPR, and

HIPAA, Compliant

Build the future of voice agent orchestration

Contact sales

311 California Street, Suite 320
San Francisco, CA 94104

Models

Text to Speech

Speech to Text

Speech to Speech

Voice cloning

Agents

Overview

On Prem

Industries

Debt Collection

Healthcare

Real Estate

Small business

E-commerce

Legals

Privacy notice

Terms and conditions

Data processing

User Policy

TCPA compliance

Documentation

For Agents

For Models

Resources

Pricing

Blogs

Research

Careers

Voice AI apps

Integrations

Dictionary

Press kit

Initiatives

Startup Grants

Twitter

Instagram

Youtube

Discord

Substack

Medium

System status operational

We are

SOC 2,

GDPR, and

HIPAA, Compliant

Build the future of voice agent orchestration

Contact sales

311 California Street, Suite 320
San Francisco, CA 94104

Models

Text to Speech

Speech to Text

Speech to Speech

Voice cloning

Agents

Overview

On Prem

Industries

Debt Collection

Healthcare

Real Estate

Small business

E-commerce

Documentation

For Agents

For Models

Resources

Pricing

Blogs

Research

Careers

Voice AI apps

Integrations

Dictionary

Press kit

Initiatives

Startup Grants

Legals

Privacy notice

Terms and conditions

Data processing

User Policy

TCPA compliance

Twitter

Instagram

Youtube

Discord

Substack

Medium

System status operational

We are

SOC 2,

GDPR, and

HIPAA, Compliant