Introducing Smallest AI Agent Skills: Give Your Coding Agent Voice API Expertise

Harshita Jain

TABLE OF CONTENT

Agent Workflows

AI-Powered Solutions

Revolutionizing Industries

Automate your Contact Centers with Us

Experience fast latency, strong security, and unlimited speech generation.

Best Speech Recognition Software in 2026
Best Speech Recognition Software in 2026

Most of the time, an AI coding agent feels like a superpower. You describe what you want, and code appears in seconds. So when you ask one to add text-to-speech to your app, you expect the same. It hands back a clean function, you run it, and you get a 404. You spend the next two hours assuming you misconfigured something, before realizing the agent confidently called an API route that no longer exists. The code was never going to work, and the agent had no way of knowing that. Understanding why means understanding something about how these agents learn.

The problem your coding agent has with voice APIs

When you ask Claude Code or Cursor to add text-to-speech to your application, it usually writes something that looks correct.

Then you run it and get a 404.

Why?

Because the endpoint it used was deprecated six months ago.

AI coding agents are trained on snapshots of the internet, and APIs evolve much faster than model training cycles. Smallest AI's unified /waves/v1/tts endpoint replaced older per-model routes. Model names changed. Recommended sample rates changed. Best practices changed.

An agent relying on stale training data often generates code that was never going to work. You end up spending your afternoon debugging generated code instead of building your product.

There's a better way.

Introducing Smallest AI Agent Skills

Today, we're open-sourcing the Smallest AI Agent Skills collection:

GitHub Repository: https://github.com/smallest-inc/skills

Agent Skills is an open standard originally developed by Anthropic and now supported by Claude Code, Cursor, GitHub Copilot, Gemini CLI, and dozens of other AI coding tools.

Skills provide coding agents with specialized, up-to-date knowledge for a specific domain. Each skill is packaged as a SKILL.md file. When your agent detects a relevant task, it automatically loads the corresponding skill into context.

The result:

  • No hallucinated endpoints

  • No deprecated model names

  • No outdated examples

  • No API guesswork

Just working code.

Install in one line

Install the complete skill collection:


Or install only the skills you need:





Works with Claude Code, Cursor, GitHub Copilot (VS Code), Gemini CLI, Kiro, and any agent that supports the Agent Skills standard.

Six skills, everything covered

setup-api-key

Get your API key configured and verified. Start here.

Use this skill to:

  • Configure authentication

  • Verify API connectivity

  • Validate credentials

  • Troubleshoot setup issues

Example prompt:

"Set up my Smallest AI API key and verify that I can successfully make a request."

The agent loads the setup skill and walks through authentication using the latest onboarding flow.

text-to-speech

Everything you need for Lightning v3.1 and Lightning v3.1 Pro (Text-to-Speech models).

Includes:

  • HTTP generation

  • SSE streaming

  • WebSocket streaming

  • Voice cloning

  • Voice/model compatibility rules

  • Supported output formats

  • Sample rate recommendations

Example prompt:

"Generate speech from 'Your order has been confirmed.' using Smallest AI. Use the Magnus voice, stream it as WAV at 24000 Hz, and save it to output.wav."

The agent loads the Text-to-Speech skill and generates code using the correct endpoint, parameters, authentication headers, and streaming configuration.

speech-to-text

Everything you need for Pulse and Pulse Pro (Speech-to-Text models).

Includes:

  • Pre-recorded audio transcription

  • HTTP uploads

  • Real-time WebSocket streaming

  • Word-level timestamps

  • Speaker diarization

  • Automatic language detection

Example prompt:

"Transcribe recording.mp3 using Smallest AI. Return word-level timestamps and identify speaker changes."

The agent loads the Speech-to-Text skill and generates the correct transcription workflow using the latest API patterns.

speech-to-speech

Build real-time voice experiences with Hydra (Speech-to-Speech model).

Includes:

  • WebSocket session management

  • Turn detection

  • Barge-in handling

  • Tool calling

  • End-to-end voice pipeline examples

Example prompt:

"Create a real-time customer support voice agent using Hydra that can interrupt itself when the user starts speaking and call external tools when needed."

The agent loads the Speech-to-Speech skill and generates a complete real-time voice pipeline with the correct session and turn-handling logic.

llm-electron

Build with Electron using an OpenAI-compatible interface.

Includes:

  • Chat completions

  • Streaming responses

  • Function calling

  • Migration guidance from OpenAI models

Example prompt:

"Replace my OpenAI GPT-4o call with Smallest AI's Electron model. Keep the same messages array and streaming setup, just swap the base URL and model name."

The agent loads the Electron skill and performs the migration using the latest model names and API configuration.

voice-agents

Build production voice agents on Atoms.

Includes:

  • Agent creation

  • Agent crews

  • Outbound calling

  • Campaign management

  • Knowledge base integration

  • Call analytics

Example prompt:

"Create an outbound calling campaign for appointment reminders and give me code to monitor call outcomes and analytics."

The agent loads the Voice Agents skill and generates the correct Atoms API integration.

Getting Started

Detailed installation instructions, skill documentation, and example prompts are available in the documentation.

The skills repository is fully open source. If you find an issue, want additional examples, or would like to contribute improvements, pull requests are welcome.

Build faster. Ship working integrations. Let your coding agent stay up to date.

Resources:

Frequently asked questions

Frequently asked questions

How do I add or install skills in Claude Code?

How do I use skills in Claude Code?

Can I create my own custom skills?

Why do AI coding agents generate broken or deprecated API code?