Agents

Models

Resources

Pricing

Contact Sales

Most Accurate Mandarin Speech-to-Text

Transcribe Mandarin audio with the industry's lowest word error rate

Start transcribing

Contact sales

Click anywhere to start transcribing

Experience pulse speech to text

Most Accurate Mandarin Speech-to-Text

Transcribe Mandarin audio with the industry's lowest word error rate

Start transcribing

Contact sales

Click anywhere to start transcribing

Experience pulse speech to text

Most Accurate Mandarin Speech-to-Text

Transcribe Mandarin audio with the industry's lowest word error rate

Start transcribing

Contact sales

Click anywhere to start transcribing

Experience pulse speech to text

Mandarin Transcription Benchmarks

Model

Streaming

Batch

Smallest

6.2%

5.5%

Scribe v2

10.5%

9.2%

Deepgram

15.2%

13.4%

Whisper large v3

9.3%

8.2%

Mandarin Transcription Benchmark — WER % (lower is better).

World’s Most Advanced Speech Intelligence

Go beyond text with automated speaker labeling, real-time sentiment analysis, and intelligent language identification for global production workloads.

World’s Most Advanced Speech Intelligence

Go beyond text with automated speaker labeling, real-time sentiment analysis, and intelligent language identification for global production workloads.

World’s Most Advanced Speech Intelligence

Go beyond text with automated speaker labeling, real-time sentiment analysis, and intelligent language identification for global production workloads.

Industry-Leading Accuracy and Speed
Outperforms the competition with the lowest WER across 30+ languages and sub-70ms latency
Emotion Recognition
Detects user emotions in Mandarin speech to make conversations more empathetic
Speaker Diarization for Clarity
Identify transitions between Mandarin speakers and accurately label each contribution
Mandarin and Adaptive
38 languages. Automatic detection. Seamless Mandarin-English code-mixing mid-sentence.
PII / PCI Redaction
Built-in redaction of personal and payment data, across streaming and non-streaming
Noise Reduction
Background-noise handling built into the model—no preprocessing required

Industry-Leading Accuracy and Speed
Outperforms the competition with the lowest WER across 30+ languages and sub-70ms latency
Emotion Recognition
Detects user emotions in Mandarin speech to make conversations more empathetic
Speaker Diarization for Clarity
Identify transitions between Mandarin speakers and accurately label each contribution
Mandarin and Adaptive
38 languages. Automatic detection. Seamless Mandarin-English code-mixing mid-sentence.
PII / PCI Redaction
Built-in redaction of personal and payment data, across streaming and non-streaming
Noise Reduction
Background-noise handling built into the model—no preprocessing required

Language overview

Mandarin language overview

About

Most widely spoken variety of Chinese and the official language of China, Taiwan, and Singapore. A tonal language using logographic characters with four tones that distinguish meaning.

Speakers

920 million

Official language

China, Taiwan, Singapore

Accents

Beijing (Standard), Cantonese-inflected, Shanghainese-inflected, Taiwanese Mandarin

Spoken language demographic

China, Taiwan, Singapore, and global Chinese diaspora

For Developers

Automate.Orchesrate. Dominate with code

Build real-time speech pipelines with Node and Python SDKs. Audio in, transcription out — no middleware, no complexity.

For Developers

Automate.Orchesrate. Dominate with code

Build real-time speech pipelines with Node and Python SDKs. Audio in, transcription out — no middleware, no complexity.

For Developers

Automate.Orchesrate. Dominate with code

Build real-time speech pipelines with Node and Python SDKs. Audio in, transcription out — no middleware, no complexity.

01const url = 'https://api.smallest.ai/waves/v1/pulse/get_text';02const options = {03  method: 'POST',04  headers: {05    Authorization: 'Bearer <BearerAuth>',06    'Content-Type': 'application/octet-stream'07  }08};09 10try {11  const response = await fetch(url, options);12  const data = await response.json();13  console.log(data);14} catch (error) {15  console.error(error);16}

Certified & Compliant

Guarding your data with enterprise security

Certified & Compliant

Guarding your data with enterprise security

Certified & Compliant

Guarding your data with enterprise security

ISO 27001

SOC 2 Type 2

GDPR Compliant

HIPAA Compliant

Proactive Defense

Anticipating threats before they emerge, thanks to our advanced monitoring.

ISO 27001

SOC 2 Type 2

GDPR Compliant

HIPAA Compliant

Proactive Defense

Anticipating threats before they emerge, thanks to our advanced monitoring.

ISO 27001

SOC 2 Type 2

GDPR Compliant

HIPAA Compliant

Proactive Defense

Anticipating threats before they emerge, thanks to our advanced monitoring.

Frequently
asked questions

Does Pulse handle Cantonese and Mandarin dialects?

Does Pulse handle Cantonese -English code-switching?

What's the latency?

Is speaker diarization built in?

What makes Pulse different from Deepgram or AssemblyAI?

Can I use Pulse for real-time voice agents?

Languages

From your city to Timbuktu,
we hear you.

Languages

From your city to Timbuktu,
we hear you.

The speech to text API your product needs

Contact sales

311 California Street, Suite 320
San Francisco, CA 94104

Models

Text to Speech

Speech to Text

Speech to Speech

Voice cloning

Agents

Overview

On Prem

Industries

Debt Collection

Healthcare

Real Estate

Small business

E-commerce

Documentation

For Agents

For Models

Resources

Pricing

Blogs

Research

Careers

Voice AI apps

Integrations

Initiatives

Startup Grants

Legals

Privacy notice

Terms and conditions

Data processing

User Policy

TCPA compliance

Twitter

Instagram

Youtube

Discord

Substack

Medium

System status operational

We are

SOC 2,

GDPR, and

HIPAA, Compliant

The speech to text API your product needs

Contact sales

311 California Street, Suite 320
San Francisco, CA 94104

Models

Text to Speech

Speech to Text

Speech to Speech

Voice cloning

Agents

Overview

On Prem

Industries

Debt Collection

Healthcare

Real Estate

Small business

E-commerce

Legals

Privacy notice

Terms and conditions

Data processing

User Policy

TCPA compliance

Documentation

For Agents

For Models

Resources

Pricing

Blogs

Research

Careers

Voice AI apps

Integrations

Initiatives

Startup Grants

Twitter

Instagram

Youtube

Discord

Substack

Medium

System status operational

We are

SOC 2,

GDPR, and

HIPAA, Compliant

The speech to text API your product needs

Contact sales

311 California Street, Suite 320
San Francisco, CA 94104

Models

Text to Speech

Speech to Text

Speech to Speech

Voice cloning

Agents

Overview

On Prem

Industries

Debt Collection

Healthcare

Real Estate

Small business

E-commerce

Documentation

For Agents

For Models

Resources

Pricing

Blogs

Research

Careers

Voice AI apps

Integrations

Initiatives

Startup Grants

Legals

Privacy notice

Terms and conditions

Data processing

User Policy

TCPA compliance

Twitter

Instagram

Youtube

Discord

Substack

Medium

System status operational

We are

SOC 2,

GDPR, and

HIPAA, Compliant

Most Accurate Mandarin Speech-to-Text

Most Accurate Mandarin Speech-to-Text

Most Accurate Mandarin Speech-to-Text

Mandarin Transcription Benchmarks

Mandarin Transcription Benchmarks

Mandarin Transcription Benchmarks

World’s Most Advanced Speech Intelligence

World’s Most Advanced Speech Intelligence

World’s Most Advanced Speech Intelligence

Mandarin language overview

Automate.Orchesrate. Dominate with code

Automate.Orchesrate. Dominate with code

Automate.Orchesrate. Dominate with code

Guarding your data with enterprise security

Guarding your data with enterprise security

Guarding your data with enterprise security

Frequently asked questions

From your city to Timbuktu,we hear you.

From your city to Timbuktu,we hear you.

The speech to text API your product needs

The speech to text API your product needs

The speech to text API your product needs

Frequently
asked questions

From your city to Timbuktu,
we hear you.

From your city to Timbuktu,
we hear you.