Return Zero Inc. - The future of next-generation speech AI

PRODUCT

RTZR STT

COMPANY

We bring practical AI to the world

RTZR STT CALLABO VITO AICO

Pricing Try API Docs

About News Career Blog CI

The Most Accurate and Fastest Speech Recognition in Korea

Try RTZR STT today!

RTZR STT is an powerful speech AI that works robustly in any scenario, including finance, call centers, mobile apps, and video processing - supporting both Cloud and On-Premise.

Accuracy

35% - 46% lower error rate compared to competitors

Speed

2.5x - 30x faster conversion

Price

1.5x - 3x lower per-Second pricing

The voice AI that has processed the most Korean data

We've learned from an unprecedented volume of real Korean free speech data, setting us apart as the only provider to do so while offering the app VITO with over 1 million downloads, automatically converting calls into text.

Total length of speech processing15 million hours+
(Approximately 1,712 years, #1 in Korea)

Total learned data set200,000 hours+
(Refined from a total of 8 million hours)

# of calls transcribed200 million+

RTZR STT Offered Models

We offer from returnzero's own model, which provides up to 10 hours for free, to custom models for businesses

Sommers

#Try it now#Fast#Accurate#VITO engine

returnzero offers the most accurate, fast, and affordable speech recognition engine. It's over 2.5 times faster and more than 2 times cheaper than competitors.

Whisper

#Multilingual support#Lighter#More accurate

We provide the latest Whisper model from OpenAI, finely tuned for fast and accurate integration into your service.

Custom

#Additional training#Top accuracy#Customer needs

A custom model trained on your customer's data to deliver top performance in a specific domain or service. Offered to enterprise customers demanding the industry's best STT quality.

Sommers

#Try it now#Fast#Accurate#VITO engine

returnzero offers the most accurate, fast, and affordable speech recognition engine. It's over 2.5 times faster and more than 2 times cheaper than competitors.

Whisper

#Multilingual support#Lighter#More accurate

We provide the latest Whisper model from OpenAI, finely tuned for fast and accurate integration into your service.

Custom

#Additional training#Top accuracy#Customer needs

A custom model trained on your customer's data to deliver top performance in a specific domain or service. Offered to enterprise customers demanding the industry's best STT quality.

Setting a new standard in AI speech recognition RTZR STT for Korean and Japanese speech recognition

Have you ever considered the need for the highest voice accuracy in your business? RTZR STT is the fastest and most efficient AI engine, perfectly suited for business use.

Accuracy (Error Rate)

Batch processing speed
(1 hour)

Real-time response processing

Speaker isolation

Custom vocabulary recognition

Additional learning

Language model incremental learning

Sentence punctuation

Profanity filter

4.66%

38seconds

< 300ms

~10speakers

We used an hour long non-biased audio dataset of Korean conversational speech. You can test it anytime with the RTZR STT API.

9.09%

90seconds

~10speakers

14.11%

1440seconds

800-2000ms

~6speakers

17.27%

158seconds

We used an hour long non-biased audio dataset of Korean conversational speech. You can test it anytime with the RTZR STT API.

Finance

Shinhan Bank AICC (#1 bank in Korea)

Next-generation AI call center with RTZR STT

CSB2B SaaS

ChannelTalk sales team

Sales meeting will tell your future revenue

Fire DeptGovernment

Gwangju Fire Department

The most reliable and accurate AI chosen by the fire department.

AICC

MindWareWorks

Perfectly handles Japanese AICC, including complex addresses and numbers

CounselingDivination

Chun-Myung (fortune telling app)'s fortune counseling transcription

User's purchase conversion rate has tripled after experiencing free trascription add-on service

Finance

Shinhan Bank AICC (#1 bank in Korea)

Next-generation AI call center with RTZR STT

CSB2B SaaS

ChannelTalk sales team

Sales meeting will tell your future revenue

Fire DeptGovernment

Gwangju Fire Department

The most reliable and accurate AI chosen by the fire department.

AICC

MindWareWorks

Perfectly handles Japanese AICC, including complex addresses and numbers

CounselingDivination

Chun-Myung (fortune telling app)'s fortune counseling transcription

User's purchase conversion rate has tripled after experiencing free trascription add-on service

All Features Needed for Enterprise Speech Recognition

Ultra-fast batch processingQuickly processes large of audio files by segmenting into batches Ideal for handling recorded audio data.

Real-Time ProcessingSupports real-time speech recognition with short latency. Suitable when immediate feedback and transcription are needed, such as AICC, call center assistants, and more.

Supports all file formatsMP3, WAV, FLAC, MP4, and all audio/video formats are supported.

Speaker isolationPredicts the number of speakers and provides speech recognition results separated by speakers. Essential for audio with multiple speakers, such as calls and meeting records.

Multi-channel separationEven if voices from different speakers overlap, audio files with two or more channels are separated by channel, providing accurate speech recognition results by speaker.

Multilingual SupportProvides high-accuracy speech recognition for 97 languages based on the Sommers Japanese model and Whisper.

Keyword BoostingOffers keyword boosting, improving the recognition rate of predefined word lists frequently used in specific services or industries.

Custom additional trainingCustomizes the Sommers model by adding actual customer audio data, resulting in extremely high accuracy.

Add sentence punctuationAutomatically adds sentence punctuation, enhancing readability and providing easy-to-understand text conversion.

Filler words removalAutomatically removes meaningless words like "um," "uh," and "well," providing concise text conversion.

Profanity FilterOptionally filters profanity and inappropriate words automatically.

Number-to-English auto conversionConverts numbers, English, units, and special characters into the most suitable format, maximizing readability.

Paragraph divisionDivides converted sentences into appropriate-length paragraphs to enhance readability.

Word timestampsProvides timestamps to make it easy to process the text conversion results for various services.