← Back Try live API →

Coming soon

ChindaTTS

Production-grade Thai-English text-to-speech. Natural prosody, correct numbers and dates, clean long-form, and voice cloning.

3 voices8 tones<1s to first sound6-7× faster than real time3.2% CER

By the numbers

Measured quality

Character accuracy

97% 3.2% error rate (CER)

Naturalness

82% PESQ 3.68 (max 4.5)

Intelligibility

99% STOI 0.99

Voice-clone match

92% matches the target voice

Measured on standard Thai test sets. Higher is better.

Live demo

Main voices

KaitomFLAGSHIP

สวัสดีครับ ยินดีต้อนรับสู่บริการแปลงข้อความเป็นเสียงพูด ภาษาไทยและภาษาอังกฤษ

KaidangSoon

สวัสดีครับ นี่คือตัวอย่างเสียงผู้ชาย เหมาะสำหรับงานประกาศและผู้ช่วยเสียง

KaimookFEMALE

สวัสดีค่ะ นี่คือตัวอย่างเสียงผู้หญิง น้ำเสียงสดใสและเป็นมิตร

Speaking styles

Neutral

วันนี้เป็นวันที่อากาศดี เหมาะแก่การออกไปทำกิจกรรมต่างๆ นอกบ้าน

Friendly

สวัสดีครับ ยินดีที่ได้รู้จักนะครับ มีอะไรให้ช่วยบอกได้เลยนะครับ

Cheerful

ข่าวดีครับ วันนี้เรามีโปรโมชั่นพิเศษ ลดราคาสูงสุดถึงห้าสิบเปอร์เซ็นต์เลยนะครับ

Calm

ค่อยๆ หายใจเข้าลึกๆ แล้วผ่อนลมหายใจออกช้าๆ ปล่อยให้ร่างกายผ่อนคลายลง

Serious

โปรดทราบ ระบบจะปิดปรับปรุงในเวลาเที่ยงคืน กรุณาบันทึกข้อมูลของท่านให้เรียบร้อย

Sad

เราเสียใจอย่างสุดซึ้งต่อการสูญเสียในครั้งนี้ ขอแสดงความเสียใจกับครอบครัวด้วยครับ

Excited

สุดยอดไปเลย เราเพิ่งคว้ารางวัลชนะเลิศมาได้สำเร็จ ทุกคนดีใจกันมากจริงๆ

Empathetic

เราเข้าใจความรู้สึกของคุณดีนะครับ ไม่ต้องกังวลไป เราจะคอยอยู่ช่วยเหลือเสมอ

Thai & English

Mixed Thai-EnglishTH + EN

วันนี้เราจะมา demo ฟีเจอร์ใหม่ของ ChindaTTS ที่รองรับทั้งภาษาไทยและ English ครับ

All EnglishEN

Thank you for trying our demo. Our voices sound clear and natural in both languages.

Numbers, dates & money

Numbers & money

วันนี้เวลา 9 นาฬิกา 30 นาที อุณหภูมิ 28 องศา ยอดสั่งซื้อของคุณคือ 1,250 บาท ลดพิเศษ 15 เปอร์เซ็นต์

Voice cloning

Reference (original speaker)ORIGINAL

The ~14-second sample the voice was cloned from.

ChindaTTS cloneCHINDATTS

ChindaTTS reproducing that speaker. Hear the match.

Where teams use it

Conversational voice assistants and chatbots with spoken replies.
IVR and call-center prompts that read out balances and order numbers.
Public announcements, notifications and alerts.
Article, document and e-learning narration.

In plain words

LanguagesThai and English, mixed in one sentence

Voices3 voices, 8 tones, any combination

ClarityAs clear as a real human voice3.2% character error rate

SoundClean and natural, no robotic buzzPESQ 3.68, STOI 0.99

NumbersPrices, dates and times read correctly

ConsistencyDetects a garbled take and regenerates it automatically, so expressive styles stay clean6.7% bad-take rate

Voice cloneClone a new voice from just ~15 seconds of reference audio92% match

SpeedFirst sound in under 1 second even for long text (streaming); full audio at 6-7× real time

LengthUp to ~100 seconds of speech per request

To runOne standard GPU server, on-prem or cloud