Fon AI Studio

Record voice

Speak Fon (or French). 5–60 s works best.

Tap to record

00:00

Upload an audio file

wav · mp3 · m4a · ogg · flac · webm · mp4 — up to 50 MB / 5 min

📂

Drop audio here or click to browse

Synthesize Fon speech

Type Fon text and hear it spoken with facebook/mms-tts-fon. Diacritics (á à â ǎ ɛ́ ɔ́ ɖ ŋ) are respected.

0 / 1500 chars

Synthesis details

Quick examples

About the voice

Model

facebook/mms-tts-fon

Architecture

VITS (Meta MMS)

Speakers

Single (monolithic)

Sample rate

16 000 Hz

Max input

1 500 chars

Multi-speaker / voice-cloning is on the roadmap. Today the voice and tone are fixed.

🎯 High-quality mode

Synthesize 5 candidates and auto-pick the one with the lowest transcription error.

Validated at 9.98 % WER vs the stock model's 17.76 % on a 200-prompt eval (~44 % fewer errors), at the cost of ~5× latency. Affects every Speak button in the app.

Use high-quality TTS everywhere

Speech ↔ Speech conversation

Talk in Fon or French. The pipeline transcribes → reasons → speaks the reply back. One tap, full loop.

Tap the mic and speak.

00:00

1Record

→

2ASR (Fon)

→

3Fon → FR

→

4Reason (FR)

→

5FR → Fon

→

6TTS (Fon)

Auto-play reply Keep dialog memory

Conversation

Your spoken conversation will appear here. Each turn shows what was heard and what the assistant said back.

Record voice

Upload an audio file

Transcript

Synthesize Fon speech

Quick examples

Speech ↔ Speech conversation

Conversation

Chat assistant