Multilingual audio indexing | Voxsigma software

The VoxSigma software suite offers advanced language technologies such as speech-to-text transcription, language identification and speaker diarization to transform raw audio data into structured and searchable XML documents. Designed for professional users needing to process large quantities of audio and video documents such as broadcast data, it includes adaptive features allowing the transcription of noisy speech, such as speech over background music. The speech transcription software enables users to access content in video documents in an analogous manner to searching in text documents.

Supported languages: Arabic, Cantonese, Czech, Dutch, English (US, UK), Finnish, French, German, Greek, Hebrew, Hindi, Hungarian, Italian, Latvian, Lithuanian, Mandarin, Pashto, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Swahili, Swedish, Turkish, Ukrainian, Urdu [more languages coming soon]

Other applications: Transcription of Speeches, Teleconference Transcription, Subtitling, Telephone Speech Analytics, Avionics, Audio communication analysis for tactical situational awareness, Speaker Diarization.