Applications: broadcast monitoring, audio visual archive indexing
The VoxSigma software suite offers advanced language technologies such as
speech-to-text transcription, language identification and speaker diarization to
transform raw audio data into structured and searchable XML documents. Designed
for professional users needing to process large quantities of audio and video
documents such as broadcast data, it includes adaptive features allowing the
transcription of noisy speech, such as speech over background music.
The speech transcription software enables users to access content in video
documents in an analogous manner to searching in text documents.
Supported languages: Arabic, Cantonese, Czech, Dutch, English (US, UK), Finnish, French, German,
Greek, Hebrew, Hindi, Hungarian, Italian, Latvian, Lithuanian, Mandarin, Pashto, Persian,
Polish, Portuguese, Romanian, Russian, Spanish, Swahili, Swedish, Turkish, Ukrainian, Urdu [more
languages coming soon]
|