| Home | About Us | Contact Us | Support | Twitter Linkedin Facebook RSS
Vocapia Logo Leading edge speech processing technology

VoxSigma® Speech to Text Software Suite

is a suite of language-specific speech-to-text transcription software products offered by Vocapia Research for Linux x86, x86-64 and ARM platforms. VoxSigma is also available as a Web service.

Request Form

Features

The VoxSigma software suite offers large vocabulary speech-to-text capabilities in multiple languages. It includes adaptive features allowing the transcription of noisy speech, such as speech over background music. The software suite has been designed for professional users needing to transcribe large quantities of audio and video documents such as broadcast data, either in real-time or in batch mode. Versions can also be used to transcribe call-center data.

The full speech-to-text conversion process (also call voice-to-text conversion) is done in three steps. The software first identifies the audio segments containing speech, then it identifies the language being spoken if it is not known a priori, and finally it converts the speech segments to text. It includes adaptive features allowing the transcription of noisy speech such as speech with background music.

The speech-to-text processing result is a fully annotated XML document including labels for speech and non-speech segments, speaker labels, words with time codes and high quality confidence scores. This XML file can be directly indexed by a search engine, or alternatively can be converted into plain text with capitalization and punctuation.

Technical characteristics

PlatformsLinux x86, x86_64, ARM (OpenSuse, Debian, Fedora, CentOS, Ubuntu, SuSE, Red Hat, ...)
APIcommand line tools, C++ library, REST
Audiostudio (e.g. broadcast) and telephone bandwidths
Key functionsaudio segmentation, speaker segmentation, language identification, spoken word transcription (speech-to-text)
Operating modesbatch, real-time, single or multi-threaded, low footprint version
OuputsXML with speaker diarization, language identification tags, word transcription, punctuation, confidence measures, numeral entities and other specific entities
Supported LanguagesArabic, Cantonese, Czech, Dutch, English (US, UK), Finnish, French, German, Greek, Hebrew, Hindi, Hungarian, Italian, Latvian, Lithuanian, Mandarin, Pashto, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Swahili, Swedish, Turkish, Ukrainian, Urdu (other language options are under development, contact us for more information)

The VoxSigma speech recognition software is also available as a Web service via a REST API, allowing customers to quickly reap the benefits of regular improvements to our technology and take advantage of additional features offered by the online environment. Our speech-to-text service is available 24/7/365 with failover servers and geographic redundancy.

To get more information about the VoxSigma software suite or to get a price quote you may fill out our online request form.

 
Thursday November 21, 2024

© Vocapia Research SAS,
2006-2023. All rights reserved.

Legal Notice   Privacy
About Us
API
Apply for job
Apps
Contact Us
Logos
FAQs
Glossary
News
Publications
Request form
Services
Speech-to-text
STT for Linux
Support
Technologies
Videos
VoxSigma