Vocapia Research and LIMSI-CNRS have partnered to provide automatic
transcriptions of audiovideo data for two MediaEval 2012 tasks: Search and
Hyperlinking task and TheTagging Task. This is the third year that Vocapia and
LIMSI team together to provide automatic transcripts. Updated language
identification and speech-to-text transcription systems have been used to
process a multilingual collection of Internet video collected from blip.tv. A
total of 3500 hours of audio were processed this year. As in previous years,
most of the data is in English with about 10% of the files in the Dutch, French,
German, Italian and Spanish languages. All videos were shared by their owners
under a Creative Commons license.
MediaEval2012
Vocapia Research, founded in July 2000, is an R&D company and
software publisher developing and providing leading edge speech
technologies and solutions for many languages, including most major
European languages as well as Arabic, Mandarin, and Russian. The
Vocapia Research VoxSigma
® software suite uses advanced
language technologies such as language identification, speech
recognition, and speaker identification to transform raw audio and
audiovisual data into structured and searchable XML documents. This
technology relies on over 25 years of research at LIMSI-CNRS, with
which there is a priviledged partnership. Joint systems developed
with LIMSI have achieved top ranks in national and international
challenges of speech-to-text transcription. The most common
applications of the VoxSigma software suite are audio and audiovisual
data mining (broadcast data, podcasts, call center data), media
monitoring, and media asset management. Vocapia Research is located in
the scientific pole of the Saclay Plateau, France. Readers who wish to
get more information about Vocapia Research are invited to check out
the Vocapia Research website or use the contact information page
http://www.vocapia.com/contact.
The Computer Sciences Laboratory for Mechanics and Engineering Sciences
(LIMSI) is a CNRS (French National Agency for Scientific Research) laboratory
associated with Paris VI and Paris XI Universities, being located on the Orsay
campus of the latter. The laboratory undertakes multidisciplinary research in
mechanical and chemical engineering and in sciences and technologies for
information and communication. The research fields cover a wide spectrum from
thermodynamics to cognition, encompassing fluid mechanics, energetics,
acoustics and voice synthesis, spoken and written language processing, vision,
human interaction and virtual reality. To know more, please check out the web
site:
http://www.limsi.fr.