Vocapia Research in collaboration with
LIMSI-CNRS has
provided automatic transcriptions of multilingual audiovideo data for
TRECVID 2011.
The TREC Video Retrieval Evaluation (http://trecvid.nist.gov/) aims to
use open, metrics-based evaluation in foster progress in content-based
analysis of and retrieval from digital video. Most of the speech in
the over 230 hour audio corpus is in English, with some of the speech
identified as and transcribed in other languages (Arabic, Dutch,
French, Italian, and Spanish). Vocapia Research and LIMSI-CNRS have
provided ASR transcripts for TRECVID since 2002.
Vocapia Research, founded in July 2000, is an R&D company and
software publisher developing and providing leading edge speech
technologies and solutions for many languages, including most major
European languages as well as Arabic, Mandarin, and Russian. The
Vocapia Research VoxSigma
® software suite uses advanced
language technologies such as language identification, speech
recognition, and speaker identification to transform raw audio and
audiovisual data into structured and searchable XML documents. This
technology relies on over 25 years of research at LIMSI-CNRS, with
which there is a priviledged partnership. Joint systems developed
with LIMSI have achieved top ranks in national and international
challenges of speech-to-text transcription. The most common
applications of the VoxSigma software suite are audio and audiovisual
data mining (broadcast data, podcasts, call center data), media
monitoring, and media asset management. Vocapia Research is located in
the scientific pole of the Saclay Plateau, France. Readers who wish to
get more information about Vocapia Research are invited to check out
the Vocapia Research website or use the contact information page
http://www.vocapia.com/contact.
The Computer Sciences Laboratory for Mechanics and Engineering Sciences
(LIMSI) is a CNRS (French National Agency for Scientific Research) laboratory
associated with Paris VI and Paris XI Universities, being located on the Orsay
campus of the latter. The laboratory undertakes multidisciplinary research in
mechanical and chemical engineering and in sciences and technologies for
information and communication. The research fields cover a wide spectrum from
thermodynamics to cognition, encompassing fluid mechanics, energetics,
acoustics and voice synthesis, spoken and written language processing, vision,
human interaction and virtual reality. To know more, please check out the web
site:
http://www.limsi.fr.