NLP-KG
Semantic Search

Publication:

TED-LIUM: an Automatic Speech Recognition dedicated corpus

Anthony RousseauP. DelégliseY. Estève • @International Conference on Language Resources and Evaluation • 01 May 2012

TLDR: The content of the corpus, how the data was collected and processed, how it will be publicly available and how an ASR system was built using this data leading to a WER score of 17.4 % are described.

Citations: 251
Abstract: This paper presents the corpus developed by the LIUM for Automatic Speech Recognition (ASR), based on the TED Talks. This corpus was built during the IWSLT 2011 Evaluation Campaign, and is composed of 118 hours of speech with its accompanying automatically aligned transcripts. We describe the content of the corpus, how the data was collected and processed, how it will be publicly available and how we built an ASR system using this data leading to a WER score of 17.4 %. The official results we obtained at the IWSLT 2011 evaluation campaign are also discussed.

Related Fields of Study

loading

Citations

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next

References

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next