Automatic Speech Recognition / Machine Translation systems by UPV
Friday 21 February 2025 -
10:00
Monday 17 February 2025
Tuesday 18 February 2025
Wednesday 19 February 2025
Thursday 20 February 2025
Friday 21 February 2025
10:00
Automatic Speech Recognition / Machine Translation systems by UPV
-
Jorge Iranzo
(
UPV
)
Joan Albert Silvestre
(
UPV
)
Jaume Santamaria
(
UPV
)
Automatic Speech Recognition / Machine Translation systems by UPV
Jorge Iranzo
(
UPV
)
Joan Albert Silvestre
(
UPV
)
Jaume Santamaria
(
UPV
)
10:00 - 11:00
Room: 31/3-004 - IT Amphitheatre
The Universitat Politècnica de València (UPV) as a CERN contractor has provisioned an on-premises Automated Speech Recognition (ASR) and Machine Translation (MT) solution from February 2022 to August 2024. More precisely, the Machine Learning and Language Processing (MLLP) research group of the UPV, with proven expertise in the development of language processing technologies for high quality media transcription and translation, developed English and French ASR and MT systems adapted to the particle-physics domain for offline and streaming multimedia content. MLLP's transcription and translation services and documentation were provided to the CERN via an on-premises installation of the MLLP's Transcription and Translation Platform (TTP). MLLP's ASR and MT services once adapted to the particle-physics domain thanks to the tight collaboration with CERN Media Services proved to supersede the performance of publicly available large pretrained ASR and MT models. Finally, CERN was provided with an auto-training procedure to automatically enhance ASR systems on a regular basis by taking advantage of newly produced resources and materials at CERN. This talk describes the development, evaluation and deployment of these particle-physics adapted ASR and MT systems now available to the CERN community.