Blog(2)
Initializing parameters by a pretrained masked language model (LM) [1] is a knowledge transfer method widely applied to natural language processing tasks. Following its success, pretrained neural machine translation (NMT) models have attracted more and more research interest [2,3,4,5].
In today’s world of virtual meetings, conferences, and multi-media, automatic speech translation offers a wide variety of applications. Traditional offline speech translation models used a cascade of speech recognition and text translation. In our prior works [1], we developed efficient techniques for end-to-end speech translation which outperforms traditional cascaded approaches.
Research Areas(0)
Publications(23)
Cross-Modal Decision Regularization for Simultaneous Speech Translation
AuthorMohd Abbas Zaidi, Beomseok Lee, Sangha Kim, Chanwoo Kim
PublishedAnnual Conference of the International Speech Communication Association (INTERSPEECH)
Date2022-09-18
Deep Multivariate Domain Translation for Device Invariant Pulmonary Patient Identification from Cough and Speech Sounds
AuthorMohsin Ahmed,Korosh Vatanparvar,Jilong Kuang,Alex Gao
PublishedEngineering in Medicine and Biology Conference (EMBC)
Date2022-07-11
Language Model Augmented Monotonic Attention for Simultaneous Translation
AuthorSathish Reddy Indurthi, Mohd Abbas Zaidi, Beomseok Lee, Nikhil Kumar Lakumarapu, Sangha Kim
PublishedNorth American Chapter of the Association for Computational Linguistics (NAACL)
Date2022-07-10
News(11)
Galaxy AI now supports 16 languages, helping more people to lower language barriers with real-time and on-device translation.
As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as Live Translate, Interpreter, Note Assist and Browsing Assist. But what does AI language development involve? This series examines the challenges of working with mobile AI and how we overcame them.
Globally, at least 2.2 billion people have near or distant vision impairment. It significantly limits their access to TV, movies, and other video content. Subtitling is a very popular feature for audiovisual translation and multimedia localization. Reading subtitles aloud (voice-over) makes producing the audio track in different languages possible without dubbing.
Others(1)
Seoul, September 12 ~ 13 Samsung AI Forum is an academic event for the foremost minds of today’s world to come together and share their views on the current and future development of Artificial Intelligence technology. saif videoPlay Registration * Registration has been closed early due to the maximum capacity. Participants Anyone with an interest in AI technology including undergraduate and graduate students, university faculty members, researchers, and members of ...