Taipei, Wednesday, Jul 17, 2019, 03:13

News

MOST Launches “AI Voice Data Set” to Assist Chinese AI Language Technology

By Korbin Lan
Published: Jun 26,2019

TAIPEI, Taiwan - The Ministry of Science and Technology (MOST) on June 25 at The National Taiwan University of Science and Technology (Taiwan Tech) held the a press conference regarding the online launch of the “AI Voice Data Set.” The first batch of 400 hours of voice testing data is being made available to the public, and in addition to self-recorded data, the data collection also combines voice data related to police and educational broadcasts. Following the release of this data set to the public, Taiwanese enterprises and academic circles will have authorization to freely access the data.

More on This

MOST Calls for Startups for CES 2020

TAIPEI, Taiwan - Taiwan Tech Arena (TTA) is going to lead 80 startups to CES 2020, and hope to catch attentions from international by having its national pavilion...

Taiwan’s NTU Discovers A New Catalyst to Fight Against Global Warming

TAIPEI, Taiwan - Through long-term support provided by the Ministry of Science and Technology and the National Synchrotron Radiation Research Center (NSRRC)...

MOST Deputy Minister Yu-Chin Hsu stated that he anticipates having the capability to accelerate the development of core technologies for Chinese voice dialogues. Regarding AI training, he explained that the most important component is accurate data. MOST established an AI voice digital database and invited Taiwan Tech to assist in the labeling of the voice data. Meanwhile, MOST has provided data that is convenient for everybody to access in order to accelerate the pace of AI training and improvement.

The release of this “AI Voice Data Set,” which contains 400 hours of recordings, will have the capacity to dramatically reduce the labor and time required for data labeling during the process of developing voice applications, and it will reduce the developmental barriers for related technologies.

At National Applied Research Laboratories (NARLabs), National Center for High-Performance Computing (NCHC), the NCHC DATA MARKET has been launched and is expected to attract more investment from enterprises and academic institutions in order to jointly promote a leap forward in the advancement Taiwan's voice AI technology and assist Taiwanese companies in undergoing digital transformations.

(TR/ Phil Sweeney)

CTIMES loves to interact with the global technology related companies and individuals, you can deliver your products information or share industrial intelligence. Please email us to en@ctimes.com.tw

503 viewed

comments powered by Disqus