Taipei, Sunday, Oct 20, 2019, 20:32

News

MOST Launches “AI Voice Data Set” to Assist Chinese AI Language Technology

By Korbin Lan
Published: Jun 26,2019

TAIPEI, Taiwan - The Ministry of Science and Technology (MOST) on June 25 at The National Taiwan University of Science and Technology (Taiwan Tech) held the a press conference regarding the online launch of the “AI Voice Data Set.” The first batch of 400 hours of voice testing data is being made available to the public, and in addition to self-recorded data, the data collection also combines voice data related to police and educational broadcasts. Following the release of this data set to the public, Taiwanese enterprises and academic circles will have authorization to freely access the data.

More on This

TTA Startups Led by the MOST Won $125M Business Opportunities during the MedTech Conference 2019

Under the initiative of the Ministry of Science and Technology (MOST) of Taiwan, Taiwan Tech Arena (TTA) led 12 startups...

Taiwan’s MOST and MOE Showcase the Innovation for the Transformation of Technologies

TAIPEI, Taiwan - Taiwan Innotech Expo 2019 will be held from September 26 (Thu) to 28 (Sat), 2019 at the TWTC Exhibition Hall 1...

MOST Deputy Minister Yu-Chin Hsu stated that he anticipates having the capability to accelerate the development of core technologies for Chinese voice dialogues. Regarding AI training, he explained that the most important component is accurate data. MOST established an AI voice digital database and invited Taiwan Tech to assist in the labeling of the voice data. Meanwhile, MOST has provided data that is convenient for everybody to access in order to accelerate the pace of AI training and improvement.

The release of this “AI Voice Data Set,” which contains 400 hours of recordings, will have the capacity to dramatically reduce the labor and time required for data labeling during the process of developing voice applications, and it will reduce the developmental barriers for related technologies.

At National Applied Research Laboratories (NARLabs), National Center for High-Performance Computing (NCHC), the NCHC DATA MARKET has been launched and is expected to attract more investment from enterprises and academic institutions in order to jointly promote a leap forward in the advancement Taiwan's voice AI technology and assist Taiwanese companies in undergoing digital transformations.

(TR/ Phil Sweeney)

CTIMES loves to interact with the global technology related companies and individuals, you can deliver your products information or share industrial intelligence. Please email us to en@ctimes.com.tw

716 viewed

comments powered by Disqus