2

participants

Topic provider


Computational Intelligence Technology Center (CITC) is the first big data R&D center to bridge the big data technology across industries in Taiwan. CITC aims to develop intelligent analytics technologies to boost Artificial Intelligence and big data core capabilities for local IT software companies, also to apply intelligent analytics to help the related industries to enhance their productivity and create new business opportunities. The cross-disciplinary approach enables CITC’s core technology capabilities to provide intelligent analytics and machine learning algorithms that industries need. CITC develops an open Artificial Intelligence and big data platform to create solutions that facilitate the innovative application for service design and business model for industries.

Introduction

Making machines understand the meanings of human speech has always been a common goal for industries and the academia. The techniques of speech recognition has been studied for a few decades, and has kept gaining momentum as the innovation in artificial intelligence grew significantly. Will there be more breakthroughs in speech recognition of Mandarin Chinese? Come participate in this challenge and show that you are an expert in speech recognition of Mandarin Chinese!

The data set of this topic is selected from “artificial intelligent voice data” released by the Ministry of Science and Technology (MOST). It contains four Classic Chinese Novels: Dream of the Red Chamber, Romance of the Three Kingdoms, Journey to the West and Water Margin, as well as traffic reports by the Police Broadcasting Service (Pbs), the news broadcasted by National Education Radio, etc. There are 8 categories, and the participants should utilize speech recognition techniques to determine the category of each file.

Source of the materials: Artificial intelligent voice data


Activity time

Ask HR for details.


Evaluation Criteria

After classification models of defect prediction are offered by researchers participating in the issue, the back-end of the system would process them in bathces regularly to calculate the score. Evaluations are conducted by calculating the corresponding accuracy rate of the actual value.

The following is the formula:

$$Accuracy = {\text{Number of correct predictions} \over \text{Number of total predictions}}$$

Rules

  • The evaluation will be based on the final uploaded result.
  • The maximum number of uploading times is 5 times per day.
  • This topic does not offer a team-up option, only one account per person is allowed, and each person can only participate once. If any violations are found, people who are involved would be forced to withdraw the activity.
  • All the data, techniques and source codes that are used belong to the participants. If any third-party claims their intellectual property rights or other rights and interests are being violated, the participant will need to handle the disputes personally. If any participants violate intellectual property rights, they will be disqualified, and shall bear legal responsibilities.
  • All the achievements and their IPRs (intellectual property rights) belong to the participants, and the Copyright License Agreements, patent applications, technology transfer and equity distributions of them, should be in accordance with the relevant Laws and Regulations.
  • The organizers reserve the right to enquire the results or take any related actions.
  • There should be no answer discussions between different accounts, or it will be considered as cheating.
  • If there is cheating or fraud during the activity, the participant that cheats will be disqualified from the activity.
  • After uploading, the answers of test data would be divided into two parts to calculate the score:
    • Before the deadline of the activity: The system will examine and calculate the score refer to the Ground Truth of approximately 40% of the entire test data. The result will be posted on the Public Leaderboard.
    • After the deadline of the activity: The system will examine and calculate the score referring to the Ground Truth of the remaining test data (approximately 60%). The result will be posted on the Private Leaderboard.
  • Any artificial marking is forbidden.
  • Should there be disputes, the organizers reserves the right to the final decision.
  • The organizers reserve the right to modify any details regarding the contest when needed.