Topic provider

Expo Union, which was established in 2003, is actively collaborating with government agencies, associations, and private sectors to organize professional exhibitions related to ecotechnologies, such as “Circular Economy Taiwan” and “Taiwan Water.” Besides, Expo Union hosted some International Medical Conferences and governmental conferences, such as FinTech Taipei 2018(Institute for Information Industry,III), Green Energy and Frontier products Expo 2018 (Academia Sinica),World Congress on Information Technology 2017 (Institute for Information Industry,III), Asia Pacific Academy of Ophthalmology Congress 2016(The Ophthalmological Society of Taiwan). Moreover, it has organized the Marathon Expo for the masses in recent years.

A successful Expo not only requires manpower from all fields with a variety of professional abilities, but also time and effort. Site planning, Exhibition affairs, tender process are equally important. Moreover, it is essential to provide customized and a high-quality design based on customers’ needs. Expo Union has profound experience in organizing international exhibitions and has the most professional team which pays attention to every detail. Last but not least, the company aims to creates maximum values by making all-out effort, professional work division, quality communication, and careful control on cost and time to organize an excellent campaign.

Computational Intelligence Technology Center (CITC) is the first big data R&D center to bridge the big data technology across industries in Taiwan. CITC aims to develop intelligent analytics technologies to boost Artificial Intelligence and big data core capabilities for local IT software companies, also to apply intelligent analytics to help the related industries to enhance their productivity and create new business opportunities. The cross-disciplinary approach enables CITC’s core technology capabilities to provide intelligent analytics and machine learning algorithms that industries need. CITC develops an open Artificial Intelligence and big data platform to create solutions that facilitate the innovative application for service design and business model for industries.


Crowd route is one of the most considerable part in the study of crowd behavior. It is not only the main reference on marketing events, but also a long-term concern of shops, department stores, exhibitions, and all kinds of campaigns.

By collecting and analyzing crowd routes, we can understand the preference and the staying time of the visitors, then make the best use of limited space and provide a more proper content for everyone. Based on the analysis result, we can optimize and customize campaigns by considering the preferences of certain target audiences. In practical applications scenarios, crowd route analysis can also be used to identify potential visitors and send push notifications.

The data of this topic is a random sampling of 2018 Marathon Expo, each sample had been classified into five different visiting route groups. The focus of this challenge is to train and build a favorable strategic model to correctly label visitor type by using such a great amount of route combinations.


First place: 100,000 discount points of hicloud

Second place: 50,000 discount points of hicloud

Third place: 50,000 discount points of hicloud

Honorable mention: 50,000 discount points of hicloud (multiple winners, depends on the final result)


* Rewards were provided by Chunghwa Telecom


1. Hicloud points can redeem service charge. After the discount, the price will be charged at 30% off based on the list price automatically.

2. Chunghwa Telecom deserves the right to make changes to the terms and conditions herein.

3. The example of discount points calculation can be referred to https://aidea-web.tw/computing


Activity time

The activity time will based on Taiwan Standard Time (UTC+8), and the schedule is as follows.

2019/07/16Registration begins
2019/09/30Upload deadline
2019/10/07Awards announcement


Evaluation Criteria

After the participants provide the Category forecast of Marathon Expo visiting route, the system would process them in batches regularly to calculate the score.

Evaluations are conducted by extracting predictive probability that is in the accurate category of selected test samples, thus calculating the sum of the logarithms and the final mean value. 

The formula is as follows,

$$ logloss \; = \; - \frac{1}{N} \; { \sum_{i=1}^{N} \; { \sum_{j=1}^{M} \; { y_{ij} \log({p_{ij}}) }}} $$

N: Number of samples
M: Number of categories
Yij: Whether it is an early-warning category
Pij: The probability forecast of selected samples which is in the accurate category



  • The evaluation will be based on the final uploaded result. If multiple participants get the same evaluation scores, the time of uploading will determine the ranking.
  • There is no limit on the number of uploading for the online ranking system.
    When using external data sets, participants should avoid using future data as the basis of the prediction results, and must state source of data sets in the forum for references.
  • All the data, techniques and source codes are original work of participants or are used by permission complying with laws and regulations. If any third-party claims their intellectual property rights or other rights and interests are being violated, the participant will need to handle the disputes personally. If any participants violate intellectual property rights, they will be disqualified, and shall bear legal responsibilities.
  • All the achievements and their IPRs (intellectual property rights) belong to the participants, and the Copyright License Agreements, patent applications, technology transfer and equity distributions of them, should be in accordance with the relevant Laws and Regulations.
    There should be no answer discussions between different accounts, or it will be considered as cheating.
  • If there is cheating or fraud during the activity, the participant that cheats will be disqualified from the activity and the vacancy would be filled up by other participants in the ranking order.
    After uploading, the answers of test data would be divided into two parts to calculate the score:
    • Before the deadline of the activity: The system only refers to a part of the test data to examine and calculate the score. The result will be posted on Public Leaderboard as reference for the final score and ranking. This data accounts for 60% of the entire test data.
    • After the deadline of the activity: The system refers to the remaining test data (40%) to examine and calculate the score. The result will be posted on Private Leaderboard as reference for the final score and ranking.
  • For those who have signed up for this competition, you acknowledge that you have read, understood, and agreed to be bound by the terms and regulations of this activity.
  • For participants who violate relevant terms or regulations, will be disqualified from the activity, if any of the violators has been awarded, the prize will be revoked, and the participants will need to return the prize money as well as certificate.
  • Should there be disputes, the organizer reserves the right to make the final decision.