203

參加人數

議題提供單位

Pervasive Artificial Intelligence Research (PAIR) Labs, National Chiao Tung University
(NCTU), Taiwan

The Pervasive AI Research (PAIR) Labs, a group of national research labs funded by the
Ministry of Science and Technology, Taiwan, is commissioned to achieve academic
excellence, nurture local AI talents, build international linkage, and develop pragmatic
approaches in the areas of applied AI technologies toward services, products, workflows, and
supply chains innovation and optimization. PAIR is constituted of 18 distinguished research
institutes in Taiwan to conduct research in various of applied AI areas.

Website: https://pairlabs.ai/

Intelligent Vision System (IVS) Lab, National Yang Ming Chiao Tung University (NYCU), Taiwan (NCTU), Taiwan

The Intelligent Vision System (IVS) Lab at National Yang Ming Chiao Tung University is directed by Professor Jiun-In Guo. We are tackling practical open problems in autonomous driving research, which focuses on intelligent vision processing systems, applications, and SoC exploiting deep learning technology.

Website: http://ivs.ee.nctu.edu.tw/ivs/

AI System (AIS) Lab, National Cheng Kung University (NCKU), Taiwan

The AI System (AIS) Lab at National Cheng Kung University is directed by Professor ChiaChi Tsai. We dedicate our passion on the system with AI technology. Our research includes AI accelerator development, AI architecture improvement, and AI-based solutions to multimedia problems.

MediaTek

MediaTek Inc. is a Taiwanese fabless semiconductor company that provides chips for wireless communications, high-definition television, handheld mobile devices like smartphones and tablet computers, navigation systems, consumer multimedia products and digital subscriber line services as well as optical disc drives. MediaTek is known for advances in multimedia, AI and expertise delivering the most power possible – when and where needed. MediaTek’s chipsets are optimized to run cool and super power-efficient to extend battery life. Always a perfect balance of high performance, power-efficiency, and connectivity.

Website: https://www.mediatek.com/

Wistron-NCTU Embedded Artificial Intelligence Research Center

Sponsored by Wistron and founded in 2020 September, Wistron-NCTU Embedded Artificial Intelligence Research Center (E-AI RDC) is a young and enthusiastic research center leaded by Prof. Jiun-In Guo (Institute of Electronics, National Chiao Tung University) aiming at developing the key technology related to embedded AI applications, ranging from AI data acquisition and labeling, AI model development and optimization and AI computing platform development with the help of easy to use AI toolchain (called ezAIT). The target applications cover AIoT, ADAS/ADS, smart transportation, smart manufacturing, smart medical imaging, and emerging communication systems. In addition to developing the above-mentioned technology, E-AI RDC will also collaborate with international partners as well as industrial partners in cultivating the talents in the embedded AI field to further enhance the industrial competitiveness in Taiwan Industry

The A19 Lab

The A19 Lab is a joint research lab sponsored by AU Optronics (AUO) Corp. and College of Artificial Intelligence, National Yangming Chiaotung University (NYCU), in November 2021. The A19 Lab, directed by Professor Ted Kuo of NYCU, is commissioned to explore leading-edge research in optronics and AI technologies with missions to develop innovative human-machine interfaces (HMI) and systems toward total immersive metaverse.

2022/03/22 Award Announcement

Dear Participants, The ICME 2022 competition awardees are: - Champion: okt2077 - 1st Runner-up: asdggg - 3rd-place: ACVLab Special Award - Best INT8 model development Award: without a winner Congratulations!

2022/02/22 Finalist Announcement

Finalists have been announced as follows. 1 okt2077 2 feishen 3 asdggg 4 TonyTTTTT 5 APTX4869 6 LeeC 7 OzHsu 8 UTS_GBDTC_MMLab 9 ACVLab 10 AiPoG 11 project_test 12 TonyStark 13 jerry88277 14 chingweihsu0809 15 Polybahn

2022/02/18 Qualification Competition is about to end.

Dear competitor: The Qualification Competition is about to end. Please remember to upload your result. We will send e-mail for Final Competition information.

2022/02/15 Private testing data has been released!

We have announced “Private Testing Data for Qualification.zip”. Please submit your result (700 images) for qualification. Since the new private testing dataset is different from the previous dataset, the leaderboard is reset. Thank you!

簡介

Object detection in the computer vision area has been extensively studied and making tremendous progress in recent years. Furthermore, image segmentation takes it to a new level by trying to find out accurately the exact boundary of the objects in the image. Semantic segmentation is in pursuit of more than just location of an object, going down to pixel level information. However, due to the heavy computation required in most deep learning-based algorithms, it is hard to run these models on embedded systems, which have limited computing capabilities. In addition, the existing open datasets for traffic scenes applied in ADAS applications usually include main lane, adjacent lanes, different lane marks (i.e. double line, single line, and dashed line) in western countries, which is not quite similar to that in Asian countries like Taiwan with lots of motorcycle riders speeding on city roads, such that the semantic segmentation models training by only using the existing open datasets will require extra technique for segmenting complex scenes in Asian countries.

In this competition, we encourage the participants to design semantic segmentation model that can be applied in Taiwan’s traffic scene with lots of fast speeding motorcycles running on city roads along with vehicles and pedestrians. The developed models not only fit for embedded systems but also achieve high accuracy at the same time.

This competition includes two stages: qualification and final competition.

  • Qualification competition: all participants submit their answers online. A score is calculated. The top 15 teams would be qualified to enter the final round of the competition.
  • Final competition: the final score will be evaluated on new MediaTek platform (Dimensity Series) platform for the final score.

The goal is to design a lightweight deep learning semantic segmentation model suitable for constrained embedded system design to deal with traffic scenes in Asian countries like Taiwan. We focus on segmentation accuracy, power consumption, real-time performance optimization and the deployment on MediaTek’s Dimensity Series platform.

With MediaTek’s Dimensity Series platform and its heterogeneous computing capabilities such as CPUs, GPUs and APUs (AI processing units) embedded into the system-on-chip products, developers are provided the high performance and power efficiency for building the AI features and applications. Developers can target these specific processing units within the system-on-chip or, they can also let MediaTek NeuroPilot SDK intelligently handle the processing allocation for them.

Given the test image dataset, participants are asked to segment each pixel belonging to the following six classes {background, main_lane, alter_lane, double_line, dashed_line, single_line} in each image.


Reference

[1] F. Yu et al., “BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning”,
in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern
Recognition (CVPR), 2020.
[2] Google, “Measuring device power : Android Open Source Project,” Android Open Source
Project. [Online]. Available:
https://source.android.com/devices/tech/power/device?hl=en#power-consumption.
[Accessed: 11-Nov-2021].
[3] M. Cordts et al., “The Cityscapes Dataset for Semantic Urban Scene Understanding”, in
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
(CVPR), 2016.


獎項

According to the points of each team in the final evaluation, we select the highest three teams for regular awards.

  1. Champion:            $USD 1500
  2. 1st Runner-up:      $USD 1000
  3. 3rd-place:              $USD 700

Special Award

  1. Best INT8 model development Award: $USD 500
    • Best overall score in the final competition using INT8 model development

All the award winners must agree to submit contest paper and attend the IEEE ICME2022 Grand Challenge PAIR Competition Special Session to present their work.


活動時間

Deadline for Submission(UTC+8):

DateEvent
1/10/2022Qualification Competition Start Date
1/10/2022Date to Release Public Testing Data
2/14/2022Date to Release Private Testing Data for Qualification
2/21/2022 11:59:59 PM UTC+8Qualification Competition End Date
2/22/2022 12:00 PM UTC+8Finalist Announcement
2/22/2022Final Competition Start Date
2/28/2022Date to Release Private Testing Data for Final
3/7/2022 11:59:59 PM UTC+8Final Competition End Date
3/19/2022 12:00 PM UTC+8Award Announcement
3/31/2022Invited Paper Submission Deadline

 

評估標準

Qualification Competition

The grading criteria is the same as used for the Cityscapes [3] Pixel-Level Semantic Labeling Task.

  • The standard Jaccard Index (commonly known as the PASCAL VOC intersectionover-union metric IoU = TP ⁄ (TP+FP+FN) is used to evaluate the semanticsegmentation results.
  • TP, FP, and FN are the numbers of true positive, false positive, and false negativepixels, respectively.
  • Pixels labeled as void do not contribute to the score.

The IoU compares the prediction region with the ground truth region for a class andquantifies this based on the area of overlap between both regions. The IoU is calculatedfor each semantic class in an image and the mean of all class IoU scores makes up the mean Intersection over Union (mIoU) score.

Final Competition

  • Mandatory Criteria
    • Accuracy of final submission cannot be 5% lower (include) than theirsubmitted model of qualification.
    • The summation of Preprocessing & Postprocessing time of final submission cannot be 50% slower (include) than the inference time of the main model. (Evaluated on the host machine)
  • [Host] Accuracy (mIoU)–40%
    The team with the highest accuracy will get the full score (40%) and the team withthe lowest one will get zero. The rest teams will get scores directly proportional tothe mIoU difference.
  • [Device] Power consumption (average current computation on MediaTek DimensitySeries)–30%
    Measured by android battery fuel gauge on MediaTek’s Dimensity Series Platform[2]. The “BATTERY_PROPERTY_CURRENT_AVERAGE” mode is used in the evaluation.
    The team with a single model (w/o Preprocessing & Postprocessing) to completethe semantic segmentation task with the lowest power consumption will get the fullscore (30%) and the team with the largest one will get zero. The rest teams will getscores directly proportional to the average current computation difference.
  • [Device] Speed on MediaTek Dimensity 9000 Series Platform–30%
    The team with a single model (w/o Preprocessing & Postprocessing) to completethe semantic segmentation task in the shortest time will get the full score (30%) andthe team that takes the longest time will get zero score. The rest teams will getscores directly proportional to the execution time difference.
    The evaluation procedure will be toward the overall process from reading the privatetesting dataset in final to completing submission.csv file, including parsing image list,loading images, and any other overhead to conduct the semantic segmentationthrough the testing dataset.

Final Competition

The finalists have to hand in a package that includes SavedModel (should be compatible w/ freeze_graph.py@tensorflow_v1.13.2) and inference script. We will deploy tensorflow model to MediaTek’s Dimensity Series platform and grade the final score by running the model. 

A technical report is required to reveal the model structure, complexity, and execution efficiency, etc.

Submission File

Upload the zip file naming submission.zip containing the following files:

  1. Tensorflow Inference Package
    • An official Docker Image will be released by organizer
    • Tensorflow version are restricted to the following version.
      • Tensorflow 1.13.2
      • Other versions or other frameworks are not allowed.
    • The following files must exist in the submitted inference package
    • Tensorflow SaveModel
      (refer to TF1 saved_model.md@tensorflow_v1.13.2 for more detail)
    • Tensorflow-Lite model
    • run_model.py [image_list.txt] [path of output results]
      It runs your model to detect objects in test images listed in the image_list.txt and creates submission folder in the [path of output results]. The submission folder format is identical to the qualification competition.
    • Source code of your model
      The directory structure of source code shall be illustrated in README.txt
  2. techreport.pdf
    • The technical report that describes the model structure, complexity, execution efficiency, implementation techniques, and experiment results, etc.

 

Coordinator Contacts

Ted Kuo, tkuo@cs.nctu.edu.tw
Jenq-Neng Hwang,hwang@uw.edu
Jiun-In Guo, jiguo@nycu.edu.tw
Marvin Chen, marvin.chen@mediatek.com
Hsien-Kai Kuo, hsienkai.kuo@mediatek.com
Chia-Chi Tsai, cctsai@gs.ncku.edu.tw

規則

  • Team mergers are not allowed in this competition.
  • Each team can consist of a maximum of 6 team members.
  • The task is open to the public. Individuals, institutions of higher education, research
    institutes, enterprises, or other organizations can all sign up for the task.
  • A leaderboard will be set up and make available publicly.
  • Multiple submissions are allowed before the deadline and the last one will be used to
    enter the final qualification consideration.
  • The upload date/time will be used as the tiebreaker.
  • Privately sharing code or data outside of teams is not permitted. It is okay to share
    code if made available to all participants on the forums.
  • Personnel of IVSLAB team are not allowed to participate in the task.
  • A common honor code should be observed. Any violation will be disqualified.