participants
Pervasive Artificial Intelligence Research (PAIR) Labs, National Chiao Tung University
(NCTU), Taiwan
The Pervasive AI Research (PAIR) Labs, a group of national research labs funded by the
Ministry of Science and Technology, Taiwan, is commissioned to achieve academic
excellence, nurture local AI talents, build international linkage, and develop pragmatic
approaches in the areas of applied AI technologies toward services, products, workflows, and
supply chains innovation and optimization. PAIR is constituted of 18 distinguished research
institutes in Taiwan to conduct research in various of applied AI areas.
Website: https://pairlabs.ai/
Intelligent Vision System (IVS) Lab, National Yang Ming Chiao Tung University (NYCU), Taiwan (NCTU), Taiwan
The Intelligent Vision System (IVS) Lab at National Yang Ming Chiao Tung University is directed by Professor Jiun-In Guo. We are tackling practical open problems in autonomous driving research, which focuses on intelligent vision processing systems, applications, and SoC exploiting deep learning technology.
Website: http://ivs.ee.nctu.edu.tw/ivs/
AI System (AIS) Lab, National Cheng Kung University (NCKU), Taiwan
The AI System (AIS) Lab at National Cheng Kung University is directed by Professor ChiaChi Tsai. We dedicate our passion on the system with AI technology. Our research includes AI accelerator development, AI architecture improvement, and AI-based solutions to multimedia problems.
MediaTek
MediaTek Inc. is a Taiwanese fabless semiconductor company that provides chips for wireless communications, high-definition television, handheld mobile devices like smartphones and tablet computers, navigation systems, consumer multimedia products and digital subscriber line services as well as optical disc drives. MediaTek is known for advances in multimedia, AI and expertise delivering the most power possible – when and where needed. MediaTek’s chipsets are optimized to run cool and super power-efficient to extend battery life. Always a perfect balance of high performance, power-efficiency, and connectivity.
Website: https://www.mediatek.com/
Wistron-NCTU Embedded Artificial Intelligence Research Center
Sponsored by Wistron and founded in 2020 September, Wistron-NCTU Embedded Artificial Intelligence Research Center (E-AI RDC) is a young and enthusiastic research center leaded by Prof. Jiun-In Guo (Institute of Electronics, National Chiao Tung University) aiming at developing the key technology related to embedded AI applications, ranging from AI data acquisition and labeling, AI model development and optimization and AI computing platform development with the help of easy to use AI toolchain (called ezAIT). The target applications cover AIoT, ADAS/ADS, smart transportation, smart manufacturing, smart medical imaging, and emerging communication systems. In addition to developing the above-mentioned technology, E-AI RDC will also collaborate with international partners as well as industrial partners in cultivating the talents in the embedded AI field to further enhance the industrial competitiveness in Taiwan Industry
Dear Participants, The ICME 2023 competition awardees are: - Champion: You Only Lowpower Once - 1st Runner-up: Polybahn - 3rd-place: ACVLab Special Award - Best INT8 model development Award: Polybahn Congratulations!
Dear competitor: We have announced Private Testing Data for Qualification Data. Please submit your result for qualification. Since the new private testing dataset is different from the previous dataset, the leaderboard is reset. The final ranking will be based on the score of the private leaderboard. Thank you!
Object detection in the computer vision area has been extensively studied and making tremendous progress in recent years. Furthermore, image segmentation takes it to a new level by trying to find out accurately the exact boundary of the objects in the image. Semantic segmentation is in pursuit of more than just location of an object, going down to pixel level information. However, due to the heavy computation required in most deep learning-based algorithms, it is hard to run these models on embedded systems, which have limited computing capabilities. In addition, the existing open datasets for traffic scenes applied in ADAS applications usually include main lane, adjacent lanes, different lane marks (i.e. double line, single line, and dashed line) in western countries, which is not quite similar to that in Asian countries like Taiwan with lots of motorcycle riders speeding on city roads, such that the semantic segmentation models training by only using the existing open datasets will require extra technique for segmenting complex scenes in Asian countries. Often time, for most of the complicated applications, we are dealing with both object detection and segmentation task. We will have difficulties when accomplish these two tasks in separated models on limited-resources platform.
In this competition, we encourage the participants to design a lightweight single deep learning model to support multi-task functions, including semantic segmentation and object detection, that can be applied in Taiwan’s traffic scene with lots of fast speeding motorcycles running on city roads along with vehicles and pedestrians. The developed models not only fit for embedded systems but also achieve high accuracy at the same time.
This competition includes two stages: qualification and final competition.
The goal is to design a lightweight single deep learning model to support multi-task functions, including semantic segmentation and object detection, which is suitable for constrained embedded system design to deal with traffic scenes in Asian countries like Taiwan. We focus on segmentation/object detection accuracy, power consumption, real-time performance optimization and the deployment on MediaTek’s Dimensity Series platform.
With MediaTek’s Dimensity Series platform and its heterogeneous computing capabilities such as CPUs, GPUs and APUs (AI processing units) embedded into the system-on-chip products, developers are provided the high performance and power efficiency for building the AI features and applications. Developers can target these specific processing units within the system-on-chip or, they can also let MediaTek NeuroPilot SDK intelligently handle the processing allocation for them.
Given the test image dataset, participants are asked to do two tasks in a single model at the same time, which includes object detection and semantic segmentation. For the semantic segmentation task, the model should be able to segment each pixel belonging to the following six classes {background, main_lane, alter_lane, double_line, dashed_line, single_line} in each image. For the object detection task, the same model should be able to detect objects belonging to the following four classes {pedestrian, vehicle, scooter, bicycle} in each image, including class, bounding box, and confidence.
Reference
[1] F. Yu et al., “BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning”,
in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern
Recognition (CVPR), 2020.
[2] Google, “Measuring device power : Android Open Source Project,” Android Open Source
Project. [Online]. Available:
https://source.android.com/devices/tech/power/device?hl=en#power-consumption.
[Accessed: 11-Nov-2021].
[3] M. Cordts et al., “The Cityscapes Dataset for Semantic Urban Scene Understanding”, in
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
(CVPR), 2016.
[4] COCO API: https://github.com/cocodataset/cocoapi
[5] Average Precision (AP): https://en.wikipedia.org/wiki/Evaluation_measures_(information_retrieval)#Average_precisio n
[6] Intersection over union (IoU): https://en.wikipedia.org/wiki/Jaccard_index
According to the points of each team in the final evaluation, we select the highest three teams for regular awards.
Special Award
All the award winners must agree to submit contest paper and attend the IEEE ICME2023 Grand Challenge PAIR Competition Special Session to present their work. If the paper failed to submit, or the length of the submitted paper is less than 3 pages, the award would be cancelled.
Deadline for Submission(UTC+8):
Date | Event |
---|---|
2/03/2023 | Qualification Competition Start Date |
2/03/2023 | Date to Release Public Testing Data |
3/17/2023 | Date to Release Private Testing Data for Qualification |
3/24/2023 19:59:59 PM UTC+8 | Qualification Competition End Date |
3/25/2023 20:00 PM UTC+8 | Finalist Announcement |
3/26/2023 | Final Competition Start Date |
4/03/2023 | Date to Release Private Testing Data for Final |
4/10/2023 19:59:59 PM UTC+8 | Final Competition End Date |
4/19/2023 20:00 PM UTC+8 | Award Announcement |
4/30/2023 | Invited Paper Submission Deadline |
5/07/2023 | Camera ready form deadline |
Qualification Competition
The grading criteria is divided into two parts. One for the object detection score and the other for semantic segmentation score. Each account for 50% points in the qualification competition. The evaluation metric for these two parts is listed below.
The evaluation metric for object detection is based on MSCOCO object detection rule.
The resulting average precision (AP) of each class will be calculated and the mean AP (mAP) over all classes is evaluated as the key metric.
The evaluation metric for semantic segmentation is based on the Cityscapes [3] Pixel-Level Semantic Labeling Task.
The IoU compares the prediction region with the ground truth region for a class and quantifies this based on the area of overlap between both regions. The IoU is calculated for each semantic class in an image and the mean of all class IoU scores makes up the mean Intersection over Union (mIoU) score.
The total score for qualification competition is listed below.
Final Competition
Final Competition
The finalists have to hand in a package that includes SavedModel (should be compatible w/ freeze_graph.py@tensorflow_v2.0.0~v2.8.0) and inference script. We will deploy tensorflow model to MediaTek’s Dimensity Series platform and grade the final score by running the model.
A technical report is required to reveal the model structure, complexity, and execution efficiency, etc.
Submission File
Upload the zip file naming submission.zip containing the following files: