Vision-based human presence detection pipeline by means of transfer learning approach

Tang, Jin Cheng (2023) Vision-based human presence detection pipeline by means of transfer learning approach. Masters thesis, Universiti Malaysia Pahang (Contributors, Thesis advisor: Ahmad Fakhri, Ab. Nasir).

[img]
Preview
Pdf
ir.Vision-based human presence detection pipeline by means of transfer learning approach.pdf - Accepted Version

Download (403kB) | Preview

Abstract

Over the last century, industrial robots have gained an immense amount of popularity in replacing the human workers due to their highly repetitive nature. It was a twist to the industries when the concept of cooperative robots, known as cobots, has been innovated. Sharing space between the cobots and human workers has considered as the most effective way of utilizing the cobots. Keeping in mind that the safety of the human workers is always the top priority of the cobot applications in the industries, many time and efforts have been invested to improve the safeness of the cobots deployments. Yet, the utilization of deep learning technologies is rarely found in accordance with human detection in the field of research, especially the transfer learning approach, providing that the visual perception has shown to be a unique sense that still cannot be replaced by other. Hence, this thesis aimed to leverage the transfer learning approach to fine-tune the deep learning-based object detection models in the human detection task. In relation to this main goal, the objectives of the study were as follows: establish an image dataset for cobot environment from the surveillance cameras in TT Vision Holdings Berhad, formulate deep learning-based object detection models by using the transfer learning approach, and evaluate the performance of various transfer learning models in detecting the presence of human workers with relevant evaluation metrics. Image dataset has acquired from the surveillance system of TT Vision Holdings Berhad and annotated accordingly. The variations of the dataset have been considered thoroughly to ensure the models can be well-trained on the distinct features of the human workers. TensorFlow Object Detection API was used in the study to perform the fine-tuning of the one-stage object detectors. Among all the transfer learning strategies, fine-tuning has chosen since it suits the study well after the interpretation on the size-similarity matrix. A total of four EfficientDet models, two SSD models, three RetinaNet models, and four CenterNet models were deployed in the present work. As a result, SSD-MobileNetV2-FPN model has achieved 81.1% AP with 32.82 FPS, which is proposed as the best well-balanced fine-tuned model between accuracy and speed. In other case where the consideration is taken solely on either accuracy or inference speed, SSD_MobileNetV1-FPN model has attained 87.2% AP with 28.28 FPS and CenterNet-ResNet50-V1-FPN has achieved 78.0% AP with 46.52 FPS, which is proposed to be the model with best accuracy and inference speed, respectively. As a whole, it could be deduced that the transfer learning models can handle the human detection task well via the fine-tuning on the COCO-pretrained weights.

Item Type: Thesis (Masters)
Additional Information: Thesis (Master of Science) -- Universiti Malaysia Pahang – 2023, SV: Dr. Ahmad Fakhri Ab. Nasir, NO.CD: 13401
Uncontrolled Keywords: cobots, deep learning technologies
Subjects: T Technology > TA Engineering (General). Civil engineering (General)
T Technology > TS Manufactures
Faculty/Division: Institute of Postgraduate Studies
Faculty of Manufacturing and Mechatronic Engineering Technology
Depositing User: Mr. Nik Ahmad Nasyrun Nik Abd Malik
Date Deposited: 14 Nov 2023 02:42
Last Modified: 14 Nov 2023 02:42
URI: http://umpir.ump.edu.my/id/eprint/39286
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item