Vis enkel innførsel

dc.contributor.advisorBrastein, Ole Magnus
dc.contributor.advisorSkeie, Nils-Olav
dc.contributor.authorIlling, Eirik
dc.date.accessioned2023-06-28T16:41:36Z
dc.date.available2023-06-28T16:41:36Z
dc.date.issued2023
dc.identifierno.usn:wiseflow:6838201:54569106
dc.identifier.urihttps://hdl.handle.net/11250/3074098
dc.description.abstractOperator interface display images, often referred to as HMI, contains large amounts of information that can be valuable to obtain. If access to the source code or design files are limited, modern frameworks for object detection and text extraction can be used to obtain this information directly from images. However, obtaining data and training such modern solutions is time consuming, and require a lot of manual work to get started. In this project, traditional computer vision methods have been used to extract objects from images, separated the objects into training data and transferred learned a ResNet model to do multi-label image classification of individual objects. This model, in combination with methods such as sliding window, pyramid scaling and NMS gave the foundation for creating a semi-automated annotation tool that generates training data for more optimized object detection methods, in this case YOLO object detector. The semi- automated annotation tool provides a starting point for engineers to do manual touchup on the training data, and finally export state of the art training images for YOLO. The YOLO model is transfer learned on the annotated data, achieving a satisfying mAP50 score of 95.5%. A third-party library for OCR is used to obtain text information from preprocessed images, postprocessing the text by filtering tag data only, and an algorithm is used to link objects and tags together. The final solution is hosted in a software developed to focus on optimized user interaction, resulting in a excel formatted analysis document available for export to the end user.
dc.languageeng
dc.publisherUniversity of South-Eastern Norway
dc.titleObject detection, information extraction and analysis of operator interface images using computer vision and machine learning
dc.typeMaster thesis


Tilhørende fil(er)

Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel