Search | VHL Regional Portal

Data Fusion for Cross-Domain Real-Time Object Detection on the Edge.

Kovalenko, Mykyta; Przewozny, David; Eisert, Peter; Bosse, Sebastian; Chojecki, Paul.

Sensors (Basel) ; 23(13)2023 Jul 04.

Article in English | MEDLINE | ID: mdl-37447986

ABSTRACT

We investigate an edge-computing scenario for robot control, where two similar neural networks are running on one computational node. We test the feasibility of using a single object-detection model (YOLOv5) with the benefit of reduced computational resources against the potentially more accurate independent and specialized models. Our results show that using one single convolutional neural network (for object detection and hand-gesture classification) instead of two separate ones can reduce resource usage by almost 50%. For many classes, we observed an increase in accuracy when using the model trained with more labels. For small datasets (a few hundred instances per label), we found that it is advisable to add labels with many instances from another dataset to increase detection accuracy.

Subject(s)

Gestures , Running , Hand , Neural Networks, Computer , Upper Extremity

Assessing the Value of Multimodal Interfaces: A Study on Human-Machine Interaction in Weld Inspection Workstations.

Chojecki, Paul; Strazdas, Dominykas; Przewozny, David; Gard, Niklas; Runde, Detlef; Hoerner, Niklas; Al-Hamadi, Ayoub; Eisert, Peter; Bosse, Sebastian.

Sensors (Basel) ; 23(11)2023 May 24.

Article in English | MEDLINE | ID: mdl-37299770

ABSTRACT

Multimodal user interfaces promise natural and intuitive human-machine interactions. However, is the extra effort for the development of a complex multisensor system justified, or can users also be satisfied with only one input modality? This study investigates interactions in an industrial weld inspection workstation. Three unimodal interfaces, including spatial interaction with buttons augmented on a workpiece or a worktable, and speech commands, were tested individually and in a multimodal combination. Within the unimodal conditions, users preferred the augmented worktable, but overall, the interindividual usage of all input technologies in the multimodal condition was ranked best. Our findings indicate that the implementation and the use of multiple input modalities is valuable and that it is difficult to predict the usability of individual input modalities for complex systems.

Subject(s)

Technology , User-Computer Interface , Humans , Speech

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL