Motion prediction and object detection for image-based visual servoing systems using deep learning

Date published

2024-09-02

Free to read from

2024-09-18

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

MDPI

Department

Type

Article

ISSN

1450-5843

Format

Citation

Hao Z, Zhang D, Honarvar Shakibaei Asli B. (2024) Motion prediction and object detection for image-based visual servoing systems using deep learning. Electronics, Volume 13, Issue 17, August 2024, Article number 3487

Abstract

This study primarily investigates advanced object detection and time series prediction methods in image-based visual servoing systems, aiming to capture targets better and predict the motion trajectory of robotic arms in advance, thereby enhancing the system’s performance and reliability. The research first implements object detection on the VOC2007 dataset using the Detection Transformer (DETR) and achieves ideal detection scores. The particle swarm optimization algorithm and 3-5-3 polynomial interpolation methods were utilized for trajectory planning, creating a unique dataset through simulation. This dataset contains randomly generated trajectories within the workspace, fully simulating actual working conditions. Significantly, the Bidirectional Long Short-Term Memory (BILSTM) model was improved by substituting its traditional Multilayer Perceptron (MLP) components with Kolmogorov–Arnold Networks (KANs). KANs, inspired by the K-A theorem, improve the network representation ability by placing learnable activation functions on fixed node activation functions. By implementing KANs, the model enhances parameter efficiency and interpretability, thus addressing the typical challenges of MLPs, such as the high parameter count and lack of transparency. The experiments achieved favorable predictive results, indicating that the KAN not only reduces the complexity of the model but also improves learning efficiency and prediction accuracy in dynamic visual servoing environments. Finally, Gazebo software was used in ROS to model and simulate the robotic arm, verify the effectiveness of the algorithm, and achieve visual servoing.

Description

Software Description

Software Language

Github

Keywords

40 Engineering, 4009 Electronics, Sensors and Digital Hardware, Machine Learning and Artificial Intelligence, 4009 Electronics, sensors and digital hardware, visual servoing, detection transformer, particle swarm optimization, bidirectional long short-term memory, Kolmogorov–Arnold network

DOI

Rights

Attribution 4.0 International

Relationships

Relationships

Resources

Funder/s