• Home >
  • Hybrid Attention for Robust RGB-T Pedestrian Detection

Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions

 Arunkumar Rathinam*, Leo Pauly*, Abd El Rahman Shabayek, Wassim Rharbaoui
 Anis Kacem, Vincent Gaudillière and Djamila Aouada
* – equal contribution
IEEE ROBOTICS AND AUTOMATION LETTERS (2024)

Visualization of experiment results on KAIST Dataset (Successful detections). a. Dual modality, b. RGB blackout, c. Thermal blackout, d. Sides blackout (RGB-Thermal), e. Sides blackout (Thermal-RGB), f. Surrounding blackout


Abstract
:

Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges.

 

Downloads

Logos_pdf
Logos_Code
logos_Video
Logos_Dataset

Bibliography

@ARTICLE{rath2024,
author={Rathinam, Arunkumar and Pauly, Leo and Shabayek, Abd El Rahman and Rharbaoui, 
Wassim and Kacem, Anis and Gaudillière, Vincent and Aouada, Djamila},
journal={IEEE Robotics and Automation Letters}, 
title={Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions}, 
year={2024},
keywords={Multi-Modal Perception for HRI; Deep Learning for Visual Perception; 
Sensor Fusion; Object Detection, Segmentation and Categorization.}}

Acknowledgement

This work was supported by Luxembourg National Research Fund (FNR), under the project reference BRIDGES2020/IS/14755859/MEET-A/Aouada.