This paper introduces a novel wearable assistive device designed to enhance the daily lives of visually impaired people (VIP) by providing continuous, real-time object finding. The proposed device integrates a miniature camera and a system on module (SoM) computing unit. Deep learning-based algorithms, specifically Faster R-CNNs, run on the SoM targeting to detect and recognize pertinent objects in the images provided by the camera. An integrated speaker conveys audio sentences letting the user know the presence of object(s) in the visual space. Experimental results conducted in real-world conditions show an 86% mean average precision for object recognition and a 215 ms mean computing time for real-time processing. The proposed image-to-speech device offers a practical and efficient solution to assist VIP in finding everyday objects.

Real-Time Object Finding for the Visually Impaired Using an Image-to-Speech Wearable Device

R. de Fazio;P. Visconti
Ultimo
2026-01-01

Abstract

This paper introduces a novel wearable assistive device designed to enhance the daily lives of visually impaired people (VIP) by providing continuous, real-time object finding. The proposed device integrates a miniature camera and a system on module (SoM) computing unit. Deep learning-based algorithms, specifically Faster R-CNNs, run on the SoM targeting to detect and recognize pertinent objects in the images provided by the camera. An integrated speaker conveys audio sentences letting the user know the presence of object(s) in the visual space. Experimental results conducted in real-world conditions show an 86% mean average precision for object recognition and a 215 ms mean computing time for real-time processing. The proposed image-to-speech device offers a practical and efficient solution to assist VIP in finding everyday objects.
2026
978-3-032-10861-6
978-3-032-10862-3
File in questo prodotto:
File Dimensione Formato  
MuSMe 2025-Research article_Mechanisms and Machine Science Springer_Velazquez-Visconti et al.pdf

solo utenti autorizzati

Descrizione: Atto di convegno
Tipologia: Versione editoriale
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 2.98 MB
Formato Adobe PDF
2.98 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11587/565366
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact