Generative AI-Aided Navigation for the Visually Impaired and Blind

Description
Navigation for the visually impaired and blind remains to be a major barrier to independence. Existing assistive tools like guide dogs or white mobility canes provide limited, immediate information within a range of about 5 feet. Alternatively, assistive applications for navigation only provide

Navigation for the visually impaired and blind remains to be a major barrier to independence. Existing assistive tools like guide dogs or white mobility canes provide limited, immediate information within a range of about 5 feet. Alternatively, assistive applications for navigation only provide static, generalizable information about a broader area that could be a few hundred feet radius to miles. Currently, no solution effectively covers the 5 to 20 feet range, leaving users without crucial information about their surroundings in this mid-distance area. This project explores the potential of state-of-the-art vision-language models (VLMs) to provide new navigation solutions for the visually impaired and blind that bridge the aforementioned gap in information about the environment. VLMs prove capable of identifying key objects and reasoning from corresponding text and images in real time, making them the ideal candidate for assistive technology. Leveraging these capabilities, these models may be integrated into wearable or extendable devices that allow users to receive continuous support in unfamiliar environments, improving their independence and maintaining safety. This project investigates the practical application of VLMs in real-world scenarios, with an emphasis on ease of use and reliability. This work has the potential to expand the role of assistive technology in daily life and complement existing solutions for more intuitive and responsive understanding.

Details

Contributors
Date Created
2024-12

Additional Information

English
Series
  • Academic Year 2024-2025
Extent
  • 100 pages
Open Access
Peer-reviewed

Breathe: A Short Story Featuring Childhood Apraxia of Speech

1 month 2 weeks ago

Exploring the Relationship of Sleep with Behavioral and Emot ...

1 month 2 weeks ago

Strategic Positioning of the Republic of Serbia in Relation ...

1 month 4 weeks ago

Social Media's Role in Normalizing Vaping and Desensiti ...

2 months 2 weeks ago

¿Cuénteme más?: Hermeneutics and Complexity of Identity

6 months ago

Lost in Translation? A Comparative Analysis of Dubbing for I ...

6 months 3 weeks ago

Unlocking Linguistic Equality: Advocating for Bilingualism i ...

6 months 3 weeks ago

The Utilization of Cryptotephra to Refine the Age Model at A ...

6 months 2 weeks ago

Unraveling the Nexus of Glioblastoma Multiforme and Type II ...

6 months 2 weeks ago