[UIST 2024] WorldScribe: Towards Context-Aware Live Visual Descriptions (30s Preview)

Опубликовано: 12 Август 2024
на канале: Anhong Guo
143
4

WorldScribe: Towards Context-Aware Live Visual Descriptions

UIST 2024, 30s Preview

Authors:
Ruei-Che Chang, Yuxuan Liu, Anhong Guo

Abstract:
Automated live visual descriptions can aid blind people in understanding their surroundings with autonomy and independence. However, providing descriptions that are rich, contextual, and just-in-time has been a long-standing challenge in accessibility. In this work, we develop WorldScribe, a system that generates automated live real-world visual descriptions that are customizable and adaptive to users' contexts: (i) WorldScribe's descriptions are tailored to users' intents and prioritized based on semantic relevance. (ii) WorldScribe is adaptive to visual contexts, e.g., providing consecutively succinct descriptions for dynamic scenes, while presenting longer and detailed ones for stable settings. (iii) WorldScribe is adaptive to sound contexts, e.g., increasing volume in noisy environments, or pausing when conversations start. Powered by a suite of vision, language, and sound recognition models, WorldScribe introduces a description generation pipeline that balances the tradeoffs between their richness and latency to support real-time use. The design of WorldScribe is informed by prior work on providing visual descriptions and a formative study with blind participants. Our user study and subsequent pipeline evaluation show that WorldScribe can provide real-time and fairly accurate visual descriptions to facilitate environment understanding that is adaptive and customized to users' contexts. Finally, we discuss the implications and further steps toward making live visual descriptions more context-aware and humanized.

The project video is available at:    • WorldScribe: Towards Context-Aware Li...  
Paper available at: https://guoanhong.com/papers/UIST24-W...


Смотрите видео [UIST 2024] WorldScribe: Towards Context-Aware Live Visual Descriptions (30s Preview) онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь Anhong Guo 12 Август 2024, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 143 раз и оно понравилось 4 людям.