EchoLens: Image-to-Audio Accessibility Tool
A Chrome extension designed to make the web more accessible for visually impaired users by providing real-time, AI-generated audio descriptions of images. Powered by Llama 3.2 Vision for analysis and Google TTS for natural voice output.
Problem
Visually impaired users often face significant barriers when navigating the web, as many images lack proper alt text or descriptions, leaving a large portion of digital content inaccessible.
Approach
Developed a seamless Chrome extension that integrates a JavaScript frontend with a Flask backend. The system leverages Groq's Llama 3.2 Vision model to analyze images in real-time and converts the descriptions into speech using Google TTS. User preferences are securely managed via a Microsoft SQL database.
Impact
Significantly improved web accessibility by enabling visually impaired users to 'hear' images, providing instant, detailed audio descriptions for any visual content on the web, thereby bridging the digital divide.
Key Metrics
Technologies
Links
My Role
Initiated and developed the entire solution, from Chrome extension to backend API and database integration.
Team Size: 1 person