Slidecho: Flexible Non-Visual Exploration ofPresentation Videos

Carnegie Mellon University | Human-Computer Interaction Institute


A screenshot of the Slidecho interface, includes a (A) video player augmented with optional audio notifications for new slides, and undescribed element (displayed video: Effective learning outcomes are SMART is the slide title shown in the video, the slide also features the list: specific, measurable, attainable, relevant, time-bounded on one side and an icon of a dart on the left side), the (B) video timeline that lets viewers play/pause and navigate via slide boundaries (the timeline appears as one bar to show progression, and dots along the timeline to indicate where audio descriptions will play), (C) the undescribed slide elements pane with the header “slide 6, (undescribed regions)” then under it features 'Attainable', 'Image:Arrow', (D) the slides pane that has all slide elements represented as HTML elements it reads Slide 6 and then has the slide title and the entire acronym Specific, Measurable, Attainable, Relevant, and Time-Bound, Image: Arrow. A control bar at the top lets you turn off/on the audio notifications, next/prev slide, edit, and toggle sync.

Abstract

We present Slidecho, a system that enables non-visual access of the slide content in a presentation video on-demand. Slidecho automatically extracts slides and their text and image elements from the presentation video and aligns these elements to the presenter’s speech. When listening to the video, Slidecho provides learners with audio notifications about slide changes and slide elements that are not described by the presenter. The learner can pause the video and browse the entire slide, or only the undescribed slide elements, to gain information. A technical evaluation with presentation videos in-the-wild shows that compared to the presenter’s speech alone, Slidecho provides access to an additional 20% of total text elements and 30% of total image elements that were previously not described. Blind and visually impaired participants in our user study reported that it was easier to locate undescribed slide elements with Slidecho’s synchronized interface than when browsing the video and extracted slides separately, and using Slidecho they read fewer slides that were fully redundant with the speech.

Links

[HTML Paper] [PDF] [ACM DL] [Presentation] [Demo (interactive sample without backend connection)]

Full talk video (7 minutes 50 seconds)

BibTex

@inproceedings{peng2021slidecho,
  title={Slidecho: Flexible Non-Visual Exploration of Presentation Videos},
  author={Peng, Yi-Hao and Bigham, Jeffrey P and Pavel, Amy},
  booktitle={Proceedings of the 23rd International ACM SIGACCESS Conference on Computers and Accessibility},
  year={2021}
}