Speech Summarization Datasets

Comprehensive collection of datasets for speech summarization research

Summarizing Speech: A Comprehensive Survey

Authors: Fabian Retkowski1, Maike ZΓΌfle1, Andreas Sudmann2, Dinah Pfau3, Shinji Watanabe4, Jan Niehues1, Alexander Waibel1,4
1KIT, 2University of Bonn, 3Deutsches Museum, 4CMU

This interactive table accompanies our comprehensive survey of speech summarization, a field at the intersection of speech recognition, text summarization, and domain-specific applications. Our work synthesizes recent developments from traditional cascaded systems to end-to-end approaches, while highlighting ongoing challenges in evaluation benchmarks, multilingual datasets, and long-context handling. We will continuously update the table below. Feel free to reach out to us using the green 'Submit Dataset' button for feedback and request for additions.

All Summary Types
All Languages
All Licenses
Name ↕ Domain ↕ Summary Type ↕ Lang. ↕ Size ↕ Modalities ↕ License ↕ Status ↕
Loading datasets...