SSLectures: Abstractive Summaries and Topic Segments of Lecture Videos

SSLectures: Abstractive Summaries and Topics Segments of Lecture Videos   SSLectures is a dataset containing abstractive summaries of lecture videos from AK Lectures website and MIT OCW repository. It also contains topic segments (chapters) for the MIT lectures. The dataset was scraped from free pub...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Alesh, Yaser Haitham, Abdulghani, Osama, Al Ali, Omar Ibrahim, Aoudia, Meriem, Abu Talib, Dr. Manar
Format: Dataset
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Alesh, Yaser Haitham
Abdulghani, Osama
Al Ali, Omar Ibrahim
Aoudia, Meriem
Abu Talib, Dr. Manar
description SSLectures: Abstractive Summaries and Topics Segments of Lecture Videos   SSLectures is a dataset containing abstractive summaries of lecture videos from AK Lectures website and MIT OCW repository. It also contains topic segments (chapters) for the MIT lectures. The dataset was scraped from free publicly available material and is published under a Creative Commons License that allows re-distribution and re-use.   The dataset is split into 3 files explained below: mit_chapters_summarized.csv: Contains the transcript and other details of 14.8K chapters (segments) from the MIT lectures along with abstractive summaries generated with GPT-3.5. Each row is one chapter from one lecture video.  Suitable to train summarization to summarize parts of lecture videos. (Not full lectures). ak_lectures_summarized.csv: Contains the transcript and other details of 1.8k lecture videos from aklectures.com. Each lecture video comes with the abstractive summary that was published on the website. Most videos of this dataset are short, between 5-15 minutes on average. Suitable to train summarization models to summarize full short lecture videos. (~ 15 min. in length for most) mit_videos_all_courses_segmentations.csv: Contains details of the chaptering (segmentation) of each lecture video from MIT. Each row is for one lecture video, and comes with the timing (end times) and titles of each chapter in the video.  Suitable to train and/or evaluate segmentation algorithms and models for both short and long lecture videos. Please cite this page if you use this dataset in your research or in other projects.  Copyright Notice: All rights of the lecture videos, the transcripts the have been scraped, the chapters and titles, the human-written summaries and all other related details belong to the respective owners of the MIT OCW or the AK Lectures websites. Our work here is for research and educational purposes. 
doi_str_mv 10.5281/zenodo.10498679
format Dataset
fullrecord <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_5281_zenodo_10498679</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_5281_zenodo_10498679</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_5281_zenodo_104986793</originalsourceid><addsrcrecordid>eNpjYBA3NNAzNbIw1K9KzctPydczNDCxtDAzt-RkcAsO9klNLiktSi22UnBMKi4pSkwuySxLVQguzc1NLMpMLVZIzEtRCMkvyExWCE5Nz03NKylWyE9TgOpSCMtMSc0v5mFgTUvMKU7lhdLcDPpuriHOHropiSWJyZklqfEFRZlA8yrjDQ3iQS6Jh7gkHuYSY9J1AAA6z0OT</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>SSLectures: Abstractive Summaries and Topic Segments of Lecture Videos</title><source>DataCite</source><creator>Alesh, Yaser Haitham ; Abdulghani, Osama ; Al Ali, Omar Ibrahim ; Aoudia, Meriem ; Abu Talib, Dr. Manar</creator><creatorcontrib>Alesh, Yaser Haitham ; Abdulghani, Osama ; Al Ali, Omar Ibrahim ; Aoudia, Meriem ; Abu Talib, Dr. Manar</creatorcontrib><description>SSLectures: Abstractive Summaries and Topics Segments of Lecture Videos   SSLectures is a dataset containing abstractive summaries of lecture videos from AK Lectures website and MIT OCW repository. It also contains topic segments (chapters) for the MIT lectures. The dataset was scraped from free publicly available material and is published under a Creative Commons License that allows re-distribution and re-use.   The dataset is split into 3 files explained below: mit_chapters_summarized.csv: Contains the transcript and other details of 14.8K chapters (segments) from the MIT lectures along with abstractive summaries generated with GPT-3.5. Each row is one chapter from one lecture video.  Suitable to train summarization to summarize parts of lecture videos. (Not full lectures). ak_lectures_summarized.csv: Contains the transcript and other details of 1.8k lecture videos from aklectures.com. Each lecture video comes with the abstractive summary that was published on the website. Most videos of this dataset are short, between 5-15 minutes on average. Suitable to train summarization models to summarize full short lecture videos. (~ 15 min. in length for most) mit_videos_all_courses_segmentations.csv: Contains details of the chaptering (segmentation) of each lecture video from MIT. Each row is for one lecture video, and comes with the timing (end times) and titles of each chapter in the video.  Suitable to train and/or evaluate segmentation algorithms and models for both short and long lecture videos. Please cite this page if you use this dataset in your research or in other projects.  Copyright Notice: All rights of the lecture videos, the transcripts the have been scraped, the chapters and titles, the human-written summaries and all other related details belong to the respective owners of the MIT OCW or the AK Lectures websites. Our work here is for research and educational purposes. </description><identifier>DOI: 10.5281/zenodo.10498679</identifier><language>eng</language><publisher>Zenodo</publisher><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,1894</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.5281/zenodo.10498679$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Alesh, Yaser Haitham</creatorcontrib><creatorcontrib>Abdulghani, Osama</creatorcontrib><creatorcontrib>Al Ali, Omar Ibrahim</creatorcontrib><creatorcontrib>Aoudia, Meriem</creatorcontrib><creatorcontrib>Abu Talib, Dr. Manar</creatorcontrib><title>SSLectures: Abstractive Summaries and Topic Segments of Lecture Videos</title><description>SSLectures: Abstractive Summaries and Topics Segments of Lecture Videos   SSLectures is a dataset containing abstractive summaries of lecture videos from AK Lectures website and MIT OCW repository. It also contains topic segments (chapters) for the MIT lectures. The dataset was scraped from free publicly available material and is published under a Creative Commons License that allows re-distribution and re-use.   The dataset is split into 3 files explained below: mit_chapters_summarized.csv: Contains the transcript and other details of 14.8K chapters (segments) from the MIT lectures along with abstractive summaries generated with GPT-3.5. Each row is one chapter from one lecture video.  Suitable to train summarization to summarize parts of lecture videos. (Not full lectures). ak_lectures_summarized.csv: Contains the transcript and other details of 1.8k lecture videos from aklectures.com. Each lecture video comes with the abstractive summary that was published on the website. Most videos of this dataset are short, between 5-15 minutes on average. Suitable to train summarization models to summarize full short lecture videos. (~ 15 min. in length for most) mit_videos_all_courses_segmentations.csv: Contains details of the chaptering (segmentation) of each lecture video from MIT. Each row is for one lecture video, and comes with the timing (end times) and titles of each chapter in the video.  Suitable to train and/or evaluate segmentation algorithms and models for both short and long lecture videos. Please cite this page if you use this dataset in your research or in other projects.  Copyright Notice: All rights of the lecture videos, the transcripts the have been scraped, the chapters and titles, the human-written summaries and all other related details belong to the respective owners of the MIT OCW or the AK Lectures websites. Our work here is for research and educational purposes. </description><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2024</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNpjYBA3NNAzNbIw1K9KzctPydczNDCxtDAzt-RkcAsO9klNLiktSi22UnBMKi4pSkwuySxLVQguzc1NLMpMLVZIzEtRCMkvyExWCE5Nz03NKylWyE9TgOpSCMtMSc0v5mFgTUvMKU7lhdLcDPpuriHOHropiSWJyZklqfEFRZlA8yrjDQ3iQS6Jh7gkHuYSY9J1AAA6z0OT</recordid><startdate>20240112</startdate><enddate>20240112</enddate><creator>Alesh, Yaser Haitham</creator><creator>Abdulghani, Osama</creator><creator>Al Ali, Omar Ibrahim</creator><creator>Aoudia, Meriem</creator><creator>Abu Talib, Dr. Manar</creator><general>Zenodo</general><scope>DYCCY</scope><scope>PQ8</scope></search><sort><creationdate>20240112</creationdate><title>SSLectures: Abstractive Summaries and Topic Segments of Lecture Videos</title><author>Alesh, Yaser Haitham ; Abdulghani, Osama ; Al Ali, Omar Ibrahim ; Aoudia, Meriem ; Abu Talib, Dr. Manar</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_5281_zenodo_104986793</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Alesh, Yaser Haitham</creatorcontrib><creatorcontrib>Abdulghani, Osama</creatorcontrib><creatorcontrib>Al Ali, Omar Ibrahim</creatorcontrib><creatorcontrib>Aoudia, Meriem</creatorcontrib><creatorcontrib>Abu Talib, Dr. Manar</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Alesh, Yaser Haitham</au><au>Abdulghani, Osama</au><au>Al Ali, Omar Ibrahim</au><au>Aoudia, Meriem</au><au>Abu Talib, Dr. Manar</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>SSLectures: Abstractive Summaries and Topic Segments of Lecture Videos</title><date>2024-01-12</date><risdate>2024</risdate><abstract>SSLectures: Abstractive Summaries and Topics Segments of Lecture Videos   SSLectures is a dataset containing abstractive summaries of lecture videos from AK Lectures website and MIT OCW repository. It also contains topic segments (chapters) for the MIT lectures. The dataset was scraped from free publicly available material and is published under a Creative Commons License that allows re-distribution and re-use.   The dataset is split into 3 files explained below: mit_chapters_summarized.csv: Contains the transcript and other details of 14.8K chapters (segments) from the MIT lectures along with abstractive summaries generated with GPT-3.5. Each row is one chapter from one lecture video.  Suitable to train summarization to summarize parts of lecture videos. (Not full lectures). ak_lectures_summarized.csv: Contains the transcript and other details of 1.8k lecture videos from aklectures.com. Each lecture video comes with the abstractive summary that was published on the website. Most videos of this dataset are short, between 5-15 minutes on average. Suitable to train summarization models to summarize full short lecture videos. (~ 15 min. in length for most) mit_videos_all_courses_segmentations.csv: Contains details of the chaptering (segmentation) of each lecture video from MIT. Each row is for one lecture video, and comes with the timing (end times) and titles of each chapter in the video.  Suitable to train and/or evaluate segmentation algorithms and models for both short and long lecture videos. Please cite this page if you use this dataset in your research or in other projects.  Copyright Notice: All rights of the lecture videos, the transcripts the have been scraped, the chapters and titles, the human-written summaries and all other related details belong to the respective owners of the MIT OCW or the AK Lectures websites. Our work here is for research and educational purposes. </abstract><pub>Zenodo</pub><doi>10.5281/zenodo.10498679</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.5281/zenodo.10498679
ispartof
issn
language eng
recordid cdi_datacite_primary_10_5281_zenodo_10498679
source DataCite
title SSLectures: Abstractive Summaries and Topic Segments of Lecture Videos
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T05%3A27%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Alesh,%20Yaser%20Haitham&rft.date=2024-01-12&rft_id=info:doi/10.5281/zenodo.10498679&rft_dat=%3Cdatacite_PQ8%3E10_5281_zenodo_10498679%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true