Learn Before
Dataset

LectureBank Dataset

LectureBank is a publicly released corpus of English lecture slide files collected from university courses in NLP, ML, AI, deep learning, and information retrieval. The original release contains 1{,}352 lecture files and 208208 manually annotated prerequisite-relation topic pairs. Each topic is classified against a fixed taxonomy, and each ordered pair of topics is labeled as a prerequisite link or not. The dataset is the canonical resource for prerequisite-chain learning in NLP education.

0

1

Updated 2026-05-16

Contributors are:

Who are from:

Tags

Science

Auditable Strict-Parity Evaluation of Prerequisite-Graph Retrieval for RAG under Leakage Controls

Related