This page introduces the anonymised Open University Learning Analytics Dataset (OULAD). It contains data about courses, students and their interactions with Virtual Learning Environment (VLE) for seven selected courses (called modules). Presentations of courses start in February and October - they are marked by "B" and "J" respectively. The dataset consists of tables connected using unique identifiers. All tables are stored in the csv format.
You can download the latest version of the OULAD here:
* You can check integrity of downloaded zip file using the MD5 checksum.
The two-day event was held at the University of British Columbia, Canada. Over 100 participants dove into our dataset and experimented with it. Interesting projects in the area of social comparison and visualisation have been developed.
The principal aim of Hack@LAK18 was to enable multi-disciplinary thinking over key open challenges in Learning Analytics based on a problem-oriented, pragmatic approach. OULAD was one of the recommended datasets by the organisers.
File contains the list of all available modules and their presentations. The columns are:
The structure of B and J presentations may differ and therefore it is good practice to analyse the B and J presentations separately. Nevertheless, for some presentations the corresponding previous B/J presentation do not exist and therefore the J presentation must be used to inform the B presentation or vice versa. In the dataset this is the case of CCC, EEE and GGG modules.
This file contains information about assessments in module-presentations. Usually, every presentation has a number of assessments followed by the final exam. CSV contains columns:
If the information about the final exam date is missing, it is at the end of the last presentation week.
The csv file contains information about the available materials in the VLE. Typically, these
are html pages, pdf files, etc. Students have access to these materials online and their interactions
with the materials are recorded.
The vle.csv file contains the following columns:
This file contains demographic information about the students together with their results. File contains the following columns:
This file contains information about the time when the student registered for the module presentation. For students who unregistered the date of unregistration is also recorded. File contains five columns:
This file contains the results of students' assessments. If the student does not submit the
assessment, no result is recorded. The final exam submissions is missing, if the result of the
assessments is not stored in the system.
This file contains the following columns:
The studentVle.csv file contains information about each student's interactions with the materials in the VLE.
This file contains the following columns:
This dataset is released under CC-BY 4.0 license.
When citing the dataset please use the following reference:
Kuzilek J., Hlosta M., Zdrahal Z.
Open University Learning Analytics dataset
Sci. Data 4:170171 doi: 10.1038/sdata.2017.171 (2017).