The 2010 KDD Cup Competition Dataset: Engaging the machine learning community in predictive learning analytics
DOI:
https://doi.org/10.18608/jla.2016.32.16Abstract
In the spring of 2010, the Association for Computing Machinery (ACM) Special Interest Group on Knowledge Discovery and Data-mining (KDD) selected a dataset from an educational technology for its annual competition. The competition, titled “Educational Data Mining Challenge”, tasked participants with predicting the correctness of student answers to questions within an Intelligent Tutoring System (ITS) from The Cognitive Tutors suite of tutors. This challenge was hosted by the PSLC DataShop, and included data provided by the Carnegie Learning Inc., producers of The Cognitive Tutors. Consisting of over 9GB of student data this was the largest KDD Cup dataset up to that point in time. The competition brought in 655 competitors submitting 3,400 solutions. Five years later, we believe the competition dataset has been the most often cited from an educational technology platform.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2016 Journal of Learning Analytics

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons License, Attribution - NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND 3.0) license that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).