As part of a broader effort to provide tools for enabling research and practice in the space of collaborative and Discussion based learning, DiscourseDB is an NSF funded data infrustructure project designed to bridge data sources from multiple platforms for hosting those learning experiences. Our vision is to provide a common data model designed to accommodate data from diverse sources including but not limited to Chat, Threaded Discussions, Blogs, Twitter, Wikis, and Text messaging.

We will make available analytics components related to constructs including role taking, help exchange, collaborative knowledge construction, showing openness, taking an authoritative stance, attitudes, confusion, alliance and opposition. In enabling application of such metrics across datasets from multiple platforms, research questions related to the mediating and moderating effect of these process and state measures on information transfer, learning, and attrition can be conducted, building on the earlier research of our team.

Current Capabilities

We have one publically available dataset, consisting of online discussion of bugs and features in a set of related open source software projects, OpenFL. Other datasets are available to researchers by request, subject to IRB approval.

These datasets can be viewed in the Data browser. Researchers can create their own annotations on this data using an integrated installation of the Brat annotation tool, and apply machine learning techniques to generalize these labels using LightSide.

Next Steps

This month (Nov 2017) we are working to allow integration of DiscouseDB data into Learnsphere workflows, allowing researchers to apply its growing infrastructure of analyses to discourse data, and perform combined analyses with other data products under the Learnsphere umbrella.

Research and Development Team

Carolyn P. Rose, Carnegie Mellon University
Chris Bogart, Carnegie Mellon University
Oliver Ferschke, Carnegie Mellon University


How to annotate data
How to generalize annotations
DiscourseDB Wiki


The National Science Foundation