Click here for Other situated corpora collected in the SLaTe lab

OSU Quake 2004 Dialog Corpus:
A publicly available corpus of collaborative dialog in a virtual world

This small corpus is intended for researchers who want to study collaboration between humans working on a situated task. In this case, two human partners perform a treasure-hunt task in a graphically-rendered virtual world that is portrayed on their computer monitors. The partners spoke to each other in order to coordinate their activity on the task. The problem domain was chosen to simulate the search-and-rescue domain.

Here's a little sample of the interaction in this domain.

The corpus has the following properties:

The technical report OSU-CISRC-8/05-TR57 describes the data collection conditions, subject instructions, etc. If you publish results that include the use of this corpus, please cite:

Donna K. Byron and Eric Fosler-Lussier. "The OSU Quake 2004 corpus of two-party situated problem-solving dialogs." In Proceedings of the 15th Language Resources and Evaluation Conference (LREC'06), 2006.
      » PDF, BibTeX

The corpus

To date we have transcribed and annotated 5 problem-solving sessions. The data is available to any researchers who want to use it it subject to the terms of use.

Links to tools we used

If you would like to perform a similar experiment, here are some of the tools you will need:

Local Resources

Guidelines for transcribing the audio

SLaTe: Speech and LAnguage TEchnologies Lab

Dept. of Computer Science & Engineering
580 Dreese Labs
The Ohio State University

All content on this page is Copyright © 2005, THE OHIO STATE UNIVERSITY