OSU Quake 2004 Dialog Corpus:
A publicly available corpus of collaborative dialog in a virtual world

This small corpus is intended for researchers who want to study collaboration between humans working on a situated task. In this case, two human partners perform a treasure-hunt task in a graphically-rendered virtual world that is portrayed on their computer monitors. The partners spoke to each other in order to coordinate their activity on the task. The problem domain was chosen to simulate the search-and-rescue domain.

Here's a little sample of the interaction in this domain.

The corpus has the following properties:

The technical report OSU-CISRC-8/05-TR57 (available on the main CSE tech report site) describes the data collection conditions, subject instructions, etc.

The corpus

To date we have transcribed and annotated 5 problem-solving sessions. More will be posted on the web site as it becomes available. The data is available to any researchers who want to use it it subject to the terms of use, but I would like to keep a distribution list of people who are currently using the corpus. So, to get access to the movies, audio files, and transcripts, please email Donna Byron to get a password to the corpus webpage.

Links to tools we used

If you would like to perform a similar experiment, here are some of the tools you will need:

Local Resources

Guidelines for transcribing the audio

SLaTe: Speech and Technologies Lab

Dept. of Computer Science & Engineering
580 Dreese Labs
The Ohio State University

All content on this page is Copyright © 2005, THE OHIO STATE UNIVERSITY