SCARE: A Situated Corpus with Annotated Referring Expressions
This multimodal corpus was collected as part of our ongoing work on
multi-agent collaboration in situated tasks. Each session in this
corpus records the joint problem-solving of a pair of human partners
working through a treasure-hunt style task in a virtual world. The corpus includes
time-aligned transcriptions, audio files, quicktime movies showing the
action as it unfolded in the virtual world, positional information
from the virtual world's log, and state-change information on items that could be manipulated in the virtual world.
The corpus is freely available for research and educational use only, according to
this User Agreement. It could be used for a wide variety of investigations, and is especially suited for investigating task-oriented dialog or deictic
expressions in English.
Please email us your contact information if you plan to use the corpus, so that you can be notified of any future updates.
(illustration by Mariana Sharp)
Files are compressed with bzip. To extract files from the tar archives, use option -xjf
Key Publications describing the corpus:
- Technical Report describing the hardware and software set-up for data collection (pdf)
- LREC paper describing the data collection. Please cite this
paper if you are using the SCARE corpus in your research (pdf)BibTeX
- Laura Stoia's dissertation, which describes a system designed using
models developed from the corpus. It includes an appendix with the instructions and
questionnaires completed by the subjects who provided data for the corpus. (pdf)
Published Papers using the SCARE corpus:
If you publish research that references the SCARE corpus, please tell us so we can advertise your research here.
- Laura Stoia, Darla Magdalene Shockley, Donna K. Byron, and Eric Fosler-Lussier.
Noun phrase generation for situated dialogs.
In Proceedings of the Fourth International Natural Language
Generation Conference, pages 81-88, Sydney, Australia, July 2006.
Association for Computational Linguistics.
- Laura Stoia, Donna K. Byron, Darla Shockley, and Eric Fosler-Lussier.
Sentence planning for realtime navigational instruction.
In Proceedings of the Human Language Technology Conference of
the NAACL, Companion Volume: Short Papers, pages 157-160, New York City,
USA, June 2006. Association for Computational Linguistics.
Please read the License Agreement before using the data.
If you have any questions about using the corpus, please send an email!.
SLaTe: Speech and Language Technologies Lab
Dept. of Computer Science & Engineering
580 Dreese Labs
The Ohio State University
All content on this page is Copyright © 2007, THE OHIO STATE UNIVERSITY