BOBC |
Resource type: Book Chapter Language: en: English DOI: 10.1109/ICDAR.2017.286 BibTeX citation key: Dunst2017 Email resource to friend View all bibliographic details |
Categories: General Keywords: Cognition, Comics research, Digitalization Creators: Bilof, Dunst, Hartel, Laubrock Publisher: The Institute of Electrical and Electronics Engineers (Piscataway) Collection: 1st International Workshop on Computational Document Forensics. IWCDF 2017 |
Views: 3/1154
|
Attachments |
Abstract |
Developed for an interdisciplinary DH project, the Graphic Narrative Corpus (GNC) is the first digital corpus of graphic novels, memoirs, and non-fiction written in English. It currently includes 160 book-length titles and will grow to around 250 graphic narratives by 2018. In contrast to collections such as Manga109, the eBDtheque, and the Iyyer corpus, the GNC was conceived to serve both the research needs of humanities and social science scholars and as a data set for computational analysis. The GNC has been constructed as a stratified monitor corpus that balances different historical periods, geographical origin, literary genres, and the gender and ethnic background of authors. Based on an extension of John Walsh’s XML-dialect CBML and editor software developed for the corpus, annotation combines a focus on the first ten pages of each title and sample annotation of full-length books. XML-annotation currently includes visual objects, as well as word-image and character relations (panels, characters, balloons, captions, text, interaction types). In addition, we also provide eye-tracking data for annotated titles. Information about the corpus and sample visualizations can be found at: https://groups.uni-paderborn.de/graphic-literature/gncorpus/corpus.php.
|