<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:identifier>doi:10.18061/emr.v18i1.8903</dc:identifier>
  <dc:title xml:lang="eng">An annotated corpus of tonal piano music from the long 19th century</dc:title>
  <dc:type xml:lang="eng">Text</dc:type>
  <dc:type xml:lang="eng">journal article</dc:type>
  <dc:rights>http://creativecommons.org/licenses/by-nc/4.0/</dc:rights>
  <dc:format>application/pdf</dc:format>
  <dc:date>2024-01-12</dc:date>
  <dc:language>eng</dc:language>
  <dc:source xml:lang="deu">Empirical Musicology Review</dc:source>
  <dc:type xml:lang="deu">Text</dc:type>
  <dc:type xml:lang="deu">Wissenschaftlicher Artikel</dc:type>
  <dc:type xml:lang="ita">Testo</dc:type>
  <dc:type xml:lang="ita">Articolo di rivista</dc:type>
  <dc:description xml:lang="eng">We present a dataset of 264 annotated piano pieces of nine composers, composed in the long 19th century (https://doi.org/10.5281/zenodo.7483349). Annotations adhere to the DCML harmony annotation standard and include Roman numerals, phrase boundaries, and cadence types. The scores are encoded in the XML-based MuseScore 3 format. Annotations are embedded within the MuseScore files. In addition, all harmony information, alongside key features of the encoded measure and note objects, is provided in the form of plaintext TSV-formatted tables for increased interoperability with other datasets and analysis tools. Annotations were collaboratively created and reviewed by a pool of trained music theorists. Collaboration took place asynchronously online via a semi-automated GitHub-based workflow designed for quality assurance, allowing cycles of revisions and reviews until consensus is reached. The full revision history is retained, providing data for further empirical research on inter-annotator agreement and related topics. We also present descriptive statistics about the nine corpora and the dataset as a whole, including comparisons of pitch-class contents, phrase lengths, modulations, and cadence types. We conclude with a discussion of our musicological principles for corpus building and considerations of representability.</dc:description>
  <dc:creator>Johannes Hentschel</dc:creator>
  <dc:creator>Yannis Rammos</dc:creator>
  <dc:creator>Fabian C. Moss</dc:creator>
  <dc:creator>Markus Neuwirth</dc:creator>
  <dc:creator>Martin Rohrmeier</dc:creator>
  <dc:subject xml:lang="deu">corpora</dc:subject>
  <dc:subject xml:lang="deu">harmony</dc:subject>
  <dc:subject xml:lang="deu">phrase</dc:subject>
  <dc:subject xml:lang="deu">cadence</dc:subject>
  <dc:subject xml:lang="deu">piano</dc:subject>
  <dc:subject xml:lang="deu">19th century</dc:subject>
  <dc:identifier>https://phaidra.bruckneruni.at/o:3910</dc:identifier>
</oai_dc:dc>