Parlameter – a Corpus of Contemporary Slovene Parliamentary Proceedings

  • Darja Fišer Department of Translation, Faculty of Arts, University of Ljubljana
  • Nikola Ljubešić Jožef Stefan Institute
  • Tomaž Erjavec Department of Knowledge Technologies, Jožef Stefan Institute
Keywords: parliamentary proceedings, corpus construction, language technology, corpus analysis


The paper presents the Parlameter corpus of contemporary Slovene parliamentary proceedings, which covers the VIIth mandate of the Slovene Parliament (2014-2018). The Parlameter corpus offers rich speaker metadata (gender, age, education, party affiliation) and is linguistically annotated (lemmatization, tagging), which boost research in several digital humanities and social sciences disciplines. We demonstrate the potential of the corpus analysis techniques for investigating political debates. The corpus architecture allows for regular extensions of the corpus with additional Slovene data, as well as data from other parliaments, starting with Croatian.



