Turku Onchyko Corpus

Keywords: literary language, journalistic language, academic language

The Turku Onchyko Corpus contains texts from the journal Onchyko ('Forward'), published in Yoshkar-Ola.

The number of word tokens in the corpus is ca. 2,263,000.

The 856 texts of the corpus are from the years 1996–1999. Contents from the following issues of Onchyko are represented in the corpus:

  • 1996: 4–12
  • 1997: 1–6, 10–12
  • 1998: 1–12
  • 1999: 1–12

The corpus is accessible through Finno-Ugric Corpora portal.

Details about the resource

Content
  • Language: Meadow Mari
  • Form: written language
  • Genre: fiction, journalistic texts, poetry, scientific texts
  • Dataset size: 856 texts, 2,263,000 word tokens
  • Timescale: 1996–1999
Authors
Jorma Luutonencoordinator
Availability

Contact person

Jussi Ylikoskivolgaserver *at* utu.fi

Detailed corpus description (pdf)