Diachronic Corpus of Literary Meadow Mari

Keywords: literary language, journalistic language

The Diachronic Corpus of Literary Meadow Mari contains newspaper articles from different periods of the development of Mari literary language. The oldest texts are from the year 1909, and the newest from 2008.

Articles are divided into periods: 

  • 1900s–1910s
  • 1920s–1937
  • 1940s–1950s
  • 1960s–1980s
  • 1990s
  • 2000s

They are also classified according to their content: 

  • politics and society (I)
  • economics (II)
  • culture and education (III)
  • fiction(IV)
  • miscellaneous (V).

The corpus includes 575 texts, containing ca. 336,000 word tokens.

The materials can be used to study changes in Meadow Mari literary language.

The corpus is accessible through Finno-Ugric Corpora portal.

Details about the resource

Content
  • Language: Meadow Mari
  • Form: written language
  • Genre: journalism
  • Timescale: 1909–2008
Authors
Jorma Luutonencoordinator
Oleg Sergejevcoordinator
Valeri Maksimovcoordinator
Availability

Contact person

Jussi Ylikoskivolgaserver *at* utu.fi