Electronic Word Lists: Mari, Mordvin, Udmurt, Komi, Chuvash, Tatar

Keywords: word lists

Word lists: Mari 54,000 lexemes Mordvin 75,000 lexemes Udmurt 49,000 lexemes Komi 70,000 lexemes Chuvash 31,000 lexemes Tatar 46,000 lexemes

In the files, the material is arranged in four columns:

  1. the word
  2. language
  3. word class
  4. sources

The meanings of the words are not given in the word list. Technically, the files are plain text Comma Separated Value (CSV) files. This simply means that a comma character (,) separates the fields for different types of information (word, language, word class, sources) in each line of the file.

For full description, see: https://www.sgr.fi/fi/items/show/404

Details about the resource

Content
  • Language: Mari, Mordvin, Udmurt, Komi, Chuvash, Tatar
  • Form: word lists
  • Dataset size: Mari 54,000 lexemes, Mordvin 75,000 lexemes, Udmurt 49,000 lexemes, Komi 70,000 lexemes, Chuvash 31,000 lexemes, Tatar 46,000 lexemes
Annotations
  • word class
Authors
Jorma Luutonen et al.coordinator
Availability

Contant person

Jussi Ylikoskivolgaserver *at* utu.fi