The Advanced Finnish Learners’ Corpus

Keywords: academic language, longitudinal corpus

The Corpus of Advanced Learners of Finnish contains written data produced by advanced learners of Finnish in various academic settings. Additionally, the corpus contains reference material produced by native speakers of Finnish. All the material has been morphologically and syntactically annotated.

Ivaska, Ilmari. 2014. The Corpus of Advanced Learner Finnish (LAS2): Database and toolkit to study academic learner Finnish. Apples – Journal of Applied Language Studies 8(3). 21–38. http://apples.jyu.fi/article/abstract/317

Details about the resource

Content
  • Language: Finnish
  • Form: written language
  • Genre: academic texts
  • Size: 400 texts
  • Timescale: 2007–2019
Annotations
  • lemmatisation
  • part of speech
  • morphology
  • syntax

The tagsets used for parts of speech, morphology and syntactic functions are all from the Syntax Archive. The corpus is not dependency annotated. 

Authors
Ilmari Ivaskafounder and principal investigator (PI)
Availability

Available at

https://www.kielipankki.fi/corpora/LAS2/ 

Contact person

Ilmari Ivaskaitivas *at* utu.fi

Usage license

CC BY-NC-ND
Referring

Permanent Address of Dataset

Reference instructions

University of Turku, School of Languages and Translation Studies (2012). The Advanced Finnish Learners’ Corpus [data set]. Kielipankki. http://urn.fi/urn:nbn:fi:lb-201407167

Ivaska, Ilmari. 2014. The Corpus of Advanced Learner Finnish (LAS2): Database and toolkit to study academic learner Finnish. Apples – Journal of Applied Language Studies 8(3). 21–38. http://apples.jyu.fi/article/abstract/317