Veronika
Laippala
Professor, Digital Language Studies, Chinese, French, German, Italian, Spanish
Areas of expertise
Computational linguistics
text linguistics
corpus linguistics
digital discourse analysis.
Biography
I am a linguist who likes computers. My main research topics include language variation across different communicative situations and the development of automatic tools so that we could better benefit from large, web-crawled corpora.
My ongoing projects include "A piece of news, an opinion or something else? Different texts and their detection from the multilingual Internet" funded by Emil Aaltonen foundation and "Massively multilingual modeling of registers in web-scale data" funded by Academy of Finland.
For more information, please have a look at our lab website at https://turkunlp.github.io/
Publications
Register identification from the unrestricted open Web using the Corpus of Online Registers of English (2022)
Language Resources and Evaluation
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1))
Exploring the role of lexis and grammar for the stable identification of register in an unrestricted corpus of web documents (2021)
Language Resources and Evaluation
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1))
Multilingual and Zero-Shot is Closing in on Monolingual Web Register Classification (2021)
Nordic Conference on Computational Linguistics, Linköping Electronic Conference Proceedings
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))
Beyond the English web: Zero-shot cross-lingual and lightweight monolingual classification of registers (2021)
European Chapter of the Association for Computational Linguistics
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))
Affectivity in the #jesuisCharlie Twitter discussion (2020)
Pragmatics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1))
From Web Crawl to Clean Register-Annotated Corpora (2020)
Web as Corpus Workshop
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))
Korpusaineistot (2020)
(Vertaisarvioitu artikkeli kokoomateoksessa (A3))Commenting on poverty online: A corpus-assisted discourse study of the Suomi24 forum (2020)
SKY Journal of Linguistics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1))
A broad-coverage corpus for finnish named entity recognition (2020)
International Conference on Language Resources and Evaluation
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))