Veronika
Laippala
Professor, Digital Language Studies, Chinese, French, German, Italian, Spanish
Areas of expertise
Computational linguistics
text linguistics
corpus linguistics
digital discourse analysis.
Biography
I am a linguist who likes computers. My main research topics include language variation across different communicative situations and the development of automatic tools so that we could better benefit from large, web-crawled corpora.
My ongoing projects include "A piece of news, an opinion or something else? Different texts and their detection from the multilingual Internet" funded by Emil Aaltonen foundation and "Massively multilingual modeling of registers in web-scale data" funded by Academy of Finland.
For more information, please have a look at our lab website at https://turkunlp.github.io/
Publications
Selkosten Proust taipuu moneen - Iijoki-korpus ja digitaalisen tekstilouhinnan mahdollisuudet (2022)
(Vertaisarvioitu artikkeli kokoomateoksessa (A3))Towards better structured and less noisy Web data: Oscar with Register annotations (2022)
International Conference on Computational Linguistics, International Conference on Computational Linguistics
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))
Beyond the English web: Zero-shot cross-lingual and lightweight monolingual classification of registers (2021)
European Chapter of the Association for Computational Linguistics
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))
Exploring the role of lexis and grammar for the stable identification of register in an unrestricted corpus of web documents (2021)
Language Resources and Evaluation
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1))
Multilingual and Zero-Shot is Closing in on Monolingual Web Register Classification (2021)
Nordic Conference on Computational Linguistics, Linköping Electronic Conference Proceedings
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))
Commenting on poverty online: A corpus-assisted discourse study of the Suomi24 forum (2020)
SKY Journal of Linguistics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1))
A broad-coverage corpus for finnish named entity recognition (2020)
International Conference on Language Resources and Evaluation
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))
Määrällinen korpuslingvistiikka (2020)
(Vertaisarvioitu artikkeli kokoomateoksessa (A3))Affectivity in the #jesuisCharlie Twitter discussion (2020)
Pragmatics
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1))
From Web Crawl to Clean Register-Annotated Corpora (2020)
Web as Corpus Workshop
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))