Recent Submissions

  • Foneetikakorpuse sagedussõnastik 

    Lippus, Pärtel (2019-06-20)
    Eesti keele spontaanse kõne foneetilise korpuse sagedussõnastik on koostatud korpuse v.1.0.5 (20.06.2019, doi:10.15155/1-00-0000-0000-0000-001A3L) versiooni põhjal, kui korpuses oli märgendatud 685 750 sõna (89 tundi ja ...
  • Paleopathological lesions in infant human remains 

    Morrone, Alessandra; Oras, Ester; Tõrv, Mari (2019-05)
    This study reports the paleopathological findings of six anomalous child burials discovered in the Medieval and Early Modern cemetery of St Jacob (Tartu, Estonia). Specific fetal-neonatal disease patterns are identified ...
  • Pretrained word and multi-sense embeddings for Estonian 

    Aedmaa, Eleri (2019)
    Word and multi-sense embedding for Estonian trained on lemmatized etTenTen: Corpus of the Estonian Web. Word embeddings are trained with word2vec. Sense embeddings are trained with SenseGram. Sense inventory is induced ...
  • Inari Saami geminates 

    Türk, Helen; Lippus, Pärtel; Pajusalu, Karl; Teras, Pire (2018-11-08)
    Data extracted from the Inari Saami prosody corpus (http://dx.doi.org/10.15155/1-00-0000-0000-0000-00150L), used in Türk et al (2018). The Acoustic Correlates of Quantity in Inari Saami. Journal of Phonetics. Target words ...
  • (Non-)Literalness ratings for Estonian particle verbs 

    Aedmaa, Eleri (2018-06)
    (Non-)literalness dataset of 1481 sentences formed with 184 Estonian particle verbs. Sentences are evaluated by 3 native speakers of Estonian on a 6-point scale [0,5] indicating the degree of compositionality of a particle ...

View more