Recent Submissions

  • Foneetikakorpuse sagedussõnastik 

    Lippus, Pärtel (2019-06-20)
    Eesti keele spontaanse kõne foneetilise korpuse sagedussõnastik on koostatud korpuse v.1.0.5 (20.06.2019, doi:10.15155/1-00-0000-0000-0000-001A3L) versiooni põhjal, kui korpuses oli märgendatud 685 750 sõna (89 tundi ja ...
  • Pretrained word and multi-sense embeddings for Estonian 

    Aedmaa, Eleri (2019)
    Word and multi-sense embedding for Estonian trained on lemmatized etTenTen: Corpus of the Estonian Web. Word embeddings are trained with word2vec. Sense embeddings are trained with SenseGram. Sense inventory is induced ...
  • Inari Saami geminates 

    Türk, Helen; Lippus, Pärtel; Pajusalu, Karl; Teras, Pire (2018-11-08)
    Data extracted from the Inari Saami prosody corpus (http://dx.doi.org/10.15155/1-00-0000-0000-0000-00150L), used in Türk et al (2018). The Acoustic Correlates of Quantity in Inari Saami. Journal of Phonetics. Target words ...
  • (Non-)Literalness ratings for Estonian particle verbs 

    Aedmaa, Eleri (2018-06)
    (Non-)literalness dataset of 1481 sentences formed with 184 Estonian particle verbs. Sentences are evaluated by 3 native speakers of Estonian on a 6-point scale [0,5] indicating the degree of compositionality of a particle ...
  • Context-dependent articulation of consonant gemination in Estonian (data) 

    Türk, Helen; Lippus, Pärtel; Šimko, Juraj (2017)
    This dataset is collected from 4 native Estonian speakers with Carstens AG-500 electromagnetic articulograph articluating the 27 combinations of disyllabic words for the purpose of studying gemination in the Estonian ...

View more