Kas jauns
corpus
Apraksts:
A small subset of phonetically annotated data has been derived from the LATE-sarunas and LATE-media. The phonetic annotation is available at two levels: (1) the dictionary or standard pronunciation of a word or segment, ...
Šajā vienumā nav failu.
corpus
Apraksts:
The corpus contains audio recordings of media broadcasts and their transcripts in orthographic transcription. The data are transcribed in the orthography of Standard Latvian, observing also the principles of punctuation.
Šajā vienumā nav failu.
corpus
Apraksts:
Corpus contains recordings of informal conversations, interviews and public speeches and their transcripts in orthographic transcription. Metadata has been added to each audio recording: gender and age group of the speaker, ...
Šajā vienumā nav failu.
Visvairāk skatītie vienumi
Populārākie pēdējā nedēļā
lexicalConceptualResource
Apraksts:
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 405,000 entries based on 345 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic ...
Šajā vienumā ir 2 faili (348.56
MB).
Publicly Available
corpus
Apraksts:
The Balanced Corpus of Modern Latvian, which contains unique texts not yet included in other so far developed balanced corpora (LVK2013 and LVK2018). The corpus is primarily based on the design principles of previous ...
Šajā vienumā nav failu.
toolService
Apraksts:
A neural model for text-to-speech (TTS) synthesis in Latvian. Trained using VITS on a 25-hour speech corpus of audiobooks read in a male voice. Available for academic and non-commercial purposes via an API. To get access ...
Šajā vienumā ir 1 fails (376.17
MB).
Restricted Use