A machine-readable pronunciation dictionary of the medical domain derived from a large text corpus of historical medical records. Consists of 109k entries in the CSV format: first column - a wordform; second column - its pronunciation in the IPA encoding. The dictionary contains Latvian words and terms used in the medical domain, as well as abbreviations, Latin terms and frequent English named entities (e.g. drug names).
Activities of CLARIN Latvia are supported by project of the European Regional Development Fund Nr. 22.214.171.124/18/I/016 University of Latvia and institutes in the European Research Area - Excellency, activity, mobility, capacity.