Show simple item record

 
dc.contributor.author Goško, Didzis
dc.contributor.author Bārzdiņš, Guntis
dc.date.accessioned 2024-02-09T11:46:06Z
dc.date.available 2024-02-09T11:46:06Z
dc.date.issued 2024-02
dc.identifier.uri http://hdl.handle.net/20.500.12574/97
dc.description The SELMA Open-Source Software (OSS) offers effective means to test and compare the performance of various language models used in multilingual media monitoring and content production. The SELMA OSS Platform (also referred to as Use Case 0, UC0, or The Basic Testing and Configuration Interface) provides: * automatic speech recognition (ASR) from audio/video files, * punctuation and capitalization of the transcribed text, * machine translation (MT) into a target language, * text-to-speech synthesis (TTS) and voice-over generation. To provide this functionality, the demonstrator release uses these multilingual open source models: OpenAI Whisper (ASR), Meta MMS (TTS, ASR), Meta M2M-100 (MT). Thus, it facilitates easy access to such open large language models. The SELMA Platform can be used not only by developers in order to combine and test alternative language models before they are integrated into the end-user applications – it can also be used as an entry-level application by journalists and media producers themselves to transcribe their recordings, generate subtitles and voice-over, or to generate a podcast from an input text. The demonstrator of the SELMA OSS Platform does not require registration and authentication nor does it store any content, original or generated, after the session is closed by the user.
dc.publisher AiLab IMCS UL
dc.relation.isreferencedby https://selma-project.eu/2023/10/18/the-selma-open-source-platform/
dc.relation.isreferencedby https://github.com/SELMA-project/UC0-OpenSource
dc.source.uri https://selma-project.eu
dc.subject ASR
dc.subject TTS
dc.subject MT
dc.subject multilingual content production
dc.subject multilingual media monitoring
dc.subject LLM
dc.title SELMA Open Source Platform (UC0)
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType platform
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent false
has.files no
branding CLARIN Centre of Latvian language resources and tools
demo.uri https://selma.ailab.lv
contact.person Guntis Bārzdiņš guntis.barzdins@lumii.lv IMCS at University of Latvia
sponsor European Commission 957017 SELMA – Stream Learning for Multilingual Knowledge Transfer euFunds
files.size 0
files.count 0


Show simple item record