The book presents the research on the Terminology databanks as the bodies of knowledge project and is supported by a critical review of selected existing terminological dictionaries for Slovene. The argumentation clearly shows the advantages of the use of corpus approach and modern language technologies in the building of terminological databases, and also provides a model for further terminographical work. Weight and applicability of the research for the Slovenian terminography and terminology comes in particular from the fact that the methodological innovations are systematically tested and evaluated in the actual process of making a terminological database.
COBISS.SI-ID: 268600576
The aim of the research project Terminology Databanks as the Bodies of Knowledge: The Model for the Systematisation of Terminologies is to compile a dictionary of Slovenian public relations, with 2000 entries. The terms will be explained and translated into English, with typical context, examples. From July 2013, the dictionary will be publicly available on www.termania.net.
COBISS.SI-ID: 32193373
This paper describes the process of compiling an online dictionary of terminology within the TERMIS project. The compilation began from a morphosyntactically tagged synchronous LSP corpus and involved automatic term recognition performed for single- and multi-word terms with the LUIZ term extraction system and the automatic extraction of lexical information from the corpus via the Sketch Engine tool. The information obtained, along with the results of the GDEX system, was imported into the dictionary editing system to the Termania web portal. A free online terminological database of the public relations field comprised of 2000 entries has been publicly available since July 2013.
COBISS.SI-ID: 32364125
The article describes an analysis of automatic term recognition results performed for single- and multi-word terms with the LUIZ term extraction system. The target application of the results is a terminology database of Public Relations and the main resource the KoRP Public Relations Corpus. Our analysis is focused on two segments: (a) single-word noun term candidates, which we compare with the frequency list of nouns from KoRP and evaluate termhood on the basis of the judgements of two domain experts, and (b) multi-word term candidates with verb and noun as headword. In order to better assess the performance of the system and the soundness of our approach we also performed an analysis of recall. Our results show that the terminological relevance of extracted nouns is indeed higher than that of merely frequent nouns, and that verbal phrases only rarely count as proper terms. The most productive patterns of multi-word terms with noun as a headword have the following structure: [adjective + noun], [adjective + and + adjective + noun] and [adjective + adjective + noun]. The analysis of recall shows low inter-annotator agreement, but nevertheless very satisfactory recall levels
COBISS.SI-ID: 36636973
The feature includes two starting points: a) the development of the Slovenian (and Croatian) language in science and professions is being increasingly limited by the narrow comprehension of the internationalisation of higher education and science; b) in the digital age, fields of usage are being lost in those languages that are not supported enough in terms of language technology. With the case of the Slovenian public relations terminology, we reveal that it is possible to confront both: on the basis of the previously formed corpus of professional texts, KoRP, which is linguistically earmarked and freely available online, in the TERMIS project, we initially inferred one- or multi-word term candidates with the LUIZ programme, and then acquired the typical text environment and best dictionary examples automatically by means of the Sketch Engine tool and its application Word Sketches together with the GDEX tool. The infrastructure which will be formed during the project will be freely available after the conclusion of the project (June 2013), and the dictionary, which will include 2,000 terms regarding public relations, may be understood as a model for the creation of modern terminology dictionaries of other professions as well.
COBISS.SI-ID: 32194397