Slovenščina 2.0: empirical, applied and interdisciplinary research is an online linguistic journal, established with the intention of filling the gap between theoretical and interdiscplinary research of the Slovene language, especially research involving language technologies. This gap is also found if research of Slovene is compared, connected or applied to other languages.
C.06 Editorial board membership
In this paper we describe the existing cooperation between Tokyo University of Technology in Japan and the Faculty of Arts, University of Ljubljana in Slovenia, highlight important results of research and teaching cooperation, and persons who have contributed to this.
F.01 Acquisition of new practical knowledge, information and skills
COBISS.SI-ID: 54313314This paper presents annotation and construction of a new large-scale Japanese web corpus JpTenTen consisting of ten billion words and introduces an example of profiling of vocabulary and grammatical information on Japanese using the corpus. By applying the part of speech, inflection, utilizing two types of unit lengths in the UniDic dictionary, short and long unit words, and Japanese various grammatical relations, it became possible to extract information on words behavior. It is expected that the results of the research can be utilized for Japanese language studies, contrastive linguistics, Japanese lexicography, Japanese language education, Japanese language processing, psychology and other studies.
B.03 Paper at an international scientific conference
COBISS.SI-ID: 54313314