Претрага
22 items
-
Српски језик у дигиталном добу -- The Serbian Language in the Digital Age
Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević (2012)... of Novi Sad. e AlfaNum company has a con- siderable number of users among Serbian companies. e first corpus of contemporary Serbian, an electronic morphological dictionary of Serbian, aligned French- Serbian and English-Serbian corpora of literary texts, as well as different soware tools were developed ...
... next section, we de- scribe the role of Serbian in European information soci- ety and assess the current state of language technology for the Serbian language. 47 3 THE SERBIAN LANGUAGE IN THE EUROPEAN INFORMATION SOCIETY 3.1 GENERAL FACTS Standard Serbian is the standard national language of Serbs ...
... handle Serbian. 72 English-based systems for syntactic analysis of sentence structure are in general unsuitable for Serbian texts. e work on language processing for Serbian has been con- centrated so far on the development of resources and tools that comply with the specific features of Serbian (in ...Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević. "Српски језик у дигиталном добу -- The Serbian Language in the Digital Age" in META-NET White Paper Series, G. Rehm, H. Uszkoreit (eds.), Springer (2012)
-
WS4LR - a Worksation for Lexical Resources
... 1693 Serbian where two alphabets, Cyrillic and Latin, are used, and lexical and textual resources must exist for both. To that end the HLT group produces resources for Serbian in a special encoding that uses the ASCII character set and that can be unambiguously transformed into Serbian Latin or ...
... wordnets, aligned texts and transducers equally and has already proved very useful for various tasks. Although it has so far been used mainly for Serbian, WS4LR is not language dependent and can be successfully used for resources in other languages provided that they follow the described formats and ...
... languages, including French, English, Greek, Portuguese, Russian, Thai, Korean, Italian, Spanish, Norwegian, Arabic, German, Polish, Bulgarian, and Serbian. The Intex2 , Unitex3 and Nooj4 systems for natural language processing based on linguistic resources provide for text processing using this type ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović. "WS4LR - a Worksation for Lexical Resources" in Proceedings of the Fifth Interantional Conference on Language Resources and Evaluation, Genoa, Italy, May 2006, ELRA - European Language Resources Association (2006)
-
An Italian-Serbian Sentence Aligned Parallel Literary Corpus
This article presents the construction and relevance of an Italian-Serbian sentence-aligned parallel corpus, delving into the aligned sentences in order to facilitate effective translation between the two languages. The parallel corpus serves as a valuable resource for language experts, researchers, and language enthusiasts, fostering a deeper understanding of linguistic nuances and cultural expressions. By bridging the gap between Serbian and Italian, this corpus opens new avenues for cross-cultural communication and collaboration, and ultimately contributes to the improvement of language-related ...Saša Moderc, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić. "An Italian-Serbian Sentence Aligned Parallel Literary Corpus" in Review of the National Center for Digitization, Belgrade : Faculty of Mathematics, University of Belgrade (2023). https://doi.org/10.5281/zenodo.11203388
-
The Dictionary of the Serbian Academy: from the Text to the Lexical Database
In this paper we discuss the project of digitization of the Dictionary of the Serbo-Croatian Standard and Vernacular Language. Scanning and character recognition were a particular challenge, since various non-standard character set encoding was used in the course of the almost 60-year long production of the dictionary. The first aim of the project was to formalize the micro-structure of the dictionary articles in order to parse the digitized text of and transform it into structured data stored in relational lexical database. This approach ...... resources, dictionary, Serbian language 1 Introduction The first volume of the Dictionary of the Serbo-Croatian Standard and Vernacular Language (re- ferred to as the Dictionary of Serbian Academy or DSA), prepared and compiled by the Institute for the Serbian Language of the Serbian Academy of Sciences ...
... 941Lexicography in gLobaL contexts The Dictionary of the Serbian Academy: from the Text to the Lexical Database Ranka Stanković1, Rada Stijović2, Duško Vitas1, Cvetana Krstev1, Olga Sabo2 1University of Belgrade, 2Institute for Serbian Language, Serbian Academy of Sciences and Arts E-mail: ranka.stankovic@rgf ...
... 03:32:16 The Dictionary of the Serbian Academy: from the Text to the Lexical Database Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] The Dictionary of the Serbian Academy: from the Text to ...Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo. "The Dictionary of the Serbian Academy: from the Text to the Lexical Database" in Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts, Ljubljana : Ljubljana University Press, Faculty of Arts (2018)
-
Vebran Web Services for Corpus Query Expansion
Ranka Stanković, Miloš Utvić (2020)U ovom radu se govori o razvoju veb usluga Vebran i njihovoj primeni u poboljšanju pretraživanja korpusa. Veb-servisi Vebran koriste se za konsultovanje spoljnih leksičkih izvora za srpski jezik (uglavnom elektronski morfološki rečnici i srpski Vordnet) i proširivanje korisničkih upita radi dobijanja relevantnijih rezultata iz srpskih korpusa.... Contemporary Serbian (versions SrpKor2003 and SrpKor2013)1 and its subset SrpLemKor2; – SrpEngKor3, aligned English-Serbian corpus including subcorpus SELFEH (Serbian-English Law Finance Education and Health) with documents on finance, health, law and education; – SrpFranKor4, aligned French-Serbian corpus; ...
... lexical resources for Ser- bian (mainly electronic morphological dictio- naries and Serbian Wordnet) and expand user queries to retrieve more relevant results from Serbian corpora. KEYWORDS: corpus search, web service, Serbian lexical resources, query expansion. PAPER SUBMITTED: 18 October 2019 PAPER ACCEPTED: ...
... developed for Serbian by the HLT Group at the University of Belgrade and JeRTeh Society: – System of Serbian morphological electronic dictionaries (Unitex DELA format); 7 https://unitexgramlab.org/ 102 Infotheca Vol. 19, No. 2, December 2019 Scientific paper – Semantic network WordNet for Serbian; – T ...Ranka Stanković, Miloš Utvić. "Vebran Web Services for Corpus Query Expansion" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.5
-
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment ...... taggers, and the annotation schema alignment between Serbian morphological dictionaries, MULTEXT-East and Universal Part-of-Speech tagset. The trained models will be used to publish the new version of the Corpus of Contemporary Serbian as well as the Serbian literary corpus. The performance of developed taggers ...
... version of TreeTag- ger for Serbian (Utvić, 2011), and discussed in Section 4. The paper ends with concluding remarks and an outline of future work in Section 5. 2. Resources The main resources used for the production of the new tag- ger model for Serbian are: (a) Serbian morphological dic- tionaries ...
... Tagging for Serbian Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian | Ranka Stanković ...Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020)
-
Automatic construction of a morphological dictionary of multi-word units
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multiwordn units, noun phrases, query expansion... University of Belgrade Abstract. The development of a comprehensive morphological dictio- nary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictio- nary proved to be extremely time-consuming. In this paper we present ...
... is used for other languages. Key words: electronic dictionary, Serbian, morphology, inflection, multi- word units, noun phrases, query expansion 1 Introduction We have been developing morphological electronic dictionaries of Serbian for natural language processing for many years now. Our e-dictionaries ...
... of settlements in Serbia (236), MWUs extracted from a log file of a Serbian professional journal that deals with economic issues (162), from Verne’s novel Around the world in 80 days (114), from the explanatory dictionary of Serbian, under the letter R (604). As the analysis in the first step indicated ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić. "Automatic construction of a morphological dictionary of multi-word units" in Lecture Notes in Computer Science 6233, Advances in Natural Language Processing, Proceedings of the 7thInternational Conference on NLP, IceTAL 2010, Reykjavik, Iceland, August 2010, Springer (2010): 226-237. https://doi.org/10.1007/978-3-642-14770-8_26
-
Frequency and Length of Syllables in Serbian
Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová (2019)Basic analyses of several properties of syllables (the rank-frequency distribution, the distribution of length, and the relation between length and frequency) in Serbian is presented. The syllabification algorithm used combines the maximum onset principle and the sonority hierarchy. Results indicate that syllables behave similarly to words as far as mathematical models are concerned, but values of parameters in models for syllables are quite different from those for words.... laterals (l, lj), a vibrant (r) and semivowels (v, j). The Serbian language uses two alphabets: Latin and Cyrillic. Serbian graphemes are presented in Table 1, first Latin ones, then, in brackets, their Cyrillic equivalents14. Every phoneme in Serbian can be presented by a grapheme or by a digraph, in ...
... Length of Syllables in Serbian Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Frequency and Length of Syllables in Serbian | Marija Radojičić, Biljana ...
... organized as follows. The syllabification algorithm is described in Section 2. Section 3 presents some properties of Serbian phonology that are relevant for syllabification, and the Serbian alphabets (both Latin and Cyrillic). Then, the language material used is introduced. In Section 4, mathematical models ...Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová. "Frequency and Length of Syllables in Serbian" in Glottometrics (2019)
-
Production of morphological dictionaries of multi-word units using a multipurpose tool
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multi-word units, noun phrases, query expansion... of e-dictionaries for various languages while Unitex distribution contains large samples of e-dictionaries, including one for Serbian which covers a sample text, the Serbian translation of Voltaire’s Candide. Some multi-word compounds composed of two or more contiguous graphical words that show some ...
... scratch. Serbian is, however, like all Slavic languages a highly inflectional language and such a shortcut procedure cannot be applied. We will illustrate this with one example. The nominal MWU petokraka zvezda ‘five-pointed star’ consists of an adjective followed by a noun, which in Serbian is the ...
... of lemmas for e-dictionaries of multi-word units. Development of morphological dictionaries of MWUs is a tedious task, especially in the case of Serbian and other languages featuring complex morphological structures. After realizing that the development of such a dictionary manually is an extremely ...Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas. "Production of morphological dictionaries of multi-word units using a multipurpose tool" in Proceedings of the Computational Linguistics-Applications Conference, October 2011, Jachranka, Poland, Jachranka, Poland : PTI - Polish Information Processing Society (2011)
-
SrpELTeC: A Serbian Literary Corpus for Distant Reading
U članku je predstavljen SrpELTeC, korpus razvijen u okviru akcije COST Distant Reading for European Literary History (CA16204). Svi romani u SrpELTeC-u su odabrani, pripremljeni i obeleženi korišćenjem zajedničkih principa uspostavljenih za sve jezičke zbirke u Evropskoj zbirci književnog teksta (ELTeC). Navedeni su izazovi i rešenja u pripremi SrpELTeC od nule. Svi romani su ručno kodirani u TEI sa bogatim metapodacima i strukturnim napomenama. Automatska anotacija je uključivala POS-označavanje, lematizaciju i imenovane entitete, oslanjajući se na resurse za obradu ...digital humanities, Serbian literature, text corpora, distant reading , linked data, named entity recognition, text analyticsRanka Stanković, Cvetana Krstev, Duško Vitas. "SrpELTeC: A Serbian Literary Corpus for Distant Reading" in Primerjalna književnost, Research Centre of the Slovenian Academy of Sciences and Arts (2024). https://doi.org/10.3986/pkn.v47.i2.03
-
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian
Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...... low-resource languages such as Serbian. The main contribution of this work is the creation of the AbCoSER, the first abusive speech corpus in Serbian, that will, together with abusive speech lexicon, enable the development of automatic abusive speech detection systems for the Serbian language. In the course ...
... language column, in the majority of cases language of Serbian tweets was marked as und – unidentified – since Twitter cannot reliably recognize the Serbian language. For example, out of 150,000 tweets, only 8,000 were marked as tweets in Serbian, while 120,000 were marked as und. Therefore, we could ...
... globalex-1.1.pdf 13:16 Building Language Resources for Abusive Language Detection in Serbian 19 Ivana Krstić. Report on the use of hate speech in Serbian media, 2020. URL: https: //rm.coe.int/hf25-hate-speech-serbian-media-eng/1680a2278e. 20 K. Kumaresan and K. Vidanage. Hatesense: Tackling ambiguity ...Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13
-
Development and Evaluation of Three Named Entity Recognition Systems for Serbian - The Case of Personal Names
In this paper we present a rule- and lexicon-based system for the recognition of Named Entities (NE) in Serbian news paper texts that was used to prepare a gold standard annotated with personal names. It was further used to prepare training sets for four different levels of annota tion, which were further used to train two Named Entity Recognition (NER) sys tems: Stanford and spaCy. All obtained models, together with a rule- and lexicon based system were evaluated on ...... Portuguese, French, Italian, Dutch and Serbian; StanfordNER module provides Named Entity annotation using STANFORD NER models (Manning et al., 2014), which are available for Serbian, English and German with dif- ferent levels of details, e.g. number of NE classes. Serbian model is developed withing presented ...
... Recognition Systems for Serbian - The Case of Personal Names Branislava Šandrih, Cvetana Krstev, Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Development and Evaluation of Three Named Entity Recognition Systems for Serbian - The Case of Personal ...
... uk/ 2OpenNLP, https://opennlp.apache.org/ 3spaCy, https://spacy.io/ 1061 2 SRPNER and the Gold Standard 2.1 Rule-Based NER for Serbian The first NER system for Serbian was a rule- and lexicon-based system developed several years ago. It has been designed to recognize the main classes of NEs: 1) numerical ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Development and Evaluation of Three Named Entity Recognition Systems for Serbian - The Case of Personal Names" in Proceedings - Natural Language Processing in a Deep Learning World, Incoma Ltd., Shoumen, Bulgaria (2019). https://doi.org/10.26615/978-954-452-056-4_122
-
Witness of the history: A hundred years old the geological hammer of Jovan Zujovic
Ljupko Rundić (2021)Током обележавања тридесет година постојања и рада Српског геолошког друштва 10. фебруара 1921. године, у знак великог поштовања према академику Јовану Жујовићу, председнику и оснивачу Српског геолошког друштва и српске геолошке школе, чланови СГД поклонили су му јединствени геолошки чекић са угравираном посветом и својим потписима. Током протеклих стотину година, многе генерације геолога проналазиле су инспирацију гледајући чекић и делећи ову причу с великим пијететом. Данас, када геолози посећују Спомен собу геологије (Рударско-геолошки факултет, ул. Каменичка бр. 6), где се пажљиво ...Ljupko Rundić. "Witness of the history: A hundred years old the geological hammer of Jovan Zujovic" in Annales Geologiques de la Peninsule Balkanique, National Library of Serbia (2021). https://doi.org/10.2298/GABP210607004R
-
Understanding partitioning of deformation in highly arcuate orogenic systems: Inferences from the evolution of the Serbian Carpathians
Nemanja Krstekanić, Liviu Matenco, Marinko Toljić, Oleg Mandić, Uroš Stojadinović, Ernst Willingshofer (2020)... Dinarides orogenic systems and in terms of strain partitioning during oroclinal bending. 2. Tectonic evolution of the Serbian Carpathians and associated sedimentary basins The Serbian segment of the Carpathians–Balkanides forms part of the larger Europe-derived Dacia tectonic mega-unit (Fig. 1b). This ...
... Global and Planetary Change 195 (2020) 103361 7 Serbian Carpathians and amphibolite to sub-greenschist facies in the South Carpathians during Variscan times (Figs. 3, 4; Iancu et al., 2005b; Kalenić et al., 1980). In the Serbian Carpathians, these rocks are overlain by a transgressive- ...
... rotations have been taken up by the Serbian Carpathians after the Cretaceous thrusting (Fig. 5). 4.2. Paleogene to middle Miocene interplay between bi-directional extension and strike-slip Although not directly apparent in the large-scale structure of the studied Serbian Carpathians, our detailed field ...Nemanja Krstekanić, Liviu Matenco, Marinko Toljić, Oleg Mandić, Uroš Stojadinović, Ernst Willingshofer. "Understanding partitioning of deformation in highly arcuate orogenic systems: Inferences from the evolution of the Serbian Carpathians" in Global and Planetary Change, Elsevier BV (2020). https://doi.org/10.1016/j.gloplacha.2020.103361
-
Речници у дигиталном добу - информатичка подршка за српски језик
Биљана Рујевић (2022)Морфолошки речници српског језика представљају електронски језички ресурс који има значајну историју развоја и коришћења за потребе обраде природних језика. С обзиром на то да су чувани у облику датотека чији је број нарастао па је самим тим управљање речницима постало отежано јавила се потреба за смештањем информација из речника у облик лексикографске базе. Како би се омогућио симултани рад на развоју речника за више корисника јавила се потреба за веб-апликацијом заснованој на лексикографској бази. Како би се размотриле ...Биљана Рујевић. Речници у дигиталном добу - информатичка подршка за српски језик, Београд : [Б. Рујевић], 2022
-
Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model
Ova studija predstavlja analizu sentimenta srpskih starih romana iz perioda 1840-1920, koristeći veliki jezički model (LLM) Mistral za tehniku učenja sa zasnovani na takozvanim "zero" i "few-shot" pokušajima. Glavni pristup uvodi inovacije osmišljavanjem istraživačkih upita (promptova) uključuju tekst sa uputstvom za klasifikaciju bez primera i na osnovu nekoliko primera, omogućavajući jezičkom modelu da klasifikuje osećanja u pozitivne, negativne ili objektivne kategorije. Ova metodologija ima za cilj da pojednostavi analizu osećanja ograničavanjem odgovora, čime se povećava preciznost ...Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković, Biljana Rujević. "Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model" in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Sofia, Bulgaria, 9-10 September 2024, LREC | COLING (2024)
-
The lost freshwater goby fish fauna (Teleostei, Gobiidae) from the early Miocene of Klinci (Serbia)
Freshwater gobies played an important role in the Miocene paleolakes of central and southeastern Europe. Much data have been gathered from isolated otoliths, but articulated skeletons are relatively rare. Here, we review a rich assemblage of articulated gobies with abundant otoliths in situ from the late early Miocene lake deposits of Klinci in the Valjevo freshwater lake of the Valjevo-Mionica Basin of Serbia. The fauna was originally described by Anđelković in 1978, who noted many different fishes, including one ...Katarina Bradić-Milinović, Harald Ahnelt, Ljupko Rundić, Werner Schwarzhans. "The lost freshwater goby fish fauna (Teleostei, Gobiidae) from the early Miocene of Klinci (Serbia)" in Swiss Journal of Palaeontology, Springer Science and Business Media LLC (2019). https://doi.org/10.1007/s13358-019-00194-4
-
Middle Miocene otoliths of freshwater fishes from the Vracevic Lake (Serbian Lake System)
Katarina Bradić-Milinović, Ljupko Rundić, Werner Schwarzhans. "Middle Miocene otoliths of freshwater fishes from the Vracevic Lake (Serbian Lake System)" in Annales Geologiques de la Peninsule Balkanique, Rudarsko-geološki fakultet, Beograd (2021). https://doi.org/10.2298/GABP210616006B
-
SASA Dictionary as the Gold Standard for Good Dictionary Examples for Serbian
Ranka Stanković, Branislava Šandrih, Rada Stijović, Cvetana Krstev, Duško Vitas, Aleksandra Marković (2019)У овом раду представљамо модел за избор добрих примера за речник српског језика и развој иницијалних компоненти модела. Метода која се користи заснива се на детаљној анализи различитих лексичких и синтактичких карактеристика у корпусу састављених од примера из пет дигитализованих свезака речника САНУ. Почетни скуп функција био је инспирисан сличним приступом и за друге језике. Дистрибуција карактеристика примера из овог корпуса упоређује се са карактеристиком дистрибуције узорака реченица ексцерпираних из корпуса који садрже различите текстове. Анализа је показала да ...Српски, добри примери из речника, аутоматизација израде речника, издвајање својстава, Машинско учење... of the Serbian language. The approach was motivated by the need for modernization of the dictionary-making process for the dictionary of the Serbian Academy of Sciences and Arts (SASA), a large monolingual thesaurus of Serbian, as well as for the production of new dictionaries of Serbian. The SASA ...
... the GDEX ranking tool for Serbian candidate examples and a supervised machine learning model for classification on standard and non-standard Serbian sentences, for further integration into a solution for present and future dictionary production projects. Keywords: Serbian; good dictionary examples; ...
... Blueprint for the computerized dictionary of the Serbian language [Nacrt za informatizovani rečnik srpskog jezika]. Naučni sastanak slavista u Vukove dane, 44(3), pp. 105–116. (In Serbian, Cyrillic.) Vitas, D. & Krstev, C. (2012). Processing of Corpora of Serbian Using Electronic Dictionaries. Prace F ...Ranka Stanković, Branislava Šandrih, Rada Stijović, Cvetana Krstev, Duško Vitas, Aleksandra Marković. "SASA Dictionary as the Gold Standard for Good Dictionary Examples for Serbian" in Electronic lexicography in the 21st century. Proceedings of the eLex 2019 conference , Lexical Computing CZ, s.r.o. (2019)
-
FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain
U radu se daje kratak prikaz teorije semantike okvira, na kojoj je zasnovana leksička baza Frejmnet. Predstavljena je koncepcija ove mreže, kao i mogućnosti njene primene. Predstavljena je i leksička analiza koja se primenjuje u projektu izrade Frejmneta i ukazano na razlike između analize zasnovane na okviru u odnosu na analizu zasnovanu na reči. Zatim je prikazano nekoliko povezanih okvira koje prizivaju reči iz domena rizika. U radu je predstavljena i platforma NLTК pomoću koje se mogu koristiti ...... research presented above only hints at the possibility of adapting FrameNet to Serbian and aligning that network with the FrameNet data in other languages. Future research intends to align the use of Serbian WordNet and Serbian FrameNet, joining them together. While working toward this aim, we will be following ...
... also included. KEYWORDS: Serbian language, frame semantics, FrameNet, risk scenario, mining corpus, natural language processing. PAPER SUBMITTED: 15 July 2021 PAPER ACCEPTED: 6 September 2021 Aleksandra Marković aleksan- dra.markovic@isj.sanu.ac.rs Institute for Serbian Language, SASA Belgrade ...
... alongside the description of word meaning in the descriptive dictionaries of Serbian for all of the four most common frame-evoking word classes (nouns, adjectives, verbs and adverbs) (Марковић 2017, 34–41). 4. In Serbian lexicographic literature, as well as in syntax papers that explore the relationship ...Aleksandra Marković, Ranka Stanković, Natalija Tomić, Olivera Kitanović. "FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.1