Претрага
98 items
-
Knowledge and Rule-Based Diacritic Restoration in Serbian
In this paper we present a procedure for the restoration of diacritics in Serbian texts written using the degraded Latin alphabet. The procedure relies on the comprehensive lexical resources for Serbian: the morphological electronic dictionaries, the Corpus of Contemporary Serbian and local grammars. Dictionaries are used to identify possible candidates for the restoration, while the dataobtainedfromSrpKorandlocalgrammarsassistsinmakingadecisionbetween several candidates in cases of ambiguity. The evaluation results reveal that,dependingonthetext,accuracyrangesfrom95.03%to99.36%,whilethe precision (average 98.93%) is always higher than the recall (average 94.94%).... Analytics The main stages of thesaurus-based document processing include: • Tokenization and lemmatization, that is, the transfer of word forms to dictionary forms (lemmas); • Matching with the thesaurus based on the lemma representation of the document. Multiword terms from a thesaurus are matched with ...
... sciences and technology. In Proceedings of Linguistic Resources and Evaluation Conference, pages 1077–1082. Fellbaum, C., Ed. (1998). WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press. Gelfenbeyn, I., Goncharuk, A., Lehelt, V., Lipatov, A., and Shilo, V. (2003). Automatic translation ...Cvetana Krstev, Ranka Stanković, Duško Vitas. "Knowledge and Rule-Based Diacritic Restoration in Serbian" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018): 41-51
-
OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian
Ovaj rad predstavlja novi jezički resurs za pretraživanje i istraživanje verbalnih aspektnih parova u BCS (bosanskom, hrvatskom i srpskom), kreiran korišćenjem principa Lingvističkih Povezanih Otvorenih Podataka (LLOD). Pošto ne postoji resurs koji bi pomogao učenicima bosanskog, hrvatskog i srpskog kao stranih jezika da prepoznaju aspekt glagola ili njegove parove, kreirali smo novi resurs koji će korisnicima pružiti informacije o aspektu, kao i link ka aspektnim parovima glagola. Ovaj resurs takođe sadrži spoljne linkove ka monolingvalnim rečnicima, Wordnetu i BabelNetu. ...Ranka Stanković, Maxim Ionov, Medina Bajtarević, Lorena Ninčević. "OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
Advantages and challenges in presenting mathematical content using EDX platform
... suitable dictionary for mathematical terms. An additional option in Termi is the possibility of creating an export link to a term, which can be embedded in an html page. This provides for establishing a connection between a term within the course and its definition in Termi dictionary. From the ...Marija Radojičić, Ivan Obradović, Ranka Stanković, Olivera Kitanović, Roberto Linzalone. "Advantages and challenges in presenting mathematical content using EDX platform" in The Seventh International Conference on e-Learning (eLearning-2016), Belgrade : Metropolitan University (2016)
-
Novel cerium and praseodymium doped phosphate tungsten bronzes: Synthesis, characterization, the behavior in the Briggs-Rauscher reaction and photoluminescence properties
Tijana Maksimović, Pavle Tančić, Jelena Maksimović, Dimitrije Mara, Marija Ilić, Rik Van Deun, Ljubinka Joksović, Maja Pagnacco (2023)Due to the interesting and potentially useful properties, phosphate tungsten bronzes are constantly being studied and attract a lot of attention. In the present work, two different metallic elements, belonging to the group of rare-earth metals, cerium and praseodymium, were used as a dopants for phosphate tungsten bronzes. Novel cerium and praseodymium doped phosphate tungsten bronzes were successfully synthesized and further characterized by thermal analyses, Fourier-transform infrared spectroscopy, X-ray powder diffraction, scanning electron microscopy with energy-dispersive X-ray spectrometer and ...Electrical and Electronic Engineering, Atomic and Molecular Physics and Optics, Electronic, Optical and Magnetic Materials, Inorganic Chemistry, Organic Chemistry, Physical and Theoretical Chemistry, SpectroscopyTijana Maksimović, Pavle Tančić, Jelena Maksimović, Dimitrije Mara, Marija Ilić, Rik Van Deun, Ljubinka Joksović, Maja Pagnacco. "Novel cerium and praseodymium doped phosphate tungsten bronzes: Synthesis, characterization, the behavior in the Briggs-Rauscher reaction and photoluminescence properties" in Optical Materials, Elsevier BV (2023). https://doi.org/10.1016/j.optmat.2023.114125
-
Frequency and Length of Syllables in Serbian
Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová (2019)Basic analyses of several properties of syllables (the rank-frequency distribution, the distribution of length, and the relation between length and frequency) in Serbian is presented. The syllabification algorithm used combines the maximum onset principle and the sonority hierarchy. Results indicate that syllables behave similarly to words as far as mathematical models are concerned, but values of parameters in models for syllables are quite different from those for words.... more complicated consonant clusters. The second approach was chosen e.g. by Best (2011, 2013), who refers to a prestigious German pronunciation dictionary (which suggests also syllabification rules). The approach according to which only those syllable onsets exist that are observable word-initially ...
... the best known theoretical framework. A more detailed description can be found in Pulgram (1970). However, this approach requires a comprehensive dictionary that contains practically all words used in a language. Lehfeldt (1971) presented a modification, distinguishing between marginal (rarely occurring ...
... Cyrillic alphabet. In: Daniels, P.T., Bright, W. (eds.), The World’s Writing Systems: 700-726. Oxford: Oxford University Press. Crystal, D. (2008). A Dictionary of Linguistics and Phonetics. Oxford: Blackwell. Eddington, D., Treiman, R., Elzinga, D. (2013a). Syllabification of American English: Evidence ...Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová. "Frequency and Length of Syllables in Serbian" in Glottometrics (2019)
-
Development of A Business Intelligence Tool For Accident Analysis in Mines
... with numerical data is olap_povreda, whereas table dimensions are organizational structure (organizaciona struktura - OJ) and the RudOnto data dictionary (s_koncept) which can be related to the main table on basis of different fields, depending on the dimension we want to follow. The BI model for ...
... Figure 3 depicts three ways of presenting data on the number of injuries categorized by body parts. Due to the hierarchical structure of the RudOnto dictionary the analysis can be performed on several levels of detail. 5 Figure 2 The BI development environment Thus we can observe injuries ...Ljiljana Kolonja, Ranka Stanković, Ivan Obradović, Olivera Kitanović, Uroš Pantelić. "Development of A Business Intelligence Tool For Accident Analysis in Mines" in Proceedings of the 5th International Symposium Mining And Environmental Protection, June 10-13, 2015, Vrdnik, Serbia, Belgrade : Faculty of Mining and Geology (2015)
-
Keyword Extraction from Parallel Abstracts of Scientific Publications
... Bel- grade Human Language Technology Group [17]. Morphological electronic dictionaries of Serbian for NLP have been developing for many years now. In the dictionary of lemmas (DELAS) each lemma is described in full detail so that the dictionary of forms containing all the necessary grammatical information ...
... is the process of grouping together the inflected forms of a word so they can be ana- lyzed as a single item, identified by the word’s lemma, or a dictionary form. Stemming usually refers to a process that chops off the ends of words in the hope of achieving this goal correctly most of the time. Serbian ...Slobodan Beliga, Olivera Kitanović, Ranka Stanković, Sanda Martinčić-Ipšić . "Keyword Extraction from Parallel Abstracts of Scientific Publications" in Sematic Keyword-Based Search on Structured Data Sources - Third International KEYSTONE Conference, IKC 2017 Gdańsk, Poland, September 11–12, 2017 Revised Selected Papers and COST Action IC1302 Reports, Springer (2017)
-
A bilingual digital library for academic and entrepreneurial knowledge management
A generic knowledge management process of organization, storage and retrieval of knowledge can suitably be fitted in a digital library. In the digital and knowledge age digital libraries can be used in knowledge management to handle intellectual assets and support knowledge creation. A multilingual digital library either stores content in more than one language or provides multilingual query access to monolingual content. In Serbia 18 of 308 scientific journals regularly published are bi-lingual, with papers simultaneously being in English ...... purpose semantic networks (English and Serbian WordNets) and domain terminological bases and ontologies. A Dictionary of Library and Information Sciences1 is a terminology dictionary that covers the library and information sciences and related disciplines in Serbian, English and German developed ...
... language processing, more specifically to language resources development and their application. She has developed the Serbian morphological e-dictionary, and was one of the key contributors to the development of Serbian wordnet, aligned multilingual corpus, and many other language resources. She ...
... same – 123, which means that these Serbian terms have no other English equivalent in our DL other than distance learning. Finally, we also add electronic learning to our query, but we obtain the same result. As a result of our research, we can conclude that the English term “e-learning” has two ...Ranka Stanković, Cvetana Krstev, Biljana Lazić, Dalibor Vorkapić. "A bilingual digital library for academic and entrepreneurial knowledge management" in Proceeding of 10th International Forum on Knowledge Asset Dynamics — IFKAD 2015: Culture, Innovation and Entrepreneurship: connecting the knowledge dots, Bari, Italy, 10-12 June 2015, Bari : IFKAD (2015)
-
A Tel Platform Blending Academic And Entrepreneurial Knowledge
... system An example of terminological resources that BAEKTEL features is GeolISS term, a thesaurus of geological terms with more than 3000 dictionary entries in Serbian and their English equivalents, developed within the GeolISS project and available for search on the web (http://geoliss ...
... eLearning-2012 (pp. 114-119). [8] Krstev, C., Stanković, R., Obradović, I., Vitas, D., & Utvić, M. (2010). Automatic construction of a morphological dictionary of multi-word units. In Advances in Natural Language Processing (pp. 226- 237). Springer Berlin Heidelberg. [9] Daconta, M. C., Obrst, L. J ...
... called "big" languages. In order to offer support in expert terminology within the multilingual approach, the BAEKTEL platform provides electronic terminological resources, parallel (multilingual) corpora of lessons and texts in written form, and functionalities for searching and browsing ...Ivan Obradović, Ranka Stanković, Jelena Prodanović, Olivera Kitanović. "A Tel Platform Blending Academic And Entrepreneurial Knowledge" in Proceedings of the The Fourth International Conference on e-Learning (eLearning-2013), September 2013, Belgrade, Serbia, Belgrade, Serbia : Belgrade Metropolitan University (2013)
-
Size–strain line-broadening analysis of anatase/brookite (TiO<sub>2</sub>)-based nanocomposites with carbon (C): XRPD and Raman spectroscopic analysis
Aleksandar Kremenović, Mirjana Grujić-Brojčin, Nataša Tomić, Vladimir Lazović, Danica Bajuk-Bogdanović, Jugoslav Krstić, Maja Šćepanović (2022)A size-strain line-broadening analysis of the XRPD patterns and Raman spectra for two anatase/brookite (TiO2)-based nanocomposites with carbon (C) was carried out and the results compared with those of a similar sample free of carbon. The crystal structures and microstructures of anatase and brookite, as well as their relative abundance ratio, have been refined from XRPD data by the Rietveld method (the low amount of carbon is neglected). The XRPD size-strain analysis resulted in reliable structure and microstructure results ...Materials Chemistry, Metals and Alloys, Atomic and Molecular Physics, Optics, Electronic, Optical and Magnetic MaterialsAleksandar Kremenović, Mirjana Grujić-Brojčin, Nataša Tomić, Vladimir Lazović, Danica Bajuk-Bogdanović, Jugoslav Krstić, Maja Šćepanović. "Size–strain line-broadening analysis of anatase/brookite (TiO2)-based nanocomposites with carbon (C): XRPD and Raman spectroscopic analysis" in Acta Crystallographica Section B Structural Science, Crystal Engineering and Materials, International Union of Crystallography (IUCr) (2022). https://doi.org/10.1107/S2052520622001731
-
Multiword Expressions between the Corpus and the Lexicon: Universality, Idiosyncrasy and the Lexicon-Corpus Interface
Verginica Barbu Mititelu, Voula Giouli, Kilian Evang, Daniel Zeman, Petya Osenova, Carole Tiberius, Simon Krek, Stella Markantonatou, Ivelina Stoyanova, Ranka Stankovic, Christian Chiarcos (2024)Predstavljamo trenutne aktivnosti na definisanju interfejsa leksikona i korpusa koji će služiti kao referenca u prikazu polileksemskih jedinica - višečlanih izraza - (različitih tipova - imenskih, glagolskih, itd.) u specijalizovanim leksikonima i povezivanju ovih unosa sa njihovim pojavljivanjima u korpusima. Konačni cilj je korišćenje ovakvih resursa za automatsko identifikovanje višečlanih izraza u tekstu. Uključivanje nekoliko prirodnih jezika ima za cilj univerzalnost rešenja koje nije usredsređeno na određeni jezik, kao i prilagođavanje idiosinkrazijama. Raspravljaju se izazovi u leksikografskom opisu višerečnih ...Verginica Barbu Mititelu, Voula Giouli, Kilian Evang, Daniel Zeman, Petya Osenova, Carole Tiberius, Simon Krek, Stella Markantonatou, Ivelina Stoyanova, Ranka Stankovic, Christian Chiarcos. "Multiword Expressions between the Corpus and the Lexicon: Universality, Idiosyncrasy and the Lexicon-Corpus Interface" in Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, Turin, May 25, 2024, ELRA and ICCL (2024)
-
Using Metadata For Content Indexing Within An OER Network
Ranka Stanković, Olivera Kitanović, Ivan Obradović, Roberto Linzalone, Giovanni Schiuma, Daniela Carlucci (2014)... application.In the terminological dictionary, a definition is givenfor each term, with its synonyms and translation in English, Russian and other languages[16].If the resource is HTML based, an additional possibility is to link key terms in the text with dictionary entries via web services, thus ...
... address).Format is the layout of the resource in terms of how the information contained in the resource is organized. It indicates whetherit is an electronic document, paper only document, slide(s), website, cd-rom/dvd, audio, or video. Educational data, taken from the LOM standard, suggest the auditorium ...Ranka Stanković, Olivera Kitanović, Ivan Obradović, Roberto Linzalone, Giovanni Schiuma, Daniela Carlucci. "Using Metadata For Content Indexing Within An OER Network" in Proceedings of the Fifth International Conference on e-Learning, eLearning 2014, September 2014, Belgrade, Serbia, Belgrade : Belgrade Metropolitan University (2014)
-
Development of the Cost-Based Model for Monitoring the Lifetime of the Earth Moving Machines
This paper presents the model for the identification of reference points on the lifetime curve of engineering systems. This curve commonly represents the increase and decline of failure in relation to time. A direct correlation between failure rate and costs is assumed in this paper, therefore, statistical and empirical analysis of costs provided reference points. This approach is used for positioning stages of the engineering system’s lifetime with a minimal number of failures and costs, regardless of whether these ...Electrical and Electronic Engineering,Industrial and Manufacturing Engineering, Control and Optimization, Mechanical Engineering, Computer Science (miscellaneous), Control and Systems EngineeringUgljesa Bugarić, Milos Tanasijević, Stevan Djenadić, Dragan Ignjatović, Ivan Janković. "Development of the Cost-Based Model for Monitoring the Lifetime of the Earth Moving Machines" in Machines, MDPI AG (2022). https://doi.org/10.3390/machines10110995
-
A WebGIS Decision Support System for Management of Abandoned Mines
Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis (2016)... preferences data, and editors, who have the option of editing data related to mine sites and companies, but not data related to user accounts and data dictionary. 2.2. Web Application for Non-Spatial (Attribute) Data For the non-spatial data, authorized users can use the protected part of the application ...
... max_500 m is maximal population at a 500 m distance from a site in the dataset. For the qualitative variables the score is mapped using a metadata dictionary, like for example for groundwater vulnerability: very high 100, high 80, medium high 60, medium 40, low, 20, very low 0. The complete list of ...Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis. "A WebGIS Decision Support System for Management of Abandoned Mines" in Energies 7 no. 9 (2016): 567. https://doi.org/10.3390/en9070567
-
Proširivanje upita zasnovano na leksičkim resursima
U radu je opisano kako se leksički resursi za srpski jezik i softverski alati, razvijeni u okviru Grupe za jezičke tehnologije Univerziteta u Beogradu, mogu koristiti za unapređenje postavljanja upita. Rezultati pretrage mogu biti značajno unapređeni korišćenjem različitih leksičkih resursa, kakvi su morfološki rečnici i semantičke mreže. Izloženi pristup može se iskoristiti i u Sistemu naučnih, tehnoloških i poslovnih informacija, jer je efikasno pretraživanje ovog dragocenog resursa, imajući u vidu njegovu heterogenost i obim, kao i preovladavajući tekstualni sadržaj, ...... Information Science and Technology, Bucureşti, Publishing house of the Romanian academy. [3] Vitas D., Krstev C. (2007), „Extending Serbian E dictionary by the Use of the Lexical Transducers“, Proc. of the 6th and 7th INTEX/Nooj Workshop, S. Koeva, D. Maurel, M. Silberztein (eds.), Formaliser ...
... langues avec l'ordinateur: De INTEX à NooJ, Presses Universitaires de Franche Compte, Paris, 2007. [4] Fellbaum C. (ed.) (1998) WordNet: An Electronic Lexical Database, The MIT Press. [5] Maurel D., Vitas D., Krstev S., Koeva S., (2007) „Prolex: a lexical model for translation of proper names ...Ranka Stanković, Ivan Obradović, Cvetana Krstev. "Proširivanje upita zasnovano na leksičkim resursima" in SNTPI 09 - Naučno-stručni skup Sistem naučnih, tehnoloških i poslovnih informacija, Beograd 19. i 20. jun 2009, Beograd : Fakultet informacionih tehnologija (2009)
-
Српски језик у дигиталном добу -- The Serbian Language in the Digital Age
Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević (2012)... resources: theory, description and applications”. Within the scope of these projects, the development of the electronic dictionary of simple words was finalised, the development of a dictionary of compounds was initiated. Aligned French-Serbian andEnglish-Serbian corpora of literary texts were devel- ...
... Novi Sad. e AlfaNum company has a con- siderable number of users among Serbian companies. e first corpus of contemporary Serbian, an electronic morphological dictionary of Serbian, aligned French- Serbian and English-Serbian corpora of literary texts, as well as different soware tools were developed ...
... currently having more than 300 registered users from different Serbian and foreign uni- versities and institutes. Development of an electronic morphological dictionary of Serbian following the so- called LADL format was also initiated within the scope of this project [48]. e project was later continued ...Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević. "Српски језик у дигиталном добу -- The Serbian Language in the Digital Age" in META-NET White Paper Series, G. Rehm, H. Uszkoreit (eds.), Springer (2012)
-
Development of terminological resources for expert knowledge: a case study in mining
Ljiljana Kolonja, Ranka Stanković, Ivan Obradović, Olivera Kitanović, Aleksandar Cvjetić. "Development of terminological resources for expert knowledge: a case study in mining" in Knowledge Management Research & Practice, Palgrave Macmillan (2015). https://doi.org/10.1057/kmrp.2015.10
-
Developing Termbases for Expert Terminology under the TBX Standard
... bIMM Ha POTOPHOM Konece Synonyms a Language RU Fig. 3: Available translations of a selected term and details in Russian 2.3. Morphological E-Dictionary of Mining and Geology The main shortcoming of GeolISSTerm and RudOnto is their lack of lexical in- formation, such as part of speech, grammatical ...
... still lacks domain specific terms for some areas, such as mining and geology, which motivated the language technol- ogy team at FMG to build such a dictionary for these areas. For its production and management we used an integrated and easily adjustable tool for language resources, LeXimir, also developed ...
... each particular resource in realizing a task, in this case production of morphological dictionaries [13]. Some examples from the simple word DELAS dictionary of terms related to mining and geology, represented by their lemmas, transducers for their respective inflection classes and semantic markers are: ...Ranka Stanković, Ivan Obradović, and Miloš Utvić. "Developing Termbases for Expert Terminology under the TBX Standard" in Natural Language Processing for Serbian - Resources and Applications, Belgrade : University of Belgrade, Faculty of Mathematics (2014)
-
WebGIS Cadastre of Abandoned Mines in Autonomous Province of Vojvodina
Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis (2015)... data, while editors have the option of editing data related to pits and commercial organizations, but not data related to user accounts and data dictionary. 3.3 Web application for textual (attribute) data Authorized users can use the protected part of the application on the following pages: ...Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis. "WebGIS Cadastre of Abandoned Mines in Autonomous Province of Vojvodina" in Proceedings of the 5th International Symposium Mining And Environmental Protection,June 10-13,2015, Vrdnik, Serbia, Belgrade : Faculty of Mining and Geology (2015)
-
Integrisanje heterogenih leksičkih resursa
Osnovna aktivnost Grupe za obradu prirodnih jezika na Matematičkom fakulteta Univeziteta u Beogradu je usmerena na razvoj različitih resursa za obradu srpskog jezika. Među njima su posebno značajni sistem morfoloških rečnika srpskog jezika razvijenih u okviru mreže RELEX [1] i semantička mreža (tipa wordnet) za srpski jezik razvijena u okviru međunarodnog projekta Balkanet. Radi se o dva heterogena leksička resursa, razvijena na osnovu sasvim različitih modela, koji samim tim sadrže i različite vrste leksičkih informacija. Integracijom ovih resursa, informacije ...... okruženju da bi se pretražile sve flektivne forme reči u sinsetu. Zahtevi opisani u odeljku 2 su realizovani tab-stranom “Update with Intex dictionary” koji omogućavaju uključivanje morfosintaktičkih informacija iz srpskog MR u radni sinset. Element, koji je u sastavu elementa ... Ranka Stanković, Cvetana Krstev, Duško Vitas, Ivan Obradović, Gordana Pavlović-Lažetić. "Integrisanje heterogenih leksičkih resursa" in Festivalski katalog 11. Festivala informatičkih dostignuća INFOFEST 2004, 26th September - 2nd October, 2004, Budva, Montenegro, INFOFEST (2004)