Претрага
2182 items
-
A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals
This paper outlines the main features of Bibliša, a tool that offers various possibilities of enhancing queries submitted to large collections of TMX documents generated from aligned parallel articles residing in multilingual digital libraries of e-journals. The queries initiated by a simple or multiword keyword, in Serbian or English, can be expanded by Bibliša, both semantically and morphologically, using different supporting monolingual and multilingual resources, such as wordnets and electronic dictionaries. The tool operates within a complex system composed ...... which covers the field of Library and Information Sciences, is one of the few journals that publish all articles both in Serbian and in English, and was thus an almost perfect resource for testing our tool. The interest in collections of aligned texts and tools tailored for their search is ...
... sentences or segments) in one of the TMX document languages, where the text in the first TUV is usually in the source language, and the texts in the remaining TUVs are in one or more target languages. Although the order of languages is the same in each TU, there is a TUV attribute xml:lang that ...
... full-text and metadata search tool, to a useful translator’s aid, which could be of assistance both in reviewing terminology used in context and in refining the multilingual resources used within the system. Keywords: multilingual digital libraries, query expansion, TMX 1. Motivation In this ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac, Miloš Utvić. "A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals" in Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, May 2012, Istanbul, Turkey, Istanbul, Turkey : European Language Resources Association (2012)
-
Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++
Branislava Šandrih, Ranka Stanković (2020)U nauci, industriji i mnogim istraživačkim oblastima, terminologija se brzo razvija. Najčešće, jezik koji je „lingua franca“ za većinu ovih oblasti je engleski. Kao posledica toga, za mnoga polja termini domena su koncipirani na engleskom, a kasnije se prevode na druge jezike. U ovom radu predstavljamo pristup za automatsko izdvajanje dvojezične terminologije za englesko-srpski jezički par koji se oslanja na usaglašeni dvojezični korpus domena, ekstraktor terminologije za ciljni jezik i alat za usklađivanje delova. Ispitujemo performanse metode na domenu ...... thoroughly explained in Section 4. Results and a discussion are given in Section 5. A Web application that implements the proposed technique is presented in Section 6. Finally, conclusions and directions for future work are given in Section 7. 2 Related Work Over the past years, in order to compile bilingual ...
... a word “reči” is a noun, has feminine gender, is in plural and is in nominative case. A lemma for any noun is singular, and is nominative case, namely “reč” for this case. The words “rečnika” and “rečniku” are also both nouns, but in genitive and da- tive case, respectively. After single-word le ...
... extraction and alignment that differ in method- ology, resources used, languages involved and purpose for which they were built. Bilingual lexica were compiled for different language pairs: En- glish/French (Bouamor et al., 2012; Hamon and Grabar, 2016; Hazem and Morin, 2016; Hakami and Bollegala, 2017; ...Branislava Šandrih, Ranka Stanković. "Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.6
-
Asbestos-Based Pottery from Corsica: The First Fiber-Reinforced Ceramic Matrix Composite
Asbestos-containing pottery shards collected in the northeast of Corsica (Cap Corse) and dating from the 19th century, or earlier, have been analyzed by SEM-EDS, XRPD, FTIR and Raman microspectroscopy. Blue (crocidolite) and white (chrysotile) asbestos fiber bundles are observed in cross-sections. Most of the asbestos is partly or totally dehydroxylated, and some transformation to forsterite is observed to occur, indicative of a firing above 800 C. Examination of freshly fractured pieces shows a nonbrittle fracture with fiber pull-out, consistent with ...... orthopyroxene in sample c. Actinolite and quartz are also found in all samples. Antigorite and chlorite-serpentine are found in samples a, b and d, while albite is found only in samples a and b. Crocidolite is found in samples b and d, while tremolite is found in samples b and c. Diopside and orthopyroxene ...
... actinolite and/or tremolite (actinolite and tremolite are difficult to distinguish in complex XRPD patterns like this one); and sample d is primarily composed of actinolite and/or crocidolite (actinolite and crocidolite are hard to distinguish in complex XRPD patterns like this one), antigorite and ch ...
... Romans for cremation and that asbestos textiles were made. Hulthćn [9] reported that artefacts very rich in asbestos (50% to 90% volume) have been made in Finland and North Scandinavia from ca. 3900 BC to 200 AD and 1500 AD, respectively, for metal production (crucibles and molds). It is also reported ...Philippe Colomban, Aleksandar Kremenović. "Asbestos-Based Pottery from Corsica: The First Fiber-Reinforced Ceramic Matrix Composite" in Materials, MDPI AG (2020). https://doi.org/10.3390/ma13163597
-
Spatial assessment of the areas sensitive to degradation in the rural area of the municipality Čukarica
Nature and Landscape Conservation, Soil Science, Agronomy and Crop Science, Water Science and TechnologyNatalija Momirović, Ratko Kadović, Veljko Perović, Miloš Marjanović, Aleksandar Baumgertel. "Spatial assessment of the areas sensitive to degradation in the rural area of the municipality Čukarica" in International Soil and Water Conservation Research, Elsevier BV (2019). https://doi.org/10.1016/j.iswcr.2018.12.004
-
Dehydroicetexanes in sediments and crude oils: Possible markers for Cupressoideae
Hans Peter Nytoft, Geir Kildahl-Andersen, Sofie Lindström, Frode Rise, Achim Bechtel, Danica Mitrović, Nataša Đoković, Dragana Životić, Ksenija A. Stojanović (2019)Two previously unidentified dehydroabietane isomers were isolated from Miocene Serbian lignite and Rhaetian (Late Triassic) coaly mudstones from South Sweden and characterized using NMRspectroscopy as cis- and trans-dehydroicetexane. Both have a 9(10?20)-abeo-abietane or icetexane skeleton, consisting of a 6-7-6 tricyclic framework with seven carbons in ring B instead of the usual six in common diterpanes of the abietane-type. Dehydroicetexanes can be detected using GC-MS-MS in m/z 270?146 chromatograms without interference from dehydroabietane or other isomers. Dehydroicetexanes are often abundant ...Hans Peter Nytoft, Geir Kildahl-Andersen, Sofie Lindström, Frode Rise, Achim Bechtel, Danica Mitrović, Nataša Đoković, Dragana Životić, Ksenija A. Stojanović. "Dehydroicetexanes in sediments and crude oils: Possible markers for Cupressoideae" in Organic Geochemistry, Elsevier BV (2019). https://doi.org/10.1016/j.orggeochem.2019.01.001
-
INVENTS: a hybrid system for subsurface ventilation analysis
Ventilation system analysis is a complex process based on the calculation and analysis of numerous parameters. These problems can be successfully solved by the SimVent numerical package, but a full understanding and use of the obtained results require the involvement of an experienced specialist in the ventilation field. The solution was found in the creation of a hybrid system INVENTS, whose knowledge base represents a formalization of the expert knowledge in the mine ventilation field. In this paper we ...... understanding and use of the obtained results require the involvement of an experienced specialist in the ventilation field. The solution was found in the creation of a hybrid system INVENTS, whose knowledge base represents a formalization of the expert knowledge in the mine ventilation field. In this paper ...
... introduction to the mine design process. In the planning process key relations are defined that have to be taken into account in the mine design phase. The initial step in the mine ventilation planing and design process is the establishment of a basic or initial network and an appropriate database related ...
... maintenance, in order to secure the highest possible level of system use and effectiveness. The final phase in the outlined design process is state evaluation and modification. All parameters of mine ventilation obtained by monitoring have to be compared with designed parameters and when differences ...Nikola Lilić, Ranka Stanković, Ivan Obradović. "INVENTS: a hybrid system for subsurface ventilation analysis" in Proc. of International Scientific Conference of FME, September 2000, Ostrava, FME (2000)
-
Laboratory Testing of Nanosilica- Reinforced Silicate and Polyacrylamide Gels
Irina Zahirovic, Dušan Danilović, Milica Šuput Vranjin, Miloš Tripković. "Laboratory Testing of Nanosilica- Reinforced Silicate and Polyacrylamide Gels" in SPE Journal (2023)
-
Definition of Circulation Conditions and Groundwater Genesis of the Complex Krupaja Hydrogeological Karst System (Eastern Serbia)
Management, Monitoring, Policy and Law,Renewable Energy, Sustainability and the Environment, Geography, Planning and Development, Building and Construction... the karst aquifer in which rapid circulation and water exchange takes place, the part of the aquifer that depends strongly on recharge. The results of this research have been processed and published several times in scientific journals and chapters of monographs [15-18], and this paper will only ...
... channels and indicated that they may function as part of an aquifer that is completely dependent on recharge and whose circulation takes place through large and well-developed karst channels. This model of karst channels, its spatial position and dependence in relation to recharge and discharge ...
... uniform water temperatures and other physico-chemical parameters observed in the Krupaja Spring, groundwater in all seasons. The tritium content in the borehole water (Table 4), as well as in the thermal spring groundwater, is significantly lower, amounting to 3.9 TJ and only 1.7 TJ. These contents ...Ljiljana Vasić, Saša Milanović, Tina Dašić. "Definition of Circulation Conditions and Groundwater Genesis of the Complex Krupaja Hydrogeological Karst System (Eastern Serbia)" in Sustainability, MDPI AG (2023). https://doi.org/10.3390/su151411146
-
Groundwater resources for drinking water supply in Serbia´s Southeast Pannonian basin
Dušan Polomčić, Bojan Hajdin, Marina Ćuk, Petar Papić, Zoran Stevanović. "Groundwater resources for drinking water supply in Serbia´s Southeast Pannonian basin" in Carpathian Journal of Earth and Environmental Sciences (2014)
-
Results of Recent Monitoring Activities on Landslide Umka, Belgrade, Serbia—IPL 181
Biljana Abolmasov, Uroš Đurić, Jovan Popović, Marko Pejić, Mileva Samardžić Petrović, Nenad Brodić (2021)... even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate ...
... systems and its implementation in Structure from Motion (SfM) photogrammetry is found very practical for landslide surface modeling and monitoring. Many advantages of UAV-based remote sensing for landslide characterization and monitoring were disccusesed in Colomina and Molina (2014), Balek and Blahut ...
... renovac Highway (E-763), and for the Umka urban plans and regulations. A summary of the geotechnical investigations results until 1995 can be found in Ćorić et al. (1996), while the summary of investigations and monitoring results until 2005 can be found in Mitrović and Jelisavac (2006). Geometry ...Biljana Abolmasov, Uroš Đurić, Jovan Popović, Marko Pejić, Mileva Samardžić Petrović, Nenad Brodić. "Results of Recent Monitoring Activities on Landslide Umka, Belgrade, Serbia—IPL 181" in Understanding and Reducing Landslide Disaster Risk. WLF 2020. ICL Contribution to Landslide Disaster Risk Reduction, Springer, Cham (2021). https://doi.org/10.1007/978-3-030-60196-6_14
-
Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis
U ovom radu predstavljen je model koji omogućava prikupljanje, pripremu, opis metapodataka, upravljanje i eksploataciju, uključujući pretragu punog teksta dokumenata iz domena kriminalistike napisanih na srpskom jeziku. Predloženi pristup primenjuje se na veb portalu koji sakuplja različite tekstove nastale iz časopisa Akademije za kriminalistiku i policijske studije, Krivičnog zakona Srbije, konferencija „Tara“ i „Reiss“, kao i iz nekih doktorskih disertacija vezanih za ovu oblast istraživanje. Nakon obrade teksta, korpus koji sadrži preko 5500 stranica običnog teksta, kreiran je i ...... Resources and Basic Tools ”, in Workshop on Balkan Language Resources and Tools, 21 Novembar 2003, Thessaloniki, Greece, eds, S. Piperidis and V. Karkaletsis, pp. 97- 104, 2003. 5. Miljana Mladenović, Jelena Mitrović, Cvetana Krstev, “Developing and Maintaining a WordNet: Procedures and Tools”, In The ...
... types classification and syntax and semantic analysis of texts written in a natural language. Various texts are subject of the study: Acts of Parliament (or other law-making body), private wills, court judgements and summonses and the statutes of the bodies such as States and government departments ...
... sources in the network and aims to: simplicity in the creation and maintenance so that each user could make a set of descriptive statements understandable semantics to facilitate searches across the global network to all who are in need of information; localization: originally implemented in English ...Dalibor Vorkapić, Aleksandra Tomašević, Miljana Mladenović, Ranka Stanković, Nikola Vulović. "Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis" in International Scientific Conference “Archibald Reiss Days” Thematic Conference Proceedings Of International Significance, Belgrade, 7-9 November 2017, Academy Of Criminalistic And Police Studies Belgrade (2017)
-
Groundwater management by riverbank filtration and an infiltration channel, the case of Obrenovac, Serbia
Dušan Polomčić, Bojan Hajdin, Zoran Stevanović, Dragoljub Bajić, Katarina Hajdin. "Groundwater management by riverbank filtration and an infiltration channel, the case of Obrenovac, Serbia" in Hydrogeology Journal, Berlin, Heidelberg : Springer, International Association of Hydrogeologists (2013). https://doi.org/10.1007/s10040-013-1025-9
-
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment ...... including simple- and multi-word units (MWUs), proper names, general- and domain-oriented lexica. Its basic tagset is similar to the one used by the Serbian TreeTag- ger models built in 2011 (TT11) and 2019 (TT19) and it generally corresponds to the traditional notion of Part- of-Speech in Serbian. These ...
... tagset that includes grammatical gender and adjective comparative degree. The texts used in this research are shown in Table 2. The text 1984, Serbian translation of Orwell’s novel, was anno- tated according to the MULTEXT-East specification and in- cluded in MULTEXT-East resources (version 3) (Krstev ...
... devetnaesti and the noun vek. • Same word tokens were assigned different lemmas in different texts. The reason for this was that texts were tagged with SMD, which evolved in time and many entries were enhanced and corrected. For instance, numerous lemmas of adjectives were represented in SMD using ...Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020)
-
Strain partitioning in a large intracontinental strike-slip system accommodating backarc-convex orocline formation: The Circum-Moesian Fault System of the Carpatho-Balkanides
Nemanja Krstekanić, Liviu Matenco, Uroš Stojadinović, Ernst Willingshofer, Marinko Toljić, Daan Tamminga (2022)oroklini, raspodela deformacija, transkurentna kretanja, Karpato-balkanidi, Cirkum-mezijski rasedni sistem... Global and Planetary Change 208 (2022) 103714 3 Carpatho-Balkanides orocline of south-eastern Europe. This double 180◦ curved orogenic system is comprised of a foreland-convex segment in the north and east, and a backarc-convex segment in the south and west (Fig. 1a). In our definition, ...
... its offset to thrusting southwards in the Balkanides (e.g., Schmid et al., 2020, and references therein) and to transtension and extension in the frontal part of the South Carpathians (e.g., Rabăgia and Maţenco, 1999). Further away from the Moesian Platform, in the hinge of the backarc-convex orocline ...
... of faults and fault zones, foliations within fault gouge and cataclasite, faulting-related cleavage and observations of tilting and rotations. Slickenside kinematic indicators (including calcite slickenfibres, grooves and other brittle indicators), Riedel shears and brittle shear bands in fault gouge ...Nemanja Krstekanić, Liviu Matenco, Uroš Stojadinović, Ernst Willingshofer, Marinko Toljić, Daan Tamminga. "Strain partitioning in a large intracontinental strike-slip system accommodating backarc-convex orocline formation: The Circum-Moesian Fault System of the Carpatho-Balkanides" in Global and Planetary Change, Elsevier BV (2022). https://doi.org/10.1016/j.gloplacha.2021.103714
-
Towards Sustainable Management of Transboundary Hungarian-Serbian Aquifer
Zoran Stevanović, Peter Kozák, Milojko Lazić, Janos Szanyi, Dušan Polomčić, Balazs Kovács, Jozsef Török, Saša Milanović, Bojan Hajdin, Petar Papić (2011)... management and monitoring at all levels. 4.9.2 Study Area The aquifer system under study is located between the Danube and Tisa (Tisza) rivers and extends to the vicinity of lGskunfelegyhaza on the Hungarian side (north) and to Vrbas in Serbia (souih). The main groundwater consumers are the cities and industries ...
... recreation and medical purposes in several spas in both countries, while geothermal energy is more efficiently used in Hungary. The number of wells that tap deeper aquifer layers with thermal waters is over 100 on the Hungarian side, while in Serbia there are some 15-20 such wells. In the Hungarian ...
... industries of Szeged, I(skunhalas, Baja, Tompa, H6dmez5vdsdrhely (csongr6d and Bdcs-Ifiskun counties in Hungary) and subotica, sombor, Baika Topola, vrbas, I(ula (in total 16 municipalities in Serbia, vojvodina province, Baika region). The study area is populated by over 800 000 inhabitants, about 407o ...Zoran Stevanović, Peter Kozák, Milojko Lazić, Janos Szanyi, Dušan Polomčić, Balazs Kovács, Jozsef Török, Saša Milanović, Bojan Hajdin, Petar Papić. "Towards Sustainable Management of Transboundary Hungarian-Serbian Aquifer" in Transboundary Water Resources Management - A Multidisciplinary Approach, Weinheim, Germany : Wiley-VCH (2011): 143-149
-
Effects of Energy Production and Consumption on Air Pollution in Serbia
Marija Živković (2019)Energy production and combustion, mostly from unregulated or inefficient fuel combustion, are the single most important anthropogenic sources of air pollutant emissions. Energy sector in Serbia is highly fossil fuel intensive: 87.88% of energy consumed in Serbia is related to fossil fuels, while almost 95% of energy sources are combusted. In 2017. energy related carbon dioxide emission in Serbia is was 47.95 million tones. The main source of carbon dioxide emission was coal, responsible for almost 70% of energy ...... consumption in terms of carbon dioxide emission and firewood and oil derivates in terms of carbon monoxide and nitrogen oxides. Actions should sublimate both types of actions: avoid and reduce Effect or some measures that could be option for mitigating air pollution in Serbia are explored in 11.Emissions ...
... carbon dioxide emission in the country. Currently the transport sector is the main source of pollution in urban areas in Serbia, while the main sources of air pollution are: biomass (firewood) for carbon monoxide and NOx from the buildings sector and oil derivates for carbon monoxide and NOx from the transport ...
... to economic and social development of each community and improved quality of life of citizens. Energy demand in the world has a constant growth, which is driven by increased number of inhabitants and growth of life standard 1. Much of the world's energy is currently produced and consumed in ways that could ...Marija Živković. "Effects of Energy Production and Consumption on Air Pollution in Serbia" in Mining and Environmental Protection-MEP 2019, University of Belgrade, Faculty of Mining and Geology (2019)
-
Study of the Synergetic Effect of Co-Pyrolysis of Lignite and High-Density Polyethylene Aiming to Improve Utilization of Low-Rank Coal
Ivan Kojić, Achim Bechtel, Nikoleta Aleksić, Dragana Životić, Snežana Trifunović, Gordana Gajica, Ksenija Stojanović (2021)... δ13C values of mid- and long-chain homologues are more negative than in both lignite pyrolysate and HDPE pyrolysate. In lignite/HDPE pyrolysate at 500 ◦C, this Polymers 2021, 13, 759 18 of 25 effect of enrichment of normal hydrocarbons in 12C isotope (in comparison with lignite and HDPE pyrolysate) ...
... individual n-alkanes and n-alkenes, being enriched in 12C, in lignite/HDPE liquid co-pyrolysates in relation to both lignite and HDPE liquid pyrolysates unambiguously confirmed the synergetic effect at 450 ◦C and particularly at 500 ◦C, which promotes degradation of both HDPE and kerogen, associated with ...
... amount of low-rank coal in addition to thermal power plants is used in households (for cooking and heating), particularly in Serbia, India, and China [13,14]. Numerous studies were performed on lignite pyrolysis as a method for its improved utilization [15–17]. A recent strategy in bioplastics development ...Ivan Kojić, Achim Bechtel, Nikoleta Aleksić, Dragana Životić, Snežana Trifunović, Gordana Gajica, Ksenija Stojanović. "Study of the Synergetic Effect of Co-Pyrolysis of Lignite and High-Density Polyethylene Aiming to Improve Utilization of Low-Rank Coal" in Polymers, MDPI AG (2021). https://doi.org/10.3390/polym13050759
-
Electronic Dictionaries - from File System to lemon Based Lexical Database
In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same ...... matching and replacement, and the newly constructed lemmas had to (a) exist in dictionaries; and (b) have an inverse marker. For instance, verbs afirmisati and afirmirati are two vari- ants (the first one being preferred today in Serbian) of the same verb ‘to establish’. Similarly, hleb and leb are ...
... for suffix variations in Figure 4 and for affix varia- tions in Figure 5. Similar procedures are produced to connect some deriva- tionally related entries (e.g. verbs and verbal nouns and adjectives) and to produce explicit inverse relations from originally implicit ones (in DELAS format). Figure ...
... 2006), subse- quently upgraded and renamed LeXimir (Stanković, Ranka and Krstev, Cvetana, 2016), was designed and implemented for the purpose of further development and management of morphological electronic dictionaries of Serbian (SMD), presented in more details in Section 3.. However, with the ...Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić. "Electronic Dictionaries - from File System to lemon Based Lexical Database" in Proceedings of the 11th International Conference on Language Resources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
-
Determination of the groundwater-leakage mechanism (binary mixing) in a karstic dam site using thermometry and isotope approach (HPP Visegrad, Bosnia, and Herzegovina)
Earth-Surface Processes,Geology, Pollution, Soil Science, Water Science and Technology, Environmental Chemistry, Global and Planetary ChangeLjiljana Vasić, Saša Milanović, Anita Puskás-Preszner, Laszlo Palcsu. "Determination of the groundwater-leakage mechanism (binary mixing) in a karstic dam site using thermometry and isotope approach (HPP Visegrad, Bosnia, and Herzegovina)" in Environmental Earth Sciences, Springer Science and Business Media LLC (2020). https://doi.org/10.1007/s12665-020-08910-x
-
Indexing of textual databases based on lexical resources: A case study for Serbian
In this paper we describe an approach to improvement of information retrieval results for large textual databases by pre-indexing documents using bag-of-words and Named Entity Recognition. The approach was applied on a database of geological projects financed by the Republic of Serbia in the last half century. Each document within this database is described by metadata, consisting of several fields such as title, domain, keywords, abstract, geographical location and the like. A bag of words was produced from these ...... geologic data, and provide a modern and effective in- formation basis for carrying out all activities related to planning, design and decision-making in the field of geology. Within the project a web portal3 was established that allows quick and easy access to geological data and information in the field ...
... ungrammatical words — in our case nouns, adjectives, adverbs and acronyms — followed by their frequencies. Thus, the text is lemmatized and lemmas (simple and multi-word) are extracted and their frequency is calculated. In that way 12,204 simple lemmas (with 450,418 occurences) and 271 MWUs (with 6,525 ...
... organization, persons — are entered in the index. Figure 1 represents one document from our collection in which recognized NEs are highlighted: locations in blue, persons in pink, and organizations in light green. Determination of term weights is a complex process and there are numerous models, the most ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Indexing of textual databases based on lexical resources: A case study for Serbian" in Semantic Keyword-based Search on Structured Data Sources : First COST Action IC1302 International KEYSTONE Conference, IKC 2015, Coimbra, Portugal, September 8-9, 2015. Revised Selected Papers, Springer (2015). https://doi.org/10.1007/978-3-319-27932-9_15