Претрага
554 items
-
Using Lexical Resources for Irony and Sarcasm Classification
The paper presents a language dependent model for classification of statements into ironic and non-ironic. The model uses various language resources: morphological dictionaries, sentiment lexicon, lexicon of markers and a WordNet based ontology. This approach uses various features: antonymous pairs obtained using the reasoning rules over the Serbian WordNet ontology (R), antonymous pairs in which one member has positive sentiment polarity (PPR), polarity of positive sentiment words (PSP), ordered sequence of sentiment tags (OSA), Part-of-Speech tags of words (POS) ...... of this set of word forms. For that purpose, we used a hybrid system for Serbian that combines three NLP tasks: PoS tagging, compound and named-entity recognition [10] (step 5 in Fig. 1) that was trained on various annotated texts – literary, newspaper and textbooks. Tagging results are represented by ...
... cases a corpus consisting of tweets was used, andwe have developed a similar resource for Serbian which we present in Section 3. A sys- tem for recognition and tagging of ironic tweets based on the SWN ontology and other language resources is presented in Section 4. The results of the evaluation of the ...
... leading to irony detection precision of 85.4% and 68.3%, respectively. In research described in [29] five linguistic patterns were suggested for recognition of ironic statements in a corpus of tweets in Chinese, while authors in [32] used the pattern “about as * as *” as a query sent to the Google Search ...Miljana Mladenović, Cvetana Krstev, Jelena Mitrović, Ranka Stanković. "Using Lexical Resources for Irony and Sarcasm Classification" in Proceedings of the 8th Balkan Conference in Informatics (BCI '17), New York, NY, USA, : ACM (2017). https://doi.org/
-
Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC
OntoLex, dominantni standard zajednice za mašinski čitljive leksičke resurse u kontekstu RDF-a, Linked Data i tehnologija Semantičkog veba, trenutno se proširuje sa posebnim modulom za Frekvencije, Primere i Informacije zasnovane na Korpusu (OntoLex-FrAC). Predlažemo novi komponent za OntoLex-FrAC, koji se bavi inkorporacijom korpusnih upita za (a) povezivanje rečnika sa korpusnim mašinama, (b) omogućavanje RDF baziranih web servisa da dinamički razmenjuju korpusne upite i podatke odgovora, i (c) korišćenje konvencionalnih upitačkih jezika za formalizaciju unutrašnje strukture kolokacija, skica reči i ...standardizacija, digitalna leksikografija, OntoLex, upiti korpusa, povezani podaci, Lingvistički povezani otvoreni podaciChristian Chiarcos, Ranka Stanković, Maxim Ionov, Gilles Sérasset. "Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC" in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Turin, 20-25 May 2024, LREC (2024)
-
Application of machine learning for diagnosing the operation of a deep well pump in oil production
Maja Trikić (2024)This paper will thoroughly examine how machine learning can improve the diagnosis of deep well pumps by analyzing the role and function of the pumps, dynamograms, sensor technologies, and diagnostic methods.Our analysis will provide insights into modern techniques and approaches for enhancing the performance and reliability of oil production systems, targeting cost reduction and increased operational efficiency.deep well pump, dynamograms, machine learning, diagnostics of operating coditions,Random Forest, XGBoost... typically requiring human intelligence, such as speech recognition, visual perception, and decision-making. By applying machine learning and Al, it is possible to automate and enhance processes, improve prediction accuracy, and optimize operational systems across various industries, including oil production ...
... Therefore, developing algorithms for automatic recognition of significant events is crucial for improving asset management. Machine Learning (ML) can process vast amounts of information in real- time and convert it into actionable insights.Pattern recognition on dynamometric cards 44 is not a novel ...
... compared various pattern recognition methods. Over the years, different methods and algorithms, including neural networks and deep learning, have been used to analyze and classify cards.Recent studies have shown that the latest algorithms achieve high accuracy in pattern recognition. However, it remains ...Maja Trikić. Application of machine learning for diagnosing the operation of a deep well pump in oil production, 2024
-
Named Entity Recognition for Distant Reading in ELTeC
Francesca Frontini, Carmen Brando, Joanna Byszuk, Ioana Galleron, Diana Santos, Ranka Stanković (2020)Akcija COST „Udaljeno čitanje za evropsku književnu istoriju“, koja je počela 2017. godine, ima među svojim glavnim ciljevima stvaranje višejezične zbirke evropskih književnih tekstova (ELTeC) otvorenog koda. U ovom radu predstavljamo rad koji je obavljen na ručnom označavanju selekcije ELTeC kolekcije za imenovane entitete, kao i na proceni postojećih alata za prepoznavanje imenovanih entiteta u pogledu njihove sposobnosti da automatski urade takve anotacije. U poslednjem paragrafu se razmatraju zajedničke tačke između ove inicijative i CLARIN-a.... 2023-10-14 04:19:44 Named Entity Recognition for Distant Reading in ELTeC Francesca Frontini, Carmen Brando, Joanna Byszuk, Ioana Galleron, Diana Santos, Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Named Entity Recognition for Distant Reading ...
... access, as well as the employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Annotation and Visualization Tools 37 Named Entity Recognition for Distant Reading in ELTeC Francesca Frontini P ra x il in g C N R S U n iv e rs i te P a u l-V a le ry M o n tp e llie r 3 Carmen ...Francesca Frontini, Carmen Brando, Joanna Byszuk, Ioana Galleron, Diana Santos, Ranka Stanković. "Named Entity Recognition for Distant Reading in ELTeC" in CLARIN Annual Conference 2020, Oct 2020, Virtual Event, France, CLARIN (2020)
-
Old or New, We Repair, Adjust and Alter (Texts)
Cvetana Krstev, Ranka Stanković (2020)U ovom radu predstavljamo kako se e-rečnici i kaskade transduktora konačnih stanja implementirani u alatu Unitex mogu koristiti za rešavanje tri problema transformacije teksta: ispravljanje tekstova nakon OCR-a, vraćanje dijakritičkih znakova i prebacivanje između različitih jezičkih varijanti.ispravka teksta, OCR greške, restauracija dijakritika , jezičke varijante, elektronski rečnik, transduktori konačnih stanja... Belgrade, Serbia 1 Text mending – introduction to problems Text mending is one of the simplest text transformation problems, when compared to speech recognition and generation, text summarization and machine translation. It is also one of the first problems posed to computers that did not involve calculation ...
... approaches were developed for many languages. (Krstev et al., 2018). Errors produced during machine text input, for instance by Optical Char- acter Recognition (OCR), are of a different type and different solutions were developed for detecting and correcting such errors. As early as in the late 1950s, Bledsoe ...
... context or more complex structures. 2 Correction of OCR errors In the process of digitization printed books are scanned and then optical character recognition (OCR) is applied. A text that fully corresponds to the original is rarely obtained since OCR is prone to errors. The quality of the resulting text ...Cvetana Krstev, Ranka Stanković. "Old or New, We Repair, Adjust and Alter (Texts)" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.3
-
An intelligent hybrid system for surface coal mine safety analysis
Nikola Lilić, Ivan Obradović, Aleksandar Cvjetić. "An intelligent hybrid system for surface coal mine safety analysis" in Engineering Applications of Artificial Intelligence (2010)
-
Razvoj ARCGIS geobaze površinskog kopa korišćenjem UML CASE alata
... geodatabase represents a collection of interrelated data, namely: attributes (data describing a geographic entity numerically or textually), geometry (data defining the shape and size of an entity and its position in space) and topology (data defining relations between different geographic entities) ...
... built-in system for storing and indexing both alphanumeric and geometric data. The Irish company ESRI2 created one of the most complex GIS platforms named ArcGIS®. This integrated software family provides all functions necessary for developing a geographic information system. ArcGIS encompasses a palette ...
... Tomašević, Ljiljana Kolonja, Ivan Obradović, Ranka Stanković, Olivera Kitanović1 ABSTRACT Opportunities offered by geographic information systems are very modestly exploited in Serbian mining industry. It is the authors’ wish to bring closer to the mining public at least a small part of the ...Aleksandra Tomašević, Ljiljana Kolonja, Ivan Obradović, Ranka Stanković, Olivera Kitanović. "Razvoj ARCGIS geobaze površinskog kopa korišćenjem UML CASE alata" in Podzemni radovi, Beograd : Univerzitet u Beogradu - Rudarsko-geološki fakultet (2012)
-
Српски језик у дигиталном добу -- The Serbian Language in the Digital Age
Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević (2012)... of companies, the system also needs to recognise a par- ticular string of words in a document represents a com- pany name, using a process called named entity recogni- tion. A more demanding challenge is matching a query in one language with documents in another language. Cross-lingual information retrieval ...
... many appli- cations for text summarisation. Within the aforementioned areas, highly successful ex- periments for Serbian are underway related to named entity extraction as a part of the information extrac- tion problem. A speedy development of IE and QA is expected, given the extent of developed morphological ...
... technology include interfaces to car navigation systems and the use of spoken language as an alternative to the graphical or touch-screen inter- faces in smartphones. Speech interaction technology comprises four technologies: 1. Automatic speech recognition (ASR) determines which words are actually spoken ...Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević. "Српски језик у дигиталном добу -- The Serbian Language in the Digital Age" in META-NET White Paper Series, G. Rehm, H. Uszkoreit (eds.), Springer (2012)
-
Развој геолошког терминолошког речника ГеолИССТерм
... class Entitet (Entity) comprises instances of all spatial and classes of attributes and also their subclasses, namely sub- types. Among the metadata provided by the rela- tionship class SvojstvoEntiteta (EntityProperty) is the domain (Figure4). The instances of the class Entitet (Entity) are for ...
... such a way so as to display equivalence, homography, hierarchy and associa- tion relations among terms in a clear manner and allow their easy recognition through standard in- dicators. The primary role of a thesaurus is to fa- cilitate finding documents and achieve consistency in the indexing of ...
... disciplines by using geolISS. The development of the electronic dictionary of geologic terms intensified in 2008 as part of a separate geolISS project named The Develop- ment of Geologic Terminology and Nomencla- ture for the Geologic Database of Serbia. The goal of this project was to add information ...Ranka Stanković, Branislav Trivić, Olivera Kitanović, Branislav Blagojević, Velizar Nikolić. "Развој геолошког терминолошког речника ГеолИССТерм" in INFOteka: časopis za informatiku i bibliotekarstvo, Beograd : Zajednica biblioteka univerziteta u Srbiji (2011)
-
Classification of Terms on a Positive-Negative Feelings Polarity Scale Based on Emoticons
Mihailo Škorić (2017)The goal of this paper is to draw attention to the possibility of using emoticon-riddled text on the web in language-neutral sentiment analysis. It introduces several innovations in the existing framework of research and tests their effectiveness. It also presents a software tool especially made for that purpose, explains how it builds a database with sentimental value of terms and offers the user manual. Finally, it presents a software tool that tests the new database and gives some examples ...... pp. 67–91 well-formed XML document. Entity reference lt;(character & that marks the begining of the entity reference was replaced with a whitespace during preprocessing step) is replaced with < character, so that the emoticon <3 cound be found in the text. Entity reference 039 is replaced with ’ charac- ...
... information access, Vol. 19, 321–327. Citeseer, 2005. Neviarouskaya, Alena, Helmut Prendinger, and Mitsuru Ishizuka. Compo- sitionality principle in recognition of fine-grained emotions from text. In Proceedings of the Third International ICWSM Conferenc, 278–281. The AAAI Press, 2009. Ptaszynski, Michael ...
... of the meaning of the text, often limited to one or a small number of areas. This type of software is predominantly used for text classification. Systems based on sentiment analysis assign sentiment values to text using multiple parameters, where a greater number of parameters means much greater complexity ...Mihailo Škorić. "Classification of Terms on a Positive-Negative Feelings Polarity Scale Based on Emoticons" in Infotheca, Faculty of Philology, University of Belgrade (2017). https://doi.org/10.18485/infotheca.2017.17.1.4
-
Terminology Acquisition and Description Using Lexical Resources and Local Grammars
Acquisition of new terminology from specific domains and its adequate description within terminological dictionaries is a complex task, especially for languages that are morphologically complex such as Serbian. In this paper we present an approach to solving this task semi-automatically on basis of lexical resources and local grammars developed for Serbian. Special attention is given to automatic inflectional class prediction for simple adjectives and nouns and the use of syntactic graphs for extraction of Multi-Word Unit (MWU) candidates for ...... with other resources for linguistic text pro- cessing; 2.5 Repeated linguistic preprocessing with ex- panded dictionaries for verification of recognition of new lemmas. 3. MWUs extraction 3.1. Application of syntactic graphs to extract MWUs with different syntactic structures from the same text ...
... integrating them with other resources for linguistic text processing; 5.3. Linguistic pre-processing with expanded dictionaries for verification of recognition of new MWU lemmas. Figure 1: Diagram of terminology acquisition using lexical resources and local grammars The newly acquired terms, both ...
... Due to high homography of word forms it may happen that the same sequence of words is recognized by two or more graphs; naturally, only one recognition may be correct. For in- stance if the MWU bager kašikar (case 6, NXN) is detected in the analyzed text in the genitive case bagera kašikara it ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić. "Terminology Acquisition and Description Using Lexical Resources and Local Grammars" in Proceedings of the 11th Conference on Terminology and Artificial Intelligence, Granada, Spain, 2015, Granada : LexiCon (Universidad de Granada) (2015)
-
GIS Application Improvement with Multilingual Lexical and Terminological Resources
... geodatabase on MS SQL server. The logical framework of GeolISS implementation is based on five packages of classes: concept, observation, spatial entity, description and metadata (Blagojević et al., 2008). Concept represents the core of GeolISS, and is implemented as an aggregation of geological ...
... Observation implements field data records and measurements, the basis for classification, interpretation and modeling of geological features. Spatial entity is treated as observation location and mapped/interpreted geological occurrence, and implemented in the geodatabase geometrically by points, ...
... annotation is the text or graphics on a map that provide substantial information for the map reader. Annotation may identify or describe a specific map entity, provide general information about an area on the map, or supply information about the map itself. In general, the placement of descriptive text ...Ranka Stanković, Ivan Obradović, Olivera Kitanović. "GIS Application Improvement with Multilingual Lexical and Terminological Resources" in Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2010, Valetta, Malta, May 2010, Valetta, Malta : European Language Resources Association (2010)
-
Why classical sequence stratigraphy doesn't work in Pannonian basin?
Dejan Radivojević (2018)... be absent. They are probably below seismic resolution level, so their recognition was not possible. The early rift cli‐ max systems tract usually ends up with downlap on pre‐Tertiary formations or on the initial rift systems tract. Early rift climax sediments are usually present only in deep parts ...
... flooded this systems tract is very hard to distinguished from early rift climax systems tract. Besides initial rift systems tract also early (S3a), middle (S3b) and late (S3c) rift climax systems tract is alinated. In grabens with low subsidence, one or more of that systems tract could be absent ...
... some places close to the fault it has hummocky geometry. This systems tract is not confirmed in all half‐ graben and could be related to the moment of initial rifting in certain graben. Essentially it represents initial rifting systems tract formed in continental/alluvial environment (Prosser ...Dejan Radivojević. "Why classical sequence stratigraphy doesn't work in Pannonian basin?" in 17th Serbian Geological Congress, Vrnjačka Banja, 17-20 maj 2018, Srpsko geološko društvo (2018)
-
VENTEX: An Expert System for Mine Ventilation Systems Analysis
Analiza sistema za ventilaciju je komleksan proces, koji bazira na proračunu brojnih parametara, koji se odnose na: stanje mreže, provetrenost, stabilnost sistema, gubitke vazduha, klimatske uslove, gasno stanje, ugroženost od požara i opasne prašine. Navedeni problemi se uspešno rešavaju paketom SimVent, ali potpuno razumevanje i korišćenje dobijenih rezultata zahteva angažovanje iskusnog specijaliste iz oblasti ventilacije. Rešenje je nađeno u kreiranju ekspertnog sistema VENTEX, čija je baza znanja formalizacija ekspertskog znanja iz oblasti ventilacije rudnika. U radu je prikazana metodologija ...... using a modification of the Coad- Yourdon object-oriented analysis (OOA) model (Coad and Yourdon, 1991). In the classical model every real world entity is represented by a class (object) consisting of its name, attributes and methods pertaining to the procedures related to the object. In order ...
... VENTEX: An Expert System for Mine Ventilation Systems Analysis Ranka Stanković, Nikola Lilić, Ivan Obradović Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] VENTEX: An Expert System for Mine Ventilation Systems Analysis | Ranka Stanković, Nikola Lilić, Ivan ...
... knowledge in software systems. In the late 80’s it has been recognized that expert system methods and techniques can be very useful in solving mine ventilation problems. The early 90’s already brought a number of knowledge-based applications in mine ventilation systems analysis (Ramani, Prasad ...Ranka Stanković, Nikola Lilić, Ivan Obradović. "VENTEX: An Expert System for Mine Ventilation Systems Analysis" in YU Info '96 Brezovica, Društvo za informacione sisteme i računarske mreže (1996)
-
Towards Automatic Definition Extraction for Serbian
U radu su prikazani preliminarni rezultati automatske ekstrakcije kandidata za definicije rečnika iz nestrukturiranih tekstova na srpskom jeziku u cilju ubrzanja razvoja rečnika. Definicije u rečniku Srpske akademije nauka i umetnosti (SANU) korišćene su za modelovanje različitih tipova definicija (opisnih, gramatičkih, referentnih i sinonimskih) koje imaju različite sintaksičke i leksičke karakteristike. Korpus istraživanja sastoji se od 61.213 definicija imenica, koje su analizirane korišćenjem morfoloških e-rečnika i lokalnih gramatika implementiranih kao pretvarači konačnih stanja u paketu za obradu korpusa otvorenog ...... current version contains 31 textbooks and it is expected to grow in the near future. The textbooks were scanned, optical character recognition was performed, and recognition errors were manually corrected (though a certain number of OCR errors remained). The corpus consists of 85,628 sentences, 3,4M tokens ...
... Kitanović et al. 2021), we focused our present research on the extraction of the sentences contained in the definition. The extraction also implies recognition of paradigmatic lexical relations, e.g. synonyms, antonyms, hypernyms, hyponyms. The problem of automatic extraction of definitions from the text ...
... digitized volumes of the SASA dictionary and presents the models developed for noun definitions taking the form of local grammars that can be used for recognition and extraction. These models were applied to a corpus consisting of textbooks and the results achieved in definition extraction are presented in ...Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić. "Towards Automatic Definition Extraction for Serbian" in Proceedings of the XIX EURALEX Congress of the European Assocition for Lexicography: Lexicography for Inclusion (Volume 2). 7-9 September (virtual), Democritus University of Thrace (2021)
-
WebGIS Cadastre of Abandoned Mines in Autonomous Province of Vojvodina
Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis (2015)... data are entered on the limits of the mining waste field, on the carrier of exploration and/or exploitation or economic entity that produces mining waste, on the economic entity which is the operator of mining waste, on the characterization and categorization of mining waste landfills within the mining ...
... abandoned mines as well as data concerning the remediation of this terrain were generally available. However databases of geographic information systems were often incoherent and/or incomplete for targeted use. A central repository for regional GIS spatial data was necessary in order to facilitate ...
... for viewing and sharing of geographic objects and maps, as well as interoperability with a large number of commercial and free geoinformation systems and cartographic applications. OGC (Open Geospatial Consortium) is the main international and industrial organization for standardization in ...Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis. "WebGIS Cadastre of Abandoned Mines in Autonomous Province of Vojvodina" in Proceedings of the 5th International Symposium Mining And Environmental Protection,June 10-13,2015, Vrdnik, Serbia, Belgrade : Faculty of Mining and Geology (2015)
-
Towards Sustainable Management of Transboundary Hungarian-Serbian Aquifer
Zoran Stevanović, Peter Kozák, Milojko Lazić, Janos Szanyi, Dušan Polomčić, Balazs Kovács, Jozsef Török, Saša Milanović, Bojan Hajdin, Petar Papić (2011)... 407o of whom are on the Hungarian side of the border. The Pannonian basin (or the Great Hungarian Basin) represents a geographicai and geological entity that spreads over the territory ofseveral countries. The central | ,,, il i *ti':i !l:i!lr::l llr,:ir',. 'lii l:l' : i::,;!l ritl ::lli;' ;:iiii ...
... Roof Report for 2004 [6], the transboundary acluifer system of Hungary-Serbia was preliminary separated into two parts: one large GW body in Serbia named CS-DU 10, and five in Hungary (P.1. and P.2. groups). The totai area is assumed to cover around 27 0OA1
... vertical section, water-bearing layers with intergranular porosity interfinger with impermeable strata. Sandy-gravel sediments represent major aquifer systems. Their thickness varies from less than 10 m up to 50 m, but is generally of the order of 10 to 20 m. The transmissivity coefficient ranges from 10-s ...Zoran Stevanović, Peter Kozák, Milojko Lazić, Janos Szanyi, Dušan Polomčić, Balazs Kovács, Jozsef Török, Saša Milanović, Bojan Hajdin, Petar Papić. "Towards Sustainable Management of Transboundary Hungarian-Serbian Aquifer" in Transboundary Water Resources Management - A Multidisciplinary Approach, Weinheim, Germany : Wiley-VCH (2011): 143-149
-
E-Dictionaries and Finite-State Automata for the Recognition of Named Entities
Krstev Cvetana, Vitas Duško, Obradović Ivan, Utvić Miloš. "E-Dictionaries and Finite-State Automata for the Recognition of Named Entities" in Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing, FSMNLP 2011, July 2010, Blois, France, A. Maletti and M. Constant (eds.), :Association for Computational Linguistics (2011): 48-56
-
CEEPUS Network CIII-RS-0038: Recognition of challenges in geological education in South-Eastern Europe and prompt responds
Sibila Borojević Šoštarić, Kristina Šarić. "CEEPUS Network CIII-RS-0038: Recognition of challenges in geological education in South-Eastern Europe and prompt responds" in Knjiga sažetaka 7th Croatian Geological Congress with international participation, Poreč, 2-4 October 2023, Zagreb : Hrvatski geološki institut – Croatian Geological Survey (2023)
-
Multi-word Expressions for Abusive Speech Detection in Serbian
Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...... helps in reducing ambiguity. The lexical resource, consisting of words that could be used as a trigger for recognition of abusive language is built, with an idea that the Serbian system for recognition and normalization of abusive expressions will also take into consideration phrases and figurative speech ...
... extension of the vocabulary with expressions that are not present in any existing lexicons, but evidenced in corpus as having offensive usage. The recognition of the different usages, that can be both offensive and non–offensive will be marked. The additional information about context or sense embeddings ...
... into Cyrillic: diddlei, villainess, ferociousness, carcharodon; 2) foreign (not-translated) words: anguillidae, anguilliformes, animal; 3) irrelevant named entities: Istočni Goti, Abulija, Animalija, Drag kraljica; 4) literal translations that are meaningless in Serbian: jabuka poliranje, javni pogodnost ...Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)