Претрага ⚒ Радови ⚒ Др РГФ - Репозиторијум РГФ

Претрага

Per page

Sort by

803 items

Measuring semantic relevance of words in synsets

Obradović Ivan, Krstev Cvetana, Vitas Duško (2010)

Obradović Ivan, Krstev Cvetana, Vitas Duško. "Measuring semantic relevance of words in synsets" in Text and Language, Structures · Functions · Interrelations. Quantitative Perspectives, P. Grzybek, E. Kelih, J. Mačutek (eds.), Wien:Praesens Verlag (2010): 133-144
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources

Branislava Šandrih, Cvetana Krstev, Ranka Stanković (2020)

In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...

Linguistics and Language,Software,Artificial Intelligence,Language and Linguistics

Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
Белешка о дигитализацији речника

Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић (2019)

У раду ће се анализирати ограничења која проистичу из линеарног процеса традиционалне израде речника на примеру Речника САНУ. Начин да се превазиђу ова ограничења се састоји у формирању електронске лексикографске базе која не представља само пуку дигиталну транскрипцију папирног издања речника. Посебно се указује на чињеницу да текст речника може представљати корпус и приказују се одабрани примери анализе таквог корпуса формираног из текстове 1. и 19. тома Речника САНУ.

лексикографија, рачунарска лексикографија, информатика, информациони систем

... 04:07:54 Белешка о дигитализацији речника Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Белешка о дигитализацији речника | Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић | Српски језик и његови ресурси ...
... Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић60 ЛИТЕРАТУРА Витас 2007: Душко Витас, „О проблему не(пре)познате речи у обради текс- това на српском језику”, зборник матице српске за филологију и линг- вистику, L, 111–120. Витас/Крстев 2015: Душко Витас и Цветана Крстев, „Нацрт за информа- ...
... is available at: www.dr.rgf.bg.ac.rs 811.163.41’322.2 004.9:811.163.41’374 https://doi.org/10.18485/msc.2019.48.3.ch3 Душко М. ВИТАС, Цветана Ј. КРСТЕВ, Ранка М. СТАНКОВИЋ Филолошки факултет Универзитета у Београду БЕЛЕШКА О ДИГИТАЛИЗАЦИЈИ РЕЧНИКА У раду ће се анализирати ограничења која ...
Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић. "Белешка о дигитализацији речника" in Српски језик и његови ресурси, Међународни славистички центар, Филолошки факултет, Универзитет у Београду (2019). https://doi.org/10.18485/msc.2019.48.3.ch3
Using English Baits to Catch Serbian Multi-Word Terminology

Cvetana Krstev, Branislava Šandrih, Ranka Stanković (2018)

In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...

aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inﬂection

... to Catch Serbian Multi-Word Terminology Cvetana Krstev, Branislava Šandrih, Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Using English Baits to Catch Serbian Multi-Word Terminology | Cvetana Krstev, Branislava Šandrih, Ranka Stanković | ...
... http://www.meta-net.eu/whitepapers. 8. Language Resource References Cvetana Krstev, Duško Vitas. (2015). Serbian Morpho- logical Dictionary - SMD. University of Belgrade, HLT Group and Jerteh, Lexical resource, 2.0. Cvetana Krstev. (2013). Serbian WordNet. University of Belgrade, HLT Group and Jerteh ...
... of Serbia: Scientific Re- search Department, Lexical database, 3.0. Ranka Stankovic, Cvetana Krstev. (2016). LeXimir. Uni- versity of Belgrade, HLT Group, Software Toolkit, 2.0. Ranka Stanković and Cvetana Krstev and Nikola Vulović and Biljana Lazić. (2014). Biblisha. University of Bel- grade, HLT ...
Cvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
Electronic Dictionaries - from File System to lemon Based Lexical Database

Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić (2018)

In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same ...

... Lexical Database Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Electronic Dictionaries - from File System to lemon Based Lexical Database | Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić ...
... nformatik, 1:225. 10. Language Resource References Krstev, Cvetana and Vitas, Duško. (2015). Serbian Mor- phological Dictionary - SMD. University of Belgrade, HLT Group and Jerteh, Lexical resource, 2.0. Stanković, Ranka and Krstev, Cvetana. (2016). LeXimir. University of Belgrade, HLT Group ...
... Database Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić {Faculty of Mining and Geology, Faculty of Philology } University of Belgrade {Djušina 7, Studentski trg 3} Belgrade, Serbia {ranka.stankovic, biljana.lazic, mihailo.skoric}@rgf.bg.ac.rs, cvetana@matf.bg.ac.rs Abstract In ...
Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić. "Electronic Dictionaries - from File System to lemon Based Lexical Database" in Proceedings of the 11th International Conference on Language Resources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
A Lexical Approach to Acronyms and their Definitions

Cvetana Krstev, Duško Vitas, Ranka Stanković (2015)

In this paper we present a comprehensive approach to acronyms for Natural-Language Processing (NLP) of Serbian texts. The proposed procedure includes extraction of acronyms and their definitions that are usual Multi-Word Units (MWUs), shallow parsing of MWUs that enables MWU lemmatization and production of entries in morphological electronic dictionaries, both for MWU and acronyms, that are provided with grammatical, syntactic, semantic and domain information. This approach enables representation that reflects complex relations between acronyms and their definitions.

... Lexical Approach to Acronyms and their Definitions Cvetana Krstev, Duško Vitas, Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] A Lexical Approach to Acronyms and their Definitions | Cvetana Krstev, Duško Vitas, Ranka Stanković | Proceedings of ...
... Repository is available at: www.dr.rgf.bg.ac.rs A Lexical Approach to Acronyms and their Definitions Cvetana Krstev∗, Duško Vitas∗, Ranka Stanković† University of Belgrade, Belgrade, Serbia, ∗(cvetana|vitas)@matf.bg.ac.rs, †ranka@rgf.bg.ac.rs Abstract In this paper we present a comprehensive approach ...
... 3rd Parseme General Meeting – Poster Session. Krstev, C., R. Stanković, I. Obradović, D. Vitas, and M. Utvić, 2010. Automatic construction of a morpho- logical dictionary of multi-word units. In IceTAL, vol- ume 6233 of LNCS. Springer. Krstev, C. and D. Vitas, 2005. Corpus and Lexicon – Mu- ...
Cvetana Krstev, Duško Vitas, Ranka Stanković. "A Lexical Approach to Acronyms and their Definitions" in Proceedings of the 7th Language & Technology Conference, November 27-29, 2015, Poznań, Poland, Springer (2015)
Distribution of canonical syllable types in Serbian

Obradović Ivan, Obuljen Aljoša, Vitas Duško, Krstev Cvetana, Radulović Vanja (2010)

Obradović Ivan, Obuljen Aljoša, Vitas Duško, Krstev Cvetana, Radulović Vanja. "Distribution of canonical syllable types in Serbian" in Text and Language, Structures · Functions · Interrelations. Quantitative Perspectives, P. Grzybek, E. Kelih, J. Mačutek (eds.), Wien:Praesens Verlag (2010): 145-157
SrpELTeC: A Serbian Literary Corpus for Distant Reading

Ranka Stanković, Cvetana Krstev, Duško Vitas (2024)

U članku je predstavljen SrpELTeC, korpus razvijen u okviru akcije COST Distant Reading for European Literary History (CA16204). Svi romani u SrpELTeC-u su odabrani, pripremljeni i obeleženi korišćenjem zajedničkih principa uspostavljenih za sve jezičke zbirke u Evropskoj zbirci književnog teksta (ELTeC). Navedeni su izazovi i rešenja u pripremi SrpELTeC od nule. Svi romani su ručno kodirani u TEI sa bogatim metapodacima i strukturnim napomenama. Automatska anotacija je uključivala POS-označavanje, lematizaciju i imenovane entitete, oslanjajući se na resurse za obradu ...

digital humanities, Serbian literature, text corpora, distant reading , linked data, named entity recognition, text analytics

Ranka Stanković, Cvetana Krstev, Duško Vitas. "SrpELTeC: A Serbian Literary Corpus for Distant Reading" in Primerjalna književnost, Research Centre of the Slovenian Academy of Sciences and Arts (2024). https://doi.org/10.3986/pkn.v47.i2.03
Resource-based WordNet Augmentation and Enrichment

Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev (2018)

In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset deﬁnitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...

WordNet, bilingual resources, term alignment, parallel lists

... Ivan Obradović, Marko Vitas, Cvetana Krstev Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Resource-based WordNet Augmentation and Enrichment | Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev | Proceedings of the Third In ...
... bg.ac.rs Miljana Mladenović College for Preschool Teachers Bujanovac, Serbia ml.miljana@gmail.com Cvetana Krstev and Marko Vitas Faculty of Philology University of Belgrade, Serbia cvetana@matf.bg.ac.rs vitas.marko@gmail.com Abstract In this paper we present an approach to support production ...
... and Mitrović, J. (2014). Natural Language Processing for Serbian – Resources and Application, chapter Semantic Networks for Serbian: New Functionalities of Developing and Maintaining a WordNet Tool. University of Belgrade, Mathematical Faculty. Mladenović, M., Mitrović, J., and Krstev, C. (2014). ...
Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)
Keyword-Based Search on Bilingual Digital Libraries

Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, Olivera Kitanović (2017)

This paper outlines the main features of Biblisha, a tool that offers various possibilities of enhancing queries submitted to large collections of aligned parallel text residing in bilingual digital library. Biblishsa supports keyword queries as an intuitive way of specifying information needs. The keyword queries initiated, in Serbian or English, can be expanded, both semantically, morphologically and in other language, using different supporting monolingual and bilingual resources. Terminological and lexical resources are of various types, such as wordnets, electronic ...

Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, Olivera Kitanović. "Keyword-Based Search on Bilingual Digital Libraries" in Semantic Keyword-Based Search on Structured Data Sources - Second COST Action IC1302 International KEYSTONE Conference, IKC 2016, Springer (2017). https://doi.org/10.1007/978-3-319-53640-8_10
Rule-based Automatic Multi-word Term Extraction and Lemmatization

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac (2016)

In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...

term extraction, terminology, multi-word units, lemmatization, finite-state transducers

... ion Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Rule-based Automatic Multi-word Term Extraction and Lemmatization | Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana ...
... Stanković1, Cvetana Krstev2, Ivan Obradović1, Biljana Lazić1, Aleksandra Trtovac3 1University of Belgrade, Faculty of Mining and Geology 2 University of Belgrade, Faculty of Philology 3 University Library “Svetozar Marković”, Belgrade E-mail: ranka.stankovic@rgf.bg.ac.rs, cvetana@poincare.matf ...
... , pp. 59--66. Krstev, C., Obradović, I., Stanković, R., and Vitas, D. (2013). An Approach to Efficient Processing of Multi-Word Units. In: Przepiórkowski, A., Piasecki, M., Jassem, K., Fuglewicz, P. (Eds.) Computational Linguistics. Berlin: Springer, pp. 109--129. Krstev, C., Stanković R. ...
Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration

Cvetana Krstev, Ranka Stanković, Vitas Duško (2010)

In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same ...

Morphology, Lexicon, lexical database, Standards for LRs

... Feature System Declaration Cvetana Krstev, Ranka Stanković, Vitas Duško Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] A Description of Morphological Features of Serbian: a Revision using Feature System Declaration | Cvetana Krstev, Ranka Stanković, Vitas Duško ...
... using Feature System Declaration Cvetana Krstev 1 , Ranka Stanković2 , Duško Vitas 3 1 professor, Faculty of Philology, Belgrade, 2 assistant professor, Faculty of Mining and Geology, Belgrade 3 professor, Faculty of Mathematics, Belgrade E-mail: cvetana@matf.bg.ac.rs, ranka@rgf.bg.ac.rs, vitas@matf ...
... T., and Zlatić, L. (2004). R{j}ecnik.com: English-Serbo-Croatian electronic dictionary. In Proceedings of the Workshop on Enhancing and Using Electronic Dictionaries (Geneva, Switzerland, August 29 - 29, 2004). ACL Workshops. ACL, Morristown, NJ, 61-64. Krstev, C., Vitas, D. and Erjavec, T. ...
Cvetana Krstev, Ranka Stanković, Vitas Duško. "A Description of Morphological Features of Serbian: a Revision using Feature System Declaration" in Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2010, Valetta, Malta : European Language Resources Association (2010)
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian

Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić (2020)

The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment ...

Part-of-Speech tagging, lemmatization, corpus, evaluation, Serbian, morphological dictionary

... Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian | Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš ...
... the production of the new tag- ger model for Serbian are: (a) Serbian morphological dic- tionaries (Cvetana Krstev, Duško Vitas, 2015) (SMD); (b) pre-annotated texts (Duško Vitas, Cvetana Krstev, Ranka Stanković, Miloš Utvić, 2019). 2.1. Serbian morphological dictionaries Serbian morphological ...
... 12(2):36a–47a, December. 8. Language Resource References Cvetana Krstev, Duško Vitas. (2015). Serbian Morpho- logical Dictionary - SMD. University of Belgrade, HLT Group and Jerteh, Lexical resource, 2.0. Duško Vitas, Cvetana Krstev, Ranka Stanković, Miloš Utvić. (2019). Sr-Basic: Annotated ...
Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020)
Production of morphological dictionaries of multi-word units using a multipurpose tool

Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas (2011)

The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...

electronic dictionary, Serbian, morphology, inﬂection, multi-word units, noun phrases, query expansion

... Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Production of morphological dictionaries of multi-word units using a multipurpose tool | Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas | Proceedings ...
... Mining and Geology, Djušina 7, 11000 Belgrade, Serbia Email: {ranka,ivano}@rgf.bg.ac.rs Cvetana Krstev University of Belgrade — Faculty of Philology, Studentski trg 3, 11000 Belgrade, Serbia Email: cvetana@matf.bg.ac.rs Duško Vitas University of Belgrade — Faculty of Mathematics, Studentski trg ...
... Approaches,” Linguistic Issues in Language Technologies, vol. 1, no. 2, 2008. [6] C. Krstev and D. Vitas, “Finite State Transducers for Recognition and Generation of Compound Words,” in IS-LTC 2006, T. Erjavec and J. Žganec Gros, Eds. Ljubljana, Slovenia: Institut “Jožef Stefan”, October 2006, pp. ...
Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas. "Production of morphological dictionaries of multi-word units using a multipurpose tool" in Proceedings of the Computational Linguistics-Applications Conference, October 2011, Jachranka, Poland, Jachranka, Poland : PTI - Polish Information Processing Society (2011)
Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities

Cvetana Krstev, Ranka Stanković, Aleksandra Marković, Teodora Mihajlov (2024)

Овај рад представља активности на развоју корпуса ELEXIS-sr, српском додатку вишејезичном анотираном корпусу ELEXIS-а, који се састоји од семантичких анотација и репозиторија значења речи. ELEXIS је паралелни вишејезични анотирани корпус на десет европских језика, који може да се користи као вишејезички репер за евалуацију европских језика са мање и средње развијеним ресурсима. Фокус овог рада је на вишечланим изразима и именованим ентитетима, њиховом препознавању у скупу реченица ELEXIS-sr и поређењу са анотацијама на другим језицима. Разматрају се први кораци ...

полилексемске језинице, именовани ентитет, вишезначност значења речи, складиште смисла, LLOD

Cvetana Krstev, Ranka Stanković, Aleksandra Marković, Teodora Mihajlov. "Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities" in Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, Turin, May 25, 2024, ELRA and ICCL (2024)
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian

Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih (2021)

Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...

uvredljivi jezik, govor mržnje, srpski, tviter, leksikon, korpus

... Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian | Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih | 3rd Conference ...
... Detection in Serbian Danka Jokić # University of Belgrade, Serbia Ranka Stanković # Faculty of Mining and Geology, University of Belgrade, Serbia Cvetana Krstev # ↸ Faculty of Philology, University of Belgrade, Serbia Branislava Šandrih # ↸ Faculty of Philology, University of Belgrade, Serbia Abstract ...
... such as race, color, ethnicity, gender, sexual orientation, nationality, religion, or other characteristic”. © Danka Jokić, Ranka Stanković, Cvetana Krstev, and Branislava Šandrih; licensed under Creative Commons License CC-BY 4.0 3rd Conference on Language, Data and Knowledge (LDK 2021). Editors: ...
Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13
Using Lexical Resources for Irony and Sarcasm Classification

Miljana Mladenović, Cvetana Krstev, Jelena Mitrović, Ranka Stanković (2017)

The paper presents a language dependent model for classification of statements into ironic and non-ironic. The model uses various language resources: morphological dictionaries, sentiment lexicon, lexicon of markers and a WordNet based ontology. This approach uses various features: antonymous pairs obtained using the reasoning rules over the Serbian WordNet ontology (R), antonymous pairs in which one member has positive sentiment polarity (PPR), polarity of positive sentiment words (PSP), ordered sequence of sentiment tags (OSA), Part-of-Speech tags of words (POS) ...

... Papers, 644–650. [20] Svetla Koeva, Cvetana Krstev, and Duško Vitas. 2008. Morpho-semantic relations in Wordnet – a case study for two Slavic languages. In Global Wordnet Conference (GWC’08). University of Szeged, Department of Informatics, 239–253. [21] Roger J. Kreuz and Gina M. Caucci. 2007. Lexical ...
... and Sarcasm Classification Full Paper Miljana Mladenović Milenijum III Vranje, Serbia ml.miljana@gmail.com Cvetana Krstev University of Belgrade, Faculty of Philology Belgrade, Serbia cvetana@matf.bg.ac.rs Jelena Mitrović University of Passau, Faculty of Computer Science and Mathematics Passau, Germany ...
... processing; Lexical semantics; KEYWORDS Computational irony, Verbal irony, Verbal Sarcasm, WordNet ACM Reference format: Miljana Mladenović, Cvetana Krstev, Jelena Mitrović, and Ranka Stanković. 2017. Using Lexical Resources for Irony and Sarcasm Classification. In Proceedings of BCI ’17, Skopje, Macedonia ...
Miljana Mladenović, Cvetana Krstev, Jelena Mitrović, Ranka Stanković. "Using Lexical Resources for Irony and Sarcasm Classification" in Proceedings of the 8th Balkan Conference in Informatics (BCI '17), New York, NY, USA, : ACM (2017). https://doi.org/
Towards Automatic Definition Extraction for Serbian

Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić (2021)

U radu su prikazani preliminarni rezultati automatske ekstrakcije kandidata za definicije rečnika iz nestrukturiranih tekstova na srpskom jeziku u cilju ubrzanja razvoja rečnika. Definicije u rečniku Srpske akademije nauka i umetnosti (SANU) korišćene su za modelovanje različitih tipova definicija (opisnih, gramatičkih, referentnih i sinonimskih) koje imaju različite sintaksičke i leksičke karakteristike. Korpus istraživanja sastoji se od 61.213 definicija imenica, koje su analizirane korišćenjem morfoloških e-rečnika i lokalnih gramatika implementiranih kao pretvarači konačnih stanja u paketu za obradu korpusa otvorenog ...

... for Serbian Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Towards Automatic Definition Extraction for Serbian | Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin ...
... employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Towards Automatic Definition Extraction for Serbian Stanković Ranka1, Krstev Cvetana1, Stijović Rada2, Gočanin Mirjana2, Škorić Mihailo1 1 University of Belgrade, Serbia 2 Institute for the Serbian Language of SASA, Serbia ...
... dictionary production by automatically associating some grammatical information to lemmas, namely, word class, word forms and different types of markers (Krstev 2008). Since the research related to extraction of dictionary examples has shown that information extraction from a corpus can be used to speed up ...
Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić. "Towards Automatic Definition Extraction for Serbian" in Proceedings of the XIX EURALEX Congress of the European Assocition for Lexicography: Lexicography for Inclusion (Volume 2). 7-9 September (virtual), Democritus University of Thrace (2021)
Old or New, We Repair, Adjust and Alter (Texts)

Cvetana Krstev, Ranka Stanković (2020)

U ovom radu predstavljamo kako se e-rečnici i kaskade transduktora konačnih stanja implementirani u alatu Unitex mogu koristiti za rešavanje tri problema transformacije teksta: ispravljanje tekstova nakon OCR-a, vraćanje dijakritičkih znakova i prebacivanje između različitih jezičkih varijanti.

ispravka teksta, OCR greške, restauracija dijakritika , jezičke varijante, elektronski rečnik, transduktori konačnih stanja

... 04:19:57 Old or New, We Repair, Adjust and Alter (Texts) Cvetana Krstev, Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Old or New, We Repair, Adjust and Alter (Texts) | Cvetana Krstev, Ranka Stanković | Infotheca | 2020 | | 10.18485/infotheca ...
... Language Technology Research, 257–262. Morgan Kaufmann Publishers Inc., 2002 Krstev, Cvetana. Processing of Serbian – Automata, Texts and Electronic dictionaries. Faculty of Philology, University of Belgrade, 2008 Krstev, Cvetana, Ranka Stanković and Duško Vitas. “Knowledge and Rule- Based Diacritic ...
... electronic dictionary, finite-state transducers. PAPER SUBMITTED: 13 October 2019 PAPER ACCEPTED: 08 December 2019 Cvetana Krstev University of Belgrade, Faculty of Mathematics cvetana@matf.bg.ac.rs Ranka Stanković University of Belgrade, Faculty of Mining and Geology ranka.stankovic@rgf.bg.ac.rs ...
Cvetana Krstev, Ranka Stanković. "Old or New, We Repair, Adjust and Alter (Texts)" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.3
Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović (2017)

Large collections of textual documents represent an example of big data that requires the solution of three basic problems: the representation of documents, the representation of information needs and the matching of the two representations. This paper outlines the introduction of document indexing as a possible solution to document representation. Documents within a large textual database developed for geological projects in the Republic of Serbia for many years were indexed using methods developed within digital humanities: bag-of-words and named ...

... Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources | Ranka Stanković, Cvetana Krstev, Ivan Obradović ...
... Address Belgrade, Serbia Email ranka@rgf.bg.ac.rs Author Family Name Krstev Particle Given Name Cvetana Prefix Suffix Division Faculty of Philology Organization University of Belgrade Address Belgrade, Serbia Email cvetana@matf.bg.ac.rs Author Family Name Obradović Particle Given Name Ivan Prefix ...
... language. CoRR abs/1209.4471 (2012). http:// arxiv.org/abs/1209.4471 18. Mladenović, M., Mitrović, J., Krstev, C., Vitas, D.: Hybrid sentiment analysis framework for a morphologically rich language. J. Intell. Inf. Syst. 1–22, to appear 19. Nadeau, D., Sekine, S.: A survey of named entity recognition ...
Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources" in Trans. Computational Collective Intelligence - Lecture Notes in Computer Science 26, Springer (2017). https://doi.org/10.1007/978-3-319-59268-8_8

Претрага

803 items

Measuring semantic relevance of words in synsets cite

Two approaches to compilation of bilingual multi-word terminology lists from lexical resources cite

Белешка о дигитализацији речника cite

Using English Baits to Catch Serbian Multi-Word Terminology cite

Electronic Dictionaries - from File System to lemon Based Lexical Database cite

A Lexical Approach to Acronyms and their Definitions cite

Distribution of canonical syllable types in Serbian cite

SrpELTeC: A Serbian Literary Corpus for Distant Reading cite

Resource-based WordNet Augmentation and Enrichment cite

Keyword-Based Search on Bilingual Digital Libraries cite

Rule-based Automatic Multi-word Term Extraction and Lemmatization cite

A Description of Morphological Features of Serbian: a Revision using Feature System Declaration cite

Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian cite

Production of morphological dictionaries of multi-word units using a multipurpose tool cite

Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities cite

A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian cite

Using Lexical Resources for Irony and Sarcasm Classification cite

Towards Automatic Definition Extraction for Serbian cite

Old or New, We Repair, Adjust and Alter (Texts) cite

Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources cite

Measuring semantic relevance of words in synsets

Two approaches to compilation of bilingual multi-word terminology lists from lexical resources

Белешка о дигитализацији речника

Using English Baits to Catch Serbian Multi-Word Terminology

Electronic Dictionaries - from File System to lemon Based Lexical Database

A Lexical Approach to Acronyms and their Definitions

Distribution of canonical syllable types in Serbian

SrpELTeC: A Serbian Literary Corpus for Distant Reading

Resource-based WordNet Augmentation and Enrichment

Keyword-Based Search on Bilingual Digital Libraries

Rule-based Automatic Multi-word Term Extraction and Lemmatization

A Description of Morphological Features of Serbian: a Revision using Feature System Declaration

Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian

Production of morphological dictionaries of multi-word units using a multipurpose tool

Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities

A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian

Using Lexical Resources for Irony and Sarcasm Classification

Towards Automatic Definition Extraction for Serbian

Old or New, We Repair, Adjust and Alter (Texts)

Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources