Претрага ⚒ Радови ⚒ Др РГФ - Репозиторијум РГФ

Претрага

Per page

Sort by

2342 items

From DELA Based Dictionary to Leximirka Lexical Database

Biljana Lazić, Mihailo Škorić (2020)

In this paper, we will present an approach in transforming Serbian language Morphological dictionaries from a DELA text format to a lexical database dubbed Leximirka. Considering the benefits of storing data within a database when compared to storing them in textual documents, we will outline some of the functionality that the database has made possible. We will also show how hand-made rules that use category labels lexical entries are marked with can be used to link lexical entries. ...

Morfološki rečnici, jezički resursi, Leksimirka

... Cvetana Krstev, Biljana Lazić and Mihailo Škorić. “Elec- tronic Dictionaries – from File System to lemon Based Lexical Database”. In Proceedings of the 11th International Conference on Language Re- sources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science ...
... not a language dependent application and there is no obstacle for it to be applied in purpose of maintaining DELA dictionaries of other languages. The Section 3.1 will present more detailed reasons for complex transition to a lexicographic database and to an online application. In Sections 3.2 and 3.3 ...
... recording linguistic in- formation in computer lexicons and is based on the Standard ISO 24613: 2008 (Language Resource Management - Lexical Markup Framework - LMF). LMF is designed for lexicons specially designed for Natural Language Pro- cessing and Machine-Readable Dictionaries. LMF specification is ...
Biljana Lazić, Mihailo Škorić. "From DELA Based Dictionary to Leximirka Lexical Database" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.4
Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++

Branislava Šandrih, Ranka Stanković (2020)

U nauci, industriji i mnogim istraživačkim oblastima, terminologija se brzo razvija. Najčešće, jezik koji je „lingua franca“ za većinu ovih oblasti je engleski. Kao posledica toga, za mnoga polja termini domena su koncipirani na engleskom, a kasnije se prevode na druge jezike. U ovom radu predstavljamo pristup za automatsko izdvajanje dvojezične terminologije za englesko-srpski jezički par koji se oslanja na usaglašeni dvojezični korpus domena, ekstraktor terminologije za ciljni jezik i alat za usklađivanje delova. Ispitujemo performanse metode na domenu ...

ekstrakcija terminologije, validacija terminologije, GIZA++, grafovi, Unitex, klasifikacija teksta

... poster and demonstration sessions, 177–180. Association for Computational Linguistics, 2007 Kontonatsios, G., M. Claudiu, Korkontzelos I., Thompson P and S. Anani- adou. “A Hybrid Approach to Compiling Bilingual Dictionaries of Medical Terms from Parallel Corpora”. Statistical Language and Speech ...
... Natural Language Engineer- ing Vol. 23, no. 1 (2017): 31–51 Hamon, T. and N. Grabar. “Adaptation of Cross-lingual Transfer Methods for the Building of Medical Terminology in Ukrainian”. In Proceedings of the 17th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING2016) ...
... n, 187–197. Association for Computational Linguistics, 2011. Hewavitharana, Sanjika and Stephan Vogel. “Extracting Parallel Phrases from Comparable Data for Machine Translation”. Natural Language En- gineering Vol. 22, no. 4 (2016): 549–573 Irvine, Ann and Chris Callison-Burch. “End-to-end Statistical ...
Branislava Šandrih, Ranka Stanković. "Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.6
Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis

Dalibor Vorkapić, Aleksandra Tomašević, Miljana Mladenović, Ranka Stanković, Nikola Vulović (2017)

U ovom radu predstavljen je model koji omogućava prikupljanje, pripremu, opis metapodataka, upravljanje i eksploataciju, uključujući pretragu punog teksta dokumenata iz domena kriminalistike napisanih na srpskom jeziku. Predloženi pristup primenjuje se na veb portalu koji sakuplja različite tekstove nastale iz časopisa Akademije za kriminalistiku i policijske studije, Krivičnog zakona Srbije, konferencija „Tara“ i „Reiss“, kao i iz nekih doktorskih disertacija vezanih za ovu oblast istraživanje. Nakon obrade teksta, korpus koji sadrži preko 5500 stranica običnog teksta, kreiran je i ...

Omeka, Wordnet, pretraga punog teksta, morfološka i semantička pretraga teksta, proširenje upita

... (right) APPLICATIONS FOR LINGUISTIC RESOURCES The linguistics and lexical resources used for query expansion and text analysis are depicted in Figure 3 on the left, while on the right are main application components of the language support system. Main lexical resources include morphological ...
... (2008). Forensic Linguistics, Second Edition. London: Continuum ISBN 978-0-8264-6109-4 12. John Olsson (2004). An Introduction to Language Crime and the Law. London: Continuum International Publishing Group 13. C.S. Michell (2013). Investigating the use of forensic stylistic and stylometric techniques ...
... 4 John Olsson (2008).. Forensic Linguistics, Second Edition. London: Continuum ISBN 978-0-8264-6109-4 5 Falzini, Mark W. (9 September 2008). "The Ransom Notes: An Analysis of Their Content & "Signature"" 6 John Olsson (2004). An Introduction to Language Crime and the Law. London: Continuum International ...
Dalibor Vorkapić, Aleksandra Tomašević, Miljana Mladenović, Ranka Stanković, Nikola Vulović. "Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis" in International Scientific Conference “Archibald Reiss Days” Thematic Conference Proceedings Of International Significance, Belgrade, 7-9 November 2017, Academy Of Criminalistic And Police Studies Belgrade (2017)
Rule-based Automatic Multi-word Term Extraction and Lemmatization

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac (2016)

In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...

term extraction, terminology, multi-word units, lemmatization, finite-state transducers

... 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning -Volume 7, Stroudsburg: Association for Computational Linguistics, pp. 67--72. Smadja, F.(1993). Retrieving Collocations from Text: Xtract, Computational Linguistics, 19(1), pp. 143-177. ...
... Piskorski, J., Sydow, M., and Kupść, A. (2007). Lemmatization of Polish person names. In Proc. of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies, Stroudsburg: Association for Computational Linguistics, pp. 27--34. Savary, A., Zaborowski ...
... of Natural Language Processing. Initially, MWT extraction from domain texts has been tackled mainly using the statistical approach based on different statistical measures, following the seminal work of Kenneth Church and Patrick Hanks (1990; 1991) and Frank Smadja (1993). A language independent ...
Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
Multiword Expressions between the Corpus and the Lexicon: Universality, Idiosyncrasy and the Lexicon-Corpus Interface

Verginica Barbu Mititelu, Voula Giouli, Kilian Evang, Daniel Zeman, Petya Osenova, Carole Tiberius, Simon Krek, Stella Markantonatou, Ivelina Stoyanova, Ranka Stankovic, Christian Chiarcos (2024)

Predstavljamo trenutne aktivnosti na definisanju interfejsa leksikona i korpusa koji će služiti kao referenca u prikazu polileksemskih jedinica - višečlanih izraza - (različitih tipova - imenskih, glagolskih, itd.) u specijalizovanim leksikonima i povezivanju ovih unosa sa njihovim pojavljivanjima u korpusima. Konačni cilj je korišćenje ovakvih resursa za automatsko identifikovanje višečlanih izraza u tekstu. Uključivanje nekoliko prirodnih jezika ima za cilj univerzalnost rešenja koje nije usredsređeno na određeni jezik, kao i prilagođavanje idiosinkrazijama. Raspravljaju se izazovi u leksikografskom opisu višerečnih ...

multiword expression lexicon, corpus, proof-of-concept lexicon encoding

Verginica Barbu Mititelu, Voula Giouli, Kilian Evang, Daniel Zeman, Petya Osenova, Carole Tiberius, Simon Krek, Stella Markantonatou, Ivelina Stoyanova, Ranka Stankovic, Christian Chiarcos. "Multiword Expressions between the Corpus and the Lexicon: Universality, Idiosyncrasy and the Lexicon-Corpus Interface" in Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, Turin, May 25, 2024, ELRA and ICCL (2024)
Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model

Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković, Biljana Rujević (2024)

Ova studija predstavlja analizu sentimenta srpskih starih romana iz perioda 1840-1920, koristeći veliki jezički model (LLM) Mistral za tehniku učenja sa zasnovani na takozvanim "zero" i "few-shot" pokušajima. Glavni pristup uvodi inovacije osmišljavanjem istraživačkih upita (promptova) uključuju tekst sa uputstvom za klasifikaciju bez primera i na osnovu nekoliko primera, omogućavajući jezičkom modelu da klasifikuje osećanja u pozitivne, negativne ili objektivne kategorije. Ova metodologija ima za cilj da pojednostavi analizu osećanja ograničavanjem odgovora, čime se povećava preciznost ...

zero-shot, few-shot, sentiment, Serbian, Mistral model

Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković, Biljana Rujević. "Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model" in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Sofia, Bulgaria, 9-10 September 2024, LREC | COLING (2024)
Using English Baits to Catch Serbian Multi-Word Terminology

Cvetana Krstev, Branislava Šandrih, Ranka Stanković (2018)

In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...

aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inﬂection

... target language, and a tool for word and chunk alignment. In this first experiment a source language is English, a target language is Serbian, a domain is Library and Information Science for which a bilingual terminological dictionary exists. Our term extractor is based on e-dictionaries and shallow ...
... target language) in various ways: some authors use mor- phosyntactic patterns on lemmatized and POS-tagged texts 2In this paper we will call ‘source’ language a well-resourced language (English), and ‘target’ language a less-resourced lan- guage (Serbian). 2487 (Bouamor et al., 2012; Vintar and Fišer ...
... source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a t ...
Cvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
Resource-based WordNet Augmentation and Enrichment

Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev (2018)

In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset deﬁnitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...

WordNet, bilingual resources, term alignment, parallel lists

... Different methods and resources can be used for alignment. One of the common approaches is to take PWN as the source for alignment, and a bilingual dictionary of English and the target language. There are, however, several other approaches. In (Chugur et al., 2001) a monolingual and a bilingual Sp ...
... Source term, Source language identifier, Target term, Target language identifier. The number of terms differ from language to language, due to varying levels of localization. The Microsoft Terminology Collection is a set of standard technology terms used across Microsoft products, and comprises 13,147 ...
... Mariani, J., Odjik, J., Piperidis, S., Rosner, M., and Tapias, D., Eds., Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC’10), Valletta, Malta, may. European Language Resources Asso- ciation (ELRA). Matuschek, M. and Gurevych, I. (2013). Dijkstra-WSA: A graph-based ...
Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)
Multi-word Expressions for Abusive Speech Detection in Serbian

Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev (2020)

Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...

uvredljiv govor, govor mržnje, leksički izvori, višejezični leksikon, izrazi sa više reči

... Patti. 2019. Cross-domain and cross-lingual abusive language detection: A hybrid approach with deep learning and a multilingual lexicon. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pages 363–370. Endang Wahyu Pamungkas, Alessandra ...
... the Fourth International Confer- ence Computational Linguistics in Bulgaria (CLIB 2020). Institute for Bulgarian Language “Prof. Lyubomir Andreychin”, Bulgarian Academy of Sciences, June. Cvetana Krstev. 2008. Processing of Serbian – Automata, Texts and Electronic Dictionaries. University of Belgrade, ...
... pages 75–86. Association for Computational Linguistics. Marcos Zampieri, Preslav Nakov, Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Hamdy Mubarak, Leon Derczynski, Zeses Pitenis, and Çağrı Çöltekin. 2020. SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval ...
Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)

Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić (2022)

In this paper we present the wikification of the ELTeC (European Literary Text Collection), developed within the COST Action ``Distant Reading for European Literary History'' (CA16204). ELTeC is a multilingual corpus of novels written in the time period 1840—1920, built to apply distant reading methods and tools to explore the European literary history. We present the pipeline that led to the production of the linked dataset, the novels’ metadata retrieval and named entity recognition, transformation, mapping and Wikidata population, ...

Wikidata, linked data, SPARQL, distant reading, literary corpus, named entity linking, ELTeC

Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić. "From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)" in Proceedings of The 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
Bilingual lexical extraction based on word alignment for improving corpus search

Jelena Andonovski, Branislava Šandrih, Olivera Kitanović (2019)

Library and Information Sciences,Computer Science Applications

Jelena Andonovski, Branislava Šandrih, Olivera Kitanović. "Bilingual lexical extraction based on word alignment for improving corpus search" in The Electronic Library, Emerald (2019). https://doi.org/10.1108/EL-03-2019-0056
English for Geology Students. 2

Lidija Beko (2023)

Engleski jezik

... of clarity and coherence as a way in which they wish to teach the subject of English language and geology. Six thematic units: 1. Landslides 2. Metamorphic rocks 3. Mineral deposits 4. Hydrological cycle and groundwater 5. Surface water and glaciers 6. Geophysics, seismology and prospecting ...
... the same time referring to active learning within the given context. Teaching vocabulary, which is the base of language knowledge, can be continued by creating paper vocabulary cards, and later even electronic cards, which would ensure continuity in vocabulary learning by including modern educational ...
... writing the textbooks. Those are, first and foremost, all my colleagues from the Faculty of Mining and Geology who have provided Foreword me with valuable assistance, relevant literature and insights, and who have pointed out specific geological terms and translations of those terms in order for ...
Lidija Beko. English for Geology Students. 2, Belgrade : The Faculty of Mining and Geology, 2023
Knowledge and Rule-Based Diacritic Restoration in Serbian

Cvetana Krstev, Ranka Stanković, Duško Vitas (2018)

In this paper we present a procedure for the restoration of diacritics in Serbian texts written using the degraded Latin alphabet. The procedure relies on the comprehensive lexical resources for Serbian: the morphological electronic dictionaries, the Corpus of Contemporary Serbian and local grammars. Dictionaries are used to identify possible candidates for the restoration, while the dataobtainedfromSrpKorandlocalgrammarsassistsinmakingadecisionbetween several candidates in cases of ambiguity. The evaluation results reveal that,dependingonthetext,accuracyrangesfrom95.03%to99.36%,whilethe precision (average 98.93%) is always higher than the recall (average 94.94%).

diacritic restoration, morphological dictionary, corpus, word n-grams, local grammars

... available version of thesaurus of russian language ruthes. In Computational Linguistics and Intellectual Technologies: Papers from the Annual International Conference Dialogue, Bekasovo, Russia, pages 340–349. Loukachevitch, N., Shevelev, A., Mozharova, V., Dobrov, B., and Pavlov, A. (2017). Ruthes thesaurus ...
... (Loukachevitch and Dobrov, 2014) and created for automatic processing of documents in information- analytical systems and natural language processing. These resources are linguistic ontologies uniting some principles of their organization from WordNet, information-retrieval thesauri and formal ontolo- ...
... RuThes family of Russian thesauri intended for natural language process- ing and information retrieval applications. RuThes-like thesauri include, besides RuThes, Sociopolitical thesaurus, Security Thesaurus, and Ontology on Natural Sciences and Technology. The RuThes format is based on three approaches ...
Cvetana Krstev, Ranka Stanković, Duško Vitas. "Knowledge and Rule-Based Diacritic Restoration in Serbian" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018): 41-51
English for Geology Students. 1

Lidija Beko (2023)

Engleski jezik

... roughly 40 and 60 degrees centigrade, and this, coupled with the high humidity and acidity of the atmosphere, mean it is necessary for anyone visiting the Cave to wear a protective “ice suit”. The Caves have been flooded for much oftheir history, and the unique conditions of the heat and the solute ...
... collected from Mars. Such explorations and studies allow scientists to construct a picture of the similarities and differenc- es between the Earth and other extraterrestrial bod- ies, much as doctors might build up a picture of genetic similarities and differences in a family. While many ...
... s carried into orbit by satellites. This has resulted in an increased ability to test and prove hypotheses, and also 12 Meteorite created new ways of studying the interactions of terrestrial cycles and systems on a truly global scale, leading to a new field of research known as Earth system ...
Lidija Beko. English for Geology Students. 1, Belgrade : The Faculty of Mining and Geology, 2023
Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities

Cvetana Krstev, Ranka Stanković, Aleksandra Marković, Teodora Mihajlov (2024)

Овај рад представља активности на развоју корпуса ELEXIS-sr, српском додатку вишејезичном анотираном корпусу ELEXIS-а, који се састоји од семантичких анотација и репозиторија значења речи. ELEXIS је паралелни вишејезични анотирани корпус на десет европских језика, који може да се користи као вишејезички репер за евалуацију европских језика са мање и средње развијеним ресурсима. Фокус овог рада је на вишечланим изразима и именованим ентитетима, њиховом препознавању у скупу реченица ELEXIS-sr и поређењу са анотацијама на другим језицима. Разматрају се први кораци ...

полилексемске језинице, именовани ентитет, вишезначност значења речи, складиште смисла, LLOD

Cvetana Krstev, Ranka Stanković, Aleksandra Marković, Teodora Mihajlov. "Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities" in Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, Turin, May 25, 2024, ELRA and ICCL (2024)
Using Lexical Resources for Irony and Sarcasm Classification

Miljana Mladenović, Cvetana Krstev, Jelena Mitrović, Ranka Stanković (2017)

The paper presents a language dependent model for classification of statements into ironic and non-ironic. The model uses various language resources: morphological dictionaries, sentiment lexicon, lexicon of markers and a WordNet based ontology. This approach uses various features: antonymous pairs obtained using the reasoning rules over the Serbian WordNet ontology (R), antonymous pairs in which one member has positive sentiment polarity (PPR), polarity of positive sentiment words (PSP), ordered sequence of sentiment tags (OSA), Part-of-Speech tags of words (POS) ...

... 2011. Liars and Saviors in a Sentiment Annotated Corpus of Comments to Political Debates. In Proceedings of the 49th Annual Meeting of the ACL: Human Language Technologies: short papers – Volume 2. Association for Computational Linguistics, 564–568. [10] Matthieu Constant, Cvetana Krstev, and Duško Vitas ...
... Cvetana Krstev, and Duško Vitas. 2015. Hy- brid sentiment analysis framework for a morphologically rich language. Journal of Intelligent Information Systems 46, 3 (2015), 599–620. [25] Silvana Neshkovska. 2015. Stylistic Signals of Verbal Irony. International Journal of Language & Linguistics 2, 2 (2015) ...
... Sarcasm. In Proc. of the Workshop on Computational Approaches to Figurative Language (FigLanguages ’07). Association for Computational Linguistics, 1–4. [22] Cvetana Krstev. 2008. Processing of Serbian — Automata, Texts and Electronic Dictionaries. Faculty of Philology, University of Belgrade, Belgrade ...
Miljana Mladenović, Cvetana Krstev, Jelena Mitrović, Ranka Stanković. "Using Lexical Resources for Irony and Sarcasm Classification" in Proceedings of the 8th Balkan Conference in Informatics (BCI '17), New York, NY, USA, : ACM (2017). https://doi.org/
An Approach to Development of Bilingual Lexical Resources

Stanković Ranka, Obradović Ivan, Trtovac Aleksandra (2012)

... Ranka, Obradović Ivan, Trtovac Aleksandra | Proceedings of the Fifth Balkan Conference in Informatics BCI 2012, Workshop on Computational Linguistics and Natural Language Processing of Balkan Languages – CLoBL 2012, September 2012 | 2012 | | http://dr.rgf.bg.ac.rs/s/repo/item/0001462 Дигитални репозиторијум ...
... Conference on Language Resources and Evaluation (LREC), (Istanbul, Turkey, May 23-25, 2012). [7] Stanković, R., Obradović, I., Krstev, C., Vitas, D. 2011. Production of Morphological Dictionaries of Multi-Word Units Using a Multipurpose Tool. In: Proceedings of the Computational Linguistics-Applications ...
... collocations from documents and generates multiword keywords. The paper also outlines linguistic criteria used for building language resources for French, Italian, and German, and the use of multi-term descriptors as a means to better identify the content. The Human Language Technology group at the ...
Stanković Ranka, Obradović Ivan, Trtovac Aleksandra. "An Approach to Development of Bilingual Lexical Resources" in Proceedings of the Fifth Balkan Conference in Informatics BCI 2012, Workshop on Computational Linguistics and Natural Language Processing of Balkan Languages – CLoBL 2012, September 2012, Novi Sad : BCI (2012)
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian

Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić (2020)

The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment ...

Part-of-Speech tagging, lemmatization, corpus, evaluation, Serbian, morphological dictionary

... Constant, M., Krstev, C., and Vitas, D. (2018). Lexical analysis of serbian with conditional random fields and large-coverage finite-state resources. In Zygmunt Vetu- lani, et al., editors, Human Language Technology. Chal- lenges for Computer Science and Linguistics. LTC 2015. Lecture Notes in ...
... computational linguistics: system demonstrations, pages 55–60. Petrov, S., Das, D., and McDonald, R. (2012). A Univer- sal Part-of-Speech Tagset. In Nicoletta Calzolari (Con- ference Chair), et al., editors, Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12) ...
... Pacific Asia Conference on Language, Information and Compu- tation, pages 389–398. Honnibal, M. and Montani, I. (2017). spaCy 2: Natural Language Understanding with Bloom Embeddings, Con- volutional Neural Networks and Incremental Parsing. To appear. Huang, Z., Xu, W., and Yu, K. (2015). Bidirectional ...
Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020)
English for Geology Students 1 – Dyslexia friendly

Lidija Beko (2023)

Engleski jezik

Lidija Beko. English for Geology Students 1 – Dyslexia friendly, Belgrade : The Faculty of Mining and Geology, 2023
English for Geology Students 2 - Dyslexia friendly

Lidija Beko (2023)

Engleski jezik

Lidija Beko. English for Geology Students 2 - Dyslexia friendly, Belgrade : The Faculty of Mining and Geology, 2023

Претрага

2342 items

From DELA Based Dictionary to Leximirka Lexical Database cite

Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++ cite

Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis cite

Rule-based Automatic Multi-word Term Extraction and Lemmatization cite

Multiword Expressions between the Corpus and the Lexicon: Universality, Idiosyncrasy and the Lexicon-Corpus Interface cite

Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model cite

Using English Baits to Catch Serbian Multi-Word Terminology cite

Resource-based WordNet Augmentation and Enrichment cite

Multi-word Expressions for Abusive Speech Detection in Serbian cite

From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back) cite

Bilingual lexical extraction based on word alignment for improving corpus search cite

English for Geology Students. 2 cite

Knowledge and Rule-Based Diacritic Restoration in Serbian cite

English for Geology Students. 1 cite

Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities cite

Using Lexical Resources for Irony and Sarcasm Classification cite

An Approach to Development of Bilingual Lexical Resources cite

Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian cite

English for Geology Students 1 – Dyslexia friendly cite

English for Geology Students 2 - Dyslexia friendly cite

From DELA Based Dictionary to Leximirka Lexical Database

Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++

Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis

Rule-based Automatic Multi-word Term Extraction and Lemmatization

Multiword Expressions between the Corpus and the Lexicon: Universality, Idiosyncrasy and the Lexicon-Corpus Interface

Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model

Using English Baits to Catch Serbian Multi-Word Terminology

Resource-based WordNet Augmentation and Enrichment

Multi-word Expressions for Abusive Speech Detection in Serbian

From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)

Bilingual lexical extraction based on word alignment for improving corpus search

English for Geology Students. 2

Knowledge and Rule-Based Diacritic Restoration in Serbian

English for Geology Students. 1

Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities

Using Lexical Resources for Irony and Sarcasm Classification

An Approach to Development of Bilingual Lexical Resources

Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian

English for Geology Students 1 – Dyslexia friendly

English for Geology Students 2 - Dyslexia friendly