Претрага
114 items
-
Measuring semantic relevance of words in synsets
Obradović Ivan, Krstev Cvetana, Vitas Duško. "Measuring semantic relevance of words in synsets" in Text and Language, Structures · Functions · Interrelations. Quantitative Perspectives, P. Grzybek, E. Kelih, J. Mačutek (eds.), Wien:Praesens Verlag (2010): 133-144
-
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration
In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same ...... Cvetana Krstev, Ranka Stanković, Vitas Duško Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] A Description of Morphological Features of Serbian: a Revision using Feature System Declaration | Cvetana Krstev, Ranka Stanković, Vitas Duško | Proceedings of the 5th I ...
... Cvetana Krstev 1 , Ranka Stanković2 , Duško Vitas 3 1 professor, Faculty of Philology, Belgrade, 2 assistant professor, Faculty of Mining and Geology, Belgrade 3 professor, Faculty of Mathematics, Belgrade E-mail: cvetana@matf.bg.ac.rs, ranka@rgf.bg.ac.rs, vitas@matf.bg.ac.rs Abstract In this paper ...
... and Multiflex for compound inflection (Savary, 2008). When our own system for automatic detection of compound inflectional properties (Krstev & Vitas, 2009) asked for yet another description of morphological features, we realized the necessity of producing a comprehensive description that could ...Cvetana Krstev, Ranka Stanković, Vitas Duško. "A Description of Morphological Features of Serbian: a Revision using Feature System Declaration" in Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2010, Valetta, Malta : European Language Resources Association (2010)
-
The Effects of Multi-Word Tagging on Text Disambiguation
Utvić Miloš, Obradović Ivan, Krstev Cvetana, Vitas Duško. "The Effects of Multi-Word Tagging on Text Disambiguation" in Proceedings of the 29th International Conference on Lexis and Grammar, LGC 2010, September 2010, Belgrade, Serbia, D. Vitas and C. Krstev (eds.), Belgrade:Faculty of Mathematics, University of Belgrade (2010): 333-342
-
The Nooj System as Module within an Integrated Language Processing Environment
... Processing Environment Ranka Stanković, Duško Vitas, Cvetana Krstev Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] The Nooj System as Module within an Integrated Language Processing Environment | Ranka Stanković, Duško Vitas, Cvetana Krstev | Proceedings of the ...
... rgf.bg.ac.rs The NooJ system as module within an integrated language processing environment Ranka Stanković, ranka@rgf.bg.ac.yu Duško Vitas, vitas@matf.bg.ac.yu Cvetana Krstev, cvetena@matf.bg.ac.yu 1. Introduction In this paper we describe the main structure and possible applications ...
... Pavlović-Lažetić G., Vitas D., Obradović I.: Using Textual and Lexical Resources in Developing Serbian Wordnet. Romanian Journal of Information Science and Technology, Romanian Academy, Publishing House of the Romanian Academy, vol. 7, No. 1-2, pp. 147- 161, (2004) Krstev C., Vitas D., Stanković ...Ranka Stanković, Duško Vitas, Cvetana Krstev. "The Nooj System as Module within an Integrated Language Processing Environment" in Proceedings of the 2007 International Nooj Conference, Cambridge Scholars Publishing (2008)
-
SrpELTeC: A Serbian Literary Corpus for Distant Reading
U članku je predstavljen SrpELTeC, korpus razvijen u okviru akcije COST Distant Reading for European Literary History (CA16204). Svi romani u SrpELTeC-u su odabrani, pripremljeni i obeleženi korišćenjem zajedničkih principa uspostavljenih za sve jezičke zbirke u Evropskoj zbirci književnog teksta (ELTeC). Navedeni su izazovi i rešenja u pripremi SrpELTeC od nule. Svi romani su ručno kodirani u TEI sa bogatim metapodacima i strukturnim napomenama. Automatska anotacija je uključivala POS-označavanje, lematizaciju i imenovane entitete, oslanjajući se na resurse za obradu ...digital humanities, Serbian literature, text corpora, distant reading , linked data, named entity recognition, text analyticsRanka Stanković, Cvetana Krstev, Duško Vitas. "SrpELTeC: A Serbian Literary Corpus for Distant Reading" in Primerjalna književnost, Research Centre of the Slovenian Academy of Sciences and Arts (2024). https://doi.org/10.3986/pkn.v47.i2.03
-
A System for Named Entity Recognition Based on Local Grammars
Krstev Cvetana, Obradović Ivan, Utvić Miloš, Vitas Duško. "A System for Named Entity Recognition Based on Local Grammars" in Journal of Logic and Computation 24 no. 2, :Oxford University Press (2014): 473-489. https://doi.org/10.1093/logcom/exs079
-
E-Dictionaries and Finite-State Automata for the Recognition of Named Entities
Krstev Cvetana, Vitas Duško, Obradović Ivan, Utvić Miloš. "E-Dictionaries and Finite-State Automata for the Recognition of Named Entities" in Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing, FSMNLP 2011, July 2010, Blois, France, A. Maletti and M. Constant (eds.), :Association for Computational Linguistics (2011): 48-56
-
Белешка о дигитализацији речника
У раду ће се анализирати ограничења која проистичу из линеарног процеса традиционалне израде речника на примеру Речника САНУ. Начин да се превазиђу ова ограничења се састоји у формирању електронске лексикографске базе која не представља само пуку дигиталну транскрипцију папирног издања речника. Посебно се указује на чињеницу да текст речника може представљати корпус и приказују се одабрани примери анализе таквог корпуса формираног из текстове 1. и 19. тома Речника САНУ.... зоре. (стр. 60) Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић60 ЛИТЕРАТУРА Витас 2007: Душко Витас, „О проблему не(пре)познате речи у обради текс- това на српском језику”, зборник матице српске за филологију и линг- вистику, L, 111–120. Витас/Крстев 2015: Душко Витас и Цветана Крстев, „Нацрт ...
... 2023-10-14 04:07:54 Белешка о дигитализацији речника Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Белешка о дигитализацији речника | Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић | Српски језик ...
... „Нацрт за информа- тизовани речник српског језика”, научни састанак слависта у вукове дане, 44/3, 105–116 Витас 2016: Душко Витас, „Инфраструктура за изучавање и обраду српског језика”, у: зборник института за српски језик сану III (уредник Срето Танасић), Београд: Институт за српски језик САНУ ...Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић. "Белешка о дигитализацији речника" in Српски језик и његови ресурси, Међународни славистички центар, Филолошки факултет, Универзитет у Београду (2019). https://doi.org/10.18485/msc.2019.48.3.ch3
-
A Lexical Approach to Acronyms and their Definitions
In this paper we present a comprehensive approach to acronyms for Natural-Language Processing (NLP) of Serbian texts. The proposed procedure includes extraction of acronyms and their definitions that are usual Multi-Word Units (MWUs), shallow parsing of MWUs that enables MWU lemmatization and production of entries in morphological electronic dictionaries, both for MWU and acronyms, that are provided with grammatical, syntactic, semantic and domain information. This approach enables representation that reflects complex relations between acronyms and their definitions.... to Acronyms and their Definitions Cvetana Krstev, Duško Vitas, Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] A Lexical Approach to Acronyms and their Definitions | Cvetana Krstev, Duško Vitas, Ranka Stanković | Proceedings of the 7th Language ...
... available at: www.dr.rgf.bg.ac.rs A Lexical Approach to Acronyms and their Definitions Cvetana Krstev∗, Duško Vitas∗, Ranka Stanković† University of Belgrade, Belgrade, Serbia, ∗(cvetana|vitas)@matf.bg.ac.rs, †ranka@rgf.bg.ac.rs Abstract In this paper we present a comprehensive approach to acronyms ...
... Poster Session. Krstev, C., R. Stanković, I. Obradović, D. Vitas, and M. Utvić, 2010. Automatic construction of a morpho- logical dictionary of multi-word units. In IceTAL, vol- ume 6233 of LNCS. Springer. Krstev, C. and D. Vitas, 2005. Corpus and Lexicon – Mu- tual Incompletness. In Proc. of ...Cvetana Krstev, Duško Vitas, Ranka Stanković. "A Lexical Approach to Acronyms and their Definitions" in Proceedings of the 7th Language & Technology Conference, November 27-29, 2015, Poznań, Poland, Springer (2015)
-
An Approach to Efficient Processing of Multi-Word Units
Efficient processing of Multi-Word Units in the course of development of morphological MWU dictionaries is not easy to achieve, especially when languages with complex morphological structures are concerned, such as Serbian. Manual development of this type of dictionaries is a tedious and extremely slow process. To alleviate this problem we turned to our multipurpose software tool, dubbed LeXimir, in the production of lemmas for e-dictionaries of multi-word units. In addition to that, we developed a procedure aimed at making ...... Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] An Approach to Efficient Processing of Multi-Word Units | Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas | Computational Linguistics - Applications ...
... Belgrade, Serbia, e-mail: ranka@rgf.bg.ac.rs Duško Vitas University of Belgrade — Faculty of Mathematics, Studentski trg 16, 11000 Belgrade, Serbia, e-mail: vitas@rgf.bg.ac.rs 1 2 Cvetana Krstev, Ivan Obradović, Ranka Stanković, and Duško Vitas 1 Introduction Morphological electronic dictionaries ...
... (2008) 7. Krstev, C., Stanković, R., Obradović, I., Vitas, D., Utvić, M.: Automatic Construction of a Morphological Dictionary of Multi-Word Units. In: IceTAL, pp. 226–237. Springer, Reykavik, Iceland (2010) 8. Krstev, C., Stanković, R., Vitas, D., Koeva, S.: E-Connecting Balkan Languages. In: Proc ...Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas. "An Approach to Efficient Processing of Multi-Word Units" in Computational Linguistics - Applications, Studies in Computational Intelligence 458 no. 458, Berlin Heidelberg : Springer-Verlag (2013): 109-129. https://doi.org/10.1007/978-3-642-34399-5_6
-
Distribution of canonical syllable types in Serbian
Obradović Ivan, Obuljen Aljoša, Vitas Duško, Krstev Cvetana, Radulović Vanja. "Distribution of canonical syllable types in Serbian" in Text and Language, Structures · Functions · Interrelations. Quantitative Perspectives, P. Grzybek, E. Kelih, J. Mačutek (eds.), Wien:Praesens Verlag (2010): 145-157
-
E-Connecting Balkan Languages
In this paper we present a versatile language processing tool that can be successfully used for many Balkan languages. This tool relies for its work on several sophisticated textual and lexical resources that were developed for most of Balkan languages. These resources are based on several de facto standards in natural language processing.... ng Balkan Languages Cvetana Krstev, Ranka Stanković, Duško Vitas, Svetla Koeva Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] E-Connecting Balkan Languages | Cvetana Krstev, Ranka Stanković, Duško Vitas, Svetla Koeva | Proceedings of the Workshop Workshop ...
... .bg.ac.rs Ranka Stanković Faculty of Mining and Geology University of Belgrade ranka@rgf.bg.ac.rs Duško Vitas Faculty of Mathematics University of Belgrade vitas@matf.bg.ac.rs Svetla Koeva Dep. of Computational Linguistics Institute for Bulgarian svetla@dcl.bas.bg ...
... Krstev, R. Stanković, D. Vitas, I. Obradović. WS4LR: A Workstation for Lexical Resources, Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006, Genoa, Italy, May 2006, pp. 1692- 1697, 2006. [10] C. Krstev, R. Stanković, D. Vitas, I. Obradović, The Usage ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Svetla Koeva. "E-Connecting Balkan Languages" in Proceedings of the Workshop Workshop on Multilingual resources, technologies and evaluation for Central and Eastern European Languages, 17 September 2009, eds. C. Vertan, S. Piperidis, E. Paskaleva and Milena Slavcheva, Borovets, Bulgaria : Association for Computational Linguistics Stroudsburg, PA, USA (2009)
-
The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
In this paper we present how resources and tools developed within the Human Language Technology Group at the University of Belgrade can be used for tuning queries before submitting them to a web search engine. We argue that the selection of words chosen for a query, which are of paramount importance for the quality of results obtained by the query, can be substantially improved by using various lexical resources, such as morphological dictionaries and wordnets. These dictionaries enable semantic ...LR web services, MultiWord Expressions & Collocations, Information Extraction, Information Retrieval... Stanković Ranka, Vitas Duško, Obradović Ivan Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines | Krstev Cvetana, Stanković Ranka, Vitas Duško, Obradović Ivan | ...
... , Duško Vitas 3 , Ivan Obradović4 1 professor, Faculty of Philology, Belgrade, 2 assistant, Faculty of Mining and Geology, Belgrade 3 professor, Faculty of Mathematics, Belgrade, 4 professor, Faculty of Mining and Geology, Belgrade E-mail: cvetana@matf.bg.ac.yu, ranka@rgf.bg.ac.yu, vitas@matf ...
... Krstev, C., Stanković, R., Vitas, D., Obradović, I. (2006). WS4LR: A Workstation for Lexical Resources, Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006, Genoa, Italy, May 2006, pp. 1692-1697 Krstev, C., Vitas, D., Maurel, D., Tran, M. (2005) ...Krstev Cvetana, Stanković Ranka, Vitas Duško, Obradović Ivan. "The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines" in LREC 2008: Conference on Language Resources and Evaluation, Marrakesh, Morocco, May 2008, European Language Resources Association (ELRA) (2008)
-
Production of morphological dictionaries of multi-word units using a multipurpose tool
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multi-word units, noun phrases, query expansion... Obradović, Cvetana Krstev, Duško Vitas Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Production of morphological dictionaries of multi-word units using a multipurpose tool | Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas | Proceedings of the Co ...
... of Philology, Studentski trg 3, 11000 Belgrade, Serbia Email: cvetana@matf.bg.ac.rs Duško Vitas University of Belgrade — Faculty of Mathematics, Studentski trg 16, 11000 Belgrade, Serbia Email: vitas@matf.bg.ac.rs Abstract—In this paper we outline the use of the multipurpose software tool LeXimir ...
... pp. 111–141. [10] C. Krstev, R. Stanković, D. Vitas, and I. Obradović, “The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines,” in 6th LREC, Marrakech, Marocco, 2008. [11] C. Krstev, R. Stanković, D. Vitas, and S. Koeva, “E-Connecting Balkan Languages,” ...Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas. "Production of morphological dictionaries of multi-word units using a multipurpose tool" in Proceedings of the Computational Linguistics-Applications Conference, October 2011, Jachranka, Poland, Jachranka, Poland : PTI - Polish Information Processing Society (2011)
-
On the compatibility of lexical resources for NooJ
Lexical resources for many languages are provided for the NooJ linguistic development environment. Meta-data descriptions of morphosyntactic and semantic properties of these languages and their resources are a mandatory part of each language module. In this paper we analyze how well the meta-data actually describe resources for a chosen subset of languages and to what extent are they compatible across languages to support multilingual processing. We show that there is place for improvement in both directions.... NooJ Ranka Stanković, Miloš Utvić, Duško Vitas, Cvetana Krstev, Ivan Obradović Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] On the compatibility of lexical resources for NooJ | Ranka Stanković, Miloš Utvić, Duško Vitas, Cvetana Krstev, Ivan Obradović | Automatic ...
... Proceedings of the 2011 International NooJ Conference 1 ON THE COMPATIBILITY OF LEXICAL RESOURCES FOR NOOJ RANKA STANKOVIĆ, MILOŠ UTVIĆ, DUŠKO VITAS, CVETANA KRSTEV AND IVAN OBRADOVIĆ Abstract Lexical resources for many languages are provided for the NooJ linguistic development environment ...
... Petkevič, V., Simov, K., Tadić, M., Vitas, D., 2003, “The MULTEXT -East Morphosyntactic Specifications for Slavic Languages”, in Proc. of the Workshop on Morphological Processing of Slavic Languages: EACL 2003, Budapest, Hungary, eds. T. Erjavec and D. Vitas, pp. 25-32 Obradović, I. Stanković ...Ranka Stanković, Miloš Utvić, Duško Vitas, Cvetana Krstev, Ivan Obradović. "On the compatibility of lexical resources for NooJ" in Automatic Processing of Various Levels of Linguistic Phenomena: Selected Papers from the 2011 International Nooj Conference, Cambridge Scholars Publishing (2012): 96-108
-
WS4LR - a Worksation for Lexical Resources
... Lexical Resources Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] WS4LR - a Worksation for Lexical Resources | Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović | Proceedings of the Fifth In ...
... Stanković2 , Duško Vitas 3 and Ivan Obradović2 1Faculty of Philology, Studentski trg 3, CS-11000 Belgrade, 2Faculty of Mining and Geology, Đušina 7, CS-11000 Belgrade, 3Faculty of Mathematics, Studentski trg 16, CS-11000 Belgrade E-mail: cvetana@matf.bg.ac.yu, ranka@rgf.bg.ac.yu, vitas@matf.bg.ac ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović. "WS4LR - a Worksation for Lexical Resources" in Proceedings of the Fifth Interantional Conference on Language Resources and Evaluation, Genoa, Italy, May 2006, ELRA - European Language Resources Association (2006)
-
The Dictionary of the Serbian Academy: from the Text to the Lexical Database
In this paper we discuss the project of digitization of the Dictionary of the Serbo-Croatian Standard and Vernacular Language. Scanning and character recognition were a particular challenge, since various non-standard character set encoding was used in the course of the almost 60-year long production of the dictionary. The first aim of the project was to formalize the micro-structure of the dictionary articles in order to parse the digitized text of and transform it into structured data stored in relational lexical database. This approach ...... Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] The Dictionary of the Serbian Academy: from the Text to the Lexical Database | Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo ...
... Database Ranka Stanković1, Rada Stijović2, Duško Vitas1, Cvetana Krstev1, Olga Sabo2 1University of Belgrade, 2Institute for Serbian Language, Serbian Academy of Sciences and Arts E-mail: ranka.stankovic@rgf.bg.ac.rs, rada.stijovic@isj.sanu.ac.rs, vitas@matf.bg.ac.rs, cvetana@matf.bg.ac.rs, ol ...
... of the dictionary should be modernized many years ago (Sabo & Vitas 1989). However, only in recent years were these ideas revitalized, and various possibilities of updating the work on this vocabulary have since been considered (Vitas & Krstev, 2015; Ivanović et al. 2016). The digitization (which ...Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo. "The Dictionary of the Serbian Academy: from the Text to the Lexical Database" in Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts, Ljubljana : Ljubljana University Press, Faculty of Arts (2018)
-
Knowledge and Rule-Based Diacritic Restoration in Serbian
In this paper we present a procedure for the restoration of diacritics in Serbian texts written using the degraded Latin alphabet. The procedure relies on the comprehensive lexical resources for Serbian: the morphological electronic dictionaries, the Corpus of Contemporary Serbian and local grammars. Dictionaries are used to identify possible candidates for the restoration, while the dataobtainedfromSrpKorandlocalgrammarsassistsinmakingadecisionbetween several candidates in cases of ambiguity. The evaluation results reveal that,dependingonthetext,accuracyrangesfrom95.03%to99.36%,whilethe precision (average 98.93%) is always higher than the recall (average 94.94%).... n in Serbian Cvetana Krstev, Ranka Stanković, Duško Vitas Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Knowledge and Rule-Based Diacritic Restoration in Serbian | Cvetana Krstev, Ranka Stanković, Duško Vitas | Proceedings of the Third International Conference ...Cvetana Krstev, Ranka Stanković, Duško Vitas. "Knowledge and Rule-Based Diacritic Restoration in Serbian" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018): 41-51
-
Combining Heterogeneous Lexical Resources
... Lexical Resources Cvetana Krstev, Duško Vitas, Ranka Stanković, Ivan Obradović, Gordana Pavlović-Lažetić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Combining Heterogeneous Lexical Resources | Cvetana Krstev, Duško Vitas, Ranka Stanković, Ivan Obradović ...
... Heterogeneous Lexical Resources Cvetana Krstev, professor, Faculty of Philology, Belgrade, cvetana@matf.bg.ac.yu Duško Vitas, professor, Faculty of Mathematics, Belgrade, vitas@matf.bg.ac.yu Ranka Stankoviæ , assistant, Faculty of Mining and Geology, Ðušina 7, Belgrade, ranka@rgf.bg.ac.yu Ivan ...
... wordnet (SWN) is being developed in the scope of the Balkanet project (Stamou, 2002) following the model adopted for the EuroWordnet project (Vitas, 2003). The current size of SWN is 6290 synsets with 10583 literal string-sense pairs. Since the core of wordnets developed for Balkan languages ...Cvetana Krstev, Duško Vitas, Ranka Stanković, Ivan Obradović, Gordana Pavlović-Lažetić. "Combining Heterogeneous Lexical Resources" in Proceedings of the Fourth Interantional Conference on Language Resources and Evaluation, Lisabon, Portugal , May 2004, vol. 4, ELRA - European Language Resources Association (2004)
-
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection
Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić (2022)In this paper we present the Serbian part of the ELTeC multilingual corpus of novels written in the time period 1840-1920. The corpus is being built in order to test various distant reading methods and tools with the aim of re-thinking the European literary history. We present the various steps that led to the production of the Serbian sub-collection: the novel selection and retrieval, text preparation, structural annotation, POS-tagging, lemmatization and named entity recognition. The Serbian sub-collection was published ...Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić. "Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection" in Proceedings of the Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)