Претрага
76 items
-
A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals
This paper outlines the main features of Bibliša, a tool that offers various possibilities of enhancing queries submitted to large collections of TMX documents generated from aligned parallel articles residing in multilingual digital libraries of e-journals. The queries initiated by a simple or multiword keyword, in Serbian or English, can be expanded by Bibliša, both semantically and morphologically, using different supporting monolingual and multilingual resources, such as wordnets and electronic dictionaries. The tool operates within a complex system composed ...... Vitas, D. (2011), "Production of morphological dictionaries of multi-word units using a multipurpose tool", In: Proceedings of the Computational Linguistics-Applications Conference, October 17–19, 2011. Jachranka, Poland (pp. 77-84), K. Jassem, P. W. Fuglewicz, M. Piasecki and A. Przepiórkowski ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac, Miloš Utvić. "A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals" in Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, May 2012, Istanbul, Turkey, Istanbul, Turkey : European Language Resources Association (2012)
-
A Lexical Approach to Acronyms and their Definitions
In this paper we present a comprehensive approach to acronyms for Natural-Language Processing (NLP) of Serbian texts. The proposed procedure includes extraction of acronyms and their definitions that are usual Multi-Word Units (MWUs), shallow parsing of MWUs that enables MWU lemmatization and production of entries in morphological electronic dictionaries, both for MWU and acronyms, that are provided with grammatical, syntactic, semantic and domain information. This approach enables representation that reflects complex relations between acronyms and their definitions.... In IceTAL, vol- ume 6233 of LNCS. Springer. Krstev, C. and D. Vitas, 2005. Corpus and Lexicon – Mu- tual Incompletness. In Proc. of the Corpus Linguistics Conference, Birmingham. Liberman, Mark Y and Kenneth W Church, 1992. Text analysis and word pronunciation in text-to-speech syn- thesis. Advances ...Cvetana Krstev, Duško Vitas, Ranka Stanković. "A Lexical Approach to Acronyms and their Definitions" in Proceedings of the 7th Language & Technology Conference, November 27-29, 2015, Poznań, Poland, Springer (2015)
-
The Dictionary of the Serbian Academy: from the Text to the Lexical Database
In this paper we discuss the project of digitization of the Dictionary of the Serbo-Croatian Standard and Vernacular Language. Scanning and character recognition were a particular challenge, since various non-standard character set encoding was used in the course of the almost 60-year long production of the dictionary. The first aim of the project was to formalize the micro-structure of the dictionary articles in order to parse the digitized text of and transform it into structured data stored in relational lexical database. This approach ...... Ahačič, K., Ledinek, N., & Perdih, A. (2015). Fran: The Next Generation Slovenian Dictionary Portal. In Natural Language Processing, Corpus Linguistics, Lexicography. Eight International Conference Bratislava, Slovakia, pp. 21-22. Berg, D. L., Gonnet, G. H., & Tompa, F. W. (1988). The New Oxford ...Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo. "The Dictionary of the Serbian Academy: from the Text to the Lexical Database" in Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts, Ljubljana : Ljubljana University Press, Faculty of Arts (2018)
-
Белешка о дигитализацији речника
У раду ће се анализирати ограничења која проистичу из линеарног процеса традиционалне израде речника на примеру Речника САНУ. Начин да се превазиђу ова ограничења се састоји у формирању електронске лексикографске базе која не представља само пуку дигиталну транскрипцију папирног издања речника. Посебно се указује на чињеницу да текст речника може представљати корпус и приказују се одабрани примери анализе таквог корпуса формираног из текстове 1. и 19. тома Речника САНУ.... – from File System to lemon Based Lex- ical Database, John P. McCrae et al. (eds.) Proc. of the 11th LREC – W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL 2018), Miyazaki, Japan, ISBN 979-10-95546-19-1 Станковић и др. 2018б: Stanković, Ranka; Rada Stijović, Duško ...Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић. "Белешка о дигитализацији речника" in Српски језик и његови ресурси, Међународни славистички центар, Филолошки факултет, Универзитет у Београду (2019). https://doi.org/10.18485/msc.2019.48.3.ch3
-
Using technology for knowledge transfer between academia and enterprises
Ivan Obradović, Ranka Stanković (2014)... C., & Vitas, D. (2011). Production of morphological dictionaries of multi-word units using a multipurpose tool. In Proceedings of the Computational Linguistics-Applications Conference, CLA '11, 77-84. © Institute of Knowledge Asset Management - ISBN 978-88-96687-04-8 Proceedings of IFKAD - ...Ivan Obradović, Ranka Stanković. "Using technology for knowledge transfer between academia and enterprises" in Knowledge and Management Models for Sustainable Growth, Proc. of IFKAD 2014, 9th International Forum on Knowledge Asset Dynamics, 11-13 June 2013, Matera, Italy, Bari : IFKAD (2014)
-
From DELA Based Dictionary to Leximirka Lexical Database
Biljana Lazić, Mihailo Škorić (2020)In this paper, we will present an approach in transforming Serbian language Morphological dictionaries from a DELA text format to a lexical database dubbed Leximirka. Considering the benefits of storing data within a database when compared to storing them in textual documents, we will outline some of the functionality that the database has made possible. We will also show how hand-made rules that use category labels lexical entries are marked with can be used to link lexical entries. ...... Lexical Database”. In Proceedings of the 11th International Conference on Language Re- sources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), McCrae, John P., Chris- tian Chiarcos, Thierry Declerck, Jorge Gracia and Bettina Klimek. Paris, France: ...Biljana Lazić, Mihailo Škorić. "From DELA Based Dictionary to Leximirka Lexical Database" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.4
-
Serbian NER&Beyond: The Archaic and the Modern Intertwinned
U ovom radu predstavljamo srpski književni korpus koji se razvija pod okriljem COST Akcije „Distant Reading for European Literary History” CA16204. Koristeći ovaj korpus romana napisanih pre više od jednog veka, razvili smo i učinili javno dostupnim Sistem za prepoznavanje imenovanih entiteta (NER) obučen da prepozna 7 različitih tipova imenovanih entiteta, sa konvolucionom neuronskom mrežom (CNN), koja ima F1 rezultat od ≈91% na test skupu podataka. Ovaj model je dalje ocenjen na posebnom skupu podataka za evaluaciju. Završavamo poređenje ...... An Annotated Dataset of Literary En- tities. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua- ge Technologies, volume 1, pages 2138–2144. Niels Dekker, Tobias Kuhn, and Marieke van Erp. 2019. Evaluating Named Entity ...Branislava Šandrih Todorović, Cvetana Krstev, Ranka Stanković, Milica Ikonić Nešić. "Serbian NER&Beyond: The Archaic and the Modern Intertwinned" in Proceedings of the Conference Recent Advances in Natural Language Processing - Deep Learning for Natural Language Processing Methods and Applications, INCOMA Ltd. Shoumen, BULGARIA (2021). https://doi.org/10.26615/978-954-452-072-4_141
-
Named Entity Recognition for Distant Reading in ELTeC
Francesca Frontini, Carmen Brando, Joanna Byszuk, Ioana Galleron, Diana Santos, Ranka Stanković (2020)Akcija COST „Udaljeno čitanje za evropsku književnu istoriju“, koja je počela 2017. godine, ima među svojim glavnim ciljevima stvaranje višejezične zbirke evropskih književnih tekstova (ELTeC) otvorenog koda. U ovom radu predstavljamo rad koji je obavljen na ručnom označavanju selekcije ELTeC kolekcije za imenovane entitete, kao i na proceni postojećih alata za prepoznavanje imenovanih entiteta u pogledu njihove sposobnosti da automatski urade takve anotacije. U poslednjem paragrafu se razmatraju zajedničke tačke između ove inicijative i CLARIN-a.... d D a ta se t o f L ite ra ry E n tit ie s . In Proceedings ofthe 2019 Conference ofthe NorthAmerican Chapter ofthe Associationfor Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), p a g e s 2 1 3 8 -2 1 4 4 , M in n e ap o lis , M in n e so ta , Ju n ...
... d T ex t A n n o ta tio n . In Proceedings ofthe Demonstrations at the 13th Conference ofthe European Chapter ofthe Association for Computational Linguistics, p a g e s 1 0 2 -1 0 7 , A v ig n o n , F ra n ce , A p ril. A sso c ia tio n fo r C o m p u ta tio n a l L in g u is tic s . h t tp ...
... R ., Q u ares m a , P., d a G ra c a V olpes N u n e s, M ., M a m ed e , N . J ., O liv e ira , C ., a n d D ias , M . C ., ed ito rs , Computational Processing ofthe Portuguese Language: 7thInternational Workshop, PROPOR2006. Itatiaia, Brazil, May 2006(PROPOR'2006), p ag e s 8 0 -8 9 . S ...Francesca Frontini, Carmen Brando, Joanna Byszuk, Ioana Galleron, Diana Santos, Ranka Stanković. "Named Entity Recognition for Distant Reading in ELTeC" in CLARIN Annual Conference 2020, Oct 2020, Virtual Event, France, CLARIN (2020)
-
Vebran Web Services for Corpus Query Expansion
Ranka Stanković, Miloš Utvić (2020)U ovom radu se govori o razvoju veb usluga Vebran i njihovoj primeni u poboljšanju pretraživanja korpusa. Veb-servisi Vebran koriste se za konsultovanje spoljnih leksičkih izvora za srpski jezik (uglavnom elektronski morfološki rečnici i srpski Vordnet) i proširivanje korisničkih upita radi dobijanja relevantnijih rezultata iz srpskih korpusa.... CQP_Tutorial.pdf Hardie, Andrew. “CQPweb — Combining Power, Flexibility and Usability in a Corpus Analysis Tool”. International Journal of Corpus Linguistics Vol. 17, no. 3 (2012): 380–409 Kolonja, Ljiljana, Ranka Stanković, Ivan Obradović, Olivera Kitanović and Aleksandar Cvjetić. “Development of ...Ranka Stanković, Miloš Utvić. "Vebran Web Services for Corpus Query Expansion" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.5
-
Building learning capacity by blending different sources of knowledge
... C., & Vitas, D. (2011). Production of morphological dictionaries of multi-word units using a multipurpose tool. In Proceedings of the Computational Linguistics-Applications Conference, CLA '11, 77-84. Tynjälä, P. (1999). Towards expert knowledge? A comparison between a constructivist and a t ...Ivan Obradović, Ranka Stanković, Olivera Kitanović, Dalibor Vorkapić. "Building learning capacity by blending different sources of knowledge" in International Journal of Learning and Intellectual Capital (2016). https://doi.org/10.1504/IJLIC.2016.075698
-
Old or New, We Repair, Adjust and Alter (Texts)
Cvetana Krstev, Ranka Stanković (2020)U ovom radu predstavljamo kako se e-rečnici i kaskade transduktora konačnih stanja implementirani u alatu Unitex mogu koristiti za rešavanje tri problema transformacije teksta: ispravljanje tekstova nakon OCR-a, vraćanje dijakritičkih znakova i prebacivanje između različitih jezičkih varijanti.ispravka teksta, OCR greške, restauracija dijakritika , jezičke varijante, elektronski rečnik, transduktori konačnih stanja... Stanković and Duško Vitas. “Knowledge and Rule- Based Diacritic Restoration in Serbian”. In Proceedings of the Third Inter- national Conference Computational Linguistics in Bulgaria (CLIB 2018), Koeva, Svetla. Sofia, Bulgaria: Institute for Bulgarian Language “Prof. Lyubomir Andreychin”, 41–51. Bulgarian Academy ...Cvetana Krstev, Ranka Stanković. "Old or New, We Repair, Adjust and Alter (Texts)" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.3
-
Development of terminological resources for expert knowledge: a case study in mining
Ljiljana Kolonja, Ranka Stanković, Ivan Obradović, Olivera Kitanović, Aleksandar Cvjetić. "Development of terminological resources for expert knowledge: a case study in mining" in Knowledge Management Research & Practice, Palgrave Macmillan (2015). https://doi.org/10.1057/kmrp.2015.10
-
SrpELTeC: A Serbian Literary Corpus for Distant Reading
U članku je predstavljen SrpELTeC, korpus razvijen u okviru akcije COST Distant Reading for European Literary History (CA16204). Svi romani u SrpELTeC-u su odabrani, pripremljeni i obeleženi korišćenjem zajedničkih principa uspostavljenih za sve jezičke zbirke u Evropskoj zbirci književnog teksta (ELTeC). Navedeni su izazovi i rešenja u pripremi SrpELTeC od nule. Svi romani su ručno kodirani u TEI sa bogatim metapodacima i strukturnim napomenama. Automatska anotacija je uključivala POS-označavanje, lematizaciju i imenovane entitete, oslanjajući se na resurse za obradu ...digital humanities, Serbian literature, text corpora, distant reading , linked data, named entity recognition, text analyticsRanka Stanković, Cvetana Krstev, Duško Vitas. "SrpELTeC: A Serbian Literary Corpus for Distant Reading" in Primerjalna književnost, Research Centre of the Slovenian Academy of Sciences and Arts (2024). https://doi.org/10.3986/pkn.v47.i2.03
-
Developing Termbases for Expert Terminology under the TBX Standard
... Multi-Word Units Using a Multipurpose Tool. In K. Jassem, P. W. Fuglewicz, M. Piasecki, and A. Przepi rkowski, editors, Pro- ceedings of the Computational Linguistics-A pplications Conference, October 17-19, 2011. Jachranka, Poland, pages 77-84. Polish Information Processing Society, 2011. Ranka Stankovi¢ ...Ranka Stanković, Ivan Obradović, and Miloš Utvić. "Developing Termbases for Expert Terminology under the TBX Standard" in Natural Language Processing for Serbian - Resources and Applications, Belgrade : University of Belgrade, Faculty of Mathematics (2014)
-
Sistem poslovne inteligencije za upravljanje zaštitom na radu u rudarskoj industriji
Ljiljana R. Kolonja (2016-09-23)Postojeća organizacija zaštite na radu na površinskim kopovima uglja često nije umogućnosti da blagovremeno i tačno ukaže na nedostatke, koji mogu u bližoj ilidaljoj budućnosti da budu uzrok narušavanja bezbednosti radnika, ali i uzrokoštećenja materijalnih dobara rudnika. Nemogućnost donošenja pravovremenihodluka je moguće izbeći uvođenjem softverski podržanih sistema za praćenje ianalizu bezbednosti i zaštite na radu.Osnovni cilj ove doktorske disertacije je definisanje metodologije i razvoj modelasistema upravljanja zaštitom na radu, koji će omogućiti savremen, adekvatan isveobuhvatan način praćenja i analize ...... 393‐399. KRSTEV, C., OBRADOVIĆ, I., STANKOVIĆ, R. & VITAS, D. 2013. An approach to efficient processing of multi‐word units. Computational Linguistics. Springer. KRUCHTEN, P. 2004. The rational unified process: an introduction, Addison‐Wesley Professional. LARMAN, C. 2005. Applying UML ...
... 25, 132‐148. BADAMI, V. 2003. Payback on Business Intelligence. DM Review. BERNARDINO, J. & TERESO, M. 2013. Business Intelligence Tools. Computational Intelligence and Decision Making. Springer. BERRY, M. J. & LINOFF, G. 1997. Data mining techniques: for marketing, sales, and customer support ...Ljiljana R. Kolonja. "Sistem poslovne inteligencije za upravljanje zaštitom na radu u rudarskoj industriji" in Универзитет у Београду, Универзитет у Београду, Рударско-геолошки факултет (2016-09-23)
-
Споменица 1991. – 2015. година: 135 година геологије и 70 година рударства на Универзитету у Београду
... Efficient Processing of Multi-Word Units", In: ADAM PRZEPIÓRKOWSKI, MACIEJ PIASECKI, KRZYSZTOF JASSEM, AND PIOTR FUGLEWICZ (Eds.), Computational Linguistics, Studies in Computational Intelligence, Springer Berlin Heidelberg, Volume 458, pp. 109–129. LILIĆ N., OBRADOVIĆ I., STANKOVIĆ R., 2013: “INVENTS: A ...
... Extracting Translational Equivalents from Aligned Texts. in: Methods and Applications of Quantitative Linguistics, Selected papers of the 8th International Conference on Quantitative Linguistics mailto:bih@yahoo.com mailto:bih@yahoo.com 4. СПИСАК ИЗАБРАНИХ НАУЧНИХ И СТРУЧНИХ РАДОВА НАСТАВНИКА И ...
... му је из области лингвистике, електронског учења, вештачке интелигенције и примењене математике. Члан је IQLA - International Quan- titative Linguistics Association и члан JeRTeh – Друштва за језичке ресурсе и технологије. Такође, члан је и Дру- штва математичара. Добитник је награде „Вук Кара- ...главни и одговорни уредник Душан Поломчић. Споменица 1991. – 2015. година: 135 година геологије и 70 година рударства на Универзитету у Београду, Београд : Универзитет у Београду, Рударско-геолошки факултет, 2016