Претрага
849 items
-
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection
Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić (2022)In this paper we present the Serbian part of the ELTeC multilingual corpus of novels written in the time period 1840-1920. The corpus is being built in order to test various distant reading methods and tools with the aim of re-thinking the European literary history. We present the various steps that led to the production of the Serbian sub-collection: the novel selection and retrieval, text preparation, structural annotation, POS-tagging, lemmatization and named entity recognition. The Serbian sub-collection was published ...Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić. "Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection" in Proceedings of the Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
-
Serbian ELTeC Sub-Collection in Wikidata
This paper presents an example of integration of Wikidata with digital libraries and external systems, as well as some best practices for speeding up the process of data preparation and import to Wikidata, on the use case of SrpELTeC, Serbian subcollection of the ELTeC multilingual collection (European Literary Text Collection). After preliminary work on the manual Wikidata population with SrpELTeC novels, the goal was to automate the process of preparing and importing information, so different solutions were analysed and ...Milica Ikonić Nešić, Ranka Stanković, Biljana Rujević. "Serbian ELTeC Sub-Collection in Wikidata" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.2.4
-
Веб-алат за управљање грађом Речника САНУ и анотација листића
Грађа на основу које се израђује Речник српскохрватског књижевног и народног језика САНУ, а која садржи материјал из преко 4.500 писаних извора и 300 рукописних збирки речи са подручја народних говора штокавског наречја, забележена је на око 5.000.000 листића. Богат лексички материјал, који обухвата књижевни и народни језик у протекла два века и на основу кога треба да се напише још најмање 15 томова Речника, пружа могућност и за разноврсна лингвистичка и ванлингвистичка истраживања. Из тог разлога се приступило ...Рада Стијовић, Ранка Станковић, Михаило Шкорић. "Веб-алат за управљање грађом Речника САНУ и анотација листића" in Rasprave Instituta za hrvatski jezik i jezikoslovlje, Institute of Croatian Language and Linguistics (2020). https://doi.org/10.31724/rihjj.46.2.32
-
Parallel Bidirectionally Pretrained Taggers as Feature Generators
In a setting where multiple automatic annotation approaches coexist and advance separately but none completely solve a specific problem, the key might be in their combination and integration. This paper outlines a scalable architecture for Part-of-Speech tagging using multiple standalone annotation systems as feature generators for a stacked classifier. It also explores automatic resource expansion via dataset augmentation and bidirectional training in order to increase the number of taggers and to maximize the impact of the composite system, which ...Ranka Stanković, Mihailo Škorić, Branislava Šandrih Todorović. "Parallel Bidirectionally Pretrained Taggers as Feature Generators" in Applied Sciences, MDPI AG (2022). https://doi.org/10.3390/app12105028
-
Sentiment Analysis of Serbian Old Novels
In this paper we present first study of Sentiment Analysis (SA) of Serbian novels from the 1840-1920 period. The preparation of sentiment lexicon was based on three existing lexicons: NRC, AFFIN and Bing with additional extensive corrections. The first phase of dataset refinement included filtering the word that are not found in Serbian morphological dictionary and in second automatic POS tagging and lemma were manually corrected. The polarity lexicon was extracted and transformed into ontolex-lemon and published as initial ...Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović. "Sentiment Analysis of Serbian Old Novels" in Proceedings of the 2nd Workshop on Sentiment Analysis and Linguistic Linked Data, June 2022, Marseille, France, European Language Resources Association (2022)
-
Annotation of the Serbian ELTeC Collection
Ovaj rad predstavlja takozvano izdanje nivoa 2 kolekcije tekstova SrpELTeC razvijene u okviru aktivnosti Radne grupe 2 – Metode i alati COST akcije CA 16204 (Distant Reading for European Literary History) i njene specifikacije šeme. Izdanje nivoa 2 je nastavak izdanja nivoa 1, koje se koristi kao ulaz za morfosintaksičke i NER anotacije romana. Srpska obrada nivoa-2 je navedena kroz potrebne korake, uključujući metode i alate koji se koriste u tom procesu. Neki statistički podaci iz srpske kolekcije nivoa ...udaljeno čitanje, literarni korpus, tagiranje, prepoznavanje imenovanih entiteta, lematizacija, ELTeCRanka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Mihailo Škorić. "Annotation of the Serbian ELTeC Collection" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.2.3
-
Transformer-Based Composite Language Models for Text Evaluation and Classification
Parallel natural language processing systems were previously successfully tested on the tasks of part-of-speech tagging and authorship attribution through mini-language modeling, for which they achieved significantly better results than independent methods in the cases of seven European languages. The aim of this paper is to present the advantages of using composite language models in the processing and evaluation of texts written in arbitrary highly inflective and morphology-rich natural language, particularly Serbian. A perplexity-based dataset, the main asset for the ...Mihailo Škorić, Miloš Utvić, Ranka Stanković. "Transformer-Based Composite Language Models for Text Evaluation and Classification" in Mathematics, MDPI AG (2023). https://doi.org/10.3390/math11224660
-
From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)
In this paper we present the wikification of the ELTeC (European Literary Text Collection), developed within the COST Action ``Distant Reading for European Literary History'' (CA16204). ELTeC is a multilingual corpus of novels written in the time period 1840—1920, built to apply distant reading methods and tools to explore the European literary history. We present the pipeline that led to the production of the linked dataset, the novels’ metadata retrieval and named entity recognition, transformation, mapping and Wikidata population, ...Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić. "From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)" in Proceedings of The 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
-
Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution
This paper explores the effectiveness of parallel stylometric document embeddings in solving the authorship attribution task by testing a novel approach on literary texts in 7 different languages, totaling in 7051 unique 10,000-token chunks from 700 PoS and lemma annotated documents. We used these documents to produce four document embedding models using Stylo R package (word-based, lemma-based, PoS-trigrams-based, and PoS-mask-based) and one document embedding model using mBERT for each of the seven languages. We created further derivations of these ...Mihailo Škorić, Ranka Stanković, Milica Ikonić Nešić, Joanna Byszuk, Maciej Eder. "Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution" in Mathematics, MDPI AG (2022). https://doi.org/10.3390/math10050838
-
Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data
Овај рад описује студију случаја о генерисању повезаних података креираних на основу обечежених текстуалних корпуса коришћењем формата размене података у обради природних језика (NIF). Као основа за ово истраживање послужио је подскуп корпуса ELTeC, који се састоји од 900 романа из периода 1840-1920 за 9 европских језика. Верзија романа са коментарима, у такозваном TEI level-2 формату, трансформисана је у NIF, формат заснован на RDF/OWL који има за циљ постизање интероперабилности између алата за обраду природних језика, језичких ресурса и ...Ranka Stanković, Christian Chiarcos, Miloš Utvić, Olivera Kitanović. "Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data" in LDK 2023 – 4th Conference on Language, Data and Knowledge, 12-15 September in Vienna, Austria, Lisabon : NOVA FCSH - CLUNL (2023). https://doi.org/10.34619/srmk-injj
-
Terenska nastava iz geofizike: Arheološki lokaliteti na području Trstenika
... Касноантичке виле рустике у Србији, докторска дисертација, Београд, Филозофски факултет, Универзитет у Београду,305. Стокић, М., Петровић, Б., Спасојевић, И., Станковић, Н., Ђанковић, Н., Марковић, А., 2017. Елаборат из предмета: теренска настава из геофизике 1 и 2 - Резултати геофизичких истраживања ...
... Slika 2. Plan i rezultati magnetometrijskih istraživanja na lokalitetu Šljivik - Stragari, a) poligoni merenja, b) karta vertikalnog gradijenta ZMP, c) prikaz filtrirane karte pod b) u formi sen- čenog reljefa, d) pozitivna arheološka sondа (Марковић и др., 2017; Вучковић и др., 2017) 34 C u r ...
... groba, ve- Slika 3. Plan i rezultati geofizičkih istraživanja na lokalitetu Đurovača, a) trase elektro- metrijskih i seizmometrijskih ispitivanja, b) elektrometrijski presek (gore) i seizmometrijski profil (dole), c) pozitivna arheološka sonda, d) izgled savremenog crkvišta, preuređenog od strane ...Dragana Đurić, Jelena Vukčević, Dejan Vučković, Ivana Vasiljević, Vesna Cvetkov . "Terenska nastava iz geofizike: Arheološki lokaliteti na području Trstenika" in Aktuelna interdisciplinarana istraživanja tehnologije u arheologiji jugoistočne Evrope: zbornik radova / Prvi skup Sekcije za arheometriju, arheotehnologiju, geoarheologiju i eksperimentalnu arheologiju Srpskog arheološkog društva, 28.02.2020., Beograd, Beograd : Srpsko arheološko društvo (2020)
-
Corpus-based bilingual terminology extraction in the power engineering domain
Ovaj rad predstavlja resurse i alate koji se koriste za ekstrkciju i evaluaciju dvojezične, englesko-srpske terminologije u domenu energetike. Resursi se sastoje od postojeće opšte i domenske leksike i domenskog paralelnog korpusa; alati uključuju ekstraktore termina za oba jezika i alat za poravnavanje segmenata koji pripadaju korpusnim rečenicama. Sistem je testiran variranjem funkcije podudaranja koja utvrđuje prisustvo ekstrahovanog termina u poravnatom segmentu (odsečak), u rasponu od veoma labavog do strogog. Procena rezultata je pokazala da je preciznost izdvajanja termina ...Tanja Ivanović, Ranka Stanković, Branislava Šandrih Todorović, Cvetana Krstev. "Corpus-based bilingual terminology extraction in the power engineering domain" in Terminology, John Benjamins Publishing Company (2022). https://doi.org/10.1075/term.20038.iva
-
Football terminology: compilation and transformation into OntoLex-Lemon resource
У овом раду представља се пројекат који је у развоју, креирање првог дигиталног фудбалског речника на српском језику, као и да демонстрација примене модела OntoLex и љегових модула. OntoLex-FrAC модул укључује информације о учесталости и примерима употребе екстрахованих из корпуса. У овом случају, креиран је корпус за специфичан домен под називом СрФудКо, који садржи чланке вести о фудбалу на српском језику. Вишечлани термини аутоматски су екстраховани из српског корпуса, а затим ручно евалуирани и класификовани као спортски или ...Jelena Lazarević, Ranka Stanković, Mihailo Škorić, Biljana Rujević. "Football terminology: compilation and transformation into OntoLex-Lemon resource" in LDK 2023 – 4th Conference on Language, Data and Knowledge, 12-15 September in Vienna, Austria, Lisabon : NOVA FCSH - CLUNL (2023). https://doi.org/10.34619/srmk-injj
-
Geologic Information System of Serbia
Geologic information system of Serbia (GeolISS) represents repository for digital archiving, query, retrieving, analysis and geologic data visualization. The GeolISS is implemented through ESRI ArcGIS technology, and is designed to operate as a personal geodatabase (MS Jet 4.0 Engine) and SDE enterprise geodatabase in MS SQL Server. The objective of GeolISS implementation is integration of existing geologic archives, data from published maps at different scales, newly acquired field data, as well as Web publishing of geologic information. Physical implementation ...... Blagojević, Branislav Trivić, Ranka Stanković, Nenad Banjac, Olivera Kitanović Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Geologic Information System of Serbia | Branislav Blagojević, Branislav Trivić, Ranka Stanković, Nenad Banjac, Olivera Kitanović | ...Branislav Blagojević, Branislav Trivić, Ranka Stanković, Nenad Banjac, Olivera Kitanović. "Geologic Information System of Serbia" in Proceedings of the 17th Meeting of the Association of European Geological Societies, 14.-18. september 2011., Beograd : Srpsko geološko društvo (2011)
-
Ventex: an expert system for mine ventilation systems analysis
roblems relating to mine ventilation systems analysis are usually of a very complex nature. They involve the estimation of numerous interdependent parameters pertaining to the network status, ventilation status, ventilation system stability, air current losses, climate conditions, gas state, fire risk and dangerous dust pollution risk. The solution to these problems relies, to a great extent, on numerical methods. With the development of computer technology these methods were incorporated as numerical routes in software packages, such as the mine ...Nikola Lilić, Ranka Stanković, Ivan Obradović. "Ventex: an expert system for mine ventilation systems analysis" in Mining Technology, United Kingdom (1997)
-
Coupling of artificial intelligence methods in the development of hybrid intelligent systems
In this paper we present an approach which couples various artificial intelligence (AI) methods in the solution of complex problems that cannot adequately be solved by a single AI method. We argue that the resulting, hybrid intelligent systems (HIS) can be successfully implemented with the use of available AI software libraries. Different coupling methods are analyzed and a classification of hybrid systems based on the chosen method is given. Two case studies of hybrid systems used in mining engineering ...hibridni inteligentni sistemi, spregnuti sistemi, metode veštačke inteligencije, rudarske primene veštačke inteligencije... hybrid intelligent systems Ranka Stanković, Ivan Obradović, Nikola Lilić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Coupling of artificial intelligence methods in the development of hybrid intelligent systems | Ranka Stanković, Ivan Obradović, Nikola Lilić ...
... Intelligent Hybrid Systems, John Wiley & Sons. [2] Lilić N., Obradović I., Stanković R., (1997), “Ventex: An Expert System for Mine Ventilation Systems Analysis“, Mining Technology, Vol. 79, No 915, pp. 295-302. [3] Lilić N., Stanković R., Obradović I., (1998), “An Outline of a Hybrid Intelligent System ...
... ASRTP’98, High Tatras, Slovak Republic, pp. 658-661. [4] Lilić N., Stanković R., Obradović I., (1999), “Coupling intelligent methods in hybrid system for air pollution prediction“, InfoJISA, vol.2-3, pp. 41-43. [5] Lilić N., Stanković R., Obradović I., (2000), Hibridni sistem za planiranje i analizu ...Ranka Stanković, Ivan Obradović, Nikola Lilić. "Coupling of artificial intelligence methods in the development of hybrid intelligent systems" in X Kongres Matematičara, Matematički fakultet, Beograd (2001)
-
Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking
U radu se prikazuju rezultati istraživanja vezanih za pripremu paralelnih korpusa, fokusirajući se na transformaciju u RDF grafove koristeći NLP Interchange Format (NIF) za lingvističku anotaciju. Pružamo pregled paralelnog korpusa koji je korišćen u ovom studijskom slučaju, kao i proces označavanja delova govora, lematizacije i prepoznavanja imenovanih entiteta (NER). Zatim opisujemo povezivanje imenovanih entiteta (NEL), konverziju podataka u RDF, i uključivanje NIF anotacija. Proizvedene NIF datoteke su evaluirane kroz istraživanje triplestore-a korišćenjem SPARQL upita. Na kraju, razmatra se povezivanje Linked ...paralelni korpusi, povezivanje imenovanih entiteta, prepoznavanje imenovanih entiteta, NER, NEL, povezani podaci, NIF, VikipodaciRanka Stanković, Milica Ikonić Nešić, Olja Perisic, Mihailo Škorić, Olivera Kitanović. "Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
Electronic Dictionaries - from File System to lemon Based Lexical Database
In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same ...... Based Lexical Database Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Electronic Dictionaries - from File System to lemon Based Lexical Database | Ranka Stanković, Cvetana Krstev, Biljana Lazić ...
... the type of the relation, using some simple string matching and replacement, and the newly constructed lemmas had to (a) exist in dictionaries; and (b) have an inverse marker. For instance, verbs afirmisati and afirmirati are two vari- ants (the first one being preferred today in Serbian) of the same ...
... A declarative model for the lexicon- ontology interface. Web Semantics: Science, Services and Agents on the World Wide Web, 9(1):29–51. Courtois, B. and Silberztein, M. (1990). Dictionnaires électroniques du français, volume 87 of Langue français. Larousse, Paris. Farrar, S. and Langendoen, ...Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić. "Electronic Dictionaries - from File System to lemon Based Lexical Database" in Proceedings of the 11th International Conference on Language Resources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
-
Metodologija utvrđivanja i procena tehno-ekonomskih faktora i parametara za površinsku eksploataciju u elaboratima rudnih rezervi nemetaličnih mineralnih sirovina
Zoran Stanković (1986)Površinska eksploatacija, tehno-ekonomski fakzori, elaborati, rudne rezerve, nemetalične mineralne sirovineZoran Stanković. Metodologija utvrđivanja i procena tehno-ekonomskih faktora i parametara za površinsku eksploataciju u elaboratima rudnih rezervi nemetaličnih mineralnih sirovina, Beograd:Rudarsko-geološki fakultet, 1986
-
Нумеричке симулације процеса конвекције у Земљином омотачу
Станковић Никола, Цветков Весна, Цветковић Владица. "Нумеричке симулације процеса конвекције у Земљином омотачу" in Српски геолошки конгрес, Врњачка Бања, 17‐20. мај 2020 no. 2, Врњачка Бања:Српско геолошко друштво (2018): 726-731