Претрага ⚒ Радови ⚒ Др РГФ - Репозиторијум РГФ

Претрага

Per page

Sort by

489 items

SrpELTeC on Platforms: Udaljeno čitanje, Aurora, NoSketch

Ranka Stanković, Mihailo Škorić, Petar Popović (2022)

Serbian ELTeC collection (100 novels and extended) developed within COST action CA16204 Distant Reading for European Literary History comprises at this moment 111 novels published in the period 1840-1920. Such a valuable resource is and will be used for various lexical and linguistic research, by using different tools and methodologies. In this paper, three platforms on which these novels are published will be presented: “Udaljeno ˇcitanje”, Aurora and Sketch Engine.

удаљено читање, књижевни корпус, дигитална библиотека, конкорданце, ELTeC

Ranka Stanković, Mihailo Škorić, Petar Popović. "SrpELTeC on Platforms: Udaljeno čitanje, Aurora, NoSketch" in Infotheca, Faculty of Philology, University of Belgrade (2022). https://doi.org/10.18485/infotheca.2021.21.2.7
It-Sr-NER: Web Services for Recognizing and Linking Named Entities in Text and Displaying Them on a Web Map

Olja Perišić, Ranka Stanković, Milica Ikonić Nešić, Mihailo Škorić (2023)

The paper will present the results of the project `“It-Sr-NER: Web services for named entities recognition, linking and mapping,” in which teams from the University of Turin and the Society for Language Resources and Technologies JeRTeh participated, and whose goal was the development of the It-Sr-NER web service for named entity annotations in the text and displaying them on the map. Named entities in these services are names of persons, places, organizations, demonyms (ethnicities), events and works of art.

General Engineering

Olja Perišić, Ranka Stanković, Milica Ikonić Nešić, Mihailo Škorić. "It-Sr-NER: Web Services for Recognizing and Linking Named Entities in Text and Displaying Them on a Web Map" in Infotheca, Belgrade : Faculty of Philology, University of Belgrade (2023). https://doi.org/10.18485/infotheca.2023.23.1.3
Нове технологије за оживљавање старих текстова

Цветана Крстев, Ранка Станковић, Бранислава Шандрих Тодоровић, Милица Иконић Нешић (2023)

удаљено читање, књижевни корпус, обрада српског језика, анотација врстом речи, лематизација, именовани ентитети

Цветана Крстев, Ранка Станковић, Бранислава Шандрих Тодоровић, Милица Иконић Нешић. "Нове технологије за оживљавање старих текстова" in Зборник радова Међународне научне конференције Дигитална хуманистика и словенско културно наслеђе II, Београд, 28-29 јуни 2021., Београд : Савез славистичких друштава Србије (2023)
Fourth Summer Datathon on Linguistic Linked Open Data

Tijana Radović, Ranka Stanković (2023)

The 4th Summer Datathon on Linguistic Linked Open Data (SD-LLOD-22) was held in Spain, in Cersedilla near Madrid, in May 2022, and organized by the COST Action NexusLinguarum. The school gathered interested researchers, academics, students who wanted to acquire and/or expand their knowledge in the field of linguistic linked data science. During the school, a spectrum of topics from the field of linked data was presented, from various ontologies, through document integration, annotation and natural language text processing tools ...

linguistic linked open data, sentiment analysis, linked data, RDF

Tijana Radović, Ranka Stanković. "Fourth Summer Datathon on Linguistic Linked Open Data" in Infotheca, Faculty of Philology, University of Belgrade (2023). https://doi.org/10.18485/infotheca.2023.23.1.6
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection

Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić (2022)

In this paper we present the Serbian part of the ELTeC multilingual corpus of novels written in the time period 1840-1920. The corpus is being built in order to test various distant reading methods and tools with the aim of re-thinking the European literary history. We present the various steps that led to the production of the Serbian sub-collection: the novel selection and retrieval, text preparation, structural annotation, POS-tagging, lemmatization and named entity recognition. The Serbian sub-collection was published ...

Corpus, Distant Reading, Digital Humanities, Linked Data, Named Entity Recognition, Text Analytics

Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić. "Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection" in Proceedings of the Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
Serbian ELTeC Sub-Collection in Wikidata

Milica Ikonić Nešić, Ranka Stanković, Biljana Rujević (2021)

This paper presents an example of integration of Wikidata with digital libraries and external systems, as well as some best practices for speeding up the process of data preparation and import to Wikidata, on the use case of SrpELTeC, Serbian subcollection of the ELTeC multilingual collection (European Literary Text Collection). After preliminary work on the manual Wikidata population with SrpELTeC novels, the goal was to automate the process of preparing and importing information, so different solutions were analysed and ...

Википодаци, удаљенои читање, књижевни корпус, повезивање именованих ентитета, ELTeC, SrpELTeC

Milica Ikonić Nešić, Ranka Stanković, Biljana Rujević. "Serbian ELTeC Sub-Collection in Wikidata" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.2.4
Веб-алат за управљање грађом Речника САНУ и анотација листића

Рада Стијовић, Ранка Станковић, Михаило Шкорић (2020)

Грађа на основу које се израђује Речник српскохрватског књижевног и народног језика САНУ, а која садржи материјал из преко 4.500 писаних извора и 300 рукописних збирки речи са подручја народних говора штокавског наречја, забележена је на око 5.000.000 листића. Богат лексички материјал, који обухвата књижевни и народни језик у протекла два века и на основу кога треба да се напише још најмање 15 томова Речника, пружа могућност и за разноврсна лингвистичка и ванлингвистичка истраживања. Из тог разлога се приступило ...

лексикографска грађа, листићи, лексикографски алат, дигитализација, анотација

Рада Стијовић, Ранка Станковић, Михаило Шкорић. "Веб-алат за управљање грађом Речника САНУ и анотација листића" in Rasprave Instituta za hrvatski jezik i jezikoslovlje, Institute of Croatian Language and Linguistics (2020). https://doi.org/10.31724/rihjj.46.2.32
Parallel Bidirectionally Pretrained Taggers as Feature Generators

Ranka Stanković, Mihailo Škorić, Branislava Šandrih Todorović (2022)

In a setting where multiple automatic annotation approaches coexist and advance separately but none completely solve a specific problem, the key might be in their combination and integration. This paper outlines a scalable architecture for Part-of-Speech tagging using multiple standalone annotation systems as feature generators for a stacked classifier. It also explores automatic resource expansion via dataset augmentation and bidirectional training in order to increase the number of taggers and to maximize the impact of the composite system, which ...

анотација, обрада природног језика, издвајање обележја, композитне структуре, врста речи

Ranka Stanković, Mihailo Škorić, Branislava Šandrih Todorović. "Parallel Bidirectionally Pretrained Taggers as Feature Generators" in Applied Sciences, MDPI AG (2022). https://doi.org/10.3390/app12105028
Improvement of geodatabase queries within GeolISS

Ranka Stanković (2008)

... 03:36:37 Improvement of geodatabase queries within GeolISS Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Improvement of geodatabase queries within GeolISS | Ranka Stanković | Review of the National Center for Digitization | 2008 | | http://dr ...
... chard2/ https://www.seegrid.csiro.au/twiki/bin/view/CGIModel/GeoSciML http://www.isotc211.org/ http://www.esri.com/ Ranka Stanković 74 [6] Blagojević B.,Trivić B., Stanković R. Banjac N., (2008) “Short note about implementation of Geologic information system of Serbia” u časopisu Zapisnici Srpskog ...
... Krstev, C., Stanković, R., Vitas, D., Obradović, I. (2006). “WS4LR: A Workstation for Lexical Resources”. In Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006, Genoa, Italy, May 2006, pp. 1692–1697 [10] Krstev, C., Vitas D., Stanković R., Obradović ...
Ranka Stanković. "Improvement of geodatabase queries within GeolISS" in Review of the National Center for Digitization, Beograd : Faculty of Mathematics, Belgrade (2008)
Sentiment Analysis of Serbian Old Novels

Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović (2022)

In this paper we present first study of Sentiment Analysis (SA) of Serbian novels from the 1840-1920 period. The preparation of sentiment lexicon was based on three existing lexicons: NRC, AFFIN and Bing with additional extensive corrections. The first phase of dataset refinement included filtering the word that are not found in Serbian morphological dictionary and in second automatic POS tagging and lemma were manually corrected. The polarity lexicon was extracted and transformed into ontolex-lemon and published as initial ...

sentiment lexicon, sentiment analysis, distant-reading, machine learning, old novels

Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović. "Sentiment Analysis of Serbian Old Novels" in Proceedings of the 2nd Workshop on Sentiment Analysis and Linguistic Linked Data, June 2022, Marseille, France, European Language Resources Association (2022)
Annotation of the Serbian ELTeC Collection

Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Mihailo Škorić (2021)

Ovaj rad predstavlja takozvano izdanje nivoa 2 kolekcije tekstova SrpELTeC razvijene u okviru aktivnosti Radne grupe 2 – Metode i alati COST akcije CA 16204 (Distant Reading for European Literary History) i njene specifikacije šeme. Izdanje nivoa 2 je nastavak izdanja nivoa 1, koje se koristi kao ulaz za morfosintaksičke i NER anotacije romana. Srpska obrada nivoa-2 je navedena kroz potrebne korake, uključujući metode i alate koji se koriste u tom procesu. Neki statistički podaci iz srpske kolekcije nivoa ...

udaljeno čitanje, literarni korpus, tagiranje, prepoznavanje imenovanih entiteta, lematizacija, ELTeC

Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Mihailo Škorić. "Annotation of the Serbian ELTeC Collection" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.2.3
Transformer-Based Composite Language Models for Text Evaluation and Classification

Mihailo Škorić, Miloš Utvić, Ranka Stanković (2023)

Parallel natural language processing systems were previously successfully tested on the tasks of part-of-speech tagging and authorship attribution through mini-language modeling, for which they achieved significantly better results than independent methods in the cases of seven European languages. The aim of this paper is to present the advantages of using composite language models in the processing and evaluation of texts written in arbitrary highly inflective and morphology-rich natural language, particularly Serbian. A perplexity-based dataset, the main asset for the ...

General Mathematics, Engineering (miscellaneous), Computer Science (miscellaneous)

Mihailo Škorić, Miloš Utvić, Ranka Stanković. "Transformer-Based Composite Language Models for Text Evaluation and Classification" in Mathematics, MDPI AG (2023). https://doi.org/10.3390/math11224660
From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)

Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić (2022)

In this paper we present the wikification of the ELTeC (European Literary Text Collection), developed within the COST Action ``Distant Reading for European Literary History'' (CA16204). ELTeC is a multilingual corpus of novels written in the time period 1840—1920, built to apply distant reading methods and tools to explore the European literary history. We present the pipeline that led to the production of the linked dataset, the novels’ metadata retrieval and named entity recognition, transformation, mapping and Wikidata population, ...

Wikidata, linked data, SPARQL, distant reading, literary corpus, named entity linking, ELTeC

Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić. "From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)" in Proceedings of The 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution

Mihailo Škorić, Ranka Stanković, Milica Ikonić Nešić, Joanna Byszuk, Maciej Eder (2022)

This paper explores the effectiveness of parallel stylometric document embeddings in solving the authorship attribution task by testing a novel approach on literary texts in 7 different languages, totaling in 7051 unique 10,000-token chunks from 700 PoS and lemma annotated documents. We used these documents to produce four document embedding models using Stylo R package (word-based, lemma-based, PoS-trigrams-based, and PoS-mask-based) and one document embedding model using mBERT for each of the seven languages. We created further derivations of these ...

General Mathematics, Engineering (miscellaneous), Computer Science (miscellaneous)

Mihailo Škorić, Ranka Stanković, Milica Ikonić Nešić, Joanna Byszuk, Maciej Eder. "Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution" in Mathematics, MDPI AG (2022). https://doi.org/10.3390/math10050838
Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data

Ranka Stanković, Christian Chiarcos, Miloš Utvić, Olivera Kitanović (2023)

Овај рад описује студију случаја о генерисању повезаних података креираних на основу обечежених текстуалних корпуса коришћењем формата размене података у обради природних језика (NIF). Као основа за ово истраживање послужио је подскуп корпуса ELTeC, који се састоји од 900 романа из периода 1840-1920 за 9 европских језика. Верзија романа са коментарима, у такозваном TEI level-2 формату, трансформисана је у NIF, формат заснован на RDF/OWL који има за циљ постизање интероперабилности између алата за обраду природних језика, језичких ресурса и ...

повезани отворени подаци, корпус, SrpELTeC, NIF

Ranka Stanković, Christian Chiarcos, Miloš Utvić, Olivera Kitanović. "Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data" in LDK 2023 – 4th Conference on Language, Data and Knowledge, 12-15 September in Vienna, Austria, Lisabon : NOVA FCSH - CLUNL (2023). https://doi.org/10.34619/srmk-injj
Geologic Information System of Serbia

Branislav Blagojević, Branislav Trivić, Ranka Stanković, Nenad Banjac, Olivera Kitanović (2011)

Geologic information system of Serbia (GeolISS) represents repository for digital archiving, query, retrieving, analysis and geologic data visualization. The GeolISS is implemented through ESRI ArcGIS technology, and is designed to operate as a personal geodatabase (MS Jet 4.0 Engine) and SDE enterprise geodatabase in MS SQL Server. The objective of GeolISS implementation is integration of existing geologic archives, data from published maps at different scales, newly acquired field data, as well as Web publishing of geologic information. Physical implementation ...

Geologic information system, Conceptual model, Logical model, Implementation, Geodatabase

... Blagojević, Branislav Trivić, Ranka Stanković, Nenad Banjac, Olivera Kitanović Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Geologic Information System of Serbia | Branislav Blagojević, Branislav Trivić, Ranka Stanković, Nenad Banjac, Olivera Kitanović | ...
Branislav Blagojević, Branislav Trivić, Ranka Stanković, Nenad Banjac, Olivera Kitanović. "Geologic Information System of Serbia" in Proceedings of the 17th Meeting of the Association of European Geological Societies, 14.-18. september 2011., Beograd : Srpsko geološko društvo (2011)
Can the dynamics of a subducted slab account for the Upper Cretaceous magmatism in the Sava-Vardar Zone and Timok Magmatic Complex? A Numerical Modelling Approach

Nikola Stanković, Ana Mladenović, Dejan Prelević, Vesna Cvetkov, Vladica Cvetković (2024)

Nikola Stanković, Ana Mladenović, Dejan Prelević, Vesna Cvetkov, Vladica Cvetković. "Can the dynamics of a subducted slab account for the Upper Cretaceous magmatism in the Sava-Vardar Zone and Timok Magmatic Complex? A Numerical Modelling Approach" in 16th Alpine Workshop, European Geosciences Union (2024)
Corpus-based bilingual terminology extraction in the power engineering domain

Tanja Ivanović, Ranka Stanković, Branislava Šandrih Todorović, Cvetana Krstev (2022)

Ovaj rad predstavlja resurse i alate koji se koriste za ekstrkciju i evaluaciju dvojezične, englesko-srpske terminologije u domenu energetike. Resursi se sastoje od postojeće opšte i domenske leksike i domenskog paralelnog korpusa; alati uključuju ekstraktore termina za oba jezika i alat za poravnavanje segmenata koji pripadaju korpusnim rečenicama. Sistem je testiran variranjem funkcije podudaranja koja utvrđuje prisustvo ekstrahovanog termina u poravnatom segmentu (odsečak), u rasponu od veoma labavog do strogog. Procena rezultata je pokazala da je preciznost izdvajanja termina ...

Library and Information Sciences, Communication, Language and Linguistics

Tanja Ivanović, Ranka Stanković, Branislava Šandrih Todorović, Cvetana Krstev. "Corpus-based bilingual terminology extraction in the power engineering domain" in Terminology, John Benjamins Publishing Company (2022). https://doi.org/10.1075/term.20038.iva
Претрага корпуса заснована на употреби екстерних лексичких ресурса путем веб-сервиса

Милош Утвић, Ранка Станковић, Александра Томашевић, Михаило Шкорић, Биљана Лазић (2019)

У раду се разматра хибридни приступ претрази корпуса, илустрован на примеру алатки OCWB и NoSketch Engine, примењених на специјални корпус из области рударства (РудКор) и Корпус савременог српског језика (СрпКор). Разматрани приступ комбинује постојеће могућности алатки OCWB и NoSketch Engine, које своју претрагу заснивају на лингвистичкој анотацији корпуса, са новим могућностима претраге у виду консултовања екстерних језичких ресурса (морфолошки електронски речници српског језика и лексичка база података Српски ворднет). Хибридни приступ је реализован надоградњом вебсучеља која поменуте алатке користе ...

корпус, рударство, претраживање информација, проширивање упита, лексички ресурси, лексичке релације

... Process- ing, Brno : Masaryk University, 65–70. Станковић 2009: Ранка Станковић, модели експанзије упита над текстуел- ним ресурсима (необјављена докторска дисертација, Београд: Универзи- тет у Београду, Математички факултет). Станковић и др. 2017: Ranka Stanković, Cvetana Krstev, Ivan Obradović, Ol- ivera ...
... Милош Утвић, Ранка Станковић, Александра Томашевић, Михаило Шкорић, Биљана Лазић Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Претрага корпуса заснована на употреби екстерних лексичких ресурса путем веб-сервиса | Милош Утвић, Ранка Станковић, Александра Томашевић ...
... Computer Science, 10190, Cham: Springer, 162–185. DOI: 10.1007/978-3-319-59268-8_8, https:/doi.org/10.1007/978- 3-319-59268-8_8. Станковић и др. 2018: Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev, “Resource based WordNet augmentation and enrichment”, In: Proceedings ...
Милош Утвић, Ранка Станковић, Александра Томашевић, Михаило Шкорић, Биљана Лазић. "Претрага корпуса заснована на употреби екстерних лексичких ресурса путем веб-сервиса" in Научни састанак слависта у Вукове дане - Vol. 48/3 Српски језик и његови ресурси, Међународни славистички центар, Филолошки факултет, Универзитет у Београду (2019). https://doi.org/10.18485/msc.2019.48.3.ch12
An Italian-Serbian Sentence Aligned Parallel Literary Corpus

Saša Moderc, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić (2023)

This article presents the construction and relevance of an Italian-Serbian sentence-aligned parallel corpus, delving into the aligned sentences in order to facilitate effective translation between the two languages. The parallel corpus serves as a valuable resource for language experts, researchers, and language enthusiasts, fostering a deeper understanding of linguistic nuances and cultural expressions. By bridging the gap between Serbian and Italian, this corpus opens new avenues for cross-cultural communication and collaboration, and ultimately contributes to the improvement of language-related ...

Aligned corpus, parallel corpus, Serbian, Italian, literature

Saša Moderc, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić. "An Italian-Serbian Sentence Aligned Parallel Literary Corpus" in Review of the National Center for Digitization, Belgrade : Faculty of Mathematics, University of Belgrade (2023). https://doi.org/10.5281/zenodo.11203388

Претрага

489 items

SrpELTeC on Platforms: Udaljeno čitanje, Aurora, NoSketch cite

It-Sr-NER: Web Services for Recognizing and Linking Named Entities in Text and Displaying Them on a Web Map cite

Нове технологије за оживљавање старих текстова cite

Fourth Summer Datathon on Linguistic Linked Open Data cite

Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection cite

Serbian ELTeC Sub-Collection in Wikidata cite

Веб-алат за управљање грађом Речника САНУ и анотација листића cite

Parallel Bidirectionally Pretrained Taggers as Feature Generators cite

Improvement of geodatabase queries within GeolISS cite

Sentiment Analysis of Serbian Old Novels cite

Annotation of the Serbian ELTeC Collection cite

Transformer-Based Composite Language Models for Text Evaluation and Classification cite

From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back) cite

Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution cite

Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data cite

Geologic Information System of Serbia cite

Can the dynamics of a subducted slab account for the Upper Cretaceous magmatism in the Sava-Vardar Zone and Timok Magmatic Complex? A Numerical Modelling Approach cite

Corpus-based bilingual terminology extraction in the power engineering domain cite

Претрага корпуса заснована на употреби екстерних лексичких ресурса путем веб-сервиса cite

An Italian-Serbian Sentence Aligned Parallel Literary Corpus cite

SrpELTeC on Platforms: Udaljeno čitanje, Aurora, NoSketch

It-Sr-NER: Web Services for Recognizing and Linking Named Entities in Text and Displaying Them on a Web Map

Нове технологије за оживљавање старих текстова

Fourth Summer Datathon on Linguistic Linked Open Data

Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection

Serbian ELTeC Sub-Collection in Wikidata

Веб-алат за управљање грађом Речника САНУ и анотација листића

Parallel Bidirectionally Pretrained Taggers as Feature Generators

Improvement of geodatabase queries within GeolISS

Sentiment Analysis of Serbian Old Novels

Annotation of the Serbian ELTeC Collection

Transformer-Based Composite Language Models for Text Evaluation and Classification

From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)

Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution

Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data

Geologic Information System of Serbia

Can the dynamics of a subducted slab account for the Upper Cretaceous magmatism in the Sava-Vardar Zone and Timok Magmatic Complex? A Numerical Modelling Approach

Corpus-based bilingual terminology extraction in the power engineering domain

Претрага корпуса заснована на употреби екстерних лексичких ресурса путем веб-сервиса

An Italian-Serbian Sentence Aligned Parallel Literary Corpus