Претрага
98 items
-
WS4LR - a Worksation for Lexical Resources
... forms their possible grammatical categories. The DELAS dictionary and the set of transducers describing inflectional properties are used to produce the morphological dictionary of word forms, the so called DELAF. All the entries in this dictionary have the following form: form,lemma[:categories]* ...
... using this type of dictionaries, but offer no facilities for dictionary development and management. WS4LR enables manipulation of dictionaries both of lemmas and of inflected forms. In LADL format, all the entries in the dictionary of simple word lemmas, the so called DELAS, have the following ...
... Loria: http://led.loria.fr/collab.php#projet45 pd WS4LR moduls WSLR moduls + CONVERSION + DICTIONARY MANAGMENT + WORDNET DEVELOPMENT + EXPLOITATION OF ALIGNED TEXTS (from Use Case View) DICTIONARY MANAGMENT + Simple words manipulation + Compound words management + Nooj dictionaries management ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović. "WS4LR - a Worksation for Lexical Resources" in Proceedings of the Fifth Interantional Conference on Language Resources and Evaluation, Genoa, Italy, May 2006, ELRA - European Language Resources Association (2006)
-
On the compatibility of lexical resources for NooJ
Lexical resources for many languages are provided for the NooJ linguistic development environment. Meta-data descriptions of morphosyntactic and semantic properties of these languages and their resources are a mandatory part of each language module. In this paper we analyze how well the meta-data actually describe resources for a chosen subset of languages and to what extent are they compatible across languages to support multilingual processing. We show that there is place for improvement in both directions.... differences often exist between semantic codes in metadata (in the *.def file) and those in the text dictionary. For example, in Croatian, the semantic code cons appears in the text dictionary (hraniti,V+FLX=BRANITI+Sem=cons+Prelaz=pov), although it does not exist in the *.def file. Conversely ...
... texts (Table 5). On the other hand, inhabitants of states and cities are not marked as proper names in the French dictionary (e.g. Anglais,N+z1+m+s) as opposed to the Serbian dictionary (e.g. Englez,N+m+s+1+v+Hum+NProp+Top+Inh). Another interesting illustration is the proper name Londres, which ...
... official NooJ resources web page2. The next section offers a comparison of annotation systems used in NooJ resources based on an analysis of the Dictionary Properties’ Definition files. The section that follows outlines the results of lexical analysis of the application of NooJ resources to aligned ...Ranka Stanković, Miloš Utvić, Duško Vitas, Cvetana Krstev, Ivan Obradović. "On the compatibility of lexical resources for NooJ" in Automatic Processing of Various Levels of Linguistic Phenomena: Selected Papers from the 2011 International Nooj Conference, Cambridge Scholars Publishing (2012): 96-108
-
Resource-based WordNet Augmentation and Enrichment
In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset definitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...... Gonzalo, J., and Verdejo, F. (2001). Monolingual and bilingual dictionary approaches to the enrichment of the spanish wordnet with adjectives. In Carnegie Mellon University, Pittsburgh. Fellbaum, C., Ed. (1998). WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press. Horák, A. and Smrž ...
... Sanskrit English dictionary is presented. The dictionary entries were automatically extracted and linked to PWN. In (Simões et al., 2016) an approach is given to enrichment of Portuguese WordNet with synonyms available in a standard monolingual Portuguese dictionary. Dictionary word senses were ...
... bilingual dictionary of English and the target language. There are, however, several other approaches. In (Chugur et al., 2001) a monolingual and a bilingual Spanish-English dictionary were used for the enrichment of the Spanish WordNet with adjective synsets, where the bilingual dictionary was used ...Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)
-
A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals
This paper outlines the main features of Bibliša, a tool that offers various possibilities of enhancing queries submitted to large collections of TMX documents generated from aligned parallel articles residing in multilingual digital libraries of e-journals. The queries initiated by a simple or multiword keyword, in Serbian or English, can be expanded by Bibliša, both semantically and morphologically, using different supporting monolingual and multilingual resources, such as wordnets and electronic dictionaries. The tool operates within a complex system composed ...... Bibliša also uses the “Dictionary of librarianship: English-Serbian and Serbian-English” that is available online at http://btr.nb.rs. The production of this dictionary started in 2001 at the National Library of Serbia's, with the aim of producing a domain specific dictionary in various formats ...
... international project “Multilingual dictionary of library terminology” that started in the 1990s, with the aim of collecting and presenting the librarianship terminology for 16 European languages [Kovačević et al., 2004]. The online version of the Dictionary currently contains 23 400 terms – ...
... of the dictionary, which we transformed into a MS SQL Server 2008 relational database. Due to the complexity of the Excel format the transformation turned out to be a rather complicated procedure. Nevertheless, we managed to preserve all important features of the original dictionary: the relations ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac, Miloš Utvić. "A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals" in Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, May 2012, Istanbul, Turkey, Istanbul, Turkey : European Language Resources Association (2012)
-
Terminology Acquisition and Description Using Lexical Resources and Local Grammars
Acquisition of new terminology from specific domains and its adequate description within terminological dictionaries is a complex task, especially for languages that are morphologically complex such as Serbian. In this paper we present an approach to solving this task semi-automatically on basis of lexical resources and local grammars developed for Serbian. Special attention is given to automatic inflectional class prediction for simple adjectives and nouns and the use of syntactic graphs for extraction of Multi-Word Unit (MWU) candidates for ...... (Courtois, 1990). E-dictionaries in the same for- mat have been produced for many other lan- guages. In dictionary of lemmas (DELAS) each lem- ma is described in full detail so that a dictionary of forms containing all necessary grammatical information (DELAF) can be generated from it, and subsequently ...
... text file from the chosen domain using Unitex. 2. Analysis of unrecognized words as the most probable source of terminology and expand- ing the dictionary of simple words: 2.1 Retrieval of unrecognized words; 2.2 Manual filtering, preparation of a list of extracted terms in canonical forms (for ...
... some grammatical categories (e.g. adding the gender for the nouns); 2.3 Automatic prediction of the inflectional class and the production of dictionary en- try in DELA format (detailed description of the algorithm is given in section 3); 2.4 Compiling the dictionaries of newly ac- quired terms ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić. "Terminology Acquisition and Description Using Lexical Resources and Local Grammars" in Proceedings of the 11th Conference on Terminology and Artificial Intelligence, Granada, Spain, 2015, Granada : LexiCon (Universidad de Granada) (2015)
-
Wordnet Development Using a Multifunctional Tool
Ivan Obradović, Ranka Stanković (2007)In this paper we present a multifunctional tool for manipulating heterogeneous language resources. The tool handles electronic dictionaries, wordnets and aligned texts, and provides for their synchronous use in various tasks. We focus here on the description of the possibilities this tool offers in the development of wordnets. Besides the wordnet module which enables parallel handling of two wordnets, other modules, such as the module for morphological dictionaries and the module for aligned texts, as well as available finite ...... wordnets Figure 1. The UML diagram showing WS4LR modules 3.3 Dictionary management The electronic dictionaries that WS4LR manipulates are monolingual morphological dictionaries, but also a bilingual word list and a multilingual dictionary of proper names. However, the main task of this module is ...
... can be very useful in performing this task. pd WS4LR moduls WSLR moduls + CONVERSION + DICTIONARY MANAGMENT + WORDNET DEVELOPMENT + EXPLOITATION OF ALIGNED TEXTS (from Use Case View) DICTIONARY MANAGMENT + Simple words manipulation + Compound words management + Nooj dictionaries management ...
... bilingual and multilingual dictionaries in wordnet development, we will now briefly describe the basic features of the dictionary management module. The lemma in a morphological dictionary of simple words has the following format: lemma.Knnn [+SinSem]* where lemma is the word form used in traditional ...Ivan Obradović, Ranka Stanković. "Wordnet Development Using a Multifunctional Tool" in Proceedings of the International Workshop Computer Aided Language Processing (CALP) '2007, Borovets, Bulgaria, September 2007, - (2007)
-
Softverski alati za korišćenje resursa za srpski jezik
Ivan Obradović, Ranka Stanković (2008)... Italian, Spanish, Norwegian, Arabic, German, Polish, and Bulgarian. Entries in the morphological dictionary of simple word, named DELAS (Dictionnaire électronique des mots simples – electronic dictionary of simple words), are in the following form: lema.Knnn [+SinSem]* where lema in general stands ...
... of the word. The generated inflectional forms of the word are stored in the morphological dictionary of simple word forms called DELAF (Dictionnaire électronique des formes fléchies – electronic dictionary of word forms), where the main format of data is the fol- lowing: oblik,lema[:kategorije]* ...
... of the morphological dictionary of simple words is already significant, but it is nevertheless be- ing continually developed, whereas the scope of the dictionary of compound words is at the moment more modest, which focused further dictionary development on this dictionary. The chosen format for ...Ivan Obradović, Ranka Stanković. "Softverski alati za korišćenje resursa za srpski jezik" in INFOteka: časopis za informatiku i bibliotekarstvo, Belgrade, Serbia : Zajednica biblioteka univerziteta u Srbiji (2008)
-
Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++
Branislava Šandrih, Ranka Stanković (2020)U nauci, industriji i mnogim istraživačkim oblastima, terminologija se brzo razvija. Najčešće, jezik koji je „lingua franca“ za većinu ovih oblasti je engleski. Kao posledica toga, za mnoga polja termini domena su koncipirani na engleskom, a kasnije se prevode na druge jezike. U ovom radu predstavljamo pristup za automatsko izdvajanje dvojezične terminologije za englesko-srpski jezički par koji se oslanja na usaglašeni dvojezični korpus domena, ekstraktor terminologije za ciljni jezik i alat za usklađivanje delova. Ispitujemo performanse metode na domenu ...... match if they are represented by the same set. For example, if there is one “dictionary words” chunk and another “words from dictionary” chunk, their corresponding set representations are {dictionary, words} and {words, dictionary}, respectively (‘from’ should be discarded as functional word). Since these ...
... category codes in the Serbian dictionary are a/b/c, for the positive/comparative/superlative forms. The Unitex English dictionary does not have a code for the positive, while the codes for the comparative and superlative are C and S, respectively. The second English dictionary followed the MULTEXT-EAST ...
... general-purpose dictionary, similarly as in (Tsvetkov and Wintner, 2010). We compared different configurations for the extraction of domain ter- minology on both, source and target, sides. For the source side, we compare two cases. In the first case, we use an existing bilingual domain dictionary, similarly ...Branislava Šandrih, Ranka Stanković. "Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.6
-
Building Terminological Resources in an e-Learning Environment
... relation between the dictionary and the content of the database. Some other controlled dictionaries that can be derived from RudOnto are the Geostatistics dictionary, Mine safety protection dictionary, Mineral resource exploitation dictionary, Petroleum exploitation dictionary, but also dictionaries ...
... available multilingual terminological resource in printed form involving Serbian is a Dictionary of mining in five languages (Serbian, English, French, German and Russian), published by the Mining Institute [9]. This dictionary has been used for almost three decades as the main reference, especially for ...
... among other things, for production of controlled dictionaries. A controlled dictionary is a consistent collection of terms selected for a specific purpose, chosen by its author. For example, a controlled dictionary from RudOnto is used for labeling and annotation of cartographic content both ...Ranka Stanković, Ivan Obradović, Olivera Kitanović, Ljiljana Kolonja. "Building Terminological Resources in an e-Learning Environment" in Proceedings of the Third International Conference on e-Learning, eLearning-2012, September 2012, Belgrade, Serbia, Belgrade : Belgrade Metropolitan University (2012)
-
Multi-word Expressions for Abusive Speech Detection in Serbian
Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...... supports Serbian electronic dictionaries (Krstev, 2008) was analyzed and entries with one of the following semantic markers were selected: Aug (augmentative, 103), Pej (pejorative, 89), POG (derogatory, 41). The additional 602 items from srHurtLex were added to the list. The Dictionary of Serbian Language ...
... rules, and dictionary of abusive and derogatory terms. They noticed that offensive language was characterized by extreme subjectivity. Hence, they used a combination of term’s offensiveness and subjectivity impact to calculate and assign a weight from 1 to 5 to each entry of their dictionary. Those weights ...
... weights were later corrected through adaptive learning on the training data set resulting in the Insulting or Abusive Language Dictionary (IALD). The dictionary contains terms such as “chew Somebody’s ass out” that become a search template by replacing somebody with * or by adding suffixes or prefixes ...Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
-
A Mathematical Learning Environment Based on Serbian Language Resources
In recent years, in line with ever growing usage of Information technology, the learning environments are changing. The amount of available learning materials in various forms has increased. These new environments demand comprehensive learning systems, which enable management of the learning corpus with special attention paid to relevant lexical resources. In this paper we present the concept of a Mathematical Learning Environment in Serbian (MLES), which is based on a corpus of mathematical materials and various lexical resources, enabling ...... represented in the DELAS dictionary of simple forms as ekstremum, N1+DOM=Math+FLX=N1 or the verb “faktorisati” (to factorize) represented in the same dictionary as faktorisati, V21+DOM=Math+FLX=V21. Some of the inflected forms of the noun esktremum in the DELAF dictionary are: ekstremuma,ekstremum ...
... terminology in Serbian is unsatisfactorily represented in terminological resources. Thus for example, in the Dictionary of the Serbian Academy of Sciences and Arts, out of 200,000 dictionary entries only 369 are marked as belonging to the Mathematics domain. One of the aims of MLES is to contribute ...
... (Figure 3). Results of the third component are annotated and linked texts, where every mathematical term in the text is linked to the appropriate dictionary entry or relevant corpus content related to that term. This system component also extracts mathematical concepts from problems related to ...Radojičić Marija, Obradović Ivan, Stanković Ranka, Utvić Miloć, Kaplar Sebastijan. "A Mathematical Learning Environment Based on Serbian Language Resources" in Proceedings of the 7th International Scientific Conference Technics and Informatics in Education, Faculty of Technical Sciences, Čačak (2018)
-
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...... Serbian e-dictionary of MWUs. The approach was evaluated on the example of terminology extraction from a mining corpus, with results for extraction, normalization and lemmatization presented successively. Automatic generation of a complete lemma in the form required by the electronic dictionary of Serbian ...
... management and exploitation of lexical resources, distributed via owner. Krstev C., Vitas D. (2015). SrpRec, Serbian morphological electronic dictionary, http://www-igm.univ-mlv.fr/~unitex/index.php?page= 5, CC BY-NC-ND. Lazić B., Stanković R. (2015), MineCorp, Serbian corpus from mining ...
... of results is to follow and if the approved MWT is to be entered into some kind of a dictionary or a terminological data-base. In this case we need a lemmatized MWT, that is, a MWT in the form of a dictionary head-word. The problem of lemmatization of special kind of MWUs, 507 person ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
E-Connecting Balkan Languages
In this paper we present a versatile language processing tool that can be successfully used for many Balkan languages. This tool relies for its work on several sophisticated textual and lexical resources that were developed for most of Balkan languages. These resources are based on several de facto standards in natural language processing.... Bulgarian Grammar dictionary (DELAS dictionary) consists of 127,000 lemmas distributed as follows: app. 85,000 simple lemmas belong to general lexis, app. 6,000 lemmas represent domain specific lexis and app. 36,000 lemmas are simple proper names. The corresponding DELAF dictionary consists of app ...
... Linguistique) under the guidance of Maurice Gross. The format of a DELAS-type dictionary basically consist of simple word lemmas accompanied with inflectional class codes which enable production of a DELAF-type dictionary which consists of all inflectional forms with their grammatical information ...
... responsible for generation of all inflectional forms of each DELAS lemma corresponds to each inflectional class code. The Serbian morphological dictionary of simple words contains 121,000 lemmas which yield the production of approximately 1,450,000 different lexical words. Close to 87,000 simple ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Svetla Koeva. "E-Connecting Balkan Languages" in Proceedings of the Workshop Workshop on Multilingual resources, technologies and evaluation for Central and Eastern European Languages, 17 September 2009, eds. C. Vertan, S. Piperidis, E. Paskaleva and Milena Slavcheva, Borovets, Bulgaria : Association for Computational Linguistics Stroudsburg, PA, USA (2009)
-
Vebran Web Services for Corpus Query Expansion
Ranka Stanković, Miloš Utvić (2020)U ovom radu se govori o razvoju veb usluga Vebran i njihovoj primeni u poboljšanju pretraživanja korpusa. Veb-servisi Vebran koriste se za konsultovanje spoljnih leksičkih izvora za srpski jezik (uglavnom elektronski morfološki rečnici i srpski Vordnet) i proširivanje korisničkih upita radi dobijanja relevantnijih rezultata iz srpskih korpusa.... type dictionary basically consists of simple word lemmas, each accompanied by inflectional class code. Every inflectional class code is associated with a corresponding finite-state transducer responsible for the generation of all inflectional forms of DELAS lemma. Thus, the DELAS-type dictionary with ...
... finite-state transducers for inflection enables the production of a DELAF- type dictionary which consists of all inflectional forms of DELAS-lemmas with their corresponding grammatical information. The Serbian morpholog- ical dictionary of simple words contains 190,000 lemmas which yield the production of a ...
... Externally developed Dictionary of library and information sciences10 encompasses the terminology of theory and practice of librarianship and in- formation sciences and a wide range of close or related fields, in Serbian, English and German languages. The languages in this dictionary have equal status ...Ranka Stanković, Miloš Utvić. "Vebran Web Services for Corpus Query Expansion" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.5
-
Football terminology: compilation and transformation into OntoLex-Lemon resource
У овом раду представља се пројекат који је у развоју, креирање првог дигиталног фудбалског речника на српском језику, као и да демонстрација примене модела OntoLex и љегових модула. OntoLex-FrAC модул укључује информације о учесталости и примерима употребе екстрахованих из корпуса. У овом случају, креиран је корпус за специфичан домен под називом СрФудКо, који садржи чланке вести о фудбалу на српском језику. Вишечлани термини аутоматски су екстраховани из српског корпуса, а затим ручно евалуирани и класификовани као спортски или ...Jelena Lazarević, Ranka Stanković, Mihailo Škorić, Biljana Rujević. "Football terminology: compilation and transformation into OntoLex-Lemon resource" in LDK 2023 – 4th Conference on Language, Data and Knowledge, 12-15 September in Vienna, Austria, Lisabon : NOVA FCSH - CLUNL (2023). https://doi.org/10.34619/srmk-injj
-
An Integrated Environment for Management and Exploitation of Linguistic Resources
Ranka Stanković, Ivan Obradović (2009)... of lemmas in several dictionaries, such as the dictionary of to- ponyms or dictionary of proper names. This is important for practical reasons, because smaller dictionaries are easier to manage. The most important feature of the dictionary manage- ment module is the possibility of very ...
... procedure for automatic (or semiautomatic) construction of this type of dictionary on basis of a simple list of compounds was added. The procedure for automatic construction of entries for a DELAC dictionary is based on a set of rules [10]. The rule design strategy results from expert ...
... (SWN), a lexical data- base representing the semantic network of words in Serbian. Within this group of resources, the multilingual ontological dictionary of proper names Prolex should also be men- tioned. V Besides these different types of dictionaries, the Group is engaged in developing ...Ranka Stanković, Ivan Obradović. "An Integrated Environment for Management and Exploitation of Linguistic Resources" in Proceedings of the International Multiconference on Computer Science and Information Technology, Computational Linguistics – Applications Workshop (CLA09), Mrągowo, Poland, October 2009, Piscataway : IEEE (2009)
-
Белешка о дигитализацији речника
У раду ће се анализирати ограничења која проистичу из линеарног процеса традиционалне израде речника на примеру Речника САНУ. Начин да се превазиђу ова ограничења се састоји у формирању електронске лексикографске базе која не представља само пуку дигиталну транскрипцију папирног издања речника. Посебно се указује на чињеницу да текст речника може представљати корпус и приказују се одабрани примери анализе таквог корпуса формираног из текстове 1. и 19. тома Речника САНУ.... Vitas, Cvetana Krstev, Ranka М. Stanković A NOTE ON A DICTIONARY DIGITIZATION Summary The paper analyses the limitations of the linear process of the traditional dictionary production and illustrates them on the example of the SASA Dictionary. These limitations can be overcome by the establishment ...
... n of a paper dictionary edition. It is additionally stressed that a dictionary text represents itself a valuable corpus for various research purposes which is illustrated by a few examples of the analysis of such corpus compiled form the first and 19th volume of the SASA Dictionary. ...
... (Павловић-Лаже- тић 1996), опредељивање за начин организовања корпуса и његове експлоата- ције, као и одређивање система за писање речника (DWS – dictionary writing system) који мора бити тесно повезан са структуром базе и корпуса. Одлуке о ове две компоненте су дакле најуже повезане са концепцијом ...Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић. "Белешка о дигитализацији речника" in Српски језик и његови ресурси, Међународни славистички центар, Филолошки факултет, Универзитет у Београду (2019). https://doi.org/10.18485/msc.2019.48.3.ch3
-
Чији је пример? Анализа лексичких обележја на примерима Речника САНУ
У овом раду поставља се питање: да ли се може утврдити ко је аутор неког текста уколико се анализирају искључиво његова лексичка обележја? Како бисмо покушали да добијемо одговор на ово питање, посматрали смо примере у оквиру речничког чланка појединачне лексеме Речника САНУ, који су забележени у пет томова (и то: I, II, XVIII, XIX и XX). Сваки пример је преузет из неког извора на шта упућују скраћенице, наведене у заградама. Од преко 5.000 понуђених извора, определили смо се ...... GDEX: Automatically Finding Good Dictionary Examples in a Cor- pus, In E. Bernal & J. DeCesaris (eds.). Proceedings of the XIII EURALEX International Congress, Barcelona: Universitat Pompeu Fabra, 425–432. Косем 2017: Iztok Kosem, Dictionary examples, In Dictionary of Modern Slov- ene: Problems and ...
... get an answer, we observed examples that support lexical entries listed in five of the total of twenty volumes of the Dictionary of Serbian Academy of Science and Arts. Each dictionary example is documented with its author, so we decided to examine only examples that origin from twelve great names in ...
... др. 2018: Iztok Kosem, Kristina Koppel, Tanara Zingano Kuhn, Jan Michelfeit & Carole Tiberius, Identification and Automatic Extraction of Good Dictionary Examples: the Case(s) of GDEX, International Journal of Lexicography. Чији је пример? анализа лексичких обележја на примерима речника сану 315 ...Бранислава Б. Шандрих, Ранка М. Станковић, Мирјана С. Гочанин. "Чији је пример? Анализа лексичких обележја на примерима Речника САНУ" in Српски језик и његови ресурси, Међународни славистички центар, Филолошки факултет, Универзитет у Београду (2019). https://doi.org/10.18485/msc.2019.48.3.ch13
-
Речник САНУ као база терминолошких речника (на примеру речника кулинарства)
... obtained list was compared to a list of the lexical entry of the SASA dictionary; extracted the entries from the Dictionary that have no information about the culinary domain (should be amended), and words that do not exist in the Dictionary (should be entered). In this research are also used syntactic ...
... а рад на Речнику убрза. SASA DICTIONARY AS A BASE FOR TERMINOLOGICAL DICTIONARIES (ON THE EXAMPLE OF CULINARY VOCABULARY) In this paper is discussed the possibility of creating culinary vocabulary based on the culinary lexica contained in the Dictionary of Serbo-Croatian Literary and folk ...
... lexicon contained in the SASA Dictionary and show how traditional vocabulary during the digitization process becomes a base for terminology dictionaries that allows different uses. In addition, the process of digitalization and modernization of work on the SASA Dictionary provide its enrichment with ...Рада Стијовић, Олга Сабо, Ранка Станковић. "Речник САНУ као база терминолошких речника (на примеру речника кулинарства)" in Словенска терминологија данас, Београд : Српска академија наука и уметности (2017)