Skip to main content
Пријава

Collected Item: “The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines”

Врста публикације

Рад у зборнику радова

Верзија документа

објављена

Језик

енглески

Аутор/и (Милан Марковић, Никола Николић)

Krstev Cvetana, Stanković Ranka, Vitas Duško, Obradović Ivan

Наслов рада (Наслов - поднаслов)

The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines

Назив конференције (зборника), место и датум одржавања

LREC 2008: Conference on Language Resources and Evaluation, Marrakesh, Morocco, May 2008

Уредник/ци зборника

Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias

Издавач (Београд : Просвета)

European Language Resources Association (ELRA)

Година издавања

2008

Сажетак рада на српском језику

In this paper we present how resources and tools developed within the Human Language Technology Group at the University of Belgrade can be used for tuning queries before submitting them to a web search engine. We argue that the selection of words chosen for a query, which are of paramount importance for the quality of results obtained by the query, can be substantially improved by using various lexical resources, such as morphological dictionaries and wordnets. These dictionaries enable semantic and morphological expansion of the query, the latter being very important in highly inflective languages, such as Serbian. Wordnets can also be used for adding another language to a query, if appropriate, thus making the query bilingual. Problems encountered in retrieving documents of interest are discussed and illustrated by examples. A brief description of resources is given, followed by an outline of the web tool which enables their integration. Finally, a set of examples is chosen in order to illustrate the use of the lexical resources and tool in question. Results obtained for these examples show that the number of documents obtained through a query by using our approach can double and even quadruple in some cases.

Почетна страна рада

219

Завршна страна рада

224

Укупан број страна (само уколико стране нису нумерисане)

6

ISBN број изворне публикације

2-9517408-4-0

Кључне речи на српском (одвојене знаком ", ")

LR web services, MultiWord Expressions & Collocations, Information Extraction, Information Retrieval

Линк

http://www.lrec-conf.org/proceedings/lrec2008/

Шира категорија рада према правилнику МПНТ

М60

Ужа категорија рада према правилнику МПНТ

М63

Ниво приступа

Отворени приступ

Лиценца

Creative Commons – Attribution-Share Alike 4.0 International

Формат датотеке

.pdf
Click here to view the corresponding item.