Paralelização de algoritmos de busca de documentos mais relevantes na web utilizando GPUs

Gaioso, Roussian Di Ramos Alves

Paralelização de algoritmos de busca de documentos mais relevantes na web utilizando GPUs

Arquivos

Versão Final - Paralelização de Algoritmos de Busca de Documentos mais Relevantes na Web Utilizando GPUs.pdf (2.18 MB)

Data

2019-02-13

Autores

Gaioso, Roussian Di Ramos Alves

Editor

Universidade Federal de São Carlos

Resumo

Search engines are facing performance challenges because of the large number of documents and the increase of query loads in the Web environment. The success of a search engine is related to the ability of the query processing system to find documents that match the needs of information expressed in user queries in a short time interval. Despite the large amount of documents, users are more interested in fewer results in a query. This causes few documents to be highly relevant in most queries. DAAT dynamic pruning algorithms have been exploring the efficiency of query processing systems, avoiding wasting time sorting documents that are not likely to be relevant. To handle the scale and dynamics of user query traffic, query processing needs to make efficient use of hardware resources. The main objective of this doctoral thesis is to investigate the use of parallel computing in the process of identifying the most relevant documents to a given query in the GPU architecture. For this, strategies of parallelization of algorithms that aim to reduce the latency of response of a given query and to increase the flow of queries are proposed and evaluated in the GPU. The parallelization proposals are well suited to the category of DAAT algorithms and dynamic pruning algorithms. In the DAAT category, partitioning strategies are offered in a way that performs an investigation into the location of occurrences of the same document in the memory hierarchy of the GPU. At the level of dynamic pruning algorithms, threshold propagation policies among processors are proposed and the impacts generated on the efficiency of the parallel algorithms are analyzed. To verify efficiency in practice, the parallel proposals were implemented and tested in the Pascal GPU architecture and obtained a performance of 4x to 40x relative to the fundamental algorithms.

Palavras-chave

Busca na Web, Processamento de consultas, Algoritmos DAAT, Algoritmos de Poda, Algoritmo WAND, Algoritmo MaxScore, Algoritmos paralelos, Arquitetura GPU, Web search, Query processing, DAAT Algorithms, Pruning algorithms, WAND Algorithm, MaxScore algorithm, Parallel algorithms

Citação

GAIOSO, Roussian Di Ramos Alves. Paralelização de algoritmos de busca de documentos mais relevantes na web utilizando GPUs. 2019. Tese (Doutorado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2019. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/11481.

URI

https://repositorio.ufscar.br/handle/20.500.14289/11481

Coleções

Teses e Dissertações

Página do item completo

Paralelização de algoritmos de busca de documentos mais relevantes na web utilizando GPUs

Arquivos

Data

Autores

Título da Revista

ISSN da Revista

Título de Volume

Editor

Resumo

Descrição

Palavras-chave

Citação

URI

Coleções

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced