Método escalável para aproximar uma MST utilizando um grafo de k vizinhos mais próximos
Carregando...
Arquivos
Data
Autores
Título da Revista
ISSN da Revista
Título de Volume
Editor
Universidade Federal de São Carlos
Resumo
Finding the Minimum Spanning Tree (MST) consists of inducing a spanning tree whose
sum of edge costs is minimal. Among many applications, the MST is widely used in
unsupervised machine learning algorithms, such as clustering. With increasing volume,
data used for analysis and decision making, platforms such as Apache Spark have become
more influential. However, there is a difficulty in finding the MST in a distributed envi-
ronment when the point-to-point distance needs to be calculated, since this calculation
has a quadratic complexity with respect to the number of data points, making it very
inefficient and even unfeasible for large data sets. In this work an efficient method to
find a good approximation of the MST is presented. The method consists of applying the
Boruvka algorithm in a distributed way over an approximated k nearest neighbors graph
constructed through a modified version of the NNDescent algorithm on the Apache Spark
platform. In all cases of experiments, the approximate MST reached less than 2% of the
cost higher than the cost of the exact MST when the parameter k of the NNDescent
algorithm was greater than just 1% of the number of points in the data sets. Furthermore,
the method has showed computational complexity linearly proportional to the number of
points and is slightly sublinear in relation to parameter k.
Descrição
Palavras-chave
Citação
IGOR RAPHAEL MAGOLLO,. Método Escalável Para Aproximar uma MST Utilizando um Grafo de k Vizinhos Mais Próximos. 2021. Trabalho de Conclusão de Curso (Graduação em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2021. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/14446.
Coleções
item.page.endorsement
item.page.review
item.page.supplemented
item.page.referenced
Licença Creative Commons
Exceto quando indicado de outra forma, a licença deste item é descrita como CC0 1.0 Universal
