Utilização de agrupamento como método de pré-processamento em problemas de regressão linear

Motta, Gabriel Gonçalves

View/Open

TCC Gabriel Motta - Utilização de agrupamento como método de pré-processamento em problemas de regressão linear..pdf (448.1Kb)

Date

2021-11-19

Author

Motta, Gabriel Gonçalves

Metadata

Show full item record

Abstract

For real-world data to be explored, pre-processing is needed in order to ease machine learning applications. Nowadays, various pre-processing techniques are available, and this paper aims to show the impacts of using clustering as one of them, more specifically in linear regression problems. Two experiments were carried out, using two different databases: the first one describing data from a series of properties from Ames, a city from Iowa, in the United States of America, and the second one containing information about COVID-19. It was observed that in both cases, clustering before applying the regression model improves regressor performance, based on database nature. Besides the improvement, it is recommended to use clustering alongside other pre-processing techniques.

URI

https://repositorio.ufscar.br/handle/ufscar/15156

Collections

The following license files are associated with this item:

Creative Commons

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Brazil