Enriquecendo a previsão de séries temporais usando informação textual
Fecha
2021-02-25Autor
Cruz, Lord Flaubert Steve Ataucuri
Metadatos
Mostrar el registro completo del ítemResumen
The ability to extract knowledge and forecast stock trends is crucial to mitigate investors' risks and uncertainties in the market. The stock trend is affected by non-linearity, complexity, noise, and especially the surrounding events. External factors such as daily news became one of the investors' primary resources for making decisions about buying or selling assets. However, this kind of information appears very fast. There are thousands of news generated by numerous web sources, taking a long time to analyze them, which can cost millions of dollars losses for investors due to a late decision. Recent contextual language models have transformed the area of natural language processing. However, classification models that use news that influence stock values need to deal with the unlabeled, class imbalance, and dissimilar texts. Recent studies show that the prediction of time series substantially improves by considering external information. This work proposes a hybrid methodology with three phases, one for news mining, a model for representation compact features, and the forecast model of time series, which merge for a more accurate prediction of prices. Initially, a small corpus is built using as support the time series. After that, we label the corpus based on semi-supervised learning to assign labels to other unlabeled news. In the second phase, the mining model with a classifier is used, whose output is concatenated with time series features, so the compact model representation extracts new features in a latent space. Finally, we predicted future prices with this fused knowledge. In a case study with Bitcoin cryptocurrency, the proposed methodology achieved a 1.62% decrease in the mean absolute percentage error.
Colecciones
El ítem tiene asociados los siguientes ficheros de licencia:
Excepto si se señala otra cosa, la licencia del ítem se describe como Attribution-NonCommercial-NoDerivs 3.0 Brazil
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
Análise de desempenho de consultas OLAP espaçotemporais em função da ordem de processamento dos predicados convencional, espacial e temporal
Joaquim Neto, Cesar (Universidade Federal de São Carlos, UFSCar, Programa de Pós-Graduação em Ciência da Computação - PPGCC, Câmpus São Carlos, 08/03/2016)By providing ever-growing processing capabilities, many database technologies have been becoming important support tools to enterprises and institutions. The need to include (and control) new data types to the existing ... -
Análise de séries temporais fuzzy para previsão e identificação de padrões comportamentais dinâmicos
Santos, Fábio José Justo dos (Universidade Federal de São Carlos, UFSCar, Programa de Pós-Graduação em Ciência da Computação - PPGCC, Câmpus São Carlos, 30/04/2015)The good results obtained by the fuzzy approaches applied in the analysis of time series (TS) has contributed significantly to the growth of the area. Although there are satisfactory results in TS analysis with methods ... -
Modelos de volatilidade estatística
Ishizawa, Danilo Kenji (Universidade Federal de São Carlos, UFSCar, Programa de Pós-Graduação em Estatística - PPGEs, , 22/08/2008)In the financial market usually notices are taken of the shares sequentially over the time in order to characterize them a time series. However, the major interest is to forecast the behavior of these shares. Motivated by ...