Aprendizado de máquina multirrótulo para predição de doenças associadas a RNAs longos não codificantes
Carregando...
Data
Autores
Título da Revista
ISSN da Revista
Título de Volume
Editor
Universidade Federal de São Carlos
Resumo
Long non-coding RNAs (lncRNAs) are RNA molecules longer than 200 nucleotides that are not translated into proteins. These RNAs play crucial roles in processes such as cell cycle regulation, epigenetics, cell differentiation, and both post-transcriptional and transcriptional regulation. Abnormal expression of lncRNAs has been increasingly associated with various human diseases, including cardiovascular, neurological, and oncological conditions. While different biotechnological approaches have been employed to identify and detect lncRNAs, these methods face significant challenges related to cost, operational complexity, and experimental procedure. Consequently, the development of artificial intelligence-based models for disease prediction using lncRNA data presents itself as a promising alternative. This study focuses on the implementation of machine learning algorithms for predicting diseases associated with lncRNAs, addressing it as a multi-label classification problem, given that a single lncRNA can be linked to multiple diseases. Various methods, including problem transformation approaches and algorithm adaptation approaches, were tested. The classifiers' performance was assessed using evaluation metrics for multi-label problems, including Precision, Recall, F-measure, and Hamming Loss. Furthermore, for comparative analysis, statistical tests were performed, specifically the Friedman and Nemenyi tests. Based on the analysis of the results, hypotheses regarding the performance of the algorithms were formulated. However, despite the observed differences, the methods showed an overall unsatisfactory performance, which we attribute to the high sparsity presented in the dataset. Thus, for future work, the investigation of more robust methods to address this issue was proposed, aiming to enhance prediction performance and improve the understanding of biological interactions.
Descrição
Palavras-chave
Citação
BERTONI, Lívia Umberto. Aprendizado de máquina multirrótulo para predição de doenças associadas a RNAs longos não codificantes. 2025. Trabalho de Conclusão de Curso (Graduação em Biotecnologia) – Universidade Federal de São Carlos, São Carlos, 2025. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/21465.
Coleções
item.page.endorsement
item.page.review
item.page.supplemented
item.page.referenced
Licença Creative Commons
Exceto quando indicado de outra forma, a licença deste item é descrita como Attribution-NonCommercial-NoDerivs 3.0 Brazil
