Aprendizado de Máquina para classificação das linhagens de Copia e Gypsy de angiospermas

Carregando...
Imagem de Miniatura

Título da Revista

ISSN da Revista

Título de Volume

Editor

Universidade Federal de São Carlos

Resumo

This study used machine learning algorithms (neural network, decision tree and close neighbors algorithm) to create classification models of 11 lineages of the Copia and Gypsy superfamilies, using angiosperm DNA sequences as training. Of eight models, three were efficient and were able to satisfactorily classify the sequences, in addition to being potentially efficient in classifying data from angiosperm species that were not in the dataset used for training. A comparison of the classification of the three most efficient models was also carried out with the prediction made by the Blast program, which has an algorithm based on sequence alignment, as a result, excellent classification metrics were obtained, however, considering 80% identity and 80 % coverage for there to be a prediction, it failed to classify 30% of the sequences.

Descrição

Citação

TAVARES, Thayana Vieira. Aprendizado de Máquina para classificação das linhagens de Copia e Gypsy de angiospermas. 2021. Trabalho de Conclusão de Curso (Graduação em Biotecnologia) – Universidade Federal de São Carlos, São Carlos, 2021. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/15496.

Coleções

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced

Licença Creative Commons

Exceto quando indicado de outra forma, a licença deste item é descrita como Attribution-NonCommercial-NoDerivs 3.0 Brazil