Aprendizado de Máquina para classificação das linhagens de Copia e Gypsy de angiospermas
Carregando...
Data
Autores
Título da Revista
ISSN da Revista
Título de Volume
Editor
Universidade Federal de São Carlos
Resumo
This study used machine learning algorithms (neural network, decision tree and close neighbors algorithm) to create classification models of 11 lineages of the Copia and Gypsy superfamilies, using angiosperm DNA sequences as training. Of eight models, three were efficient and were able to satisfactorily classify the sequences, in addition to being potentially efficient in classifying data from angiosperm species that were not in the dataset used for training. A comparison of the classification of the three most efficient models was also carried out with the prediction made by the Blast program, which has an algorithm based on sequence alignment, as a result, excellent classification metrics were obtained, however, considering 80% identity and 80 % coverage for there to be a prediction, it failed to classify 30% of the sequences.
Descrição
Palavras-chave
Citação
TAVARES, Thayana Vieira. Aprendizado de Máquina para classificação das linhagens de Copia e Gypsy de angiospermas. 2021. Trabalho de Conclusão de Curso (Graduação em Biotecnologia) – Universidade Federal de São Carlos, São Carlos, 2021. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/15496.
Coleções
item.page.endorsement
item.page.review
item.page.supplemented
item.page.referenced
Licença Creative Commons
Exceto quando indicado de outra forma, a licença deste item é descrita como Attribution-NonCommercial-NoDerivs 3.0 Brazil
