MIDB: um modelo de integração de dados biológicos
Resumo
In bioinformatics, there is a huge volume of data related to biomolecules and to nucleotide and amino acid sequences that reside (in almost their totality) in several Biological Data Bases (BDBs). For a specific sequence, there are some informational classifications: genomic data, evolution-data, structural data, and others. Some BDBs store just one or some of these classifications. Those BDBs are hosted in different sites and servers, with several data base management systems with different data models. Besides, instances and schema might have semantic heterogeneity. In such scenario, the objective of this project is to propose a biological data integration model, that adopts new schema integration and instance integration techniques. The proposed integration model has a special mechanism of schema integration and another mechanism that performs the instance integration (with support of a dictionary) allowing conflict resolution in the attribute values; and a Clustering Algorithm is used in order to cluster similar entities. Besides, a domain specialist participates managing those clusters. The proposed model was validated through a study case focusing on schema and instance integration about nucleotide sequence data from organisms of Actinomyces gender, captured from four different data sources. The result is that about 97.91% of the attributes were correctly categorized in the schema integration, and the instance integration was able to identify that about 50% of the clusters created need support from a specialist, avoiding errors on the instance resolution. Besides, some contributions are presented, as the Attributes Categorization, the Clustering Algorithm, the distance functions proposed and the proposed model itself.
Collections
Itens relacionados
Apresentado os itens relacionados pelo título, autor e assunto.
-
GromaXy: uma ferramenta para integração do Galaxy com o GROMACS
Souza, Alfredo Guilherme da Silva (Universidade Federal de São Carlos, UFSCar, Programa de Pós-Graduação em Ciência da Computação - PPGCC, Câmpus São Carlos, 08/08/2017)Considering the significant advances in biomolecular researches, the raise of various limitations is predicted. Thereby, reinforcing the link between science and technology, by introducing the new and effective development ... -
Contribuições do DFMEA na integração entre desenvolvimento de produtos e engenharia da qualidade: casos em empresas de grande porte
Bueno, Fernanda Campos (Universidade Federal de São Carlos, UFSCar, Programa de Pós-Graduação em Engenharia de Produção - PPGEP, Câmpus São Carlos, 06/03/2018)This research aims to examine the role of DFMEA in the context of functional integration between Product Development (PD) and Quality Engineering (QE) through case studies in Brazilian industrial companies. Data were ... -
Currículo e integração curricular em um curso de graduação em medicina: concepções manifestadas pelos docentes que o vivenciam
Rodrigues, Aline de Fatima Cruz (Universidade Federal de São Carlos, UFSCar, Programa de Pós-Graduação em Educação - PPGE, Câmpus São Carlos, 10/09/2018)The present research aims to describe and analyze the relationship between the proposal of integrated curriculum of the undergraduate course in Medicine, Federal University of São Carlos (UFSCar) and the conceptions ...