MIDB : um modelo de integração de dados biológicos
Perlin, Caroline Beatriz
MetadataShow full item record
In bioinformatics, there is a huge volume of data related to biomolecules and to nucleotide and amino acid sequences that reside (in almost their totality) in several Biological Data Bases (BDBs). For a specific sequence, there are some informational classifications: genomic data, evolution-data, structural data, and others. Some BDBs store just one or some of these classifications. Those BDBs are hosted in different sites and servers, with several data base management systems with different data models. Besides, instances and schema might have semantic heterogeneity. In such scenario, the objective of this project is to propose a biological data integration model, that adopts new schema integration and instance integration techniques. The proposed integration model has a special mechanism of schema integration and another mechanism that performs the instance integration (with support of a dictionary) allowing conflict resolution in the attribute values; and a Clustering Algorithm is used in order to cluster similar entities. Besides, a domain specialist participates managing those clusters. The proposed model was validated through a study case focusing on schema and instance integration about nucleotide sequence data from organisms of Actinomyces gender, captured from four different data sources. The result is that about 97.91% of the attributes were correctly categorized in the schema integration, and the instance integration was able to identify that about 50% of the clusters created need support from a specialist, avoiding errors on the instance resolution. Besides, some contributions are presented, as the Attributes Categorization, the Clustering Algorithm, the distance functions proposed and the proposed model itself.
Showing items related by title, author, creator and subject.
Santana, Jhonne Pedro Pedott; http://lattes.cnpq.br/7058366901763632 (Universidade Federal de São Carlos, UFSCar, Programa de Pós-graduação em Genética Evolutiva e Biologia Molecular, Câmpus São Carlos, 23/03/2016)Knowledge about horizontal gene transfer has been proposed even before the determination of the molecular structure of DNA. It has been experimentally shown that micro-homologies rich in adenine and cytosine mediates the ...
Integração entre ecologia de bacias hidrográficas e educação ambiental para a conservação dos rios da serra do mar no estado do Paraná. Marques, Paulo Henrique Carneiro; http://genos.cnpq.br:12010/dwlattes/owa/consultapesq.prc_querylist (Universidade Federal de São Carlos, UFSCar, Programa de Pós-graduação em Ecologia e Recursos Naturais, , 07/10/2004)The catchment areas on the eastern and western slopes of Serra do Mar Mountains (Paraná state, Brasil) are sites of strategic interest to the conservation of riverine ecossystems, as they provide water supply to about ...
Atividade curricular de integração entre ensino, pesquisa e extensão (ACIEPE) : anseios, conjunturas e contornos de inovações curriculares em movimento. Souza, Marcos Lopes de; http://lattes.cnpq.br/2396459642306557 (Universidade Federal de São Carlos, UFSCar, Programa de Pós-graduação em Educação, , 28/05/2007)This research describes and discusses about the conjuncture and the movements of the program so-called as Curricular Integration Activity between Teaching, Research and Outreach (ACIEPE) since its implantation in Sao ...