17 resultados para A.C. Mix, unpublished data
Resumo:
The principal topic of this work is the application of data mining techniques, in particular of machine learning, to the discovery of knowledge in a protein database. In the first chapter a general background is presented. Namely, in section 1.1 we overview the methodology of a Data Mining project and its main algorithms. In section 1.2 an introduction to the proteins and its supporting file formats is outlined. This chapter is concluded with section 1.3 which defines that main problem we pretend to address with this work: determine if an amino acid is exposed or buried in a protein, in a discrete way (i.e.: not continuous), for five exposition levels: 2%, 10%, 20%, 25% and 30%. In the second chapter, following closely the CRISP-DM methodology, whole the process of construction the database that supported this work is presented. Namely, it is described the process of loading data from the Protein Data Bank, DSSP and SCOP. Then an initial data exploration is performed and a simple prediction model (baseline) of the relative solvent accessibility of an amino acid is introduced. It is also introduced the Data Mining Table Creator, a program developed to produce the data mining tables required for this problem. In the third chapter the results obtained are analyzed with statistical significance tests. Initially the several used classifiers (Neural Networks, C5.0, CART and Chaid) are compared and it is concluded that C5.0 is the most suitable for the problem at stake. It is also compared the influence of parameters like the amino acid information level, the amino acid window size and the SCOP class type in the accuracy of the predictive models. The fourth chapter starts with a brief revision of the literature about amino acid relative solvent accessibility. Then, we overview the main results achieved and finally discuss about possible future work. The fifth and last chapter consists of appendices. Appendix A has the schema of the database that supported this thesis. Appendix B has a set of tables with additional information. Appendix C describes the software provided in the DVD accompanying this thesis that allows the reconstruction of the present work.
Resumo:
Eur. J. Biochem. 270, 3904–3915 (2003)
Resumo:
J Biol Inorg Chem (2003) 8: 777–786
Resumo:
XVIII Jornadas de Paleontología, 2002
Resumo:
Dissertação apresentada para a obtenção do Grau de Mestre em Genética Molecular e Biomedicina, pela Universidade Nova de Lisboa, Faculdade de Ciências e Tecnologia
Resumo:
The Mid Miocene marine formations of Salles area (former "Sallomacian" stage) have been studied again from numerous outcrops and cores. The deep structural framework influences notably of the characteristics and distribution of the deposits, which are neritic. The stratigraphy is stated precisely thanks to the planktonic fauna and floradetailed examination (probably Serravallian zones NN6 - N12). Several paleobiofacies are reconstituted from the rich invertebrate faunas, which give also paleoclimatic data.
Resumo:
Climatic changes that affected the Northeastern Atlantic frontage are analyzed on the basis of the evolution of faunas and floras from the late Oligocene onwards. The study deals with calcareous nannoplankton, marine micro- and macrofaunas, some terrestrial vertebrates and vegetal assemblages. The climate, first tropical, underwent a progressive cooling (North-South thermic gradient). Notable climatic deteriorations (withdrawal towards the South or disappearance of taxa indicative of warm climate and appearance of "cold" taxa) are evidenced mainly during the Middle Miocene and the late Pliocene. Faunas and floras of modern pattern have regained, after the Pleistocene glaciations, a new climatic ranging of a temperate type in the northern part.
Resumo:
The main goal of the present work is the use of mineralogical data corresponding to sediment fine fractions (silt and clay) of Quaternary littoral deposits for the definition of a more detailed vertical zonography and to discriminate the most significant morphoclimatic changes concerned with sediment source areas and sediment deposition areas. The analysis of the available mineralogical data reveals a vertical evolution of the mineral composition. The following aspects deserve particular reference: 1) fine fractions (<38 nm) are composed of quartz and phyllosilicates associated to feldspars, prevailing over other minerals; however in certain sections iron hydroxides and evaporitic minerals occur in significant amounts; 2) clay fractions (<2 nm) show a general prevalence of illite associated with kaolinite and oscillations, in relative terms, of kaolinite and illite contents. Qualitative and quantitative lateral and vertical variations of clay and non clay minerals allow the discrimination of sedimentary sequences and the establishment of the ritmicity and periodicity of the morphoclimatic Quaternary episodes that occurred in the Cortegaça and Maceda beaches. To each one of the sedimentary sequences corresponds, in a first stage, a littoral environment that increasingly became more continental. Climate would be mild to cold, sometimes with humidity - aridity oscillations. Warmer and moister episodes alternated with cooler and dryer ones.
Resumo:
Eight depositional sequences (DS) delimited by regional disconformities had been recognized in the Miocene of Lisbon and Setúbal Peninsula areas. In the case of the western coast of the Setúbal Peninsula, outcrops consisting of Lower Burdigalian to Lower Tortonian sediments were studied. The stratigraphic zonography and the environmental considerations are mainly supported on data concerning to foraminifera, ostracoda, vertebrates and palynomorphs. The first mineralogical and geochemical data determined for Foz da Fonte, Penedo Sul and Penedo Norte sedimentary sequences are presented. These analytical data mainly correspond to the sediments' fine fractions. Mineralogical data are based on X-ray diffraction (XRD), carried out on both the less than 38 nm and 2 nm fractions. Qualitative and semi-quantitative determinations of clay and non-clay minerals were obtained for both fractions. The clay minerals assemblages complete the lithostratigraphic and paleoenvironmental data obtained by stratigraphic and palaeontological studies. Some palaeomagnetic and isotopic data are discussed and correlated with the mineralogical data. Multivariate data analysis (Principal Components Analysis) of the mineralogical data was carried out using both R-mode and Q-mode factor analysis.
Resumo:
Preliminary results of the systematic and biostratigraphical study of the ostracods from the Lower Toarcian (Polymorphum and Levisoni Zones) of Peniche are presented. Most of the identified species are recognized in other European countries. Biodiversity and species abundance are high in the first Zone, decreasing dramatically in the second one.
Resumo:
This paper suggests that a convenient score test against non-nested alternatives can be constructed from the linear combination of the likelihood functions of the competing models. It is shown that this procedure is essentially a test for the correct specification of the conditional distribution of the variable of interest.
Resumo:
J. Am. Chem. Soc., 2009, 131 (23), pp 7990–7998 DOI: 10.1021/ja809448r
Resumo:
J Biol Inorg Chem. 2008 Jun;13(5):779-87. doi: 10.1007/s00775-008-0365-8
Resumo:
J Biol Inorg Chem (2003) 8: 777–786 DOI 10.1007/s00775-003-0479-y