952 resultados para Multivariate statistical method


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Two objects with homologous landmarks are said to be of the same shape if the configuration of landmarks of one object can be exactly matched with that of the other by translation, rotation/reflection, and scaling. In an earlier paper, the authors proposed statistical analysis of shape by considering logarithmic differences of all possible Euclidean distances between landmarks. Tests of significance for differences in the shape of objects and methods of discrimination between populations were developed with such data. In the present paper, the corresponding statistical methodology is developed by triangulation of the landmarks and by considering the angles as natural measurements of shape. This method is applied to the study of sexual dimorphism in hominids.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Structural genomics aims to solve a large number of protein structures that represent the protein space. Currently an exhaustive solution for all structures seems prohibitively expensive, so the challenge is to define a relatively small set of proteins with new, currently unknown folds. This paper presents a method that assigns each protein with a probability of having an unsolved fold. The method makes extensive use of protomap, a sequence-based classification, and scop, a structure-based classification. According to protomap, the protein space encodes the relationship among proteins as a graph whose vertices correspond to 13,354 clusters of proteins. A representative fold for a cluster with at least one solved protein is determined after superposition of all scop (release 1.37) folds onto protomap clusters. Distances within the protomap graph are computed from each representative fold to the neighboring folds. The distribution of these distances is used to create a statistical model for distances among those folds that are already known and those that have yet to be discovered. The distribution of distances for solved/unsolved proteins is significantly different. This difference makes it possible to use Bayes' rule to derive a statistical estimate that any protein has a yet undetermined fold. Proteins that score the highest probability to represent a new fold constitute the target list for structural determination. Our predicted probabilities for unsolved proteins correlate very well with the proportion of new folds among recently solved structures (new scop 1.39 records) that are disjoint from our original training set.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present statistical methods for analyzing replicated cDNA microarray expression data and report the results of a controlled experiment. The study was conducted to investigate inherent variability in gene expression data and the extent to which replication in an experiment produces more consistent and reliable findings. We introduce a statistical model to describe the probability that mRNA is contained in the target sample tissue, converted to probe, and ultimately detected on the slide. We also introduce a method to analyze the combined data from all replicates. Of the 288 genes considered in this controlled experiment, 32 would be expected to produce strong hybridization signals because of the known presence of repetitive sequences within them. Results based on individual replicates, however, show that there are 55, 36, and 58 highly expressed genes in replicates 1, 2, and 3, respectively. On the other hand, an analysis by using the combined data from all 3 replicates reveals that only 2 of the 288 genes are incorrectly classified as expressed. Our experiment shows that any single microarray output is subject to substantial variability. By pooling data from replicates, we can provide a more reliable analysis of gene expression data. Therefore, we conclude that designing experiments with replications will greatly reduce misclassification rates. We recommend that at least three replicates be used in designing experiments by using cDNA microarrays, particularly when gene expression data from single specimens are being analyzed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this study, we estimate the statistical significance of structure prediction by threading. We introduce a single parameter ɛ that serves as a universal measure determining the probability that the best alignment is indeed a native-like analog. Parameter ɛ takes into account both length and composition of the query sequence and the number of decoys in threading simulation. It can be computed directly from the query sequence and potential of interactions, eliminating the need for sequence reshuffling and realignment. Although our theoretical analysis is general, here we compare its predictions with the results of gapless threading. Finally we estimate the number of decoys from which the native structure can be found by existing potentials of interactions. We discuss how this analysis can be extended to determine the optimal gap penalties for any sequence-structure alignment (threading) method, thus optimizing it to maximum possible performance.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The availability of complete genome sequences and mRNA expression data for all genes creates new opportunities and challenges for identifying DNA sequence motifs that control gene expression. An algorithm, “MobyDick,” is presented that decomposes a set of DNA sequences into the most probable dictionary of motifs or words. This method is applicable to any set of DNA sequences: for example, all upstream regions in a genome or all genes expressed under certain conditions. Identification of words is based on a probabilistic segmentation model in which the significance of longer words is deduced from the frequency of shorter ones of various lengths, eliminating the need for a separate set of reference data to define probabilities. We have built a dictionary with 1,200 words for the 6,000 upstream regulatory regions in the yeast genome; the 500 most significant words (some with as few as 10 copies in all of the upstream regions) match 114 of 443 experimentally determined sites (a significance level of 18 standard deviations). When analyzing all of the genes up-regulated during sporulation as a group, we find many motifs in addition to the few previously identified by analyzing the subclusters individually to the expression subclusters. Applying MobyDick to the genes derepressed when the general repressor Tup1 is deleted, we find known as well as putative binding sites for its regulatory partners.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The distribution of optimal local alignment scores of random sequences plays a vital role in evaluating the statistical significance of sequence alignments. These scores can be well described by an extreme-value distribution. The distribution’s parameters depend upon the scoring system employed and the random letter frequencies; in general they cannot be derived analytically, but must be estimated by curve fitting. For obtaining accurate parameter estimates, a form of the recently described ‘island’ method has several advantages. We describe this method in detail, and use it to investigate the functional dependence of these parameters on finite-length edge effects.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Tranformed-rule up and down psychophysical methods have gained great popularity, mainly because they combine criterion-free responses with an adaptive procedure allowing rapid determination of an average stimulus threshold at various criterion levels of correct responses. The statistical theory underlying the methods now in routine use is based on sets of consecutive responses with assumed constant probabilities of occurrence. The response rules requiring consecutive responses prevent the possibility of using the most desirable response criterion, that of 75% correct responses. The earliest transformed-rule up and down method, whose rules included nonconsecutive responses, did not contain this limitation but failed to become generally accepted, lacking a published theoretical foundation. Such a foundation is provided in this article and is validated empirically with the help of experiments on human subjects and a computer simulation. In addition to allowing the criterion of 75% correct responses, the method is more efficient than the methods excluding nonconsecutive responses in their rules.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A statistical modeling approach is proposed for use in searching large microarray data sets for genes that have a transcriptional response to a stimulus. The approach is unrestricted with respect to the timing, magnitude or duration of the response, or the overall abundance of the transcript. The statistical model makes an accommodation for systematic heterogeneity in expression levels. Corresponding data analyses provide gene-specific information, and the approach provides a means for evaluating the statistical significance of such information. To illustrate this strategy we have derived a model to depict the profile expected for a periodically transcribed gene and used it to look for budding yeast transcripts that adhere to this profile. Using objective criteria, this method identifies 81% of the known periodic transcripts and 1,088 genes, which show significant periodicity in at least one of the three data sets analyzed. However, only one-quarter of these genes show significant oscillations in at least two data sets and can be classified as periodic with high confidence. The method provides estimates of the mean activation and deactivation times, induced and basal expression levels, and statistical measures of the precision of these estimates for each periodic transcript.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Two objects with homologous landmarks are said to be of the same shape if the configurations of landmarks of one object can be exactly matched with that of the other by translation, rotation/reflection, and scaling. The observations on an object are coordinates of its landmarks with reference to a set of orthogonal coordinate axes in an appropriate dimensional space. The origin, choice of units, and orientation of the coordinate axes with respect to an object may be different from object to object. In such a case, how do we quantify the shape of an object, find the mean and variation of shape in a population of objects, compare the mean shapes in two or more different populations, and discriminate between objects belonging to two or more different shape distributions. We develop some methods that are invariant to translation, rotation, and scaling of the observations on each object and thereby provide generalizations of multivariate methods for shape analysis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The helix-coil transition equilibrium of polypeptides in aqueous solution was studied by molecular dynamics simulation. The peptide growth simulation method was introduced to generate dynamic models of polypeptide chains in a statistical (random) coil or an alpha-helical conformation. The key element of this method is to build up a polypeptide chain during the course of a molecular transformation simulation, successively adding whole amino acid residues to the chain in a predefined conformation state (e.g., alpha-helical or statistical coil). Thus, oligopeptides of the same length and composition, but having different conformations, can be incrementally grown from a common precursor, and their relative conformational free energies can be calculated as the difference between the free energies for growing the individual peptides. This affords a straightforward calculation of the Zimm-Bragg sigma and s parameters for helix initiation and helix growth. The calculated sigma and s parameters for the polyalanine alpha-helix are in reasonable agreement with the experimental measurements. The peptide growth simulation method is an effective way to study quantitatively the thermodynamics of local protein folding.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A cetamina é uma droga amplamente utilizada e o seu uso inadequado tem sido associado à graves consequências para a saúde humana. Embora as propriedades farmacológicas deste agente em doses terapêuticas sejam bem conhecidas, existem poucos estudos sobre os efeitos secundários induzidos por doses não-terapêuticas, incluindo os efeitos nos estados de ansiedade e agressividade. Neste contexto, os modelos animais são uma etapa importante na investigação e elucidação do mecanismo de ação a nível comportamental. O zebrafish (Danio rerio) é um novo organismo-modelo, interessante e promissor, uma vez que apresenta alta similaridade fisiológica, genética e neuroquímica com seres humanos, respostas comportamentais bem definidas e rápida absorção de compostos de interesse em meio aquoso além de apresentar uma série de vantagens em relação aos modelos mamíferos tais como manutenção de baixo custo, prática e executável em espaços reduzidos. Nesse sentido, faz-se necessário a execução de ensaios comportamentais em conjunto com análises estatísticas robustas e rápidas tais como ANOVA e Métodos Multivariados; e também o desenvolvimento de métodos analíticos sensíveis, precisos e rápidos para determinação de compostos de interesse em matrizes biológicas provenientes do animal. Os objetivos do presente trabalho foram a investigação dos efeitos da cetamina sobre a ansiedade e a agressividade em zebrafish adulto empregando Testes de Claro-Escuro e Testes do Espelho e métodos estatísticos univariados (ANOVA) e multivariados (PCA, HCA e SIMCA) assim como o desenvolvimento de método analítico para determinação da cetamina em matriz biológica proveniente do animal, empregando Extração Líquido-Líquido e Cromatografia em Fase Gasosa acoplada ao Detector de Nitrogênio-Fósforo (GC-NPD). Os resultados comportamentais indicaram que a cetamina produziu um efeito significativo dose-dependente em zebrafish adulto na latência à área clara, no número de cruzamentos entre as áreas e no tempo de exploração da área clara. Os resultados das análises SIMCA e PCA mostraram uma maior similaridade entre o grupo controle e os grupos de tratamento expostos às doses mais baixas (5 e 20 mg L-1) e entre os grupos expostos às doses de 40 e 60 mg L-1. Na análise por PCA, dois componentes principais responderam por 88,74% de toda a informação do sistema, sendo que 62,59% da informação cumulativa do sistema foi descrito pela primeira componente principal. As classificações HCA e SIMCA seguiram uma evolução lógica na distribuição das amostras por classes. As doses mais altas de cetamina induziram uma distribuição mais homogênea das amostras enquanto as doses mais baixas e o controle resultaram em distribuições mais dispersas. No Teste do Espelho, a cetamina não induziu efeitos significativos no comportamento dos animais. Estes resultados sugerem que a cetamina é modulador de comportamentos ansiosos, sem efeitos indutores de agressividade. Os resultados da validação do método cromatográfico indicaram uma extração com valores de recuperação entre 33,65% e 70,89%. A curva de calibração foi linear com valor de R2 superior a 0,99. O limite de detecção (LOD) foi de 1 ng e o limite de quantificação (LOQ) foi de 5 ng. A exatidão do método cromatográfico manteve-se entre - 24,83% e - 1,258%, a precisão intra-ensaio entre 2,67 e 14,5% e a precisão inter-ensaio entre 1,93 e 13,9%.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Este trabalho apresenta resultados geoquímicos multielementares de sedimentos de corrente no estado de São Paulo, obtidos através do projeto institucional do Serviço Geológico do Brasil denominado \"Levantamento Geoquímico de Baixa Densidade no Brasil\". Dados analíticos de 1422 amostras de sedimento de corrente obtidos por ICP-MS (Inductively Coupled Plasma Mass Spectrometry), para 32 elementos químicos (Al, Ba, Be, Ca, Ce, Co, Cr, Cs, Cu, Fe, Ga, Hf, K, La, Mg, Mn, Mo, Nb, Ni, P, Pb, Rb, Sc, Sn, Sr, Th, Ti, U, V, Y, Zn e Zr), foram processadas e abordadas através da análise estatística uni e multivariada. Os resultados do tratamento dos dados através de técnicas estatísticas univariadas forneceram os valores de background geoquímico (teor de fundo) dos 32 elementos para todo estado de São Paulo. A análise georreferenciada das distribuições geoquímicas unielementares evidenciaram a compartimentação geológica da área. As duas principais províncias geológicas do estado de São Paulo, Bacia do Paraná e Complexo Cristalino, se destacam claramente na maioria das distribuições geoquímicas. Unidades geológicas de maior expressão, como a Formação Serra Geral e o Grupo Bauru também foram claramente destacadas. Outras feições geoquímicas indicaram possíveis áreas contaminadas e unidades geológicas não cartografadas. Os resultados da aplicação de métodos estatísticos multivariados aos dados geoquímicos com 24 variáveis (Al, Ba, Ce, Co, Cr, Cs, Cu, Fe, Ga, La, Mn, Nb, Ni, Pb, Rb, Sc, Sr, Th, Ti, U, V, Y, Zn e Zr) permitiram definir as principais assinaturas e associações geoquímicas existentes em todo estado de São Paulo e correlacioná-las aos principais domínios litológicos. A análise de agrupamentos em modo Q forneceu oito grupos de amostras geoquimicamente correlacionáveis, que georreferenciadas reproduziram os principais compartimentos geológicos do estado: Complexo Cristalino, Grupos Itararé e Passa Dois, Formação Serra Geral e Grupos Bauru e Caiuá. A análise discriminante multigrupos comprovou, estatisticamente, a classificação dos grupos formados pela análise de agrupamentos e forneceu as principais variáveis discriminantes: Fe, Co, Sc, V e Cu. A análise de componentes principais, abordada em conjunto com a análise fatorial pelo método de rotação varimax, forneceram os principais fatores multivariados e suas respectivas associações elementares. O georreferenciamento dos valores de escores fatoriais multivariados delimitaram as áreas onde as associações elementares ocorrem e forneceram mapas multivariados para todo o estado. Por fim, conclui-se que os métodos estatísticos aplicados são indispensáveis no tratamento, apresentação e interpretação de dados geoquímicos. Ademais, com base em uma visão integrada dos resultados obtidos, este trabalho recomenda: (1) a execução dos levantamentos geoquímicos de baixa densidade em todo país em caráter de prioridade, pois são altamente eficazes na definição de backgrounds regionais e delimitação de províncias geoquímicas com interesse metalogenético e ambiental; (2) a execução do mapeamento geológico contínuo em escala adequada (maiores que 1:100.000) em áreas que apontam para possíveis existências de unidades não cartografadas nos mapas geológicos atuais.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: The Clinical Learning Environment, Supervision and Nurse Teacher scale is a reliable and valid instrument to evaluate the quality of the clinical learning process in international nursing education contexts. Objectives: This paper reports the development and psychometric testing of the Spanish version of the Clinical Learning Environment, Supervision and Nurse Teacher scale. Design: Cross-sectional validation study of the scale. Setting: 10 public and private hospitals in the Alicante area, and the Faculty of Health Sciences (University of Alicante, Spain). Participants: 370 student nurses on clinical placement (January 2011–March 2012). Methods: The Clinical Learning Environment, Supervision and Nurse Teacher scale was translated using the modified direct translation method. Statistical analyses were performed using PASW Statistics 18 and AMOS 18.0.0 software. A multivariate analysis was conducted in order to assess construct validity. Cronbach’s alpha coefficient was used to evaluate instrument reliability. Results: An exploratory factorial analysis identified the five dimensions from the original version, and explained 66.4% of the variance. Confirmatory factor analysis supported the factor structure of the Spanish version of the instrument. Cronbach’s alpha coefficient for the scale was .95, ranging from .80 to .97 for the subscales. Conclusion: This version of the Clinical Learning Environment, Supervision and Nurse Teacher scale instrument showed acceptable psychometric properties for use as an assessment scale in Spanish-speaking countries.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A microwave-assisted extraction (MAE) procedure to isolate phenolic compounds from almond skin byproducts was optimized. A three-level, three-factor Box–Behnken design was used to evaluate the effect of almond skin weight, microwave power, and irradiation time on total phenolic content (TPC) and antioxidant activity (DPPH). Almond skin weight was the most important parameter in the studied responses. The best extraction was achieved using 4 g, 60 s, 100 W, and 60 mL of 70% (v/v) ethanol. TPC, antioxidant activity (DPPH, FRAP), and chemical composition (HPLC-DAD-ESI-MS/MS) were determined by using the optimized method from seven different almond cultivars. Successful discrimination was obtained for all cultivars by using multivariate linear discriminant analysis (LDA), suggesting the influence of cultivar type on polyphenol content and antioxidant activity. The results show the potential of almond skin as a natural source of phenolics and the effectiveness of MAE for the reutilization of these byproducts.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Introducción: La Comunidad Valenciana inició en octubre del 2008 el programa de vacunación contra el virus del papiloma humano (VPH) en niñas de 14 años. El objetivo de este estudio es evaluar los conocimientos sobre la infección por VPH y su vacuna en madres de adolescentes e identificar los factores asociados a la predisposición de vacunar a sus hijas. Material y métodos: Estudio observacional transversal mediante cuestionario dirigido a madres de alumnas nacidas en 1995 matriculadas en centros de secundaria de la provincia de Valencia durante 2010-2011. Muestra aleatoria estratificada por conglomerados (n = 1.279). Análisis estadístico: porcentajes, intervalos de confianza, OR, contrastes chi al cuadrado y regresión logística multivariante. Resultados: Ochocientos treinta y tres cuestionarios completados (65,1%). El 76,6% de las madres habían vacunado a sus hijas contra el VPH. El 93,8% conocía la vacuna, sobre todo a través de la televisión (71,5%). El 78,5% recibió consejo favorable de un profesional sanitario, lo que mejoró la vacunación de sus hijas (OR: 2,4). Los conocimientos globales sobre la infección por VPH y la vacuna fueron bajos. La confianza de las madres en las vacunas como método preventivo mejora la vacunación contra VPH (OR: 3,8). El miedo a los efectos adversos (45,6%) fue el primer motivo de rechazo. Conclusiones: No parece que los medios de comunicación influyan en la decisión de vacunar. Sería conveniente minimizar la percepción de riesgo ante esta vacuna. El consejo del profesional sanitario actúa a favor de la vacunación si este interviene activamente en sentido positivo. Existe una brecha entre nivel de conocimientos y toma de decisión para vacunar.