999 resultados para Biology, Biostatistics|Statistics
Resumo:
Motivation: A consensus sequence for a family of related sequences is, as the name suggests, a sequence that captures the features common to most members of the family. Consensus sequences are important in various DNA sequencing applications and are a convenient way to characterize a family of molecules. Results: This paper describes a new algorithm for finding a consensus sequence, using the popular optimization method known as simulated annealing. Unlike the conventional approach of finding a consensus sequence by first forming a multiple sequence alignment, this algorithm searches for a sequence that minimises the sum of pairwise distances to each of the input sequences. The resulting consensus sequence can then be used to induce a multiple sequence alignment. The time required by the algorithm scales linearly with the number of input sequences and quadratically with the length of the consensus sequence. We present results demonstrating the high quality of the consensus sequences and alignments produced by the new algorithm. For comparison, we also present similar results obtained using ClustalW. The new algorithm outperforms ClustalW in many cases.
Resumo:
Allozyme analysis was used to address the question of the source of the Australian populations of the monarch butterfly Danaus plexippus (L.). The study had three major aims: (1) To compare the levels of diversity of Australian and Hawaiian populations with potential source populations. (2) To determine whether eastern and western North American populations were sufficiently divergent for the Australian populations to be aligned to a source population. (3) To compare the differentiation among regions in Australia and North America to test the prediction of greater genetic structure in Australia, as a consequence of reduced migratory behaviour. The reverse was found, with F-ST values an order of magnitude lower in Australia than in North America. Predictably, Australian and Hawaiian populations had lower allelic diversity, but unexpected higher heterozygosity values than North American populations. It was not possible to assign the Australian populations to a definitive source, although the high levels of similarity of Australian populations to each other suggest a single colonization event. The possibility that the Australian populations have not been here long enough to reach equilibrium is discussed. (C) 2002 The Linnean Society of London, Biological Journal of the Linnean Society, 2002, 75, 437-452.
Resumo:
We present a novel maximum-likelihood-based algorithm for estimating the distribution of alignment scores from the scores of unrelated sequences in a database search. Using a new method for measuring the accuracy of p-values, we show that our maximum-likelihood-based algorithm is more accurate than existing regression-based and lookup table methods. We explore a more sophisticated way of modeling and estimating the score distributions (using a two-component mixture model and expectation maximization), but conclude that this does not improve significantly over simply ignoring scores with small E-values during estimation. Finally, we measure the classification accuracy of p-values estimated in different ways and observe that inaccurate p-values can, somewhat paradoxically, lead to higher classification accuracy. We explain this paradox and argue that statistical accuracy, not classification accuracy, should be the primary criterion in comparisons of similarity search methods that return p-values that adjust for target sequence length.
Resumo:
The eastern shovelnose ray, Aptychotrema rostrata (Rhinobatidae), is an endemic batoid common to the east coast of Australia. The reproductive cycle was studied in Moreton Bay, south-eastern Queensland, over a 14-month period. Aptychotrema rostrata is an aplacental yolksac viviparous species with an annual, seasonal reproductive cycle in Moreton Bay. Females mature at 54-66 cm total length, and males at 60-68 cm total length. Gravid females were observed during September-November and parturition occurred in November-December. Vitellogenesis does not proceed in parallel with gestation. Ovulation and copulation probably occur during July-September, resulting in a gestational period of 3-5 months. Uterine fecundity ranges from 4 to 18, with a significant positive relationship between uterine fecundity and maternal body length. In mature males, a peak in the proportion of mature spermatocysts in the testes was observed in July, whereas gonadosomatic index peaked in April.
Resumo:
We collected data on plasma levels of testosterone+5a-dihydrotestosterone (T+DHT) and corticosterone (CORT) from adult female green sea turtles (Chelonia mydas) from southern Queensland during distinct stages of their reproductive cycle. Those females capable of breeding in a given year had elevated plasma steroid levels (T+DHT 0.91 +/- 0.08; CORT 1.05 +/- 0.29 ng/ml), associated with follicular development, until courtship began in October. At the beginning of the nesting season in November plasma levels of 2 CORT were related to when the female first nested (r(2) = 0.06; F = 10.45; P = 0.01). However, they were not correlated with the number of clutches a female laid in that season (F = 3.65; P = 0.08). We repeatedly sampled 23 turtles over the nesting season and profiled changes in steroids immediately following oviposition of each clutch. Levels of T+DHT (range 0.41-0.58 ng/ml) and CORT (range 2.13-2.81 ng/ml) were similar through the early stages of the nesting season and inter-nesting period, and declined to near basal levels (T+DHT 0.37 +/- 0.03 and CORT 1.85 +/- ng/ml) following the last clutch for the season. Steroid hormone levels were also low (T+DHT 0.38 +/- 0.16; CORT 0.46 +/- 0.21 ng/ml) in four independent post-breeding (atretic) females; samples for these females were taken at a time when body condition was presumably at the lowest for the season. Subtle changes in the nesting environment, such as variation in nesting habitat or the time of night that nesting occurred, were associated with a small and slow CORT increase. We suggest CORT is increased in nesting females to assist in lipid transfer to prepare the ovarian follicles and/or the reproductive organs for ovulation.
Resumo:
This article develops a weighted least squares version of Levene's test of homogeneity of variance for a general design, available both for univariate and multivariate situations. When the design is balanced, the univariate and two common multivariate test statistics turn out to be proportional to the corresponding ordinary least squares test statistics obtained from an analysis of variance of the absolute values of the standardized mean-based residuals from the original analysis of the data. The constant of proportionality is simply a design-dependent multiplier (which does not necessarily tend to unity). Explicit results are presented for randomized block and Latin square designs and are illustrated for factorial treatment designs and split-plot experiments. The distribution of the univariate test statistic is close to a standard F-distribution, although it can be slightly underdispersed. For a complex design, the test assesses homogeneity of variance across blocks, treatments, or treatment factors and offers an objective interpretation of residual plot.
Resumo:
Motivation: A major issue in cell biology today is how distinct intracellular regions of the cell, like the Golgi Apparatus, maintain their unique composition of proteins and lipids. The cell differentially separates Golgi resident proteins from proteins that move through the organelle to other subcellular destinations. We set out to determine if we could distinguish these two types of transmembrane proteins using computational approaches. Results: A new method has been developed to predict Golgi membrane proteins based on their transmembrane domains. To establish the prediction procedure, we took the hydrophobicity values and frequencies of different residues within the transmembrane domains into consideration. A simple linear discriminant function was developed with a small number of parameters derived from a dataset of Type II transmembrane proteins of known localization. This can discriminate between proteins destined for Golgi apparatus or other locations (post-Golgi) with a success rate of 89.3% or 85.2%, respectively on our redundancy-reduced data sets.
Resumo:
Measurement of exchange of substances between blood and tissue has been a long-lasting challenge to physiologists, and considerable theoretical and experimental accomplishments were achieved before the development of the positron emission tomography (PET). Today, when modeling data from modern PET scanners, little use is made of earlier microvascular research in the compartmental models, which have become the standard model by which the vast majority of dynamic PET data are analysed. However, modern PET scanners provide data with a sufficient temporal resolution and good counting statistics to allow estimation of parameters in models with more physiological realism. We explore the standard compartmental model and find that incorporation of blood flow leads to paradoxes, such as kinetic rate constants being time-dependent, and tracers being cleared from a capillary faster than they can be supplied by blood flow. The inability of the standard model to incorporate blood flow consequently raises a need for models that include more physiology, and we develop microvascular models which remove the inconsistencies. The microvascular models can be regarded as a revision of the input function. Whereas the standard model uses the organ inlet concentration as the concentration throughout the vascular compartment, we consider models that make use of spatial averaging of the concentrations in the capillary volume, which is what the PET scanner actually registers. The microvascular models are developed for both single- and multi-capillary systems and include effects of non-exchanging vessels. They are suitable for analysing dynamic PET data from any capillary bed using either intravascular or diffusible tracers, in terms of physiological parameters which include regional blood flow. (C) 2003 Elsevier Ltd. All rights reserved.