894 resultados para SEQUENCE DATABASES


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract Background A large number of probabilistic models used in sequence analysis assign non-zero probability values to most input sequences. To decide when a given probability is sufficient the most common way is bayesian binary classification, where the probability of the model characterizing the sequence family of interest is compared to that of an alternative probability model. We can use as alternative model a null model. This is the scoring technique used by sequence analysis tools such as HMMER, SAM and INFERNAL. The most prevalent null models are position-independent residue distributions that include: the uniform distribution, genomic distribution, family-specific distribution and the target sequence distribution. This paper presents a study to evaluate the impact of the choice of a null model in the final result of classifications. In particular, we are interested in minimizing the number of false predictions in a classification. This is a crucial issue to reduce costs of biological validation. Results For all the tests, the target null model presented the lowest number of false positives, when using random sequences as a test. The study was performed in DNA sequences using GC content as the measure of content bias, but the results should be valid also for protein sequences. To broaden the application of the results, the study was performed using randomly generated sequences. Previous studies were performed on aminoacid sequences, using only one probabilistic model (HMM) and on a specific benchmark, and lack more general conclusions about the performance of null models. Finally, a benchmark test with P. falciparum confirmed these results. Conclusions Of the evaluated models the best suited for classification are the uniform model and the target model. However, the use of the uniform model presents a GC bias that can cause more false positives for candidate sequences with extreme compositional bias, a characteristic not described in previous studies. In these cases the target model is more dependable for biological validation due to its higher specificity.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract Background Blastocladiella emersonii is an aquatic fungus of the Chytridiomycete class, which is at the base of the fungal phylogenetic tree. In this sense, some ancestral characteristics of fungi and animals or fungi and plants could have been retained in this aquatic fungus and lost in members of late-diverging fungal species. To identify in B. emersonii sequences associated with these ancestral characteristics two approaches were followed: (1) a large-scale comparative analysis between putative unigene sequences (uniseqs) from B. emersonii and three databases constructed ad hoc with fungal proteins, animal proteins and plant unigenes deposited in Genbank, and (2) a pairwise comparison between B. emersonii full-length cDNA sequences and their putative orthologues in the ascomycete Neurospora crassa and the basidiomycete Ustilago maydis. Results Comparative analyses of B. emersonii uniseqs with fungi, animal and plant databases through the two approaches mentioned above produced 166 B. emersonii sequences, which were identified as putatively absent from other fungi or not previously described. Through these approaches we found: (1) possible orthologues of genes previously identified as specific to animals and/or plants, and (2) genes conserved in fungi, but with a large difference in divergence rate in B. emersonii. Among these sequences, we observed cDNAs encoding enzymes from coenzyme B12-dependent propionyl-CoA pathway, a metabolic route not previously described in fungi, and validated their expression in Northern blots. Conclusion Using two different approaches involving comparative sequence analyses, we could identify sequences from the early-diverging fungus B. emersonii previously considered specific to animals or plants, and highly divergent sequences from the same fungus relative to other fungi.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract Background One of the least common types of alternative splicing is the complete retention of an intron in a mature transcript. Intron retention (IR) is believed to be the result of intron, rather than exon, definition associated with failure of the recognition of weak splice sites flanking short introns. Although studies on individual retained introns have been published, few systematic surveys of large amounts of data have been conducted on the mechanisms that lead to IR. Results TTo understand how sequence features are associated with or control IR, and to produce a generalized model that could reveal previously unknown signals that regulate this type of alternative splicing, we partitioned intron retention events observed in human cDNAs into two groups based on the relative abundance of both isoforms and compared relevant features. We found that a higher frequency of IR in human is associated with individual introns that have weaker splice sites, genes with shorter intron lengths, higher expression levels and lower density of both a set of exon splicing silencers (ESSs) and the intronic splicing enhancer GGG. Both groups of retained introns presented events conserved in mouse, in which the retained introns were also short and presented weaker splice sites. Conclusion Although our results confirmed that weaker splice sites are associated with IR, they showed that this feature alone cannot explain a non-negligible fraction of events. Our analysis suggests that cis-regulatory elements are likely to play a crucial role in regulating IR and also reveals previously unknown features that seem to influence its occurrence. These results highlight the importance of considering the interplay among these features in the regulation of the relative frequency of IR.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract Background Identification of nontuberculous mycobacteria (NTM) based on phenotypic tests is time-consuming, labor-intensive, expensive and often provides erroneous or inconclusive results. In the molecular method referred to as PRA-hsp65, a fragment of the hsp65 gene is amplified by PCR and then analyzed by restriction digest; this rapid approach offers the promise of accurate, cost-effective species identification. The aim of this study was to determine whether species identification of NTM using PRA-hsp65 is sufficiently reliable to serve as the routine methodology in a reference laboratory. Results A total of 434 NTM isolates were obtained from 5019 cultures submitted to the Institute Adolpho Lutz, Sao Paulo Brazil, between January 2000 and January 2001. Species identification was performed for all isolates using conventional phenotypic methods and PRA-hsp65. For isolates for which these methods gave discordant results, definitive species identification was obtained by sequencing a 441 bp fragment of hsp65. Phenotypic evaluation and PRA-hsp65 were concordant for 321 (74%) isolates. These assignments were presumed to be correct. For the remaining 113 discordant isolates, definitive identification was based on sequencing a 441 bp fragment of hsp65. PRA-hsp65 identified 30 isolates with hsp65 alleles representing 13 previously unreported PRA-hsp65 patterns. Overall, species identification by PRA-hsp65 was significantly more accurate than by phenotype methods (392 (90.3%) vs. 338 (77.9%), respectively; p < .0001, Fisher's test). Among the 333 isolates representing the most common pathogenic species, PRA-hsp65 provided an incorrect result for only 1.2%. Conclusion PRA-hsp65 is a rapid and highly reliable method and deserves consideration by any clinical microbiology laboratory charged with performing species identification of NTM.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract Background Some organisms can survive extreme desiccation by entering a state of suspended animation known as anhydrobiosis. The free-living mycophagous nematode Aphelenchus avenae can be induced to enter anhydrobiosis by pre-exposure to moderate reductions in relative humidity (RH) prior to extreme desiccation. This preconditioning phase is thought to allow modification of the transcriptome by activation of genes required for desiccation tolerance. Results To identify such genes, a panel of expressed sequence tags (ESTs) enriched for sequences upregulated in A. avenae during preconditioning was created. A subset of 30 genes with significant matches in databases, together with a number of apparently novel sequences, were chosen for further study. Several of the recognisable genes are associated with water stress, encoding, for example, two new hydrophilic proteins related to the late embryogenesis abundant (LEA) protein family. Expression studies confirmed EST panel members to be upregulated by evaporative water loss, and the majority of genes was also induced by osmotic stress and cold, but rather fewer by heat. We attempted to use RNA interference (RNAi) to demonstrate the importance of this gene set for anhydrobiosis, but found A. avenae to be recalcitrant with the techniques used. Instead, therefore, we developed a cross-species RNAi procedure using A. avenae sequences in another anhydrobiotic nematode, Panagrolaimus superbus, which is amenable to gene silencing. Of 20 A. avenae ESTs screened, a significant reduction in survival of desiccation in treated P. superbus populations was observed with two sequences, one of which was novel, while the other encoded a glutathione peroxidase. To confirm a role for glutathione peroxidases in anhydrobiosis, RNAi with cognate sequences from P. superbus was performed and was also shown to reduce desiccation tolerance in this species. Conclusions This study has identified and characterised the expression profiles of members of the anhydrobiotic gene set in A. avenae. It also demonstrates the potential of RNAi for the analysis of anhydrobiosis and provides the first genetic data to underline the importance of effective antioxidant systems in metazoan desiccation tolerance.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mutations in solute carrier family 26 (sulfate transporter), member 2 (SLC26A2) gene result in a spectrum of autosomal recessive chondrodysplasias that range from the mildest recessive form of multiple epiphysial dysplasia (rMED) through the most common diastrophic dysplasia (DTD) to lethal atelosteogenesis type II and achondrogenesis IB. The clinical variability has been ascribed to quantitative effect of mutations of the sulfate transporter activity. Here we describe two Brazilian sisters, born to healthy and non consanguineous parents, with Robin sequence, mild shortening of upper and lower limbs, brachymetacarpalia/tarsalia, additional and accelerated carpal ossification, marked genu valgum, and multiple epiphysial dysplasia. This phenotype was intermediate between DTD and rMED, and both girls have a compound heterozygous mutations for the SLC26A2, a Finnish founder mutation (c.-26?+?2T>C), and R279W. This combination of mutations has been observed in individuals with different phenotypes, including DTD, DTD variant, and rMED. The distinct phenotype of our cases reinforces the hypothesis that other factors may be influencing the phenotype as previously suggested.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, a procedure for the on-line process control of variables is proposed. This procedure consists of inspecting the m-th item from every m produced items and deciding, at each inspection, whether the process is out-of-control. Two sets of limits, warning (µ0 ± W) and control (µ0 ± C), are used. If the value of the monitored statistic falls beyond the control limits or if a sequence of h observations falls between the warning limits and the control limits, the production is stopped for adjustment; otherwise, production goes on. The properties of an ergodic Markov chain are used to obtain an expression for the average cost per item. The parameters (the sampling interval m, the widths of the warning, the control limits W and C(W < C), and the sequence length (h) are optimized by minimizing the cost function. A numerical example illustrates the proposed procedure.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

To reliably determine the main physical parameters (masses and ages) of young stars, we must know their distances. While the average distance to nearby star-forming regions (<300 pc) is often known, the distances to individual stars are usually unknown. Individual distances to members of young moving groups can be derived from their radial velocities and proper motions using the convergent-point strategy. We investigate the kinematic properties of the Lupus moving group with the primary objective of deriving individual distances to all group members.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract Background Xanthomonads are plant-associated bacteria responsible for diseases on economically important crops. Xanthomonas fuscans subsp. fuscans (Xff) is one of the causal agents of common bacterial blight of bean. In this study, the complete genome sequence of strain Xff 4834-R was determined and compared to other Xanthomonas genome sequences. Results Comparative genomics analyses revealed core characteristics shared between Xff 4834-R and other xanthomonads including chemotaxis elements, two-component systems, TonB-dependent transporters, secretion systems (from T1SS to T6SS) and multiple effectors. For instance a repertoire of 29 Type 3 Effectors (T3Es) with two Transcription Activator-Like Effectors was predicted. Mobile elements were associated with major modifications in the genome structure and gene content in comparison to other Xanthomonas genomes. Notably, a deletion of 33 kbp affects flagellum biosynthesis in Xff 4834-R. The presence of a complete flagellar cluster was assessed in a collection of more than 300 strains representing different species and pathovars of Xanthomonas. Five percent of the tested strains presented a deletion in the flagellar cluster and were non-motile. Moreover, half of the Xff strains isolated from the same epidemic than 4834-R was non-motile and this ratio was conserved in the strains colonizing the next bean seed generations. Conclusions This work describes the first genome of a Xanthomonas strain pathogenic on bean and reports the existence of non-motile xanthomonads belonging to different species and pathovars. Isolation of such Xff variants from a natural epidemic may suggest that flagellar motility is not a key function for in planta fitness.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Given a large image set, in which very few images have labels, how to guess labels for the remaining majority? How to spot images that need brand new labels different from the predefined ones? How to summarize these data to route the user’s attention to what really matters? Here we answer all these questions. Specifically, we propose QuMinS, a fast, scalable solution to two problems: (i) Low-labor labeling (LLL) – given an image set, very few images have labels, find the most appropriate labels for the rest; and (ii) Mining and attention routing – in the same setting, find clusters, the top-'N IND.O' outlier images, and the 'N IND.R' images that best represent the data. Experiments on satellite images spanning up to 2.25 GB show that, contrasting to the state-of-the-art labeling techniques, QuMinS scales linearly on the data size, being up to 40 times faster than top competitors (GCap), still achieving better or equal accuracy, it spots images that potentially require unpredicted labels, and it works even with tiny initial label sets, i.e., nearly five examples. We also report a case study of our method’s practical usage to show that QuMinS is a viable tool for automatic coffee crop detection from remote sensing images.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[EN] Meagre, has been proposed as a candidate for marine finfish diversification on commercial aquaculture (Quémèner, 2002, Mateos, 2007). Despite of the elevated on growing potential, the most important bottleneck of this specie is related to the limited production of fry. Larval rearing of this species, is performed mainly adapting seabream culture techniques with different success (Roo et al., 2007) However, since limited information about the optimal feeding sequences and nutritional requirements of meagre is available, more research is needed on larval rearing protocols and nutrition. Present results (elevated larval growth rate, high survival, short rotifers period) are very promising for a successful implementation at industrial scale, which helps to solve the continues lack of fry of this specie in the Mediterranean and Canary islands.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[EN] First description of the complete embryo and larval development of the Canarian abalone (Haliotis tuberculata coccinea Reeve.) was conducted along 39 stages from fertilization to the appearance of the third tubule on the cephalic tentacles and illustrated in a microphotographic sequence. Eggs obtained by induced spawning with hydrogen peroxide from the GIA captive broodstock were stocked at a density of 10 eggs/mL and kept at 23 0.5 BC for 62 h until the formation of the third tubule. Live eggs and larvae were continuously observed on a 24 h basis at a 3400 magnification under transmitted light. At each stages, specific morphological features, illustrated by microscopic photographs, were described, as well as the time required for their apparition. Fertilized eggs diameter was 205 8 mm (mean SD), whereas length and width of larvae ready to undergo metamorphosis were 216.6 5.3 mmand 172 8.8 mm, respectively. Knowledge on the larval morphological development acquired through this study will contribute to the improvement of larval rearing techniques for this abalone species.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[EN] In this paper we present a method for the regularization of a set of unstructured 3D points obtained from a sequence of stereo images. This method takes into account the information supplied by the disparity maps computed between pairs of images to constraint the regularization of the set of 3D points. We propose a model based on an energy which is composed of two terms: an attachment term that minimizes the distance from 3D points to the projective lines of camera points, and a second term that allows for the regularization of the set of 3D points by preserving discontinuities presented on the disparity maps. We embed this energy in a 2D finite element method. After minimizing, this method results in a large system of equations that can be optimized for fast computations. We derive an efficient implicit numerical scheme which reduces the number of calculations and memory allocations.