955 resultados para phylogenetic analysis, complete genome, composition vector, correlation-related distance metric


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pós-graduação em Zootecnia - FMVZ

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data.Results: Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution.Conclusions: While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Xylella fastidiosa is a fastidious, xylem-limited bacterium that causes a range of economically important plant diseases. Here we report the complete genome sequence of X. fastidiosa clone 9a5c, which causes citrus variegated chlorosis - a serious disease of orange trees. The genome comprises a 52.7% GC-rich 2,679,305-base-pair (bp) circular chromosome and 'two plasmids of 51,158 bp and 1,285 bp. We can assign putative functions to47% of the 2,904 predicted coding regions. Efficient metabolic functions are predicted, with sugars as the principal energy and carbon source, supporting existence in the nutrient-poor xylem sap. The mechanisms associated with pathogenicity and virulence involve toxins, antibiotics and ion sequestration systems, as well as bacterium-bacterium and bacterium-host interactions mediated by a range of proteins. Orthologues of some of these proteins have only been identified in animal and human pathogens; their presence in X. fastidiosa indicates that the molecular basis for bacterial pathogenicity is both conserved and independent of host. At least 83 genes are bacteriophage-derived and include virulence-associated genes from other bacteria, providing direct evidence of phage-mediated horizontal gene transfer.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Xylella fastidiosa inhabits the plant xylem, a nutrient-poor environment, so that mechanisms to sense and respond to adverse environmental conditions are extremely important for bacterial survival in the plant host. Although the complete genome sequences of different Xylella strains have been determined, little is known about stress responses and gene regulation in these organisms. In this work, a DNA microarray was constructed containing 2,600 ORFs identified in the genome sequencing project of Xylella fastidiosa 9a5c strain, and used to check global gene expression differences in the bacteria when it is infecting a symptomatic and a tolerant citrus tree. Different patterns of expression were found in each variety, suggesting that bacteria are responding differentially according to each plant xylem environment. The global gene expression profile was determined and several genes related to bacterial survival in stressed conditions were found to be differentially expressed between varieties, suggesting the involvement of different strategies for adaptation to the environment. The expression pattern of some genes related to the heat shock response, toxin and detoxification processes, adaptation to atypical conditions, repair systems as well as some regulatory genes are discussed in this paper. DNA microarray proved to be a powerful technique for global transcriptome analyses. This is one of the first studies of Xylella fastidiosa gene expression in vivo which helped to increase insight into stress responses and possible bacterial survival mechanisms in the nutrient-poor environment of xylem vessels.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study focused on the structure and composition of archaeal communities in sediments of tropical mangroves in order to obtain sufficient insight into two Brazilian sites from different locations (one pristine and another located in an urban area) and at different depth levels from the surface. Terminal restriction fragment length polymorphism (T-RFLP) of PCR-amplified 16S rRNA gene fragments was used to scan the archaeal community structure, and 16S rRNA gene clone libraries were used to determine the community composition. Redundancy analysis of T-RFLP patterns revealed differences in archaeal community structure according to location, depth and soil attributes. Parameters such as pH, organic matter, potassium and magnesium presented significant correlation with general community structure. Furthermore, phylogenetic analysis revealed a community composition distributed differently according to depth where, in shallow samples, 74.3% of sequences were affiliated with Euryarchaeota and 25.7% were shared between Crenarchaeota and Thaumarchaeota, while for the deeper samples, 24.3% of the sequences were affiliated with Euryarchaeota and 75.7% with Crenarchaeota and Thaumarchaeota. Archaeal diversity measurements based on 16S rRNA gene clone libraries decreased with increasing depth and there was a greater difference between depths (<18% of sequences shared) than sites (>25% of sequences shared). Taken together, our findings indicate that mangrove ecosystems support a diverse archaeal community; it might possibly be involved in nutrient cycles and are affected by sediment properties, depth and distinct locations. (C) 2012 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Motivation An actual issue of great interest, both under a theoretical and an applicative perspective, is the analysis of biological sequences for disclosing the information that they encode. The development of new technologies for genome sequencing in the last years, opened new fundamental problems since huge amounts of biological data still deserve an interpretation. Indeed, the sequencing is only the first step of the genome annotation process that consists in the assignment of biological information to each sequence. Hence given the large amount of available data, in silico methods became useful and necessary in order to extract relevant information from sequences. The availability of data from Genome Projects gave rise to new strategies for tackling the basic problems of computational biology such as the determination of the tridimensional structures of proteins, their biological function and their reciprocal interactions. Results The aim of this work has been the implementation of predictive methods that allow the extraction of information on the properties of genomes and proteins starting from the nucleotide and aminoacidic sequences, by taking advantage of the information provided by the comparison of the genome sequences from different species. In the first part of the work a comprehensive large scale genome comparison of 599 organisms is described. 2,6 million of sequences coming from 551 prokaryotic and 48 eukaryotic genomes were aligned and clustered on the basis of their sequence identity. This procedure led to the identification of classes of proteins that are peculiar to the different groups of organisms. Moreover the adopted similarity threshold produced clusters that are homogeneous on the structural point of view and that can be used for structural annotation of uncharacterized sequences. The second part of the work focuses on the characterization of thermostable proteins and on the development of tools able to predict the thermostability of a protein starting from its sequence. By means of Principal Component Analysis the codon composition of a non redundant database comprising 116 prokaryotic genomes has been analyzed and it has been showed that a cross genomic approach can allow the extraction of common determinants of thermostability at the genome level, leading to an overall accuracy in discriminating thermophilic coding sequences equal to 95%. This result outperform those obtained in previous studies. Moreover, we investigated the effect of multiple mutations on protein thermostability. This issue is of great importance in the field of protein engineering, since thermostable proteins are generally more suitable than their mesostable counterparts in technological applications. A Support Vector Machine based method has been trained to predict if a set of mutations can enhance the thermostability of a given protein sequence. The developed predictor achieves 88% accuracy.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Phase 1: To validate Near-Infrared Reflectance Analysis (NIRA) as a fast, reliable and suitable method for routine evaluation of human milk’s nitrogen and fat content. Phase 2: To determine whether fat content, protein content and osmolality of HM before and after fortification may affect gastroesophageal reflux (GER) in symptomatic preterm infants. Patients and Methods: Phase 1: 124 samples of expressed human milk (55 from preterm mothers and 69 from term mothers) were used to validate NIRA against traditional methods (Gerber method for fat and Kjeldhal method for nitrogen). Phase 2: GER was evaluated in 17 symptomatic preterm newborns fed naïve and fortified HM by combined pH/intraluminal-impedance monitoring (pH-MII). HM fat and protein content was analysed by a Near-Infrared-Reflectance-Analysis (NIRA). HM osmolality was tested before and after fortification. GER indexes measured before and after fortification were compared, and were also related with HM fat and protein content and osmolality before and after fortification. Results: Phase 1: · A strong agreement was found between traditional methods’ and NIRA’s results (expressed as g/100 g of milk), both for fat and nitrogen content in term (mean fat content: NIRA=2.76; Gerber=2.76; mean nitrogen content: NIRA=1.88; Kjeldhal =1.92) and preterm (mean fat content: NIRA=3.56; Kjeldhal=3.52; mean nitrogen content: NIRA=1.91; Kjeldhal =1.89) mother’s milk. · Nitrogen content of the milk samples, measured by NIRA, ranged from 1.18 to 2.71 g/100 g of milk in preterm milk and from 1.48 to 2.47 in term milk; fat content ranged from 1.27 to 6.23 g/100 g of milk in preterm milk and from 1.01 to 6.01 g/100 g of milk in term milk. Phase 2: · An inverse correlation was found between naïve HM protein content and acid reflux index (RIpH: p=0.041, rho=-0.501). · After fortification, osmolality often exceeded the values recommended for infant feeds; furthermore, a statistically significant (p<.05) increase in non acid reflux indexes was observed. Conclusions: NIRA can be used as a fast, reliable and suitable tool for routine monitoring of macronutrient content of human milk. Protein content of naïve HM may influence acid GER in preterm infants. A standard fortification of HM may worsen non acid GER indexes and, due to the extreme variability in HM composition, may overcome both recommended protein intake and HM osmolality. Thus, an individualized fortification, based on the analysis of the composition of naïve HM, could optimize both nutrient intake and feeding tolerance.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Multilocus sequence analysis (MLSA) based on recN, rpoA and thdF genes was done on more than 30 species of the family Enterobacteriaceae with a focus on Cronobacter and the related genus Enterobacter. The sequences provide valuable data for phylogenetic, taxonomic and diagnostic purposes. Phylogenetic analysis showed that the genus Cronobacter forms a homogenous cluster related to recently described species of Enterobacter, but distant to other species of this genus. Combining sequence information on all three genes is highly representative for the species' %GC-content used as taxonomic marker. Sequence similarity of the three genes and even of recN alone can be used to extrapolate genetic similarities between species of Enterobacteriaceae. Finally, the rpoA gene sequence, which is the easiest one to determine, provides a powerful diagnostic tool to identify and differentiate species of this family. The comparative analysis gives important insights into the phylogeny and genetic relatedness of the family Enterobacteriaceae and will serve as a basis for further studies and clarifications on the taxonomy of this large and heterogeneous family.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Equine Actinobacillus species were analysed phylogenetically by 16S rRNA gene (rrs) sequencing focusing on the species Actinobacillus equuli, which has recently been subdivided into the non-haemolytic A. equuli subsp. equuli and the haemolytic A. equuli subsp. haemolyticus. In parallel we determined the profile for RTX toxin genes of the sample of strains by PCR testing for the presence of the A. equuli haemolysin gene aqx, and the toxin genes apxI, apxII, apxIII and apxIV, which are known in porcine pathogens such as Actinobacillus pleuropneumoniae and Actinobacillus suis. The rrs-based phylogenetic analysis revealed two distinct subclusters containing both A. equuli subsp. equuli and A. equuli subsp. haemolyticus distributed through both subclusters with no correlation to taxonomic classification. Within one of the rrs-based subclusters containing the A. equuli subsp. equuli type strain, clustered as well the porcine Actinobacillus suis strains. This latter is known to be also phenotypically closely related to A. equuli. The toxin gene analysis revealed that all A. equuli subsp. haemolyticus strains from both rrs subclusters specifically contained the aqx gene while the A. suis strains harboured the genes apxI and apxII. The aqx gene was found to be specific for A. equuli subsp. haemolyticus, since A. equuli subsp. equuli contained no aqx nor any of the other RTX genes tested. The specificity of aqx for the haemolytic equine A. equuli and ApxI and ApxII for the porcine A. suis indicates a role of these RTX toxins in host species predilection of the two closely related species of bacterial pathogens and allows PCR based diagnostic differentiation of the two.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

ABSTRACT: Transcription factors (TFs) are proteins that have played a central role both in evolution and in domestication, and are major regulators of development in living organisms. Plant genome sequences reveal that approximately 7% of all genes encode putative TFs. The DOF (DNA binding with One Finger) TF family has been associated with vital processes exclusive to higher plants and to their close ancestors (algae, mosses and ferns). These are seed maturation and germination, light-mediated regulation, phytohormone and plant responses to biotic and abiotic stresses, etc. In Hordeum vulgare and Oryza sativa, 26 and 30 different Dof genes, respectively, have been annotated. Brachypodium distachyon has been the first Pooideae grass to be sequenced and, due to its genomic, morphological and physiological characteristics, has emerged as the model system for temperate cereals, such as wheat and barley. RESULTS: Through searches in the B. distachyon genome, 27 Dof genes have been identified and a phylogenetic comparison with the Oryza sativa and the Hordeum vulgare DOFs has been performed. To explore the evolutionary relationship among these DOF proteins, a combined phylogenetic tree has been constructed with the Brachypodium DOFs and those from rice and barley. This phylogenetic analysis has classified the DOF proteins into four Major Cluster of Orthologous Groups (MCOGs). Using RT-qPCR analysis the expression profiles of the annotated BdDof genes across four organs (leaves, roots, spikes and seeds) has been investigated. These results have led to a classification of the BdDof genes into two groups, according to their expression levels. The genes highly or preferentially expressed in seeds have been subjected to a more detailed expression analysis (maturation, dry stage and germination). CONCLUSIONS: Comparison of the expression profiles of the Brachypodium Dof genes with the published functions of closely related DOF sequences from the cereal species considered here, deduced from the phylogenetic analysis, indicates that although the expression profile has been conserved in many of the putative orthologs, in some cases duplication followed by subsequent divergence may have occurred (neo-functionalization).

Relevância:

100.00% 100.00%

Publicador:

Resumo:

An increasing number of proteins with weak sequence similarity have been found to assume similar three-dimensional fold and often have similar or related biochemical or biophysical functions. We propose a method for detecting the fold similarity between two proteins with low sequence similarity based on their amino acid properties alone. The method, the proximity correlation matrix (PCM) method, is built on the observation that the physical properties of neighboring amino acid residues in sequence at structurally equivalent positions of two proteins of similar fold are often correlated even when amino acid sequences are different. The hydrophobicity is shown to be the most strongly correlated property for all protein fold classes. The PCM method was tested on 420 proteins belonging to 64 different known folds, each having at least three proteins with little sequence similarity. The method was able to detect fold similarities for 40% of the 420 sequences. Compared with sequence comparison and several fold-recognition methods, the method demonstrates good performance in detecting fold similarities among the proteins with low sequence identity. Applied to the complete genome of Methanococcus jannaschii, the method recognized the folds for 22 hypothetical proteins.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pax proteins are a family of transcription factors with a highly conserved paired domain; many members also contain a paired-type homeodomain and/or an octapeptide. Nine mammalian Pax genes are known and classified into four subgroups: Pax-1/9, Pax-2/5/8, Pax-3/7, and Pax-4/6. Most of these genes are involved in nervous system development. In particular, Pax-6 is a key regulator that controls eye development in vertebrates and Drosophila. Although the Pax-4/6 subgroup seems to be more closely related to Pax-2/5/8 than to Pax-3/7 or Pax-1/9, its evolutionary origin is unknown. We therefore searched for a Pax-6 homolog and related genes in Cnidaria, which is the lowest phylum of animals that possess a nervous system and eyes. A sea nettle (a jellyfish) genomic library was constructed and two pax genes (Pax-A and -B) were isolated and partially sequenced. Surprisingly, unlike most known Pax genes, the paired box in these two genes contains no intron. In addition, the complete cDNA sequences of hydra Pax-A and -B were obtained. Hydra Pax-B contains both the homeodomain and the octapeptide, whereas hydra Pax-A contains neither. DNA binding assays showed that sea nettle Pax-A and -B and hydra Pax-A paired domains bound to a Pax-5/6 site and a Pax-5 site, although hydra Pax-B paired domain bound neither. An alignment of all available paired domain sequences revealed two highly conserved regions, which cover the DNA binding contact positions. Phylogenetic analysis showed that Pax-A and especially Pax-B were more closely related to Pax-2/5/8 and Pax-4/6 than to Pax-1/9 or Pax-3/7 and that the Pax genes can be classified into two supergroups: Pax-A/Pax-B/Pax-2/5/8/4/6 and Pax-1/9/3/7. From this analysis and the gene structure, we propose that modern Pax-4/6 and Pax-2/5/8 genes evolved from an ancestral gene similar to cnidarian Pax-B, having both the homeodomain and the octapeptide.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The evolution of novelty in tightly integrated biological systems, such as hormones and their receptors, seems to challenge the theory of natural selection: it has not been clear how a new function for any one part (such as a ligand) can be selected for unless the other members of the system (e.g., a receptor) are already present. Here I show—based on identification and phylogenetic analysis of steroid receptors in basal vertebrates and reconstruction of the sequences and functional attributes of ancestral proteins—that the first steroid receptor was an estrogen receptor, followed by a progesterone receptor. Genome mapping and phylogenetic analyses indicate that the full complement of mammalian steroid receptors evolved from these ancient receptors by two large-scale genome expansions, one before the advent of jawed vertebrates and one after. Specific regulation of physiological processes by androgens and corticoids are relatively recent innovations that emerged after these duplications. These findings support a model of ligand exploitation in which the terminal ligand in a biosynthetic pathway is the first for which a receptor evolves; selection for this hormone also selects for the synthesis of intermediates despite the absence of receptors, and duplicated receptors then evolve affinity for these substances. In this way, novel hormone-receptor pairs are created, and an integrated system of increasing complexity elaborated. This model suggests that ligands for some “orphan” receptors may be found among intermediates in the synthesis of ligands for phylogenetically related receptors.