980 resultados para Nucleotide sequence


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Members of the human APOBEC3 family of editing enzymes can inhibit various mobile genetic elements. APOBEC3A (A3A) can block the retrotransposon LINE-1 and the parvovirus adeno-associated virus type 2 (AAV-2) but does not inhibit retroviruses. In contrast, APOBEC3G (A3G) can block retroviruses but has only limited effects on AAV-2 or LINE-1. What dictates this differential target specificity remains largely undefined. Here, we modeled the structure of A3A based on its homology with the C-terminal domain of A3G and further compared the sequence of human A3A to those of 11 nonhuman primate orthologues. We then used these data to perform a mutational analysis of A3A, examining its ability to restrict LINE-1, AAV-2, and foreign plasmid DNA and to edit a single-stranded DNA substrate. The results revealed an essential functional role for the predicted single-stranded DNA-docking groove located around the A3A catalytic site. Within this region, amino acid differences between A3A and A3G are predicted to affect the shape of the polynucleotide-binding groove. Correspondingly, transferring some of these A3A residues to A3G endows the latter protein with the ability to block LINE-1 and AAV-2. These results suggest that the target specificity of APOBEC3 family members is partly defined by structural features influencing their interaction with polynucleotide substrates.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Methicillin-resistant Staphylococcus aureus (MRSA) is a major cause of nosocomial infections worldwide. To differentiate reliably among S. aureus isolates, we recently developed double locus sequence typing (DLST) based on the analysis of partial sequences of clfB and spa genes. In the present study, we evaluated the usefulness of DLST for epidemiological investigations of MRSA by routinely typing 1242 strains isolated in Western Switzerland. Additionally, particular local and international collections were typed by pulsed field gel electrophoresis (PFGE) and DLST to check the compatibility of DLST with the results obtained by PFGE, and for international comparisons. Using DLST, we identified the major MRSA clones of Western Switzerland, and demonstrated the close relationship between local and international clones. The congruence of 88% between the major PFGE and DLST clones indicated that our results obtained by DLST were compatible with earlier results obtained by PFGE. DLST could thus easily be incorporated in a routine surveillance procedure. In addition, the unambiguous definition of DLST types makes this method more suitable than PFGE for long-term epidemiological surveillance. Finally, the comparison of the results obtained by DLST, multilocus sequence typing, PFGE, Staphylococcal cassette chromosome mec typing and the detection of Panton-Valentine leukocidin genes indicated that no typing scheme should be used on its own. It is only the combination of data from different methods that gives the best chance of describing precisely the epidemiology and phylogeny of MRSA.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: Establishing the genetic basis of phenotypes such as skeletal dysplasia in model organisms can provide insights into biologic processes and their role in human disease. METHODS: We screened mutagenized mice and observed a neonatal lethal skeletal dysplasia with an autosomal recessive pattern of inheritance. Through genetic mapping and positional cloning, we identified the causative mutation. RESULTS: Affected mice had a nonsense mutation in the thyroid hormone receptor interactor 11 gene (Trip11), which encodes the Golgi microtubule-associated protein 210 (GMAP-210); the affected mice lacked this protein. Golgi architecture was disturbed in multiple tissues, including cartilage. Skeletal development was severely impaired, with chondrocytes showing swelling and stress in the endoplasmic reticulum, abnormal cellular differentiation, and increased cell death. Golgi-mediated glycosylation events were altered in fibroblasts and chondrocytes lacking GMAP-210, and these chondrocytes had intracellular accumulation of perlecan, an extracellular matrix protein, but not of type II collagen or aggrecan, two other extracellular matrix proteins. The similarities between the skeletal and cellular phenotypes in these mice and those in patients with achondrogenesis type 1A, a neonatal lethal form of skeletal dysplasia in humans, suggested that achondrogenesis type 1A may be caused by GMAP-210 deficiency. Sequence analysis revealed loss-of-function mutations in the 10 unrelated patients with achondrogenesis type 1A whom we studied. CONCLUSIONS: GMAP-210 is required for the efficient glycosylation and cellular transport of multiple proteins. The identification of a mutation affecting GMAP-210 in mice, and then in humans, as the cause of a lethal skeletal dysplasia underscores the value of screening for abnormal phenotypes in model organisms and identifying the causative mutations.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Three-dimensional sequence stratigraphy is a potent exploration and development tool for the discovery of subtle stratigraphic traps. Reservoir morphology, heterogeneity and subtle stratigraphic trapping mechanisms can be better understood through systematic horizontal identification of sedimentary facies of systems tracts provided by three-dimensional attribute maps used as an important complement to the sequential analysis on the two-dimensional seismic lines and the well log data. On new prospects as well as on already-producing fields, the additional input of sequential analysis on three-dimensional data enables the identification, location and precise delimitation of new potentially productive zones. The first part of this paper presents four typical horizontal seismic facies assigned to the successive systems tracts of a third- or fourth-order sequence deposited in inner to outer neritic conditions on a elastic shelf. The construction of this synthetic representative sequence is based on the observed reproducibility of the horizontal seismic facies response to cyclic eustatic events on more than 35 sequences registered in the Gulf coast Plio-Pleistocene and Late Miocene, offshore Louisiana in the West Cameron region of the Gulf of Mexico. The second part shows how three-dimensional sequence stratigraphy can contribute in localizing and understanding sedimentary facies associated with productive zones. A case study in the early Middle Miocene Cibicides opima sands shows multiple stacked gas accumulations in the top slope fan, prograding wedge and basal transgressive systems tract of the third-order sequence between SB15.5 and SB 13.8 Ma.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Rapport de synthèse :Les individus HIV-positifs constituent une population à risque pour les maladies cardiovasculaires telles que |'infarctus cardiaque ou cérébrale. Celles-ci découlent d'une formation accélérée d'athéroscIérose. Ces pathologies s'expliquent en grande partie par une dyslipidémie observée au sein de cette population et qui sont dues à des facteurs externes tels que : l'immunosuppression avancée, la virémie non-contrôlée, et les effets de la thérapie antirétrovirale. Récemment, des polymorphismes nucléotidiques simples (SNP) associés à la dyslipidémie ont été mis en évidence d'une manière globale par des Genome-Wide Association Studies (GWAS). Le but principal de cette étude est d'éva|uer et de valider |'effet cumulatif des SNP identifiés dans ces GWAS pour la dyslipidémie chez des patients HIV-positifs. De plus, |'identification des facteurs non-génétiques qui contribuent à la dyslipidémie démontrent |'importance des facteurs externes, tels que mentionnés ci- dessus, et en particulier à ceux de la thérapie antirétrovirale.Les participants de l'étude proviennent de trois groupes: 426 personnes sélectionnées pour une étude précédente, 222 personnes sélectionnées de façon arbitraire dans la "Cohorte HIV Suisse" et 103 personnes sélectionnées avec un "New-Onset Diabetes mellitus" identifiées lors d'études précédentes. Ces individus ont contribué à plus de 34'000 mesures de lipides sur une durée moyenne supérieure à 7 ans. Pour l'étude, 33 SNP identifiés dans des GWAS et 9 SNP identifiés dans d'autres études publiées dans la littérature non-couverte par des GWAS ont été repris. Le génotypage a été complété pour 745 (99.2%) des 751 participants. Pour les analyses statistiques, les thérapies antirétrovirales ont été divisées en trois groupes (favorisant peu, moyennement et fortement la dyslipidémie), et trois scores génétiques ont été créés (profil favorable, moyennement favorable, non favorable/favorisant la dyslipidémie). Dans un premier temps, l'effet sur la valeur des lipides d'un ou deux allèles variants a été analysé au moyen d'un modèle de régression pour chaque SNP en ajustant le modèle pour les variables non- génétiques. Dans un deuxième temps, les SNP ayant une valeur p >= à 0.2 ont été repris dans un model Multi-SNP, ce modèle est également ajusté pour les variables non-génétiques. Puisque cette étude se base sur des SNP précédemment identifiés, celle-ci évalue uniquement l'association établie entre chaque SNP et les critères qui ont été établis au préalable, tels que : Cholestérol totale, HDL Cholestérol, non-HDL Cholestérol ou Triglycérides. Les résultats trouvés lors de |'étude confirment les résultats de la littérature. Cette étude montre que les SNP associés à la dyslipidémie doivent être analysés dans le contexte d'une thérapie antirétrovirale en tenant compte de la démographie et en considérant les valeurs du HIV (CD4+, virémie). Ces SNP montrent une tendance à prédire une dyslipidémie prolongée chez l'individu. En effet, un patient avec une thérapie antirétrovirale favorisant la dyslipidémie et un patrimoine génétique non-favorable a un risque qui est 3-f0is plus important d'avoir un Non-HDL- Cholestérol élevé, 5-fois plus important d'avoir un HDL-Cholestérol abaissé, et 4 à 5-fois plus important d'avoir une hypertriglycéridémie qu'un patient qui suit une thérapie antirétrovirale favorisant peu la dyslipidémie qui a un patrimoine génétique favorable. Vu la corrélation entre les SNP et la thérapie antirétrovirale, les cliniciens devraient intégrer les informations génétiques afin de choisir une thérapie antirétrovirale en fonction du patrimoine génétique.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Models of codon evolution have attracted particular interest because of their unique capabilities to detect selection forces and their high fit when applied to sequence evolution. We described here a novel approach for modeling codon evolution, which is based on Kronecker product of matrices. The 61 × 61 codon substitution rate matrix is created using Kronecker product of three 4 × 4 nucleotide substitution matrices, the equilibrium frequency of codons, and the selection rate parameter. The entities of the nucleotide substitution matrices and selection rate are considered as parameters of the model, which are optimized by maximum likelihood. Our fully mechanistic model allows the instantaneous substitution matrix between codons to be fully estimated with only 19 parameters instead of 3,721, by using the biological interdependence existing between positions within codons. We illustrate the properties of our models using computer simulations and assessed its relevance by comparing the AICc measures of our model and other models of codon evolution on simulations and a large range of empirical data sets. We show that our model fits most biological data better compared with the current codon models. Furthermore, the parameters in our model can be interpreted in a similar way as the exchangeability rates found in empirical codon models.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Rho GTPases are conformational switches that control a wide variety of signaling pathways critical for eukaryotic cell development and proliferation. They represent attractive targets for drug design as their aberrant function and deregulated activity is associated with many human diseases including cancer. Extensive high-resolution structures (.100) and recent mutagenesis studies have laid the foundation for the design of new structure-based chemotherapeutic strategies. Although the inhibition of Rho signaling with drug-like compounds is an active area of current research, very little attention has been devoted to directly inhibiting Rho by targeting potential allosteric non-nucleotide binding sites. By avoiding the nucleotide binding site, compounds may minimize the potential for undesirable off-target interactions with other ubiquitous GTP and ATP binding proteins. Here we describe the application of molecular dynamics simulations, principal component analysis, sequence conservation analysis, and ensemble small-molecule fragment mapping to provide an extensive mapping of potential small-molecule binding pockets on Rho family members. Characterized sites include novel pockets in the vicinity of the conformationaly responsive switch regions as well as distal sites that appear to be related to the conformations of the nucleotide binding region. Furthermore the use of accelerated molecular dynamics simulation, an advanced sampling method that extends the accessible time-scale of conventional simulations, is found to enhance the characterization of novel binding sites when conformational changes are important for the protein mechanism.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose and validate a multivariate classification algorithm for characterizing changes in human intracranial electroencephalographic data (iEEG) after learning motor sequences. The algorithm is based on a Hidden Markov Model (HMM) that captures spatio-temporal properties of the iEEG at the level of single trials. Continuous intracranial iEEG was acquired during two sessions (one before and one after a night of sleep) in two patients with depth electrodes implanted in several brain areas. They performed a visuomotor sequence (serial reaction time task, SRTT) using the fingers of their non-dominant hand. Our results show that the decoding algorithm correctly classified single iEEG trials from the trained sequence as belonging to either the initial training phase (day 1, before sleep) or a later consolidated phase (day 2, after sleep), whereas it failed to do so for trials belonging to a control condition (pseudo-random sequence). Accurate single-trial classification was achieved by taking advantage of the distributed pattern of neural activity. However, across all the contacts the hippocampus contributed most significantly to the classification accuracy for both patients, and one fronto-striatal contact for one patient. Together, these human intracranial findings demonstrate that a multivariate decoding approach can detect learning-related changes at the level of single-trial iEEG. Because it allows an unbiased identification of brain sites contributing to a behavioral effect (or experimental condition) at the level of single subject, this approach could be usefully applied to assess the neural correlates of other complex cognitive functions in patients implanted with multiple electrodes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

It is often supposed that a protein's rate of evolution and its amino acid content are determined by the function and anatomy of the protein. Here we examine an alternative possibility, namely that the requirement to specify in the unprocessed RNA, in the vicinity of intron-exon boundaries, information necessary for removal of introns (e.g., exonic splice enhancers) affects both amino acid usage and rates of protein evolution. We find that the majority of amino acids show skewed usage near intron-exon boundaries, and that differences in the trends for the 2-fold and 4-fold blocks of both arginine and leucine show this to be owing to effects mediated at the nucleotide level. More specifically, there is a robust relationship between the extent to which an amino acid is preferred/avoided near boundaries and its enrichment/paucity in splice enhancers. As might then be expected, the rate of evolution is lowest near intron-exon boundaries, at least in part owing to splice enhancers, such that domains flanking intron-exon junctions evolve on average at under half the rate of exon centres from the same gene. In contrast, the rate of evolution of intronless retrogenes is highest near the domains where intron-exon junctions previously resided. The proportion of sequence near intron-exon boundaries is one of the stronger predictors of a protein's rate of evolution in mammals yet described. We conclude that after intron insertion selection favours modification of amino acid content near intron-exon junctions, so as to enable efficient intron removal, these changes then being subject to strong purifying selection even if nonoptimal for protein function. Thus there exists a strong force operating on protein evolution in mammals that is not explained directly in terms of the biology of the protein.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The CD209 gene family that encodes C-type lectins in primates includes CD209 (DC-SIGN), CD209L (L-SIGN) and CD209L2. Understanding the evolution of these genes can help understand the duplication events generating this family, the process leading to the repeated neck region and identify protein domains under selective pressure. We compiled sequences from 14 primates representing 40 million years of evolution and from three non-primate mammal species. Phylogenetic analyses used Bayesian inference, and nucleotide substitutional patterns were assessed by codon-based maximum likelihood. Analyses suggest that CD209 genes emerged from a first duplication event in the common ancestor of anthropoids, yielding CD209L2 and an ancestral CD209 gene, which, in turn, duplicated in the common Old World primate ancestor, giving rise to CD209L and CD209. K(A)/K(S) values averaged over the entire tree were 0.43 (CD209), 0.52 (CD209L) and 0.35 (CD209L2), consistent with overall signatures of purifying selection. We also assessed the Toll-like receptor (TLR) gene family, which shares with CD209 genes a common profile of evolutionary constraint. The general feature of purifying selection of CD209 genes, despite an apparent redundancy (gene absence and gene loss), may reflect the need to faithfully recognize a multiplicity of pathogen motifs, commensals and a number of self-antigens

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: The human chromosome 8p23.1 region contains a 3.8–4.5 Mb segment which can be found in different orientations (defined as genomic inversion) among individuals. The identification of single nucleotide polymorphisms (SNPs) tightly linked to the genomic orientation of a given region should be useful to indirectly evaluate the genotypes of large genomic orientations in the individuals. Results: We have identified 16 SNPs, which are in linkage disequilibrium (LD) with the 8p23.1 inversion as detected by fluorescent in situ hybridization (FISH). The variability of the 8p23.1 orientation in 150 HapMap samples was predicted using this set of SNPs and was verified by FISH in a subset of samples. Four genes (NEIL2, MSRA, CTSB and BLK) were found differentially expressed (p<0.0005) according to the orientation of the 8p23.1 region. Finally, we have found variable levels of mosaicism for the orientation of the 8p23.1 as determined by FISH. Conclusion: By means of dense SNP genotyping of the region, haplotype-based computational analyses and FISH experiments we could infer and verify the orientation status of alleles in the 8p23.1 region by detecting two short haplotype stretches at both ends of the inverted region, which are likely the relic of the chromosome in which the original inversion occurred. Moreover, an impact of 8p23.1 inversion on gene expression levels cannot be ruled out, since four genes from this region have statistically significant different expression levels depending on the inversion status. FISH results in lymphoblastoid cell lines suggest the presence of mosaicism regarding the 8p23.1 inversion.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The goals of the human genome project did not include sequencing of the heterochromatic regions. We describe here an initial sequence of 1.1 Mb of the short arm of human chromosome 21 (HSA21p), estimated to be 10% of 21p. This region contains extensive euchromatic-like sequence and includes on average one transcript every 100 kb. These transcripts show multiple inter- and intrachromosomal copies, and extensive copy number and sequence variability. The sequencing of the "heterochromatic" regions of the human genome is likely to reveal many additional functional elements and provide important evolutionary information.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The construction of metagenomic libraries has permitted the study of microorganisms resistant to isolation and the analysis of 16S rDNA sequences has been used for over two decades to examine bacterial biodiversity. Here, we show that the analysis of random sequence reads (RSRs) instead of 16S is a suitable shortcut to estimate the biodiversity of a bacterial community from metagenomic libraries. We generated 10,010 RSRs from a metagenomic library of microorganisms found in human faecal samples. Then searched them using the program BLASTN against a prokaryotic sequence database to assign a taxon to each RSR. The results were compared with those obtained by screening and analysing the clones containing 16S rDNA sequences in the whole library. We found that the biodiversity observed by RSR analysis is consistent with that obtained by 16S rDNA. We also show that RSRs are suitable to compare the biodiversity between different metagenomic libraries. RSRs can thus provide a good estimate of the biodiversity of a metagenomic library and, as an alternative to 16S, this approach is both faster and cheaper.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A number of experimental methods have been reported for estimating the number of genes in a genome, or the closely related coding density of a genome, defined as the fraction of base pairs in codons. Recently, DNA sequence data representative of the genome as a whole have become available for several organisms, making the problem of estimating coding density amenable to sequence analytic methods. Estimates of coding density for a single genome vary widely, so that methods with characterized error bounds have become increasingly desirable. We present a method to estimate the protein coding density in a corpus of DNA sequence data, in which a ‘coding statistic’ is calculated for a large number of windows of the sequence under study, and the distribution of the statistic is decomposed into two normal distributions, assumed to be the distributions of the coding statistic in the coding and noncoding fractions of the sequence windows. The accuracy of the method is evaluated using known data and application is made to the yeast chromosome III sequence and to C.elegans cosmid sequences. It can also be applied to fragmentary data, for example a collection of short sequences determined in the course of STS mapping.