954 resultados para Sequence Analysis, DNA


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536(T), M. massiliense CIP 108297(T), and M. bolletii CIP 108541(T)) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering of strains. We found 10/120 (8.3%) isolates for which the concatenated MLSA gene sequence and rpoB sequence were discordant (e.g., M. massiliense MLSA sequence and M. abscessus rpoB sequence), suggesting the intergroup lateral transfers of rpoB. In conclusion, our study strongly supports the recent proposal that M. abscessus, M. massiliense, and M. bolletii should constitute a single species. Our findings also indicate that there has been a horizontal transfer of rpoB sequences between these subgroups, precluding the use of rpoB sequencing alone for the accurate identification of the two proposed M. abscessus subspecies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A porcine BAC clone harboring the tightly linked IFNAR1 and IFNGR2 genes was identified by comparative analysis of the publicly available porcine BAC end sequences. The complete 168,835 bp insert sequence of this clone was determined. Sequence comparisons of the genomic sequence with EST sequences from public databases were performed and allowed a detailed annotation of the IFNAR1 and IFNGR2 genes. The analyzed genes showed a conserved genomic organization with their known mammalian orthologs, however the sequence conservation of these genes across species was relatively low. In addition to the IFNAR1 and IFNGR2 genes, which were completely sequenced, the analyzed BAC clone also contained parts of an orphan gene encoding a putative transmembrane protein (TMEM50B). In contrast to the IFNAR1 and IFNGR2 genes the sequence conservation of the TMEM50B gene across different mammalian species was extremely high.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Defensins are a family of evolutionary ancient antimicrobial peptides consisting of three sub-families: alpha-, beta- and theta-defensins. This investigation was focused on the genomic characterization of equine beta-defensins and the investigation of the potential clustering of beta-defensin genes in the equine genome. Six genomic BAC clones were isolated from the CHORI-241 library and one of these was mapped by FISH to ECA 27q17. This location was confirmed by RH-mapping. The contiguous 212 kb sequence of this clone was determined. Sequence analysis revealed the identification of ten pseudogenes and nine genes, six of which were highly homologous to human beta-defensin DEFB4. Clustering of the beta-defensin genes was confirmed and the order of the genes on the analyzed BAC was related to the corresponding defensin cluster on HSA 8. The knowledge about the sequence and the genomic structure of the equine beta-defensin genes will improve the classification of different paralogous defensin genes and is a prerequisite for subsequent functional studies. Additionally, the first alpha-defensin-like sequence outside the groups of primates, lagomorphs and rodents (glires) was identified.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Multilocus sequence analysis (MLSA) based on recN, rpoA and thdF genes was done on more than 30 species of the family Enterobacteriaceae with a focus on Cronobacter and the related genus Enterobacter. The sequences provide valuable data for phylogenetic, taxonomic and diagnostic purposes. Phylogenetic analysis showed that the genus Cronobacter forms a homogenous cluster related to recently described species of Enterobacter, but distant to other species of this genus. Combining sequence information on all three genes is highly representative for the species' %GC-content used as taxonomic marker. Sequence similarity of the three genes and even of recN alone can be used to extrapolate genetic similarities between species of Enterobacteriaceae. Finally, the rpoA gene sequence, which is the easiest one to determine, provides a powerful diagnostic tool to identify and differentiate species of this family. The comparative analysis gives important insights into the phylogeny and genetic relatedness of the family Enterobacteriaceae and will serve as a basis for further studies and clarifications on the taxonomy of this large and heterogeneous family.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS) technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client’s site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: The Enterococcus faecium genogroup, referred to as clonal complex 17 (CC17), seems to possess multiple determinants that increase its ability to survive and cause disease in nosocomial environments. METHODS: Using 53 clinical and geographically diverse US E. faecium isolates dating from 1971 to 1994, we determined the multilocus sequence type; the presence of 16 putative virulence genes (hyl(Efm), esp(Efm), and fms genes); resistance to ampicillin (AMP) and vancomycin (VAN); and high-level resistance to gentamicin and streptomycin. RESULTS: Overall, 16 different sequence types (STs), mostly CC17 isolates, were identified in 9 different regions of the United States. The earliest CC17 isolates were part of an outbreak that occurred in 1982 in Richmond, Virginia. The characteristics of CC17 isolates included increases in resistance to AMP, the presence of hyl(Efm) and esp(Efm), emergence of resistance to VAN, and the presence of at least 13 of 14 fms genes. Eight of 41 of the early isolates with resistance to AMP, however, were not in CC17. CONCLUSIONS: Although not all early US AMP isolates were clonally related, E. faecium CC17 isolates have been circulating in the United States since at least 1982 and appear to have progressively acquired additional virulence and antibiotic resistance determinants, perhaps explaining the recent success of this species in the hospital environment.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this study, we present a trilocus sequence typing (TLST) scheme based on intragenic regions of two antigenic genes, ace and salA (encoding a collagen/laminin adhesin and a cell wall-associated antigen, respectively), and a gene associated with antibiotic resistance, lsa (encoding a putative ABC transporter), for subspecies differentiation of Enterococcus faecalis. Each of the alleles was analyzed using 50 E. faecalis isolates representing 42 diverse multilocus sequence types (ST(M); based on seven housekeeping genes) and four groups of clonally linked (by pulsed-field gel electrophoresis [PFGE]) isolates. The allelic profiles and/or concatenated sequences of the three genes agreed with multilocus sequence typing (MLST) results for typing of 49 of the 50 isolates; in addition to the one exception, two isolates were found to have identical TLST types but were single-locus variants (differing by a single nucleotide) by MLST and were therefore also classified as clonally related by MLST. TLST was also comparable to PFGE for establishing short-term epidemiological relationships, typing all isolates classified as clonally related by PFGE with the same type. TLST was then applied to representative isolates (of each PFGE subtype and isolation year) of a collection of 48 hospital isolates and demonstrated the same relationships between isolates of an outbreak strain as those found by MLST and PFGE. In conclusion, the TLST scheme described here was shown to be successful for investigating short-term epidemiology in a hospital setting and may provide an alternative to MLST for discriminating isolates.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sequence analysis and optimal matching are useful heuristic tools for the descriptive analysis of heterogeneous individual pathways such as educational careers, job sequences or patterns of family formation. However, to date it remains unclear how to handle the inevitable problems caused by missing values with regard to such analysis. Multiple Imputation (MI) offers a possible solution for this problem but it has not been tested in the context of sequence analysis. Against this background, we contribute to the literature by assessing the potential of MI in the context of sequence analyses using an empirical example. Methodologically, we draw upon the work of Brendan Halpin and extend it to additional types of missing value patterns. Our empirical case is a sequence analysis of panel data with substantial attrition that examines the typical patterns and the persistence of sex segregation in school-to-work transitions in Switzerland. The preliminary results indicate that MI is a valuable methodology for handling missing values due to panel mortality in the context of sequence analysis. MI is especially useful in facilitating a sound interpretation of the resulting sequence types.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The fragmentation of electrospray-generated multiply deprotonated RNA and mixed-sequence RNA/DNA pentanucleotides upon low-energy collision-induced dissociation (CID) in a hybrid quadrupole time-of-flight mass spectrometer was investigated. The goal of unambiguous sequence identification of mixed-sequence RNA/DNA oligonucleotides requires detailed understanding of the gas-phase dissociation of this class of compounds. The two major dissociation events, base loss and backbone fragmentation, are discussed and the unique fragmentation behavior of oligoribonucleotides is demonstrated. Backbone fragmentation of the all-RNA pentanucleotides is characterized by abundant c-ions and their complementary y-ions as the major sequence-defining fragment ion series. In contrast to the dissociation of oligodeoxyribonucleotides, where backbone fragmentation is initiated by the loss of a nucleobase which subsequently leads to the formation of the w- and [a-base]-ions, backbone dissociation of oligoribonucleotides is essentially decoupled from base loss. The different behavior of RNA and DNA oligonucleotides is related to the presence of the 2'-hydroxyl substituent, which is the only structural alteration between the DNA and RNA pentanucleotides studied. CID of mixed-sequence RNA/DNA pentanucleotides results in a combination of the nucleotide-typical backbone fragmentation products, with abundant w-fragment ions generated by cleavage of the phosphodiester backbone adjacent to the deoxy building blocks, whereas backbone cleavage adjacent to ribonucleotides induces the formation of c- and y-ions. (C) 2002 American Society for Mass Spectrometry.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The mouse p53 protein generated by alternative splicing (p53as) has amino acid substitutions at its C terminus that result in constitutively active sequence-specific DNA binding (active form), whereas p53 protein itself binds inefficiently (latent form) unless activated by C-terminal modification. Exogenous p53as expression activated transcription of reporter plasmids containing p53 binding sequences and inhibited growth of mouse and human cells lacking functional endogenous p53. Inducible p53as in stably transfected p53 null fibroblasts increased p21WAF1/Cip-1/Sdi and decreased bcl-2 protein steady-state levels. Endogenous p53as and p53 proteins differed in response to cellular DNA damage. p53 protein was induced transiently in normal keratinocytes and fibroblasts whereas p53as protein accumulation was sustained in parallel with induction of p21WAF1/Cip-1/Sdi protein and mRNA, in support of p53as transcriptional activity. Endogenous p53 and p53as proteins in epidermal tumor cells responded to DNA damage with different kinetics of nuclear accumulation and efficiencies of binding to a p53 consensus DNA sequence. A model is proposed in which C-terminally distinct p53 protein forms specialize in functions, with latent p53 forms primarily for rapid non-sequence-specific binding to sites of DNA damage and active p53 forms for sustained regulation of transcription and growth.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The intensely studied MHC has become the paradigm for understanding the architectural evolution of vertebrate multigene families. The 4-Mb human MHC (also known as the HLA complex) encodes genes critically involved in the immune response, graft rejection, and disease susceptibility. Here we report the continuous 1,796,938-bp genomic sequence of the HLA class I region, linking genes between MICB and HLA-F. A total of 127 genes or potentially coding sequences were recognized within the analyzed sequence, establishing a high gene density of one per every 14.1 kb. The identification of 758 microsatellite provides tools for high-resolution mapping of HLA class I-associated disease genes. Most importantly, we establish that the repeated duplication and subsequent diversification of a minimal building block, MIC-HCGIX-3.8–1-P5-HCGIV-HLA class I-HCGII, engendered the present-day MHC. That the currently nonessential HLA-F and MICE genes have acted as progenitors to today’s immune-competent HLA-ABC and MICA/B genes provides experimental evidence for evolution by “birth and death,” which has general relevance to our understanding of the evolutionary forces driving vertebrate multigene families.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The cell matrix adhesion regulator (CMAR) gene has been suggested to be a signal transduction molecule influencing cell adhesion to collagen and, through this, possibly involved in tumor suppression. The originally reported CMAR cDNA was 464 bp long with a tyrosine phosphorylation site at the extreme 3′ end, which mutagenesis studies had shown to be central to the function of this gene. Since the discovery of a 4-bp insertion polymorphism within the originally reported coding region, further sequence information has been obtained. The cDNA has been extended 5′ by ≈2 kb revealing a 559-bp region showing strong homology to the proposed 5′ untranslated sequence of a murine protein kinase receptor family member, variant in kinase (vik). CMAR genomic sequencing has shown the presence of an intron, the intron/exon boundary lying within this region of homology. An RNA transcript for CMAR of ≈2.5 kb has also been identified. The data suggest complex mechanisms for control of expression of two closely associated genes, CMAR and the vik- associated sequence.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The DNA binding activity of p53 is crucial for its tumor suppressor function and is subject to tight regulation. Previous studies revealed that the inhibitory function of the p53 C terminus is implicated in the latent, low affinity sequence-specific DNA binding activity of p53 in the uninduced state. Sequence-specific DNA binding of p53 has been shown to be activated by several posttranslational modifications and interacting proteins that target predominantly the C terminus. Moreover, several authors have shown that synthetic peptides corresponding to p53 C-terminal sequences activate p53 sequence-specific DNA binding. In an effort to identify the interaction site of p53 with these activating peptides we assessed complex formation between p53 deletion constructs and C-terminal activating peptides by peptide affinity precipitation. This study revealed that two distal regions of the p53 molecule contribute synergistically to the interaction with activating C-terminal peptides: amino acids 80–93 and 364–393. The C-terminal residues 364–393 are already well characterized as having negative regulatory function. DNA binding analyses with these deletion constructs reveal a comparable negative regulatory activity for residues 80–93, defining this region as a previously unidentified negative regulatory domain of p53. Furthermore, synthetic peptides spanning this newly identified proline-rich negative regulatory region (residues 80–93) are able to activate p53 sequence-specific DNA binding in vitro. We suggest that both negative regulatory regions, residues 80–93 and 364–393, contribute cooperatively to the maintenance of the latent, low-affinity DNA binding conformation of p53.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The HIV Reverse Transcriptase and Protease Sequence Database is an on-line relational database that catalogs evolutionary and drug-related sequence variation in the human immunodeficiency virus (HIV) reverse transcriptase (RT) and protease enzymes, the molecular targets of anti-HIV therapy (http://hivdb.stanford.edu). The database contains a compilation of nearly all published HIV RT and protease sequences, including submissions from International Collaboration databases and sequences published in journal articles. Sequences are linked to data about the source of the sequence sample and the antiretroviral drug treatment history of the individual from whom the isolate was obtained. During the past year 3500 sequences have been added and the data model has been expanded to include drug susceptibility data on sequenced isolates. Database content has also been integrated with didactic text and the output of two sequence analysis programs.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The human prion gene contains five copies of a 24 nt repeat that is highly conserved among species. An analysis of folding free energies of the human prion mRNA, in particular in the repeat region, suggested biased codon selection and the presence of RNA patterns. In particular, pseudoknots, similar to the one predicted by Wills in the human prion mRNA, were identified in the repeat region of all available prion mRNAs available in GenBank, but not those of birds and the red slider turtle. An alignment of these mRNAs, which share low sequence homology, shows several co-variations that maintain the pseudoknot pattern. The presence of pseudoknots in yeast Sup35p and Rnq1 suggests acquisition in the prokaryotic era. Computer generated three-dimensional structures of the human prion pseudoknot highlight protein and RNA interaction domains, which suggest a possible effect in prion protein translation. The role of pseudoknots in prion diseases is discussed as individuals with extra copies of the 24 nt repeat develop the familial form of Creutzfeldt–Jakob disease.