970 resultados para DNA sequence analysis


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536(T), M. massiliense CIP 108297(T), and M. bolletii CIP 108541(T)) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering of strains. We found 10/120 (8.3%) isolates for which the concatenated MLSA gene sequence and rpoB sequence were discordant (e.g., M. massiliense MLSA sequence and M. abscessus rpoB sequence), suggesting the intergroup lateral transfers of rpoB. In conclusion, our study strongly supports the recent proposal that M. abscessus, M. massiliense, and M. bolletii should constitute a single species. Our findings also indicate that there has been a horizontal transfer of rpoB sequences between these subgroups, precluding the use of rpoB sequencing alone for the accurate identification of the two proposed M. abscessus subspecies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The potential for mitochondrial (mt) DNA mutation accumulation during antiretroviral therapy (ART), and preferential accumulation in patients with lipoatrophy compared with control participants, remains controversial. We sequenced the entire mitochondrial genome, both before ART and after ART exposure, in 29 human immunodeficiency virus (HIV)-infected Swiss HIV Cohort Study participants initiating a first-line thymidine analogue-containing ART regimen. No accumulation of mtDNA mutations or deletions was detected in 13 participants who developed lipoatrophy or in 16 control participants after significant and comparable ART exposure (median duration, 3.3 and 3.7 years, respectively). In HIV-infected persons, the development of lipoatrophy is unlikely to be associated with accumulation of mtDNA mutations detectable in peripheral blood.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A porcine BAC clone harboring the tightly linked IFNAR1 and IFNGR2 genes was identified by comparative analysis of the publicly available porcine BAC end sequences. The complete 168,835 bp insert sequence of this clone was determined. Sequence comparisons of the genomic sequence with EST sequences from public databases were performed and allowed a detailed annotation of the IFNAR1 and IFNGR2 genes. The analyzed genes showed a conserved genomic organization with their known mammalian orthologs, however the sequence conservation of these genes across species was relatively low. In addition to the IFNAR1 and IFNGR2 genes, which were completely sequenced, the analyzed BAC clone also contained parts of an orphan gene encoding a putative transmembrane protein (TMEM50B). In contrast to the IFNAR1 and IFNGR2 genes the sequence conservation of the TMEM50B gene across different mammalian species was extremely high.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Defensins are a family of evolutionary ancient antimicrobial peptides consisting of three sub-families: alpha-, beta- and theta-defensins. This investigation was focused on the genomic characterization of equine beta-defensins and the investigation of the potential clustering of beta-defensin genes in the equine genome. Six genomic BAC clones were isolated from the CHORI-241 library and one of these was mapped by FISH to ECA 27q17. This location was confirmed by RH-mapping. The contiguous 212 kb sequence of this clone was determined. Sequence analysis revealed the identification of ten pseudogenes and nine genes, six of which were highly homologous to human beta-defensin DEFB4. Clustering of the beta-defensin genes was confirmed and the order of the genes on the analyzed BAC was related to the corresponding defensin cluster on HSA 8. The knowledge about the sequence and the genomic structure of the equine beta-defensin genes will improve the classification of different paralogous defensin genes and is a prerequisite for subsequent functional studies. Additionally, the first alpha-defensin-like sequence outside the groups of primates, lagomorphs and rodents (glires) was identified.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Multilocus sequence analysis (MLSA) based on recN, rpoA and thdF genes was done on more than 30 species of the family Enterobacteriaceae with a focus on Cronobacter and the related genus Enterobacter. The sequences provide valuable data for phylogenetic, taxonomic and diagnostic purposes. Phylogenetic analysis showed that the genus Cronobacter forms a homogenous cluster related to recently described species of Enterobacter, but distant to other species of this genus. Combining sequence information on all three genes is highly representative for the species' %GC-content used as taxonomic marker. Sequence similarity of the three genes and even of recN alone can be used to extrapolate genetic similarities between species of Enterobacteriaceae. Finally, the rpoA gene sequence, which is the easiest one to determine, provides a powerful diagnostic tool to identify and differentiate species of this family. The comparative analysis gives important insights into the phylogeny and genetic relatedness of the family Enterobacteriaceae and will serve as a basis for further studies and clarifications on the taxonomy of this large and heterogeneous family.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Since the mapping of the human genome and the technical innovations in the field of biotechnology, patent law has gone through great controversies. Protection is required for an investor to make an investment but how broad should the given protection be? Whether the invention is a mi- cro-organism capable of dissolving crude oil, or the gene of a soya plant, the genetic engineering required for their production entails vast amounts of capi- tal. The policy in that respect is tailored by legislative acts and judicial decisions, ensuring a fair balance be- tween the interests of patent right holders and third parties. However, the policy differs from jurisdiction to jurisdiction, thus creating inconsistencies with re- gards to the given protection to the same invention, and as a result this could deter innovation and pro- mote stagnation. The most active actors shaping the patent policy on an international level are the patent offices of the United States of America, Japan and the European Patent Organization. These three patent offices have set up a cooperation programme in order to promote and improve efficiency with regards to their patent policies on a global scale. However, recent judicial de- velopments have shown that the policy in respect to the field of biotechnology differs between the patent regimes of the United States of America and the two- layer system of the European Patent Organisation/ the European Union.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Musculoskeletal infections are infections of the bone and surrounding tissues. They are currently diagnosed based on culture analysis, which is the gold standard for pathogen identification. However, these clinical laboratory methods are frequently inadequate for the identification of the causative agents, because a large percentage (25-50%) of confirmed musculoskeletal infections are false negatives in which no pathogen is identified in culture. My data supports these results. The goal of this project was to use PCR amplification of a portion of the 16S rRNA gene to test an alternative approach for the identification of these pathogens and to assess the diversity of the bacteria involved. The advantages of this alternative method are that it should increase sample sensitivity and the speed of detection. In addition, bacteria that are non-culturable or in low abundance can be detected using this molecular technique. However, a complication of this approach is that the majority of musculoskeletal infections are polymicrobial, which prohibits direct identification from the infected tissue by DNA sequencing of the initial 16S rDNA amplification products. One way to solve this problem is to use denaturing gradient gel electrophoresis (DGGE) to separate the PCR products before DNA sequencing. Denaturing gradient gel electrophoresis (DGGE) separates DNA molecules based on their melting point, which is determined by their DNA sequence. This analytical technique allows a mixture of PCR products of the same length that electrophoreses through agarose gels as one band, to be separated into different bands and then used for DNA sequence analysis. In this way, the DGGE allows for the identification of individual bacterial species in polymicrobial-infected tissue, which is critical for improving clinical outcomes. By combining the 16S rDNA amplification and the DGGE techniques together, an alternative approach for identification has been used. The 16S rRNA gene PCR-DGGE method includes several critical steps: DNA extraction from tissue biopsies, amplification of the bacterial DNA, PCR product separation by DGGE, amplification of the gel-extracted DNA, and DNA sequencing and analysis. Each step of the method was optimized to increase its sensitivity and for rapid detection of the bacteria present in human tissue samples. The limit of detection for the DNA extraction from tissue was at least 20 Staphylococcus aureus cells and the limit of detection for PCR was at least 0.05 pg of template DNA. The conditions for DGGE electrophoreses were optimized by using a double gradient of acrylamide (6 – 10%) and denaturant (30-70%), which increased the separation between distinct PCR products. The use of GelRed (Biotium) improved the DNA visualization in the DGGE gel. To recover the DNA from the DGGE gels the gel slices were excised, shredded in a bead beater, and the DNA was allowed to diffuse into sterile water overnight. The use of primers containing specific linkers allowed the entire amplified PCR product to be sequenced and then analyzed. The optimized 16S rRNA gene PCR-DGGE method was used to analyze 50 tissue biopsy samples chosen randomly from our collection. The results were compared to those of the Memorial Hermann Hospital Clinical Microbiology Laboratory for the same samples. The molecular method was congruent for 10 of the 17 (59%) culture negative tissue samples. In 7 of the 17 (41%) culture negative the molecular method identified a bacterium. The molecular method was congruent with the culture identification for 7 of the 33 (21%) positive cultured tissue samples. However, in 8 of the 33 (24%) the molecular method identified more organisms. In 13 of the 15 (87%) polymicrobial cultured tissue samples the molecular method identified at least one organism that was also identified by culture techniques. Overall, the DGGE analysis of 16S rDNA is an effective method to identify bacteria not identified by culture analysis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS) technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client’s site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Models of DNA sequence evolution and methods for estimating evolutionary distances are needed for studying the rate and pattern of molecular evolution and for inferring the evolutionary relationships of organisms or genes. In this dissertation, several new models and methods are developed.^ The rate variation among nucleotide sites: To obtain unbiased estimates of evolutionary distances, the rate heterogeneity among nucleotide sites of a gene should be considered. Commonly, it is assumed that the substitution rate varies among sites according to a gamma distribution (gamma model) or, more generally, an invariant+gamma model which includes some invariable sites. A maximum likelihood (ML) approach was developed for estimating the shape parameter of the gamma distribution $(\alpha)$ and/or the proportion of invariable sites $(\theta).$ Computer simulation showed that (1) under the gamma model, $\alpha$ can be well estimated from 3 or 4 sequences if the sequence length is long; and (2) the distance estimate is unbiased and robust against violations of the assumptions of the invariant+gamma model.^ However, this ML method requires a huge amount of computational time and is useful only for less than 6 sequences. Therefore, I developed a fast method for estimating $\alpha,$ which is easy to implement and requires no knowledge of tree. A computer program was developed for estimating $\alpha$ and evolutionary distances, which can handle the number of sequences as large as 30.^ Evolutionary distances under the stationary, time-reversible (SR) model: The SR model is a general model of nucleotide substitution, which assumes (i) stationary nucleotide frequencies and (ii) time-reversibility. It can be extended to SRV model which allows rate variation among sites. I developed a method for estimating the distance under the SR or SRV model, as well as the variance-covariance matrix of distances. Computer simulation showed that the SR method is better than a simpler method when the sequence length $L>1,000$ bp and is robust against deviations from time-reversibility. As expected, when the rate varies among sites, the SRV method is much better than the SR method.^ The evolutionary distances under nonstationary nucleotide frequencies: The statistical properties of the paralinear and LogDet distances under nonstationary nucleotide frequencies were studied. First, I developed formulas for correcting the estimation biases of the paralinear and LogDet distances. The performances of these formulas and the formulas for sampling variances were examined by computer simulation. Second, I developed a method for estimating the variance-covariance matrix of the paralinear distance, so that statistical tests of phylogenies can be conducted when the nucleotide frequencies are nonstationary. Third, a new method for testing the molecular clock hypothesis was developed in the nonstationary case. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

(1) A mathematical theory for computing the probabilities of various nucleotide configurations is developed, and the probability of obtaining the correct phylogenetic tree (model tree) from sequence data is evaluated for six phylogenetic tree-making methods (UPGMA, distance Wagner method, transformed distance method, Fitch-Margoliash's method, maximum parsimony method, and compatibility method). The number of nucleotides (m*) necessary to obtain the correct tree with a probability of 95% is estimated with special reference to the human, chimpanzee, and gorilla divergence. m* is at least 4,200, but the availability of outgroup species greatly reduces m* for all methods except UPGMA. m* increases if transitions occur more frequently than transversions as in the case of mitochondrial DNA. (2) A new tree-making method called the neighbor-joining method is proposed. This method is applicable either for distance data or character state data. Computer simulation has shown that the neighbor-joining method is generally better than UPGMA, Farris' method, Li's method, and modified Farris method on recovering the true topology when distance data are used. A related method, the simultaneous partitioning method, is also discussed. (3) The maximum likelihood (ML) method for phylogeny reconstruction under the assumption of both constant and varying evolutionary rates is studied, and a new algorithm for obtaining the ML tree is presented. This method gives a tree similar to that obtained by UPGMA when constant evolutionary rate is assumed, whereas it gives a tree similar to that obtained by the maximum parsimony tree and the neighbor-joining method when varying evolutionary rate is assumed. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sequence analysis and optimal matching are useful heuristic tools for the descriptive analysis of heterogeneous individual pathways such as educational careers, job sequences or patterns of family formation. However, to date it remains unclear how to handle the inevitable problems caused by missing values with regard to such analysis. Multiple Imputation (MI) offers a possible solution for this problem but it has not been tested in the context of sequence analysis. Against this background, we contribute to the literature by assessing the potential of MI in the context of sequence analyses using an empirical example. Methodologically, we draw upon the work of Brendan Halpin and extend it to additional types of missing value patterns. Our empirical case is a sequence analysis of panel data with substantial attrition that examines the typical patterns and the persistence of sex segregation in school-to-work transitions in Switzerland. The preliminary results indicate that MI is a valuable methodology for handling missing values due to panel mortality in the context of sequence analysis. MI is especially useful in facilitating a sound interpretation of the resulting sequence types.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genetic variability of milk protein genes may influence the nutritive value or processing and functional properties of the milk. While numerous protein variants are known in ruminants, knowledge about milk protein variability in horses is still limited. Mare's milk is, however, produced for human consumption in many countries. Beta-lactoglobulin belonging to the protein family of lipocalins, which are known as common food- and airborne allergens, is a major whey protein. It is absent from human milk and thus a key agent in provoking cow's milk protein allergy. Mare's milk is, however, usually better tolerated by most affected people. Several functions of β-lactoglobulin have been discussed, but its ultimate physiological role remains unclear. In the current study, the open reading frames of the two equine β-lactoglobulin paralogues LGB1 and LGB2 were re-sequenced in 249 horses belonging to 14 different breeds in order to predict the existence of protein variants at the DNA-level. Thereby, only a single signal peptide variant of LGB1, but 10 different putative protein variants of LGB2 were identified. In horses, both genes are expressed and in such this is a striking previously unknown difference in genetic variability between the two genes. It can be assumed that LGB1 is the ancestral paralogue, which has an essential function causing a high selection pressure. As horses have very low milk fat content this unknown function might well be related to vitamin-uptake. Further studies are, however, needed, to elucidate the properties of the different gene products.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this dissertation, the cytogenetic characteristics of bone marrow cells from 41 multiple myeloma patients were investigated. These cytogenetic data were correlated with the total DNA content as measured by flow cytometry. Both the cytogenetic information and DNA content were then correlated with clinical data to determine if diagnosis and prognosis of multiple myeloma could be improved.^ One hundred percent of the patients demonstrated abnormal chromosome numbers per metaphase. The average chromosome number per metaphase ranged from 42 to 49.9, with a mean of 44.99. The percent hypodiploidy ranged from 0-100% and the percent hyperdiploidy from 0-53%. Detailed cytogenetic analyses were very difficult to perform because of the paucity of mitotic figures and the poor chromosome morphology. Thus, detailed chromosome banding analysis on these patients was impossible.^ Thirty seven percent of the patients had normal total DNA content, whereas 63% had abnormal amounts of DNA (one patient with less than normal amounts and 25 patients with greater than normal amounts of DNA).^ Several clinical parameters were used in the statistical analyses: tumor burden, patient status at biopsy, patient response status, past therapy, type of treatment and percent plasma cells. Only among these clinical parameters were any statistically significant correlations found: pretreatment tumor burden versus patient response, patient biopsy status versus patient response and past therapy versus patient response.^ No correlations were found between percent hypodiploid, diploid, hyperdiploid or DNA content, and the patient response status, nor were any found between those patients with: (a) normal plasma cells, low pretreatment tumor mass burden and more than 50% of the analyzed metaphases with 46 chromosomes; (b) normal amounts of DNA, low pretreatment tumor mass burden and more than 50% of the metaphases with 46 chromosomes; (c) normal amounts of DNA and normal quantities of plasma cells; (d) abnormal amounts of DNA, abnormal amounts of plasma cells, high pretreatment tumor mass burden and less than 50% of the metaphases with 46 chromosomes.^ Technical drawbacks of both cytogenetic and DNA content analysis in these multiple myeloma patients are discussed along with the lack of correlations between DNA content and chromosome number. Refined chromosome banding analysis awaits technical improvements before we can understand which chromosome material (if any) makes up the "extra" amounts of DNA in these patients. None of the correlations tested can be used as diagnostic or prognostic aids for multiple myeloma. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Linked polyamides bind in the minor groove of double-stranded DNA in a partially sequence-specific manner. This report analyzes the theoretical limits of DNA sequence discrimination by linked polyamides composed of two to four different types of heterocyclic rings, determining (i) the optimal choice of base-binding specificity for each ring and (ii) the optimal design for a polyamide composed of these rings to target a given DNA sequence and designed to maximize the fraction of the total polyamide binding to the specified target sequence relative to all other sequences. The results show that, fortuitously, polyamides composed of pyrrole, a naturally occurring G-excluding element, and imidazole, a rationally designed G-favoring element, have features similar to the theoretical optimum design for polyamides composed of two different rings. The results also show that, in polyamides composed of two or three types of heterocyclic rings, choosing a nonspecific “placeholder” ring, which binds equally strongly to each of the four bases, along with one or two base-specific rings will often enhance sequence specificity over a polyamide composed entirely of base-specific rings.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sequence divergence acts as a potent barrier to homologous recombination; much of this barrier derives from an antirecombination activity exerted by mismatch repair proteins. An inverted repeat assay system with recombination substrates ranging in identity from 74% to 100% has been used to define the relationship between sequence divergence and the rate of mitotic crossing-over in yeast. To elucidate the role of the mismatch repair machinery in regulating recombination between mismatched substrates, we performed experiments in both wild-type and mismatch repair defective strains. We find that a single mismatch is sufficient to inhibit recombination between otherwise identical sequences, and that this inhibition is dependent on the mismatch repair system. Additional mismatches have a cumulative negative effect on the recombination rate. With sequence divergence of up to approximately 10%, the inhibitory effect of mismatches results mainly from antirecombination activity of the mismatch repair system. With greater levels of divergence, recombination is inefficient even in the absence of mismatch repair activity. In both wild-type and mismatch repair defective strains, an approximate log-linear relationship is observed between the recombination rate and the level of sequence divergence.