925 resultados para flanking sequence


Relevância:

20.00% 20.00%

Publicador:

Resumo:

This book gives a general view of sequence analysis, the statistical study of successions of states or events. It includes innovative contributions on life course studies, transitions into and out of employment, contemporaneous and historical careers, and political trajectories. The approach presented in this book is now central to the life-course perspective and the study of social processes more generally. This volume promotes the dialogue between approaches to sequence analysis that developed separately, within traditions contrasted in space and disciplines. It includes the latest developments in sequential concepts, coding, atypical datasets and time patterns, optimal matching and alternative algorithms, survey optimization, and visualization. Field studies include original sequential material related to parenting in 19th-century Belgium, higher education and work in Finland and Italy, family formation before and after German reunification, French Jews persecuted in occupied France, long-term trends in electoral participation, and regime democratization. Overall the book reassesses the classical uses of sequences and it promotes new ways of collecting, formatting, representing and processing them. The introduction provides basic sequential concepts and tools, as well as a history of the method. Chapters are presented in a way that is both accessible to the beginner and informative to the expert.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The recently released Affymetrix Human Gene 1.0 ST array has two major differences compared with standard 3' based arrays: (i) it interrogates the entire mRNA transcript, and (ii) it uses DNA targets. To assess the impact of these differences on array performance, we performed a series of comparative hybridizations between the Human Gene 1.0 ST and the Affymetrix HG-U133 Plus 2.0 and the Illumina HumanRef-8 BeadChip arrays. Additionally, both RNA and DNA targets were hybridized on HG-U133 Plus 2.0 arrays. The results show that the overall reproducibility of the Gene 1.0 ST array is best. When looking only at the high intensity probes, the reproducibility of the Gene 1.0 ST array and the Illumina BeadChip array is equally good. Concordance of array results was assessed using different inter-platform mappings. Agreements are best between the two labeling protocols using HG-U133 Plus 2.0 array. The Gene 1.0 ST array is most concordant with the HG-U133 array hybridized with cDNA targets. This may reflect the impact of the target type. Overall, the high degree of correspondence provides strong evidence for the reliability of the Gene 1.0 ST array.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We improved, evaluated, and used Sanger sequencing for quantification of single nucleotide polymorphism (SNP) variants in transcripts and gDNA samples. This improved assay resulted in highly reproducible relative allele frequencies (e.g., for a heterozygous gDNA 50.0+/-1.4%, and for a missense mutation-bearing transcript 46.9+/-3.7%) with a lower detection limit of 3-9%. It provided excellent accuracy and linear correlation between expected and observed relative allele frequencies. This sequencing assay, which can also be used for the quantification of copy number variations (CNVs), methylations, mosaicisms, and DNA pools, enabled us to analyze transcripts of the FBN1 gene in fibroblasts and blood samples of patients with suspected Marfan syndrome not only qualitatively but also quantitatively. We report a total of 18 novel and 19 known FBN1 sequence variants leading to a premature termination codon (PTC), 26 of which we analyzed by quantitative sequencing both at gDNA and cDNA levels. The relative amounts of PTC-containing FBN1 transcripts in fresh and PAXgene-stabilized blood samples were significantly higher (33.0+/-3.9% to 80.0+/-7.2%) than those detected in affected fibroblasts with inhibition of nonsense-mediated mRNA decay (NMD) (11.0+/-2.1% to 25.0+/-1.8%), whereas in fibroblasts without NMD inhibition no mutant alleles could be detected. These results provide evidence for incomplete NMD in leukocytes and have particular importance for RNA-based analyses not only in FBN1 but also in other genes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Starting from a biologically active recombinant DNA clone of exogenous unintegrated GR mouse mammary tumor virus, we have generated three subclones of PstI fragments of 1.45, 1.1, and 2.0 kb in the plasmid vector PBR322. The nucleotide sequence has been determined for the clone of 1.45 kb which includes almost the complete region of the long terminal repeat (LTR) plus an adjacent stretch of unique sequence DNA. A short region of the 2.0 kb clone, containing the beginning of the LTR, has also been sequenced. Starting with the A of an initiation codon outside the LTR, we detected an open reading frame of 960 nucleotides, potentially coding for a protein of 320 amino acids (36K). Two hundred nucleotides downstream from the termination codon, and approximately 25 nucleotides upstream from the presumptive initiation site of viral RNA synthesis, we found a promoter-like sequence. The sequence AGTAAA was detected approximately 15-20 nucleotides upstream from the 3' end of virion RNA and probably serves as a polyadenylation signal. The 1.45 kb PstI fragment has been transfected into Ltk- cells together with a plasmid containing the thymidine kinase gene of herpes simplex virus. The virus-specific RNA synthesis detected in a Tk+ cell clone was strongly stimulated by the addition of dexamethasone.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Methicillin-resistant Staphylococcus aureus (MRSA) is a major cause of nosocomial infections worldwide. To differentiate reliably among S. aureus isolates, we recently developed double locus sequence typing (DLST) based on the analysis of partial sequences of clfB and spa genes. In the present study, we evaluated the usefulness of DLST for epidemiological investigations of MRSA by routinely typing 1242 strains isolated in Western Switzerland. Additionally, particular local and international collections were typed by pulsed field gel electrophoresis (PFGE) and DLST to check the compatibility of DLST with the results obtained by PFGE, and for international comparisons. Using DLST, we identified the major MRSA clones of Western Switzerland, and demonstrated the close relationship between local and international clones. The congruence of 88% between the major PFGE and DLST clones indicated that our results obtained by DLST were compatible with earlier results obtained by PFGE. DLST could thus easily be incorporated in a routine surveillance procedure. In addition, the unambiguous definition of DLST types makes this method more suitable than PFGE for long-term epidemiological surveillance. Finally, the comparison of the results obtained by DLST, multilocus sequence typing, PFGE, Staphylococcal cassette chromosome mec typing and the detection of Panton-Valentine leukocidin genes indicated that no typing scheme should be used on its own. It is only the combination of data from different methods that gives the best chance of describing precisely the epidemiology and phylogeny of MRSA.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Three-dimensional sequence stratigraphy is a potent exploration and development tool for the discovery of subtle stratigraphic traps. Reservoir morphology, heterogeneity and subtle stratigraphic trapping mechanisms can be better understood through systematic horizontal identification of sedimentary facies of systems tracts provided by three-dimensional attribute maps used as an important complement to the sequential analysis on the two-dimensional seismic lines and the well log data. On new prospects as well as on already-producing fields, the additional input of sequential analysis on three-dimensional data enables the identification, location and precise delimitation of new potentially productive zones. The first part of this paper presents four typical horizontal seismic facies assigned to the successive systems tracts of a third- or fourth-order sequence deposited in inner to outer neritic conditions on a elastic shelf. The construction of this synthetic representative sequence is based on the observed reproducibility of the horizontal seismic facies response to cyclic eustatic events on more than 35 sequences registered in the Gulf coast Plio-Pleistocene and Late Miocene, offshore Louisiana in the West Cameron region of the Gulf of Mexico. The second part shows how three-dimensional sequence stratigraphy can contribute in localizing and understanding sedimentary facies associated with productive zones. A case study in the early Middle Miocene Cibicides opima sands shows multiple stacked gas accumulations in the top slope fan, prograding wedge and basal transgressive systems tract of the third-order sequence between SB15.5 and SB 13.8 Ma.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Purpose: Previously we reported on a premature termination mutation in SLC16A12 that leads to dominant juvenile cataract and renal glucosuria. To assess the mutation rate and genotype-phenotype correlations of SLC16A12 in juvenile or age-related forms of cataract, we performed a mutation screen in cataract patients. Methods: Clinical data of approximately 660 patients were collected, genomic DNA was isolated and analyzed. Exons 3 to 8 including flanking intron sequences of SLC16A12 were PCR amplified and DNA sequence was determined. Selected mutations were tested by cell culture assays, in silico analysis and RT-PCR. Results: We found sequence alterations at a rate of approximately 1/75 patients. None of them was found in 360 control alleles. Alterations affect splice site and regulatory region but most mutations caused an amino acid substitution. The majority of the coding region mutations maps to trans-membrane domains. One mutation located to the 5'UTR. It affects translational efficiency of SLC16A12. In addition, we identified a cataract-predisposing SNP in the non-coding region that causes allele-specific splicing of the 5'UTR region. Conclusions: Altered translational efficiency of the solute carrier SLC16A12 and its allele-specific splicing strongly support a model of challenged homeostasis to cause various forms of cataract. In addition, the pathogenic property of the here reported sequence alterations is supported by the lack of known sequence variations within the coding region of SLC16A12. Due to the relatively high mutation rate, we suggest to include SLC16A12 in diagnostic cataract screening. Generally, our data recommend the assessment of regulatory sequences for diagnostic purposes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose and validate a multivariate classification algorithm for characterizing changes in human intracranial electroencephalographic data (iEEG) after learning motor sequences. The algorithm is based on a Hidden Markov Model (HMM) that captures spatio-temporal properties of the iEEG at the level of single trials. Continuous intracranial iEEG was acquired during two sessions (one before and one after a night of sleep) in two patients with depth electrodes implanted in several brain areas. They performed a visuomotor sequence (serial reaction time task, SRTT) using the fingers of their non-dominant hand. Our results show that the decoding algorithm correctly classified single iEEG trials from the trained sequence as belonging to either the initial training phase (day 1, before sleep) or a later consolidated phase (day 2, after sleep), whereas it failed to do so for trials belonging to a control condition (pseudo-random sequence). Accurate single-trial classification was achieved by taking advantage of the distributed pattern of neural activity. However, across all the contacts the hippocampus contributed most significantly to the classification accuracy for both patients, and one fronto-striatal contact for one patient. Together, these human intracranial findings demonstrate that a multivariate decoding approach can detect learning-related changes at the level of single-trial iEEG. Because it allows an unbiased identification of brain sites contributing to a behavioral effect (or experimental condition) at the level of single subject, this approach could be usefully applied to assess the neural correlates of other complex cognitive functions in patients implanted with multiple electrodes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In order to characterize the gene encoding the ligand binding (1(st); alpha) chain of the human IFN-gamma receptor, two overlapping cosmid clones were analyzed. The gene spans over 25 kilobases (kb) of the genomic DNA and has seven exons. The extracellular domain is encoded by exons 1 to 5 and by part of exon 6. The transmembrane region is also encoded by exon 6. Exon 7 encodes the intracellular domain and the 3' untranslated portion. The gene was located on chromosome 6q23.1, as determined by in situ hybridization. The 4 kb region upstream (5') of the gene was sequenced and analyzed for promoter activity. No consensus-matching TATA or CAAT boxes in the 5' region were found. Potential binding sites for Sp1, AP-1, AP-2, and CREB nuclear factors were identified. Compatible with the presence of the Sp1/AP-2 sites and the lack of TATA box, S1-nuclease mapping experiments showed multiple transcription initiation sites. Promoter activity of the 5' flanking region was analyzed with two different reporter genes: the Escherichia coli chloramphenicol acetyltransferase and human growth hormone. The smallest 5' region of the gene that still had full promoter activity was 692 base pairs in length. In addition, we found sequences belonging to the oldest family of Alu repeats, 2 - 3 kb upstream of the gene, which could be useful for genetic studies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

It is often supposed that a protein's rate of evolution and its amino acid content are determined by the function and anatomy of the protein. Here we examine an alternative possibility, namely that the requirement to specify in the unprocessed RNA, in the vicinity of intron-exon boundaries, information necessary for removal of introns (e.g., exonic splice enhancers) affects both amino acid usage and rates of protein evolution. We find that the majority of amino acids show skewed usage near intron-exon boundaries, and that differences in the trends for the 2-fold and 4-fold blocks of both arginine and leucine show this to be owing to effects mediated at the nucleotide level. More specifically, there is a robust relationship between the extent to which an amino acid is preferred/avoided near boundaries and its enrichment/paucity in splice enhancers. As might then be expected, the rate of evolution is lowest near intron-exon boundaries, at least in part owing to splice enhancers, such that domains flanking intron-exon junctions evolve on average at under half the rate of exon centres from the same gene. In contrast, the rate of evolution of intronless retrogenes is highest near the domains where intron-exon junctions previously resided. The proportion of sequence near intron-exon boundaries is one of the stronger predictors of a protein's rate of evolution in mammals yet described. We conclude that after intron insertion selection favours modification of amino acid content near intron-exon junctions, so as to enable efficient intron removal, these changes then being subject to strong purifying selection even if nonoptimal for protein function. Thus there exists a strong force operating on protein evolution in mammals that is not explained directly in terms of the biology of the protein.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The goals of the human genome project did not include sequencing of the heterochromatic regions. We describe here an initial sequence of 1.1 Mb of the short arm of human chromosome 21 (HSA21p), estimated to be 10% of 21p. This region contains extensive euchromatic-like sequence and includes on average one transcript every 100 kb. These transcripts show multiple inter- and intrachromosomal copies, and extensive copy number and sequence variability. The sequencing of the "heterochromatic" regions of the human genome is likely to reveal many additional functional elements and provide important evolutionary information.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The construction of metagenomic libraries has permitted the study of microorganisms resistant to isolation and the analysis of 16S rDNA sequences has been used for over two decades to examine bacterial biodiversity. Here, we show that the analysis of random sequence reads (RSRs) instead of 16S is a suitable shortcut to estimate the biodiversity of a bacterial community from metagenomic libraries. We generated 10,010 RSRs from a metagenomic library of microorganisms found in human faecal samples. Then searched them using the program BLASTN against a prokaryotic sequence database to assign a taxon to each RSR. The results were compared with those obtained by screening and analysing the clones containing 16S rDNA sequences in the whole library. We found that the biodiversity observed by RSR analysis is consistent with that obtained by 16S rDNA. We also show that RSRs are suitable to compare the biodiversity between different metagenomic libraries. RSRs can thus provide a good estimate of the biodiversity of a metagenomic library and, as an alternative to 16S, this approach is both faster and cheaper.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A number of experimental methods have been reported for estimating the number of genes in a genome, or the closely related coding density of a genome, defined as the fraction of base pairs in codons. Recently, DNA sequence data representative of the genome as a whole have become available for several organisms, making the problem of estimating coding density amenable to sequence analytic methods. Estimates of coding density for a single genome vary widely, so that methods with characterized error bounds have become increasingly desirable. We present a method to estimate the protein coding density in a corpus of DNA sequence data, in which a ‘coding statistic’ is calculated for a large number of windows of the sequence under study, and the distribution of the statistic is decomposed into two normal distributions, assumed to be the distributions of the coding statistic in the coding and noncoding fractions of the sequence windows. The accuracy of the method is evaluated using known data and application is made to the yeast chromosome III sequence and to C.elegans cosmid sequences. It can also be applied to fragmentary data, for example a collection of short sequences determined in the course of STS mapping.