977 resultados para GENOMIC SEQUENCE


Relevância:

70.00% 70.00%

Publicador:

Resumo:

We examined the MLL genomic translocation breakpoint in acute myeloid leukemia of infant twins. Southern blot analysis in both cases showed two identical MLL gene rearrangements indicating chromosomal translocation. The rearrangements were detectable in the second twin before signs of clinical disease and the intensity relative to the normal fragment indicated that the translocation was not constitutional. Fluorescence in situ hybridization with an MLL-specific probe and karyotype analyses suggested t(11;22)(q23;q11.2) disrupting MLL. Known 5′ sequence from MLL but unknown 3′ sequence from chromosome band 22q11.2 formed the breakpoint junction on the der(11) chromosome. We used panhandle variant PCR to clone the translocation breakpoint. By ligating a single-stranded oligonucleotide that was homologous to known 5′ MLL genomic sequence to the 5′ ends of BamHI-digested DNA through a bridging oligonucleotide, we formed the stem–loop template for panhandle variant PCR which yielded products of 3.9 kb. The MLL genomic breakpoint was in intron 7. The sequence of the partner DNA from band 22q11.2 was identical to the hCDCrel (human cell division cycle related) gene that maps to the region commonly deleted in DiGeorge and velocardiofacial syndromes. Both MLL and hCDCrel contained homologous CT, TTTGTG, and GAA sequences within a few base pairs of their respective breakpoints, which may have been important in uniting these two genes by translocation. Reverse transcriptase-PCR amplified an in-frame fusion of MLL exon 7 to hCDCrel exon 3, indicating that an MLL-hCDCrel chimeric mRNA had been transcribed. Panhandle variant PCR is a powerful strategy for cloning translocation breakpoints where the partner gene is undetermined. This application of the method identified a region of chromosome band 22q11.2 involved in both leukemia and a constitutional disorder.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Expressed sequence tags (ESTs) are randomly sequenced cDNA clones. Currently, nearly 3 million human and 2 million mouse ESTs provide valuable resources that enable researchers to investigate the products of gene expression. The EST databases have proven to be useful tools for detecting homologous genes, for exon mapping, revealing differential splicing, etc. With the increasing availability of large amounts of poorly characterised eukaryotic (notably human) genomic sequence, ESTs have now become a vital tool for gene identification, sometimes yielding the only unambiguous evidence for the existence of a gene expression product. However, BLAST-based Web servers available to the general user have not kept pace with these developments and do not provide appropriate tools for querying EST databases with large highly spliced genes, often spanning 50 000–100 000 bases or more. Here we describe Gene2EST (http://woody.embl-heidelberg.de/gene2est/), a server that brings together a set of tools enabling efficient retrieval of ESTs matching large DNA queries and their subsequent analysis. RepeatMasker is used to mask dispersed repetitive sequences (such as Alu elements) in the query, BLAST2 for searching EST databases and Artemis for graphical display of the findings. Gene2EST combines these components into a Web resource targeted at the researcher who wishes to study one or a few genes to a high level of detail.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The Plasmodium falciparum Genome Database (http://PlasmoDB.org) integrates sequence information, automated analyses and annotation data emerging from the P.falciparum genome sequencing consortium. To date, raw sequence coverage is available for >90% of the genome, and two chromosomes have been finished and annotated. Data in PlasmoDB are organized by chromosome (1–14), and can be accessed using a variety of tools for graphical and text-based browsing or downloaded in various file formats. The GUS (Genomics Unified Schema) implementation of PlasmoDB provides a multi-species genomic relational database, incorporating data from human and mouse, as well as P.falciparum. The relational schema uses a highly structured format to accommodate diverse data sets related to genomic sequence and gene expression. Tools have been designed to facilitate complex biological queries, including many that are specific to Plasmodium parasites and malaria as a disease. Additional projects seek to integrate genomic information with the rich data sets now becoming available for RNA transcription, protein expression, metabolic pathways, genetic and physical mapping, antigenic and population diversity, and phylogenetic relationships with other apicomplexan parasites. The overall goal of PlasmoDB is to facilitate Internet- and CD-ROM-based access to both finished and unfinished sequence information by the global malaria research community.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

There is a need for faster and more sensitive algorithms for sequence similarity searching in view of the rapidly increasing amounts of genomic sequence data available. Parallel processing capabilities in the form of the single instruction, multiple data (SIMD) technology are now available in common microprocessors and enable a single microprocessor to perform many operations in parallel. The ParAlign algorithm has been specifically designed to take advantage of this technology. The new algorithm initially exploits parallelism to perform a very rapid computation of the exact optimal ungapped alignment score for all diagonals in the alignment matrix. Then, a novel heuristic is employed to compute an approximate score of a gapped alignment by combining the scores of several diagonals. This approximate score is used to select the most interesting database sequences for a subsequent Smith–Waterman alignment, which is also parallelised. The resulting method represents a substantial improvement compared to existing heuristics. The sensitivity and specificity of ParAlign was found to be as good as Smith–Waterman implementations when the same method for computing the statistical significance of the matches was used. In terms of speed, only the significantly less sensitive NCBI BLAST 2 program was found to outperform the new approach. Online searches are available at http://dna.uio.no/search/

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The function of the prion protein gene (PRNP) and its normal product PrPC is elusive. We used comparative genomics as a strategy to understand the normal function of PRNP. As the reliability of comparisons increases with the number of species and increased evolutionary distance, we isolated and sequenced a 66.5 kb BAC containing the PRNP gene from a distantly related mammal, the model Australian marsupial Macropus eugenii (tammar wallaby). Marsupials are separated from eutherians such as human and mouse by roughly 180 million years of independent evolution. We found that tammar PRNP, like human PRNP, has two exons. Prion proteins encoded by the tammar wallaby and a distantly related marsupial, Monodelphis domestica (Brazilian opossum) PRNP contain proximal PrP repeats with a distinct, marsupial-specific composition and a variable number. Comparisons of tammar wallaby PRNP with PRNPs from human, mouse, bovine and ovine allowed us to identify non-coding gene regions conserved across the marsupial-eutherian evolutionary distance, which are candidates for regulatory regions. In the PRNP 3' UTR we found a conserved signal for nuclear-specific polyadenylation and the putative cytoplasmic polyadenylation element (CPE), indicating that post-transcriptional control of PRNP mRNA activity is important. Phylogenetic footprinting revealed conserved potential binding sites for the MZF-1 transcription factor in both upstream promoter and intron/intron 1, and for the MEF2, MyTI, Oct-1 and NFAT transcription factors in the intron(s). The presence of a conserved NFAT-binding site and CPE indicates involvement of PrPC in signal transduction and synaptic plasticity. (c) 2004 Elsevier B.V. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Following the completion of the draft Human Genome in 2001, genomic sequence data is becoming available at an accelerating rate, fueled by advances in sequencing and computational technology. Meanwhile, large collections of astronomical and geospatial data have allowed the creation of virtual observatories, accessible throughout the world and requiring only commodity hardware. Through a combination of advances in data management, data mining and visualization, this infrastructure enables the development of new scientific and educational applications as diverse as galaxy classification and real-time tracking of earthquakes and volcanic plumes. In the present paper, we describe steps taken along a similar path towards a virtual observatory for genomes – an immersive three-dimensional visual navigation and query system for comparative genomic data.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The genomic sequence of an Australian isolate of carrot mottle umbravirus (CMoV-A) was determined from cDNA generated from dsRNA. This provides the first data on the genome organization and phylogeny of an umbravirus. The 4201-nucleotide genome contains four major open reading frames (ORFs). Analysis suggests that ORF2 encodes an RNA-dependent RNA polymerase, that ORF4 encodes a movement protein, and that the virus has no coat protein gene. The functions of ORFs 1 and 3 remain unknown. ORF2 is probably translated following ribosomal frameshifting. ORFs 3 and 4 are probably translated from a subgenomic mRNA. Sequence comparisons showed CMoV-A to be closely related to pea enation mosaic RNA2 NA2), but also to have affinities with the Bromoviridae. These findings shed light on the relationships between the luteoviruses, PEMV, and the umbraviruses and on the relationships between the carmo-like viruses and the Bromoviridae.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Members of the family Circoviridae, specifically the genus Circovirus, were thought to infect only vertebrates; however, members of a sister group under the same family, the proposed genusCyclovirus, have been detected recently in insects. In an effort to explore the diversity of cycloviruses and better understand the evolution of these novel ssDNA viruses, here we present five cycloviruses isolated from three dragonfly species (Orthetrum sabina, Xanthocnemis zealandica and Rhionaeschna multicolor) collected in Australia, New Zealand and the USA, respectively. The genomes of these five viruses share similar genome structure to other cycloviruses, with a circular ~1.7 kb genome and two major bidirectionally transcribed ORFs. The genomic sequence data gathered during this study were combined with all cyclovirus genomes available in public databases to identify conserved motifs and regulatory elements in the intergenic regions, as well as determine diversity and recombinant regions within their genomes. The genomes reported here represent four different cyclovirus species, three of which are novel. Our results confirm that cycloviruses circulate widely in winged-insect populations; in eight different cyclovirus species identified in dragonflies to date, some of these exhibit a broad geographical distribution. Recombination analysis revealed both intra-and inter-species recombination events amongst cycloviruses, including genomes recovered from disparate sources (e.g. goat meat and human faeces). Similar to other well-characterized circular ssDNA viruses, recombination may play an important role in cyclovirus evolution. © 2013 SGM.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Human parvovirus B19 is a minute ssDNA virus causing a wide variety of diseases, including erythema infectiosum, arthropathy, anemias, and fetal death. After primary infection, genomic DNA of B19 has been shown to persist in solid tissues of not only symptomatic but also of constitutionally healthy, immunocompetent individuals. In this thesis, the viral DNA was shown to persist as an apparently intact molecule of full length, and without persistence-specific mutations. Thus, although the mere presence of B19 DNA in tissue can not be used as a diagnostic criterion, a possible role in the pathogenesis of diseases e.g. through mRNA or protein production can not be excluded. The molecular mechanism, the host-cell type and the possible clinical significance of B19 DNA tissue persistence are yet to be elucidated. In the beginning of this work, the B19 genomic sequence was considered highly conserved. However, new variants were found: V9 was detected in 1998 in France, in serum of a child with aplastic crisis. This variant differed from the prototypic B19 sequences by ~10 %. In 2002 we found, persisting in skin of constitutionally healthy humans, DNA of another novel B19 variant, LaLi. Genetically this variant differed from both the prototypic sequences and the variant V9 also by ~10%. Simultaneously, B19 isolates with DNA sequences similar to LaLi were introduced by two other groups, in the USA and France. Based on phylogeny, a classification scheme based on three genotypes (B19 types 1-3) was proposed. Although the B19 virus is mainly transmitted via the respiratory route, blood and plasma-derived products contaminated with high levels of B19 DNA have also been shown to be infectious. The European Pharmacopoeia stipulates that, in Europe, from the beginning of 2004, plasma pools for manufacture must contain less than 104 IU/ml of B19 DNA. Quantitative PCR screening is therefore a prerequisite for restriction of the B19 DNA load and obtaining of safe plasma products. Due to the DNA sequence variation among the three B19 genotypes, however, B19 PCR methods might fail to detect the new variants. We therefore examined the suitability of the two commercially available quantitative B19 PCR tests, LightCycler-Parvovirus B19 quantification kit (Roche Diagnostics) and RealArt Parvo B19 LC PCR (Artus), for detection, quantification and differentiation of the three B19 types known, including B19 types 2 and 3. The former method was highly sensitive for detection of the B19 prototype but was not suitable for detection of types 2 and 3. The latter method detected and differentiated all three B19 virus types. However, one of the two type-3 strains was detected at a lower sensitivity. Then, we assessed the prevalence of the three B19 virus types among Finnish blood donors, by screening pooled plasma samples derived from >140 000 blood-donor units: none of the pools contained detectable levels of B19 virus types 2 or 3. According to the results of other groups, B19 type 2 was absent also among Danish blood-donors, and extremely rare among symptomatic European patients. B19 type 3 has been encountered endemically in Ghana and (apparently) in Brazil, and sporadical cases have been detected in France and the UK. We next examined the biological characteristics of these virus types. The p6 promoter regions of virus types 1-3 were cloned in front of a reporter gene, the constructs were transfected into different cell lines, and the promoter activities were measured. As a result, we found that the activities of the three p6 promoters, although differing in sequence by >20%, were of equal strength, and most active in B19-permissive cells. Furthermore, the infectivity of the three B19 types was examined in two B19-permissive cell lines. RT-PCR revealed synthesis of spliced B19 mRNAs, and immunofluorescence verified the production of NS1 and VP proteins in the infected cells. These experiments suggested similar host-cell tropism and showed that the three virus types are strains of the same species, i.e. human parvovirus B19. Last but not least, the sera from subjects infected in the past either with B19 type 1 or type 2 (as evidenced by tissue persistence of the respective DNAs), revealed in VP1/2- and VP2-EIAs a 100 % cross-reactivity between virus types 1 and 2. These results, together with similar studies by others, indicate that the three B19 genotypes constitute a single serotype.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool `PromPredict'. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

X-ray diffraction studies on single crystals of a few viruses have led to the elucidation of their three dimensional structure at near atomic resolution. Both the tertiary structure of the coat protein subunit and the quaternary organization of the icosahedral capsid in these viruses are remarkably similar. These studies have led to a critical re-examination of the structural principles in the architecture of isometric viruses and suggestions of alternative mechanisms of assembly. Apart from their role in the assembly of the virus particle, the coat proteins of certian viruses have been shown to inhibit the replication of the cognate RNA leading to cross-protection. The coat protein amino acid sequence and the genomic sequence of several spherical plant RNA viruses have been determined in the last decade. Experimental data on the mechanisms of uncoating, gene expression and replication of several classes of viruses have also become available. The function of the non-structural proteins of some viruses have been determined. This rapid progress has provided a wealth of information on several key steps in the life cycle of RNA viruses. The function of the viral coat protein, capsid architecture, assembly and disassembly and replication of isometric RNA plant viruses are discussed in the light of this accumulated knowledge.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Sixteen million nucleotide sequence of genome of various organisms have been analysed to detect and study the extent of occurrence of simple repetitive sequences. Two sequence motifs (TG/CA)n and (CT/AG)n capable of adopting unusual DNA structures, left handed Z-conformation and triple-helical conformation respectively, are found to be abundant in rodent and human genomes, but almost completely absent in bacterial genome. (TG/CA)n and (CT/AG)n sequences are present mostly in the intron or 5'/3' flanking regions of the genes. The presence of such repeat motifs in genomic sequence of higher eukaryotes has been correlated with their possible functional significance in nucleosome organization, recombination and gene expression.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

X-ray diffraction studies on single crystals of a few viruses have led to the elucidation of their three dimensional structure at near atomic resolution. Both the tertiary structure of the coat protein subunit and the quaternary morganization of the icosahedral capsid in these viruses are remarkably similar. These studies have led to a critical re-examination of the structural principles in the architecture of isometric viruses and suggestions of alternative mechanisms of assembly. Apart from their role in the assembly of the virus particle, the coat proteins of certian viruses have been shown to inhibit the replication of the cognate RNA leading to cross-protection. The coat protein amino acid sequence and the genomic sequence of several spherical plant RNA viruses have been determined in the last decade. Experimental data on the mechanisms of uncoating, gene expression and replication of several classes of viruses have also become available. The function of the non-structural proteins of some viruses have been determined. This rapid progress has provided a wealth of information on several key steps in the life cycle of RNA viruses. The function of the viral coat protein, capsid architecture, assembly and disassembly and replication of isometric RNA plant viruses are discussed in the light of this accumulated knowledge.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Thiolases are important in fatty-acid degradation and biosynthetic pathways. Analysis of the genomic sequence of Mycobacterium smegmatis suggests the presence of several putative thiolase genes. One of these genes appears to code for an SCP-x protein. Human SCP-x consists of an N-terminal domain (referred to as SCP2 thiolase) and a C-terminal domain (referred as sterol carrier protein 2). Here, the cloning, expression, purification and crystallization of this putative SCP-x protein from M. smegmatis are reported. The crystals diffracted X-rays to 2.5 angstrom resolution and belonged to the triclinic space group P1. Calculation of rotation functions using X-ray diffraction data suggests that the protein is likely to possess a hexameric oligomerization with 32 symmetry which has not been observed in the other six known classes of this enzyme.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Physical clustering of genes has been shown in plants; however, little is known about gene clusters that have different functions, particularly those expressed in the tomato fruit. A class I 17.6 small heat shock protein (Sl17.6 shsp) gene was cloned and used as a probe to screen a tomato (Solanum lycopersicum) genomic library. An 8.3-kb genomic fragment was isolated and its DNA sequence determined. Analysis of the genomic fragment identified intronless open reading frames of three class I shsp genes (Sl17.6, Sl20.0, and Sl20.1), the Sl17.6 gene flanked by Sl20.1 and Sl20.0, with complete 5' and 3' UTRs. Upstream of the Sl20.0 shsp, and within the shsp gene cluster, resides a box C/D snoRNA cluster made of SlsnoR12.1 and SlU24a. Characteristic C and D, and C' and D', boxes are conserved in SlsnoR12.1 and SlU24a while the upstream flanking region of SlsnoR12.1 carries TATA box 1, homol-E and homol-D box-like cis sequences, TM6 promoter, and an uncharacterized tomato EST. Molecular phylogenetic analysis revealed that this particular arrangement of shsps is conserved in tomato genome but is distinct from other species. The intronless genomic sequence is decorated with cis elements previously shown to be responsive to cues from plant hormones, dehydration, cold, heat, and MYC/MYB and WRKY71 transcription factors. Chromosomal mapping localized the tomato genomic sequence on the short arm of chromosome 6 in the introgression line (IL) 6-3. Quantitative polymerase chain reaction analysis of gene cluster members revealed differential expression during ripening of tomato fruit, and relatively different abundances in other plant parts.