50 resultados para patent sequence datasets
Resumo:
Conventionally, protein structure prediction via threading relies on some nonoptimal method to align a protein sequence to each member of a library of known structures. We show how a score function (force field) can be modified so as to allow the direct application of a dynamic programming algorithm to the problem. This involves an approximation whose damage can be minimized by an optimization process during score function parameter determination. The method is compared to sequence to structure alignments using a more conventional pair-wise score function and the frozen approximation. The new method produces results comparable to the frozen approximation, but is faster and has fewer adjustable parameters. It is also free of memory of the template's original amino acid sequence, and does not suffer from a problem of nonconvergence, which can be shown to occur with the frozen approximation. Alignments generated by the simplified score function can then be ranked using a second score function with the approximations removed. (C) 1999 John Wiley & Sons, Inc.
Resumo:
Fluorescence in situ hybridization of a tile path of DNA subclones has previously enabled the cytogenetic definition of the minimal DNA sequence which spans the FRA16D common chromosomal fragile site, located at 16q23.2. Homozygous deletion of the FRA16D locus has been reported in adenocarcinomas of stomach, colon, lung and ovary. We have sequenced the 270 kb containing the FRA16D fragile site and the minimal homozygously deleted region in tumour cells. This sequence enabled localization of some of the tumour cell breakpoints to regions which contain AT-rich secondary structures similar to those associated with the FRA10B and FRA16B rare fragile sites. The FRA16D DNA sequence also led to the identification of an alternatively spliced gene, named FOR (fragile site FRA16D oxidoreductase), exons of which span both the fragile site and the minimal region of homozygous deletion. In addition, the complete DNA sequence of the FRA16D-containing FOR intron reveals no evidence of additional authentic transcripts. Alternatively spliced FOR transcripts (FOR I, FOR II and FOR III) encode proteins which share N-terminal WW domains and differ at their C-terminus, with FOR III having a truncated oxidoreductase domain. FRA16D-associated deletions selectively affect the FOR gene transcripts. Three out of five previously mapped translocation breakpoints in multiple myeloma are also located within the FOR gene. FOR is therefore the principle genetic target for DNA instability at 16q23.2 and perturbation of FOR function is likely to contribute to the biological consequences of DNA instability at FRA16D in cancer cells.
Resumo:
Two small RNAs regulate the timing of Caenorhabditis elegans development(1,2). Transition from the first to the second larval stage fates requires the 22-nucleotide lin-4 RNA(1,3,4), and transition from late larval to adult cell fates requires the 21-nucleotide let-7 RNA 2. The lin-4 and let-7 RNA genes are not homologous to each other, but are each complementary to sequences in the 3' untranslated regions of a set of protein-coding target genes that are normally negatively regulated by the RNAs1,2,5,6. Here we have detected let-7 RNAs of similar to 21 nucleotides in samples from a wide range of animal species, including vertebrate, ascidian, hemichordate, mollusc, annelid and arthropod, but not in RNAs from several cnidarian and poriferan species, Saccharomyces cerevisiae, Escherichia coli or Arabidopsis. We did not detect lin-4 RNA in these species. We found that let-7 temporal regulation is also conserved: let-7 RNA expression is first detected at late larval stages in C. elegans and Drosophila, at 48 hours after fertilization in zebrafish, and in adult stages of annelids and molluscs. The let-7 regulatory RNA may control late temporal transitions during development across animal phylogeny.
Resumo:
Endoparasitoid wasps produce maternal protein secretions, which are transported into the body of insect hosts at oviposition to regulate host physiology for successful development of their offspring. Venturia canescens calyx fluid contains so-called virus-like particles (VLPs) that are essential for immune evasion of the developing parasitoid inside the host. VLPs consist of four major proteins. In this paper, we describe the isolation and molecular cloning of a gene (vlp2) that is a constituent of VLPs and discuss its possible role in VLP structure and function.
Resumo:
Human N-acetyltransferase Type I (NAT1) catalyses the acetylation of many aromatic amine and hydrazine compounds and it has been implicated in the catabolism of folic acid. The enzyme is widely expressed in the body, although there are considerable differences in the level of activity between tissues. A search of the mRNA databases revealed the presence of several NAT1 transcripts in human tissue that appear to be derived from different promoters. Because little is known about NAT1 gene regulation, the present study was undertaken to characterize one of the putative promoter sequences of the NAT1 gene located just upstream of the coding region. We show with reverse-transcriptase PCR that mRNA transcribed from this promoter (Promoter 1) is present in a variety of human cell-lines, but not in quiescent peripheral blood mononuclear cells. Using deletion mutant constructs, we identified a 20 bp sequence located 245 bases upstream of the translation start site which was sufficient for basal NAT1 expression. It comprised an AP-1 (activator protein 1)-binding site, flanked on either side by a TCATT motif. Mutational analysis showed that the AP-1 site and the 3' TCATT sequence were necessary for gene expression, whereas the 5' TCATT appeared to attenuate promoter activity. Electromobility shift assays revealed two specific bands made up by complexes of c-Fos/Fra, c-Jun, YY-1 (Yin and Yang 1) and possibly Oct-1. PMA treatment enhanced expression from the NAT1 promoter via the AP-1-binding site. Furthermore, in peripheral blood mononuclear cells, PMA increased endogenous NAT1 activity and induced mRNA expression from Promoter I, suggesting that it is functional in vivo.
Resumo:
Background: A major goal in the post-genomic era is to identify and characterise disease susceptibility genes and to apply this knowledge to disease prevention and treatment. Rodents and humans have remarkably similar genomes and share closely related biochemical, physiological and pathological pathways. In this work we utilised the latest information on the mouse transcriptome as revealed by the RIKEN FANTOM2 project to identify novel human disease-related candidate genes. We define a new term patholog to mean a homolog of a human disease-related gene encoding a product ( transcript, anti-sense or protein) potentially relevant to disease. Rather than just focus on Mendelian inheritance, we applied the analysis to all potential pathologs regardless of their inheritance pattern. Results: Bioinformatic analysis and human curation of 60,770 RIKEN full-length mouse cDNA clones produced 2,578 sequences that showed similarity ( 70 - 85% identity) to known human-disease genes. Using a newly developed biological information extraction and annotation tool ( FACTS) in parallel with human expert analysis of 17,051 MEDLINE scientific abstracts we identified 182 novel potential pathologs. Of these, 36 were identified by computational tools only, 49 by human expert analysis only and 97 by both methods. These pathologs were related to neoplastic ( 53%), hereditary ( 24%), immunological ( 5%), cardio-vascular (4%), or other (14%), disorders. Conclusions: Large scale genome projects continue to produce a vast amount of data with potential application to the study of human disease. For this potential to be realised we need intelligent strategies for data categorisation and the ability to link sequence data with relevant literature. This paper demonstrates the power of combining human expert annotation with FACTS, a newly developed bioinformatics tool, to identify novel pathologs from within large-scale mouse transcript datasets.
Resumo:
Phylogenetic hypotheses are presented for Pultenaea based on cpDNA (trnL-F and ndhF) and nrDNA ( ITS) sequence data. Pultenaea, as it is currently circumscribed, comprises six strongly supported lineages whose relationships with each other and 18 closely related genera are weak or conflicting among datasets. The lack of resolution among the six Pultenaea clades and their relatives appears to be the result of a rapid radiation, which is evident in molecular data from both the chloroplast and nuclear genomes. The molecular data provide no support for the monophyly of Pultenaea as it currently stands. Given these results, Pultenaea could split into many smaller genera. We prefer the taxonomically stable alternative of subsuming all 19 genera currently recognised in Pultenaea sensu lato (= the Mirbelia group) into an expanded concept of Pultenaea that would comprise similar to 470 species.
Resumo:
Microsatellites or simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Single-locus SSR markers have been developed for a number of species, although there is a major bottleneck in developing SSR markers whereby flanking sequences must be known to design 5'-anchors for polymerase chain reaction (PCR) primers. Inter SSR (ISSR) fingerprinting was developed such that no sequence knowledge was required. Primers based on a repeat sequence, such as (CA)(n), can be made with a degenerate 3'-anchor, such as (CA)(8)RG or (AGC)(6)TY. The resultant PCR reaction amplifies the sequence between two SSRs, yielding a multilocus marker system useful for fingerprinting, diversity analysis and genome mapping. PCR products are radiolabelled with P-32 or P-33 via end-labelling or PCR incorporation, and separated on a polyacrylamide sequencing gel prior to autoradiographic visualisation. A typical reaction yields 20-100 bands per lane depending on the species and primer. We have used ISSR fingerprinting in a number of plant species, and report here some results on two important tropical species, sorghum and banana. Previous investigators have demonstrated that ISSR analysis usually detects a higher level of polymorphism than that detected with restriction fragment length polymorphism (RFLP) or random amplified polymorphic DNA (RAPD) analyses. Our data indicate that this is not a result of greater polymorphism genetically, but rather technical reasons related to the detection methodology used for ISSR analysis.
Resumo:
Phenylalanine hydroxylase (PAH) is activated by its substrate phenylalanine and inhibited by its cofactor tetrahydrobiopterin (BH4). The crystal structure of PAH revealed that the N-terminal sequence of the enzyme (residues 19-29) partially covered the enzyme active site, and suggested its involvement in regulation. We show that the protein lacking this N-terminal sequence does not require activation by phenylalanine, shows an altered structural response to phenylalanine, and is not inhibited by BH4. Our data support the model where the N-terminal sequence of PAH acts as an intrasteric autoregulatory sequence, responsible for transmitting the effect of phenylalanine activation to the active site, (C) 2001 Federation of European Biochemical Societies. Published by Elsevier Science B.V. All rights reserved.
Resumo:
Henneguya lesteri n. sp, (Myxosporea) is described from sand whiting, Sillago analis, from the southern Queensland coast of Australia. H. lesteri displays a preference for the pseudobranchs and is typically positioned along the afferent blood vessels, displacing the adjoining lamellae and disrupting their normal array, The plasmodia appeared as whitish-hyaline, elliptical cysts (mean dimensions 230 x 410 mum) attached to the oral mucosa lining of the hyoid arch on the inner surface of the operculum. Infections of the gills were also found, in which the plasmodia were spherical, averaged 240 x 240 mum in size and were located on the inner hemibranch margin. The parasites lodged in the gill filament crypts and generated a mild hyperplastic response of the branchial epithelium, In histological sections, the plasmodium wall and adjoining ectoplasm appeared as a finely granulated, weakly eosinophilic layer, Ultrastructurally, this section of the host-parasite interface contained an intricate complex of pinocytotic channels. H. lesteri is polysporic, disporoblastic and pansporoblast forming. Sporogenesis is asynchronous, with the earliest developmental stages aligned predominantly along the plasmodium periphery, and maturing sporoblasts and spores toward the center. Ultrastructural details of sporoblast and spore development are in agreement with previously described myxosporeans. The mature spore is drop-shaped, length (mean) 9.1 mum, width 4.7 mum, thickness 2.5 mum, and comprises 2 polar capsules positioned closely together, a binucleated sporoplasm and a caudal process of 12.6 mum. The polar capsules are elongated, 3.2 x 1.6 mum, with 4 turns of the polar filament. Mean length of the everted filament is 23.2 mum, Few studies have analyzed the 18S gene-of marine Myxosporea. In fact, H. lesteri is the first marine species of Henneguya to be characterized at the molecular level: we determined 1966 bp of the small-subunit (18S) rDNA, The results indicated that differences between this and the hitherto studied freshwater Henneguya species are greater than differences among the freshwater Henneguya species.
Resumo:
There have been no reports of DNA sequences of hepatitis B virus (HBV) strains from Australian Aborigines, although the hepatitis B surface antigen (HBsAg) was discovered among them. To investigate the characteristics of DNA sequences of HBV strains from Australian Aborigines, the complete nucleotide sequences of HBV strains were determined and subjected to molecular evolutionary analysis. Serum samples positive for HBsAg were collected from five Australian Aborigines. Phylogenetic analysis of the five complete nucleotide sequences compared with DNA sequences of 54 global HBV isolates from international databases revealed that three of the five were classified into genotype D and were most closely related in terms of evolutionary distance to a strain isolated from a healthy blood donor in Papua New Guinea. Two of the five were classified into a novel variant genotype C, which has not been reported previously, and were closely related to a strain isolated from Polynesians, particularly in the X and Core genes. These two strains of variant genotype C differed from known genotype C strains by 5.9-7.4% over the complete nucleotide sequence and 4.0-5.6 % in the small-S gene, and had residues Arg(122), Thr(127) and Lys(160) characteristic of serotype ayw3, which have not been reported previously in genotype C. In conclusion, this is the first report of the characteristics of complete nucleotide sequences of HBV from Australian Aborigines. These results contribute to the investigation of the worldwide spread of HBV, the relationship between serotype and genotype and the ancient common origin of Australian Aborigines.