15 resultados para CpGV resistance baculovirus whole genome sequencing
em University of Queensland eSpace - Australia
Resumo:
Translational pausing may occur due to a number of mechanisms, including the presence of non-optimal codons, and it is thought to play a role in the folding of specific polypeptide domains during translation and in the facilitation of signal peptide recognition during see-dependent protein targeting. In this whole genome analysis of Escherichia coli we have found that non-optimal codons in the signal peptide-encoding sequences of secretory genes are overrepresented relative to the mature portions of these genes; this is in addition to their overrepresentation in the 5'-regions of genes encoding non-secretory proteins. We also find increased non-optimal codon usage at the 3' ends of most E. coli genes, in both non-secretory and secretory sequences. Whereas presumptive translational pausing at the 5' and 3' ends of E. coli messenger RNAs may clearly have a general role in translation, we suggest that it also has a specific role in sec-dependent protein export, possibly in facilitating signal peptide recognition. This finding may have important implications for our understanding of how the majority of non-cytoplasmic proteins are targeted, a process that is essential to all biological cells. (C) 2004 Elsevier Inc. All rights reserved.
Resumo:
Topological measures of large-scale complex networks are applied to a specific artificial regulatory network model created through a whole genome duplication and divergence mechanism. This class of networks share topological features with natural transcriptional regulatory networks. Specifically, these networks display scale-free and small-world topology and possess subgraph distributions similar to those of natural networks. Thus, the topologies inherent in natural networks may be in part due to their method of creation rather than being exclusively shaped by subsequent evolution under selection. The evolvability of the dynamics of these networks is also examined by evolving networks in simulation to obtain three simple types of output dynamics. The networks obtained from this process show a wide variety of topologies and numbers of genes indicating that it is relatively easy to evolve these classes of dynamics in this model. (c) 2006 Elsevier Ireland Ltd. All rights reserved.
Resumo:
Chlamydia pneumoniae is an obligate intracellular respiratory pathogen that causes 10% of community-acquired pneumonia and has been associated with cardiovascular disease. Both whole-genome sequencing and specific gene typing suggest that there is relatively little genetic variation in human isolates of C. pneumoniae. To date, there has been little genomic analysis of strains from human cardiovascular sites. The genotypes of C. pneumoniae present in human atherosclerotic carotid plaque were analysed and several polymorphisms in the variable domain 4 (VD4) region of the outer-membrane protein-A (ompA) gene and the intergenic region between the ygeD and uridine kinase (ygeD-urk) genes were found. While one genotype was identified that was the same as one reported previously in humans (respiratory and cardiovascular), another genotype was found that was identical to a genotype from non-human sources (frog/koala).
Resumo:
Background: Determination of the subcellular location of a protein is essential to understanding its biochemical function. This information can provide insight into the function of hypothetical or novel proteins. These data are difficult to obtain experimentally but have become especially important since many whole genome sequencing projects have been finished and many resulting protein sequences are still lacking detailed functional information. In order to address this paucity of data, many computational prediction methods have been developed. However, these methods have varying levels of accuracy and perform differently based on the sequences that are presented to the underlying algorithm. It is therefore useful to compare these methods and monitor their performance. Results: In order to perform a comprehensive survey of prediction methods, we selected only methods that accepted large batches of protein sequences, were publicly available, and were able to predict localization to at least nine of the major subcellular locations (nucleus, cytosol, mitochondrion, extracellular region, plasma membrane, Golgi apparatus, endoplasmic reticulum (ER), peroxisome, and lysosome). The selected methods were CELLO, MultiLoc, Proteome Analyst, pTarget and WoLF PSORT. These methods were evaluated using 3763 mouse proteins from SwissProt that represent the source of the training sets used in development of the individual methods. In addition, an independent evaluation set of 2145 mouse proteins from LOCATE with a bias towards the subcellular localization underrepresented in SwissProt was used. The sensitivity and specificity were calculated for each method and compared to a theoretical value based on what might be observed by random chance. Conclusion: No individual method had a sufficient level of sensitivity across both evaluation sets that would enable reliable application to hypothetical proteins. All methods showed lower performance on the LOCATE dataset and variable performance on individual subcellular localizations was observed. Proteins localized to the secretory pathway were the most difficult to predict, while nuclear and extracellular proteins were predicted with the highest sensitivity.
Resumo:
The southern cattle tick, Boophilus microplus (Canestrini), causes annual economic losses in the hundreds of millions of dollars to cattle producers throughout the world, and ranks as the most economically important tick from a global perspective. Control failures attributable to the development of pesticide resistance have become commonplace, and novel control technologies are needed. The availability of the genome sequence will facilitate the development of these new technologies, and we are proposing sequencing to a 4-6X draft coverage. Many existing biological resources are available to facilitate a genome sequencing project, including several inbred laboratory tick strains, a database of approximate to 45,000 expressed sequence tags compiled into a B. microplus Gene Index, a bacterial artificial chromosome (BAC) library, an established B. microplus cell line, and genomic DNA suitable for library synthesis. Collaborative projects are underway to map BACs and cDNAs to specific chromosomes and to sequence selected BAC clones. When completed, the genome sequences from the cow, B. microphis, and the B. microphis-borne pathogens Babesia bovis and Anaplasma marginale will enhance studies of host-vector-pathogen systems. Genes involved in the regeneration of amputated tick limbs and transitions through developmental stages are largely unknown. Studies of these and other interesting biological questions will be advanced by tick genome sequence data. Comparative genomics offers the prospect of new insight into many, perhaps all, aspects of the biology of ticks and the pathogens they transmit to farm animals and people. The B. microplus genome sequence will fill a major gap in comparative genomics: a sequence from the Metastriata lineage of ticks. The purpose of the article is to synergize interest in and provide rationales for sequencing the genome of B. microplus and for publicizing currently available genomic resources for this tick.
Resumo:
The SOX family of transcription factors are found throughout the animal kingdom and are important in a variety of developmental contexts. Genome analysis has identified 20 Sox genes in human and mouse, which can be subdivided into 8 groups, based on sequence comparison and intron-exon structure. Most of the SOX groups identified in mammals are represented by a single SOX sequence in invertebrate model organisms, suggesting a duplication and divergence mechanism has operated during vertebrate evolution. We have now analysed the Sox gene complement in the pufferfish, Fugu rubripes, in order to shed further light on the diversity and origins of the Sox gene family. Major differences were found between the Sox family in Fugu and those in humans and mice. In particular, Fugu does not have orthologues of Sry, Sox,15 and Sox30, which appear to be specific to mammals, while Sox19, found in Fugu and zebrafish but absent in mammals, seems to be specific to fishes. Six mammalian Sox genes are represented by two copies each in Fugu, indicating a large-scale gene duplication in the fish lineage. These findings point to recent Sox gene loss, duplication and divergence occurring during the evolution of tetrapod and teleost lineages, and provide further evidence for large-scale segmental or a whole-genome duplication occurring early in the radiation of teleosts. (C) 2004 Elsevier B.V. All rights reserved.
Resumo:
The planctomycetes are a phylum of bacteria that have a unique cell compartmentalisation and yeast-like budding cell division and peptidoglycan-less proteinaceous cell walls. We wished to further our understanding of these unique organisms at the molecular level by searching for conserved amino acid sequence motifs and domains in the proteins encoded by Rhodopirellula baltica. Using BLAST and single-linkage clustering, we have discovered several new protein domains and sequence motifs in this planctomycete. R. baltica has multiple members of the newly discovered GEFGR protein family and the ASPIC C-terminal domain family, whilst most other organisms for which whole genome sequence is available have no more than one. Many of the domains and motifs appear to be restricted to the planctomycetes. It is possible that these protein domains and motifs may have been lost or replaced in other phyla, or they may have undergone multiple duplication events in the planctomycete lineage. One of the novel motifs probably represents a novel N-terminal export signal peptide. With their unique cell biology, it may be that the planctomycete cell compartmentalisation plan in particular needs special membrane transport mechanisms. The discovery of these new domains and motifs, many of which are associated with secretion and cell-surface functions, will help to stimulate experimental work and thus enhance further understanding of this fascinating group of organisms. (C) 2004 Federation of European Microbiological Societies. Published by Elsevier B.V. All rights reserved.
Resumo:
The number of mammalian transcripts identified by full-length cDNA projects and genome sequencing projects is increasing remarkably. Clustering them into a strictly nonredundant and comprehensive set provides a platform for functional analysis of the transcriptome and proteome, but the quality of the clustering and predictive usefulness have previously required manual curation to identify truncated transcripts and inappropriate clustering of closely related sequences. A Representative Transcript and Protein Sets (RTPS) pipeline was previously designed to identify the nonredundant and comprehensive set of mouse transcripts based on clustering of a large mouse full-length cDNA set (FANTOM2). Here we propose an alternative method that is more robust, requires less manual curation, and is applicable to other organisms in addition to mouse. RTPSs of human, mouse, and rat have been produced by this method and used for validation. Their comprehensiveness and quality are discussed by comparison with other clustering approaches. The RTPSs are available at ftp://fantom2.gsc.riken.go.jp/RTPS/. (C). 2004 Elsevier Inc. All rights reserved.
Resumo:
Columnar cell lesions (CCLs) of the breast are a spectrum of lesions that have posed difficulties to pathologists for many years, prompting discussion concerning their biologic and clinical significance. We present a study of CCL in context with hyperplasia of usual type (HUT) and the more advanced lesions ductal carcinoma in situ (DCIS) and invasive ductal carcinoma. A total of 81 lesions from 18 patients were subjected to a comprehensive morphologic review based upon a modified version of Schnitt's classification system for CCL, immunophenotypic analysis (estrogen receptor [ER], progesterone receptor [PgR], Her2/neu, cytokeratin 5/6 [CK5/6], cytokeratin 14 [CK14], E-cadherin, p53) and for the first time, a whole genome molecular analysis by comparative genomic hybridization. Multiple CCLs from 3 patients were studied in particular detail, with topographic information and/or showing a morphologic spectrum of CCL within individual terminal duct lobular units. CCLs were ER an PgR positive, CK5/6 and CK14 negative, exhibit low numbers of genetic alterations and recurrent 16q loss, features that are similar to those of low grade in situ and invasive carcinoma. The molecular genetic profiles closely reflect the degree of proliferation and atypia in CCL, indicating some of these lesions represent both a morphologic and molecular continuum. In addition, overlapping chromosomal alterations between CCL and more advanced lesions within individual terminal duct lobular units suggest a commonality in molecular evolution. These data further support the hypothesis that CCLs are a nonobligate, intermediary step in the development of some forms of low grade in situ and invasive carcinoma. Copyright: © 2005 Lippincott Williams & Wilkins, Inc.
Resumo:
Complete vertebrate genome sequencing has revealed a remarkable stability and uniformity in the protein-coding gene set, which at first glance might suggest that gene duplication events are relatively rare. This may be a red herring, or at least a red cichlid, as the Lake Malawi cichlid fishes show rapid and extensive duplication and diversification of their retinal cone photoreceptor opsin genes.
Resumo:
In Mesoamerica, tropical dry forest is a highly threatened habitat, and species endemic to this environment are under extreme pressure. The tree species, Lonchocarpus costaricensis is endemic to the dry northwest of Costa Rica and southwest Nicaragua. It is a locally important species but, as land has been cleared for agriculture, populations have experienced considerable reduction and fragmentation. To assess current levels and distribution of genetic diversity in the species, a combination of chloroplast-specific (cpDNA) and whole genome DNA markers (amplified fragment length polymorphism, AFLP) were used to fingerprint 121 individual trees in 6 populations. Two cpDNA haplotypes were identified, distributed among populations such that populations at the extremes of the distribution showed lowest diversity. A large number (487) of AFLP markers were obtained and indicated that diversity levels were highest in the two coastal populations (Cobano, Matapalo, H = 0.23, 0.28 respectively). Population differentiation was low overall, F-ST = 0.12, although Matapalo was strongly differentiated from all other populations (F-ST = 0.16-0.22), apart from Cobano (F., = 0.11). Spatial genetic structure was present in both datasets at different scales: cpDNA was structured at a range-wide distribution scale, whilst AFLP data revealed genetic neighbourhoods on a population scale. In general, the habitat degradation of recent times appears not to have yet impacted diversity levels in mature populations. However, although no data on seed or saplings were collected, it seems likely that reproductive mechanisms in the species will have been affected by land clearance. It is recommended that efforts should be made to conserve the extant genetic resource base and further research undertaken to investigate diversity levels in the progeny generation.
Resumo:
Background: Current methods to find significantly under- and over-represented gene ontology (GO) terms in a set of genes consider the genes as equally probable balls in a bag, as may be appropriate for transcripts in micro-array data. However, due to the varying length of genes and intergenic regions, that approach is inappropriate for deciding if any GO terms are correlated with a set of genomic positions. Results: We present an algorithm - GONOME - that can determine which GO terms are significantly associated with a set of genomic positions given a genome annotated with (at least) the starts and ends of genes. We show that certain GO terms may appear to be significantly associated with a set of randomly chosen positions in the human genome if gene lengths are not considered, and that these same terms have been reported as significantly over-represented in a number of recent papers. This apparent over-representation disappears when gene lengths are considered, as GONOME does. For example, we show that, when gene length is taken into account, the term development is not significantly enriched in genes associated with human CpG islands, in contradiction to a previous report. We further demonstrate the efficacy of GONOME by showing that occurrences of the proteosome-associated control element (PACE) upstream activating sequence in the S. cerevisiae genome associate significantly to appropriate GO terms. An extension of this approach yields a whole-genome motif discovery algorithm that allows identification of many other promoter sequences linked to different types of genes, including a large group of previously unknown motifs significantly associated with the terms 'translation' and 'translational elongation'. Conclusion: GONOME is an algorithm that correctly extracts over-represented GO terms from a set of genomic positions. By explicitly considering gene size, GONOME avoids a systematic bias toward GO terms linked to large genes. Inappropriate use of existing algorithms that do not take gene size into account has led to erroneous or suspect conclusions. Reciprocally GONOME may be used to identify new features in genomes that are significantly associated with particular categories of genes.
Resumo:
In Late summer 1999, an outbreak of human encephalitis occurred in the northeastern United States that was concurrent with extensive mortality in crows (Corvus species) as well as the deaths of several exotic birds at a zoological park in the same area. Complete genome sequencing of a flavivirus isolated from the brain of a dead Chilean flamingo (Phoenicopterus chilensis), together with partial sequence analysis of envelope glycoprotein (E-glycoprotein) genes amplified from several other species including mosquitoes and two fatal human cases, revealed that West Nile (WN) virus circulated in natural transmission cycles and was responsible for the human disease. Antigenic mapping with E-glycoprotein-specific monoclonal antibodies and E-glycoprotein phylogenetic analysis confirmed these viruses as WN. This North American WN virus was most closely related to a WN virus isolated from a dead goose in Israel in 1998.
Resumo:
To identify transcription factors (TFs) involved in jasmonate (JA) signaling and plant defense, we screened 1,534 Arabidopsis (Arabidopsis thaliana) TFs by real-time quantitative reverse transcription-PCR for their altered transcript at 6 h following either methyl JA treatment or inoculation with the incompatible pathogen Alternaria brassicicola. We identified 134 TFs that showed a significant change in expression, including many APETALA2/ethylene response factor (AP2/ERF), MYB, WRKY, and NACTF genes with unknown functions. Twenty TF genes were induced by both the pathogen and methyl JA and these included 10 members of the AP2/ERF TF family, primarily from the B1a and B3 subclusters. Functional analysis of the B1a TF AtERF4 revealed that AtERF4 acts as a novel negative regulator of JA-responsive defense gene expression and resistance to the necrotrophic fungal pathogen Fusarium oxysporum and antagonizes JA inhibition of root elongation. In contrast, functional analysis of the B3 TF AtERF2 showed that AtERF2 is a positive regulator of JA-responsive defense genes and resistance to F. oxysporum and enhances JA inhibition of root elongation. Our results suggest that plants coordinately express multiple repressor-and activator-type AP2/ERFs during pathogen challenge to modulate defense gene expression and disease resistance.
Resumo:
The recent emergence of a decreased susceptibility of Neisseria gonorrhoeae strains to penicillin in New Caledonia has lead clinicians to operate a change in the treatment strategy. In addition, this important health issue has emphasized the need for a rapid means of detecting penicillin resistance in N. gonorrhoeae in order to select an effective treatment and limit the spread of resistant strains. In recent years, the use of fluorescence resonance energy transfer on the LightCycler has proven to be a valuable tool for the screening of mutations occurring in the genome of various microorganisms. In this study, we developed a real-time PCR assay coupled with a fluorometric hybridization probes system to detect a penicillin resistance-associated mutation on the N. gonorrhoeae ponA gene. Following an extensive evaluation involving 136 isolates, melting curve analysis correctly evidenced a 5 degrees C T-m shift in all N. gonorrhoeae strains possessing this mutation, as determined by conventional sequencing analysis. Moreover, the mutation profiles obtained with the real-time PCR showed good correlation with the pattern of penicillin susceptibility generated with classical antibiograms. Overall, our molecular assay allowed an accurate and reproducible determination of the susceptibility to penicillin corresponding to a mutation present in all chromosomally mediated resistant strains of N. gonorrhoeae.