926 resultados para Genome annotation
Resumo:
Coleoptera is the most diverse group of insects with over 360,000 described species divided into four suborders: Adephaga, Archostemata, Myxophaga, and Polyphaga. In this study, we present six new complete mitochondrial genome (mtgenome) descriptions, including a representative of each suborder, and analyze the evolution of mtgenomes from a comparative framework using all available coleopteran mtgenomes. We propose a modification of atypical cox1 start codons based on sequence alignment to better reflect the conservation observed across species as well as findings of TTG start codons in other genes. We also analyze tRNA-Ser(AGN) anticodons, usually GCU in arthropods, and report a conserved UCU anticodon as a possible synapomorphy across Polyphaga. We further analyze the secondary structure of tRNA-Ser(AGN) and present a consensus structure and an updated covariance model that allows tRNAscan-SE (via the COVE software package) to locate and fold these atypical tRNAs with much greater consistency. We also report secondary structure predictions for both rRNA genes based on conserved stems. All six species of beetle have the same gene order as the ancestral insect. We report noncoding DNA regions, including a small gap region of about 20 bp between tRNA-Ser(UCN) and nad1 that is present in all six genomes, and present results of a base composition analysis.
Resumo:
Escherichia coli ST131 is now recognised as a leading contributor to urinary tract and bloodstream infections in both community and clinical settings. Here we present the complete, annotated genome of E. coli EC958, which was isolated from the urine of a patient presenting with a urinary tract infection in the Northwest region of England and represents the most well characterised ST131 strain. Sequencing was carried out using the Pacific Biosciences platform, which provided sufficient depth and read-length to produce a complete genome without the need for other technologies. The discovery of spurious contigs within the assembly that correspond to site-specific inversions in the tail fibre regions of prophages demonstrates the potential for this technology to reveal dynamic evolutionary mechanisms. E. coli EC958 belongs to the major subgroup of ST131 strains that produce the CTX-M-15 extended spectrum β-lactamase, are fluoroquinolone resistant and encode the fimH30 type 1 fimbrial adhesin. This subgroup includes the Indian strain NA114 and the North American strain JJ1886. A comparison of the genomes of EC958, JJ1886 and NA114 revealed that differences in the arrangement of genomic islands, prophages and other repetitive elements in the NA114 genome are not biologically relevant and are due to misassembly. The availability of a high quality uropathogenic E. coli ST131 genome provides a reference for understanding this multidrug resistant pathogen and will facilitate novel functional, comparative and clinical studies of the E. coli ST131 clonal lineage.
Resumo:
Escherichia coli strains causing urinary tract infection (UTI) are increasingly recognized as belonging to specific clones. E. coli clone O25b:H4-ST131 has recently emerged globally as a leading multi-drug resistant pathogen causing urinary tract and bloodstream infections in hospitals and the community. While most molecular studies to date examine the mechanisms conferring multi-drug resistance in E. coli ST131, relatively little is known about their virulence potential. Here we examined E. coli ST131 clinical isolates from two geographically diverse collections, one representing the major pathogenic lineages causing UTI across the United Kingdom and a second representing UTI isolates from patients presenting at two large hospitals in Australia. We determined a draft genome sequence for one representative isolate, E. coli EC958, which produced CTX-M-15 extended-spectrum β-lactamase, CMY-23 type AmpC cephalosporinase and was resistant to ciprofloxacin. Comparative genome analysis indicated that EC958 encodes virulence genes commonly associated with uropathogenic E. coli (UPEC). The genome sequence of EC958 revealed a transposon insertion in the fimB gene encoding the activator of type 1 fimbriae, an important UPEC bladder colonization factor. We identified the same fimB transposon insertion in 59% of the ST131 UK isolates, as well as 71% of ST131 isolates from Australia, suggesting this mutation is common among E. coli ST131 strains. Insertional inactivation of fimB resulted in a phenotype resembling a slower off-to-on switching for type 1 fimbriae. Type 1 fimbriae expression could still be induced in fimB-null isolates; this correlated strongly with adherence to and invasion of human bladder cells and bladder colonisation in a mouse UTI model. We conclude that E. coli ST131 is a geographically widespread, antibiotic resistant clone that has the capacity to produce numerous virulence factors associated with UTI.
Resumo:
Metastasis accounts for the poor prognosis of the majority of solid tumors. The phenotypic transition of nonmotile epithelial tumor cells to migratory and invasive “mesenchymal” cells (epithelial-to-mesenchymal transition [EMT]) enables the transit of cancer cells from the primary tumor to distant sites. There is no single marker of EMT; rather, multiple measures are required to define cell state. Thus, the multiparametric capability of high-content screening is ideally suited for the comprehensive analysis of EMT regulators. The aim of this study was to generate a platform to systematically identify functional modulators of tumor cell plasticity using the bladder cancer cell line TSU-Pr1-B1 as a model system. A platform enabling the quantification of key EMT characteristics, cell morphology and mesenchymal intermediate filament vimentin, was developed using the fluorescent whole-cell-tracking reagent CMFDA and a fluorescent promoter reporter construct, respectively. The functional effect of genome-wide modulation of protein-coding genes and miRNAs coupled with those of a collection of small-molecule kinase inhibitors on EMT was assessed using the Target Activation Bioapplication integrated in the Cellomics ArrayScan platform. Data from each of the three screens were integrated to identify a cohort of targets that were subsequently examined in a validation assay using siRNA duplexes. Identification of established regulators of EMT supports the utility of this screening approach and indicated capacity to identify novel regulators of this plasticity program. Pathway analysis coupled with interrogation of cancer-related expression profile databases and other EMT-related screens provided key evidence to prioritize further experimental investigation into the molecular regulators of EMT in cancer cells.
Resumo:
Background The koala, Phascolarctos cinereus, is a biologically unique and evolutionarily distinct Australian arboreal marsupial. The goal of this study was to sequence the transcriptome from several tissues of two geographically separate koalas, and to create the first comprehensive catalog of annotated transcripts for this species, enabling detailed analysis of the unique attributes of this threatened native marsupial, including infection by the koala retrovirus. Results RNA-Seq data was generated from a range of tissues from one male and one female koala and assembled de novo into transcripts using Velvet-Oases. Transcript abundance in each tissue was estimated. Transcripts were searched for likely protein-coding regions and a non-redundant set of 117,563 putative protein sequences was produced. In similarity searches there were 84,907 (72%) sequences that aligned to at least one sequence in the NCBI nr protein database. The best alignments were to sequences from other marsupials. After applying a reciprocal best hit requirement of koala sequences to those from tammar wallaby, Tasmanian devil and the gray short-tailed opossum, we estimate that our transcriptome dataset represents approximately 15,000 koala genes. The marsupial alignment information was used to look for potential gene duplications and we report evidence for copy number expansion of the alpha amylase gene, and of an aldehyde reductase gene. Koala retrovirus (KoRV) transcripts were detected in the transcriptomes. These were analysed in detail and the structure of the spliced envelope gene transcript was determined. There was appreciable sequence diversity within KoRV, with 233 sites in the KoRV genome showing small insertions/deletions or single nucleotide polymorphisms. Both koalas had sequences from the KoRV-A subtype, but the male koala transcriptome has, in addition, sequences more closely related to the KoRV-B subtype. This is the first report of a KoRV-B-like sequence in a wild population. Conclusions This transcriptomic dataset is a useful resource for molecular genetic studies of the koala, for evolutionary genetic studies of marsupials, for validation and annotation of the koala genome sequence, and for investigation of koala retrovirus. Annotated transcripts can be browsed and queried at http://koalagenome.org
Resumo:
This project is a step forward in the study of text mining where enhanced text representation with semantic information plays a significant role. It develops effective methods of entity-oriented retrieval, semantic relation identification and text clustering utilizing semantically annotated data. These methods are based on enriched text representation generated by introducing semantic information extracted from Wikipedia into the input text data. The proposed methods are evaluated against several start-of-art benchmarking methods on real-life data-sets. In particular, this thesis improves the performance of entity-oriented retrieval, identifies different lexical forms for an entity relation and handles clustering documents with multiple feature spaces.
Resumo:
Acoustic sensors allow scientists to scale environmental monitoring over large spatiotemporal scales. The faunal vocalisations captured by these sensors can answer ecological questions, however, identifying these vocalisations within recorded audio is difficult: automatic recognition is currently intractable and manual recognition is slow and error prone. In this paper, a semi-automated approach to call recognition is presented. An automated decision support tool is tested that assists users in the manual annotation process. The respective strengths of human and computer analysis are used to complement one another. The tool recommends the species of an unknown vocalisation and thereby minimises the need for the memorization of a large corpus of vocalisations. In the case of a folksonomic tagging system, recommending species tags also minimises the proliferation of redundant tag categories. We describe two algorithms: (1) a “naïve” decision support tool (16%–64% sensitivity) with efficiency of O(n) but which becomes unscalable as more data is added and (2) a scalable alternative with 48% sensitivity and an efficiency ofO(log n). The improved algorithm was also tested in a HTML-based annotation prototype. The result of this work is a decision support tool for annotating faunal acoustic events that may be utilised by other bioacoustics projects.