950 resultados para Similarity analysis
Resumo:
The expressed sequence tags (EST) has been proved to be a useful tool for discovering and identifying functional genes, especially in some species whose genetic information is unavailable. A total of 180 ESTs have been generated from a cDNA library of gametophytic Gracilaria lemaneiformis in this study. These clones are clustered into 151 groups, among which 8 groups are highly homologous to chloroplast genes and are abundant in the library. After searching for matches in the EST database of red alga, 22 groups are found to match with the registered ESTs of Rhadophyta and 6 with Gracilaria. Searching in the protein database reveal that 73 non-redundant clones have significant similarity to some known sequences, the majority of which are involved in photosynthesis, DNA transcription or translation, and 6, 4 and 3 clones are associated with growth or development, signal transduction and stress or defense response, respectively.
Resumo:
We tested the applicability of the random amplified polymorphic deoxyribonucleic acid (RAPD) analysis for identification of three marine fish cell lines FG. SPH, and RSBK and as a possible tool to detect cross-contamination. Sixth commercial 10-mer RAPD primers were tested on the cell lines and on samples collected from individual fish. The results obtained showed that the cell lines could be identified to the correspondent species on the basis of identical Patterns produced by 35-48% of the primers tested. the total mean similarity indices for cell lines versus correspondent species of individual fish ranged from 0.825 to 0.851. indicating the existence of genetic variation in these cell lines in relation to the species of their origin. Also, four primers, which gave a monomorphic hand pattern within species/line, but different among the species/line, were obtained. These primers can be useful for identification of these cell lines and for characterization of the genetic variation of these cell lines in relation to the species of their origin. This supported the use of RAPD analysis as an effective tool in species identification and cross-contamination test among different cell lines.
Resumo:
Prophenoloxidase (proPO) is a conserved copper-containing enzyme that plays important roles in immune response of crustaceans and insects. In the present study, the full-length cDNA of a prophenoloxidase (designated EsproPO) was cloned from haemocytes of Chinese mitten crab Eriocheir sinensis by expressed sequence tag (EST) and PCR techniques. The isolated 3549 bp full-length cDNA of EsproPO contained a 2040 bp open reading frame (ORF) encoding a putative proPO protein of 679 amino acids, a 5'-untranslated region (UTR) of 68 bp, and a long 3'-UTR of 1441 bp. Two putative copper-binding sites, a proteolytic activation site, and a complement-like motif (GCGWPQHM) were identified in the deduced amino acid sequence of EsproPO. Homology analysis revealed that EsproPO was highly similar to other proPOs from crustaceans with identities from 52% to 68%. The conserved domains and motifs, and higher similarity with other proPOs suggested that EsproPO was a member of the proPO family. The mRNA expression of EsproPO and PO specific activities in the tissues of hepatopancreas, gill, gonad, muscle, heart, eye and haemocytes were measured by quantitative real-time PCR and colorimetric assay, respectively. The mRNA transcripts of EsproPO and PO specific activities could be detected in all the examined tissues with the highest level both in hepatopancreas. Three peaks of EsproPO mRNA expression were recorded at 2 h, 12 h and 48 h in haemocytes of Chinese mitten crab post Vibrio anguillarum challenge, which was consistent with the temporal profile of PO specific activity. The mRNA expression pattern and the activity fluctuation of EsproPO post V. anguillarum stimulation indicated that it was potentially involved in the acute response against invading bacteria in Chinese mitten crab. (c) 2007 Elsevier Ltd. All rights reserved.
Resumo:
Anti-lipopolysaccharide factor (ALF) represents one kind of basic proteins, which binds and neutralizes LPS and exhibits strong antibacterial activity against Gram-negative R-type bacteria. The ALF gene of Chinese mitten crab Eriocheir sinensis (Milne Edwards, 1853) (denoted as EsALF) was identified from haemocytes by expressed sequence tag (EST) and PCR approaches. The full-length cDNA of EsALF consisted of 700 nucleotides with a canonical polyadenylation signal-sequence AATAAA, a polyA tail, and an open-reading frame of 363 bp encoding 120 amino acids. The high similarity of EsALF-deduced amino acid sequence shared with the ALFs from other species indicated that EsALF should be a member of ALF family. The mRNA expression of EsALF in the tissues of heart, gonad, gill, haemocytes, eyestalk and muscle was examined by Northern blot analysis and mRNA transcripts of EsALF were mainly detected in haemocytes, heart and gonad. The temporal expression of EsALF in haemocytes after Vibrio anguillarum challenge was recorded by quantitative real-time RT-PCR. The relative expression level of EsALF was up-regulated rapidly at 2 h post-injection and reached 3-fold to that in blank group. After a drastic decrease to the original level from 4 to 8h, the expression level increased again and reached 4-fold to that in the blank group at 12 h post-injection. The genomic DNA sequence of EsALF gene consists of 1174bp containing three exons and two introns. The coding sequence of the EsALF mature peptide was cloned and expressed in Escherichia coli BL21(DE3)-pLysS to further elucidate its biological functions. The purified recombinant product showed bactericidal activity against both Gram-positive (G(+)) and Gram-negative (G(-)) bacteria, which demonstrated that the rEsALF was a broad-spectrum antibacterial peptide. All these results indicated that EsALF was an acute-phase protein involved in the immune responses of Chinese mitten crab, and provided a potential therapeutic agent for disease control in aquaculture. (c) 2007 Elsevier Ltd. All rights reserved.
Resumo:
Translationally controlled tumor protein (TCTP) is one of the abundant and ubiquitously expressed proteins in metazoans In the present study, the first molluscan TCTP (denoted as VpTCTP) was identified from Venerupis philippinarum haemocytes by EST and RACE approaches The full-length cDNA of VpTCTP consisted of 1148 nucleotides with an open-reading frame of 555 bp encoding 184 amino acids The deduced amino acid sequence of VpTCTP shared high similarity with TCTPs from other species, indicating that VpTCTP should be a new member of TCTP family Several highly conserved motifs, including 5'terminal ologopyrimidine (5'TOP) starting sequence and rich AU and AUUT elements in 3'UTR, were also identified in VpTCTP The tissue and temporal expression of VpTCTP after Vi boo anguillarum challenge was recorded by quantitative real-time RT-PCR. VpTCTP transcript could be detected in all examined tissues with the highest expression level in haemocytes and the lowest in hepatopancreas Concerning the time-course expression in haemocytes, the relative expression of VpTCTP mRNA was down-regulated sharply from 6 h to 12 h post-infection. Then, the expression level was obviously up-regulated and reached 3.4-fold to that in the control group at 48 h post challenge As time progressed, the expression of VpTCTP recovered to the original level at 96 h. All these results indicated that VpTCTP was an acute-phase protein involved in the Immune response of V philippinarum (C) 2010 Elsevier Ltd. All rights reserved.
Resumo:
Karwath, A. King, R. Homology induction: the use of machine learning to improve sequence similarity searches. BMC Bioinformatics. 23rd April 2002. 3:11 Additional File Describes the title organims species declaration in one string [http://www.biomedcentral.com/content/supplementary/1471- 2105-3-11-S1.doc] Sponsorship: Andreas Karwath and Ross D. King were supported by the EPSRC grant GR/L62849.
Resumo:
T.Boongoen and Q. Shen. Semi-Supervised OWA Aggregation for Link-Based Similarity Evaluation and Alias Detection. Proceedings of the 18th International Conference on Fuzzy Systems (FUZZ-IEEE'09), pp. 288-293, 2009. Sponsorship: EPSRC
Resumo:
Matthew J. Nicholson, Michael K. Theodorou and Jayne L. Brookman. (2005). Molecular analysis of the anaerobic rumen fungus Orpinomyces - insights into an AT-rich genome. Microbiology, 151 (1), 121-133. Sponsorship: BBSRC RAE2008
Resumo:
Bacteriophages, viruses infecting bacteria, are uniformly present in any location where there are high numbers of bacteria, both in the external environment and the human body. Knowledge of their diversity is limited by the difficulty to culture the host species and by the lack of the universal marker gene present in all viruses. Metagenomics is a powerful tool that can be used to analyse viral communities in their natural environments. The aim of this study was to investigate diverse populations of uncultured viruses from clinical (a sputum of patient with cystic fibrosis, CF) and environmental samples (a sludge from a dairy food wastewater treatment plant) containing rich bacterial populations using genetic and metagenomic analyses. Metagenomic sequencing of viruses obtained from these samples revealed that the majority of the metagenomic reads (97-99%) were novel when compared to the NCBI protein database using BLAST. A large proportion of assembled contigs were assignable as novel phages or uncharacterised prophages, the next largest assignable group being single-stranded eukaryotic virus genomes. Sputum from a cystic fibrosis patient contained DNA typical of phages of bacteria that are traditionally involved in CF lung infections and other bacteria that are part of the normal oral flora. The only eukaryotic virus detected in the CF sputum was Torque Teno virus (TTV). A substantial number of assigned sequences from dairy wastewater could be affiliated with phages of bacteria that are typically found in the soil and aquatic environments, including wastewater. Eukaryotic viral sequences were dominated by plant pathogens from the Geminiviridae and Nanoviridae families, and animal pathogens from the Circoviridae family. Antibiotic resistance genes were detected in both metagenomes suggesting phages could be a source for transmissible antimicrobial resistance. Overall, diversity of viruses in the CF sputum was low, with 89 distinct viral genotypes predicted, and higher (409 genotypes) in the wastewater. Function-based screening of a metagenomic library constructed from DNA extracted from dairy food wastewater viruses revealed candidate promoter sequences that have ability to drive expression of GFP in a promoter-trap vector in Escherichia coli. The majority of the cloned DNA sequences selected by the assay were related to ssDNA circular eukaryotic viruses and phages which formed a minority of the metagenome assembly, and many lacked any significant homology to known database sequences. Natural diversity of bacteriophages in wastewater samples was also examined by PCR amplification of the major capsid protein sequences, conserved within T4-type bacteriophages from Myoviridae family. Phylogenetic analysis of capsid sequences revealed that dairy wastewater contained mainly diverse and uncharacterized phages, while some showed a high level of similarity with phages from geographically distant environments.
Resumo:
Meta-analyses of genome-wide association studies (GWAS) have demonstrated that the same genetic variants can be associated with multiple diseases and other complex traits. We present software called CPAG (Cross-Phenotype Analysis of GWAS) to look for similarities between 700 traits, build trees with informative clusters, and highlight underlying pathways. Clusters are consistent with pre-defined groups and literature-based validation but also reveal novel connections. We report similarity between plasma palmitoleic acid and Crohn's disease and find that specific fatty acids exacerbate enterocolitis in zebrafish. CPAG will become increasingly powerful as more genetic variants are uncovered, leading to a deeper understanding of complex traits. CPAG is freely available at www.sourceforge.net/projects/CPAG/.
Resumo:
Coccolithophores are the largest source of calcium carbonate in the oceans and are considered to play an important role in oceanic carbon cycles. Current methods to detect the presence of coccolithophore blooms from Earth observation data often produce high numbers of false positives in shelf seas and coastal zones due to the spectral similarity between coccolithophores and other suspended particulates. Current methods are therefore unable to characterise the bloom events in shelf seas and coastal zones, despite the importance of these phytoplankton in the global carbon cycle. A novel approach to detect the presence of coccolithophore blooms from Earth observation data is presented. The method builds upon previous optical work and uses a statistical framework to combine spectral, spatial and temporal information to produce maps of coccolithophore bloom extent. Validation and verification results for an area of the north east Atlantic are presented using an in situ database (N = 432) and all available SeaWiFS data for 2003 and 2004. Verification results show that the approach produces a temporal seasonal signal consistent with biological studies of these phytoplankton. Validation using the in situ coccolithophore cell count database shows a high correct recognition rate of 80% and a low false-positive rate of 0.14 (in comparison to 63% and 0.34 respectively for the established, purely spectral approach). To guide its broader use, a full sensitivity analysis for the algorithm parameters is presented.
Resumo:
Sublittoral macrobenthic communities in the Skomer Marine Nature Reserve (SMNR), Pembrokeshire, Wales, were sampled at 10 stations in 1993, 1996, 1998, 2003, 2007 and 2009 using a Day grab and a 0.5 mm mesh. The time series is analysed using Similarities Profiles (SIMPROF) tests and associated methods. Q-mode analysis using clustering with Type 1 SIMPROF addresses multivariate structure among samples, showing that there is clear structure associated with differences among years. Inverse (r-mode) analysis using Type 2 SIMPROF decisively rejects a hypothesis that species are not associated with each other. Clustering of the variables (species) with Type 3 SIMPROF identifies groups of species which covary coherently through the time-series. The time-series is characterised by a dramatic decline in abundances and diversity between the 1993 and 1996 surveys. By 1998 there had been a shift in community composition from the 1993 situation, with different species dominating. Communities had recovered in terms of abundance and species richness, but different species dominated the community. No single factor could be identified which unequivocally explained the dramatic changes observed in the SMNR. Possible causes were the effects of dispersed oil and dispersants from the Sea Empress oil spill in February 1996 and the cessation of dredge-spoil disposal off St Anne’s Head in 1995, but the most likely cause was severe weather. With many species, and a demonstrable recovery from an impact, communities within the SMNR appear to be diverse and resilient. If attributable to natural storms, the changes observed here indicate that natural variability may be much more important than is generally taken into account in the design of monitoring programmes.
Resumo:
Aminobacter lissarensis CC495 is an aerobic facultative methylotroph capable of growth on glucose, glycerol, pyruvate and methylamine as well as the methyl halides methyl chloride and methyl bromide. Previously, cells grown on methyl chloride have been shown to express two polypeptides with apparent molecular masses of 67 and 29 kDa. The 67 kDa protein was purified and identified as a halomethane:bisulfide/halide ion methyltransferase. This study describes a single gene cluster in A. lissarensis CC495 containing the methyl halide utilisation genes cmuB, cmuA, cmuC, orf 188, paaE and hutI The genes correspond to the same order and have a high similarity to a gene cluster found in Aminobacter ciceronei IMB-1 and Hyphomicrobium chloromethanicum strain CM2 indicating that genes encoding methyl halide degradation are highly conserved in these strains. (c) 2005 Federation of European Microbiological Societies. Published by Elsevier, B.V. All rights reserved.
Resumo:
We present and analyze an algorithm to measure the structural similarity of generalized trees, a new graph class which includes rooted trees. For this, we represent structural properties of graphs as strings and define the similarity of two Graphs as optimal alignments of the corresponding property stings. We prove that the obtained graph similarity measures are so called Backward similarity measures. From this we find that the time complexity of our algorithm is polynomial and, hence, significantly better than the time complexity of classical graph similarity methods based on isomorphic relations. (c) 2006 Elsevier Inc. All rights reserved.