3 resultados para local sequence alignment problem
em Cochin University of Science
Resumo:
There are a number of genes involved in the regulation of functional process in marine bivalves. In the case of pearl oyster, some of these genes have major role in the immune/defence function and biomineralization process involved in the pearl formation in them. As secondary filter feeders, pearl oysters are exposed to various kinds of stressors like bacteria, viruses, pesticides, industrial wastes, toxic metals and petroleum derivatives, making susceptible to diseases. Environmental changes and ambient stress also affect non-specific immunity, making the organisms vulnerable to infections. These stressors can trigger various cellular responses in the animals in their efforts to counteract the ill effects of the stress on them. These include the expression of defence related genes which encode factors such as antioxidant genes, pattern recognition receptor proteins etc. One of the strategies to combat these problems is to get insight into the disease resistance genes, and use them for disease control and health management. Similarly, although it is known that formation of pearl in molluscs is mediated by specialized proteins which are in turn regulated by specific genes encoding them, there is a paucity of sufficient information on these genes.In view of the above facts, studies on the defence related and pearl forming genes of the pearl oyster assumes importance from the point of view of both sustainable fishery management and aquaculture. At present, there is total lack of sufficient knowledge on the functional genes and their expressions in the Indian pearl oyster Pinctada fucata. Hence this work was taken up to identify and characterize the defence related and pearl forming genes, and study their expression through molecular means, in the Indian pearl oyster Pinctada fucata which are economically important for aquaculture at the southeast coast of India. The present study has successfully carried out the molecular identification, characterization and expression analysis of defence related antioxidant enzyme genes and pattern recognition proteins genes which play vital role in the defence against biotic and abiotic stressors. Antioxidant enzyme genes viz., Cu/Zn superoxide dismutase (Cu/Zn SOD), glutathione peroxidise (GPX) and glutathione-S-transferase (GST) were studied. Concerted approaches using the various molecular tools like polymerase chain reaction (PCR), random amplification of cDNA ends (RACE), molecular cloning and sequencing have resulted in the identification and characterization of full length sequences (924 bp) of the Cu/Zn SOD, most important antioxidant enzyme gene. BLAST search in NCBI confirmed the identity of the gene as Cu/Zn SOD. The presence of the characteristic amino acid sequences such as copper/zinc binding residues, family signature sequences and signal peptides were found out. Multiple sequence alignment comparison and phylogenetic analysis of the nucleotide and amino acid sequences using bioinformatics tools like BioEdit,MEGA etc revealed that the sequences were found to contain regions of diversity as well as homogeneity. Close evolutionary relationship between P. fucata and other aquatic invertebrates was revealed from the phylogenetic tree constructed using SOD amino acid sequence of P. fucata and other invertebrates as well as vertebrates
Characterization and Pathogenicity of Vibrio cholerae and Vibrio vulnificus from Marine environments
Resumo:
The genus Vibrioof the family Vibrionaceae are Gram negative, oxidasepositive, rod- or curved- rodshaped facultative anaerobes, widespread in marine and estuarine environments. Vibrio species are opportunistic human pathogens responsible for diarrhoeal disease, gastroenteritis, septicaemia and wound infections and are also pathogens of aquatic organisms, causing infections to crustaceans, bivalves and fishes. In the present study, marine environmental samples like seafood and water and sediment samples from aquafarms and mangroves were screened for the presence of Vibrio species. Of the134 isolates obtained from the various samples, 45 were segregated to the genus Vibrio on the basis of phenotypic characterization.like Gram staining, oxidase test, MoF test and salinity tolerance. Partial 16S rDNA sequence analysis was utilized for species level identification of the isolates and the strains were identified as V. cholerae(N=21), V. vulnificus(N=18), V. parahaemolyticus(N=3), V. alginolyticus (N=2) and V. azureus (N=1). The genetic relatedness and variations among the 45 Vibrio isolates were elucidated based on 16S rDNA sequences. Phenotypic characterization of the isolates was based on their response to 12 biochemical tests namely Voges-Proskauers’s (VP test), arginine dihydrolase , tolerance to 3% NaCl test, ONPG test that detects β-galactosidase activity, and tests for utilization of citrate, ornithine, mannitol, arabinose, sucrose, glucose, salicin and cellobiose. The isolates exhibited diverse biochemical patterns, some specific for the species and others indicative of their environmental source.Antibiogram for the isolates was determined subsequent to testing their susceptibility to 12 antibiotics by the disc diffusion method. Varying degrees of resistance to gentamycin (2.22%), ampicillin(62.22%), nalidixic acid (4.44%), vancomycin (86.66), cefixime (17.77%), rifampicin (20%), tetracycline (42.22%) and chloramphenicol (2.22%) was exhibited. All the isolates were susceptible to streptomycin, co-trimoxazole, trimethoprim and azithromycin. Isolates from all the three marine environments exhibited multiple antibiotic resistance, with high MAR index value. The molecular typing methods such as ERIC PCR and BOX PCR revealed intraspecies relatedness and genetic heterogeneity within the environmental isolatesof V. cholerae and V. vulnificus. The 21 strains of V. choleraewere serogroupedas non O1/ non O139 by screening for the presence O1rfb and O139 rfb marker genes by PCR. The virulence/virulence associated genes namely ctxA, ctxB, ace, VPI, hlyA, ompU, rtxA, toxR, zot, nagst, tcpA, nin and nanwere screened in V. cholerae and V. vulnificusstrains.The V. vulnificusstrains were also screened for three species specific genes viz., cps, vvhand viu. In V. cholerae strains, the virulence associated genes like VPI, hlyA, rtxA, ompU and toxR were confirmed by PCR. All the isolates, except for strain BTOS6, harbored at least one or a combination of the tested genes and V. choleraestrain BTPR5 isolated from prawn hosted the highest number of virulence associated genes. Among the V. vulnificusstrains, only 3 virulence genes, VPI, toxR and cps, were confirmed out of the 16 tested and only 7 of the isolates had these genes in one or more combinations. Strain BTPS6 from aquafarm and strain BTVE4 from mangrove samples yielded positive amplification for the three genes. The toxRgene from 9 strains of V. choleraeand 3 strains of V. vulnificus were cloned and sequenced for phylogenetic analysis based on nucleotide and the amino acid sequences. Multiple sequence alignment of the nucleotide sequences and amino acid sequences of the environmental strains of V. choleraerevealed that the toxRgene in the environmental strains are 100% homologous to themselves and to the V. choleraetoxR gene sequence available in the Genbank database. The 3 strains of V. vulnificus displayed high nucleotide and amino acid sequence similarity among themselves and to the sequences of V. cholerae and V. harveyi obtained from the GenBank database, but exhibited only 72% homology to the sequences of its close relative V. vulnificus. Structure prediction of the ToxR protein of Vibrio cholerae strain BTMA5 was by PHYRE2 software. The deduced amino acid sequence showed maximum resemblance with the structure of DNA-binding domain of response regulator2 from Escherichia coli k-12 Template based homology modelling in PHYRE2 successfully modelled the predicted protein and its secondary structure based on protein data bank (PDB) template c3zq7A. The pathogenicity studies were performed using the nematode Caenorhabditiselegansas a model system. The assessment of pathogenicity of environmental strain of V. choleraewas conducted with E. coli strain OP50 as the food source in control plates, environmental V. cholerae strain BTOS6, negative for all tested virulence genes, to check for the suitability of Vibrio sp. as a food source for the nematode;V. cholerae Co 366 ElTor, a clinical pathogenic strain and V. cholerae strain BTPR5 from seafood (Prawn) and positive for the tested virulence genes like VPI, hlyA, ompU,rtxA and toxR. It was found that V. cholerae strain BTOS6 could serve as a food source in place of E. coli strain OP50 but behavioral aberrations like sluggish movement and lawn avoidance and morphological abnormalities like pharyngeal and intestinal distensions and bagging were exhibited by the worms fed on V. cholerae Co 366 ElTor strain and environmental BTPR5 indicating their pathogenicity to the nematode. Assessment of pathogenicity of the environmental strains of V. vulnificus was performed with V. vulnificus strain BTPS6 which tested positive for 3 virulence genes, namely, cps, toxRand VPI, and V. vulnificus strain BTMM7 that did not possess any of the tested virulence genes. A reduction was observed in the life span of worms fed on environmental strain of V. vulnificusBTMM7 rather than on the ordinary laboratory food source, E. coli OP50. Behavioral abnormalities like sluggish movement, lawn avoidance and bagging were also observed in the worms fed with strain BTPS6, but the pharynx and the intestine were intact. The presence of multi drug resistant environmental Vibrio strainsthat constitute a major reservoir of diverse virulence genes are to be dealt with caution as they play a decisive role in pathogenicity and horizontal gene transfer in the marine environments.
Resumo:
Computational Biology is the research are that contributes to the analysis of biological data through the development of algorithms which will address significant research problems.The data from molecular biology includes DNA,RNA ,Protein and Gene expression data.Gene Expression Data provides the expression level of genes under different conditions.Gene expression is the process of transcribing the DNA sequence of a gene into mRNA sequences which in turn are later translated into proteins.The number of copies of mRNA produced is called the expression level of a gene.Gene expression data is organized in the form of a matrix. Rows in the matrix represent genes and columns in the matrix represent experimental conditions.Experimental conditions can be different tissue types or time points.Entries in the gene expression matrix are real values.Through the analysis of gene expression data it is possible to determine the behavioral patterns of genes such as similarity of their behavior,nature of their interaction,their respective contribution to the same pathways and so on. Similar expression patterns are exhibited by the genes participating in the same biological process.These patterns have immense relevance and application in bioinformatics and clinical research.Theses patterns are used in the medical domain for aid in more accurate diagnosis,prognosis,treatment planning.drug discovery and protein network analysis.To identify various patterns from gene expression data,data mining techniques are essential.Clustering is an important data mining technique for the analysis of gene expression data.To overcome the problems associated with clustering,biclustering is introduced.Biclustering refers to simultaneous clustering of both rows and columns of a data matrix. Clustering is a global whereas biclustering is a local model.Discovering local expression patterns is essential for identfying many genetic pathways that are not apparent otherwise.It is therefore necessary to move beyond the clustering paradigm towards developing approaches which are capable of discovering local patterns in gene expression data.A biclusters is a submatrix of the gene expression data matrix.The rows and columns in the submatrix need not be contiguous as in the gene expression data matrix.Biclusters are not disjoint.Computation of biclusters is costly because one will have to consider all the combinations of columans and rows in order to find out all the biclusters.The search space for the biclustering problem is 2 m+n where m and n are the number of genes and conditions respectively.Usually m+n is more than 3000.The biclustering problem is NP-hard.Biclustering is a powerful analytical tool for the biologist.The research reported in this thesis addresses the problem of biclustering.Ten algorithms are developed for the identification of coherent biclusters from gene expression data.All these algorithms are making use of a measure called mean squared residue to search for biclusters.The objective here is to identify the biclusters of maximum size with the mean squared residue lower than a given threshold. All these algorithms begin the search from tightly coregulated submatrices called the seeds.These seeds are generated by K-Means clustering algorithm.The algorithms developed can be classified as constraint based,greedy and metaheuristic.Constarint based algorithms uses one or more of the various constaints namely the MSR threshold and the MSR difference threshold.The greedy approach makes a locally optimal choice at each stage with the objective of finding the global optimum.In metaheuristic approaches particle Swarm Optimization(PSO) and variants of Greedy Randomized Adaptive Search Procedure(GRASP) are used for the identification of biclusters.These algorithms are implemented on the Yeast and Lymphoma datasets.Biologically relevant and statistically significant biclusters are identified by all these algorithms which are validated by Gene Ontology database.All these algorithms are compared with some other biclustering algorithms.Algorithms developed in this work overcome some of the problems associated with the already existing algorithms.With the help of some of the algorithms which are developed in this work biclusters with very high row variance,which is higher than the row variance of any other algorithm using mean squared residue, are identified from both Yeast and Lymphoma data sets.Such biclusters which make significant change in the expression level are highly relevant biologically.