7 resultados para Protein Sequence Analysis

em Cochin University of Science


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Modern computer systems are plagued with stability and security problems: applications lose data, web servers are hacked, and systems crash under heavy load. Many of these problems or anomalies arise from rare program behavior caused by attacks or errors. A substantial percentage of the web-based attacks are due to buffer overflows. Many methods have been devised to detect and prevent anomalous situations that arise from buffer overflows. The current state-of-art of anomaly detection systems is relatively primitive and mainly depend on static code checking to take care of buffer overflow attacks. For protection, Stack Guards and I-leap Guards are also used in wide varieties.This dissertation proposes an anomaly detection system, based on frequencies of system calls in the system call trace. System call traces represented as frequency sequences are profiled using sequence sets. A sequence set is identified by the starting sequence and frequencies of specific system calls. The deviations of the current input sequence from the corresponding normal profile in the frequency pattern of system calls is computed and expressed as an anomaly score. A simple Bayesian model is used for an accurate detection.Experimental results are reported which show that frequency of system calls represented using sequence sets, captures the normal behavior of programs under normal conditions of usage. This captured behavior allows the system to detect anomalies with a low rate of false positives. Data are presented which show that Bayesian Network on frequency variations responds effectively to induced buffer overflows. It can also help administrators to detect deviations in program flow introduced due to errors.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The resurgence of the enteric pathogen Vibrio cholerae, the causative organism of epidemic cholera, remains a major health problem in many developing countries like India. The southern Indian state of Kerala is endemic to cholera. The outbreaks of cholera follow a seasonal pattern in regions of endemicity. Marine aquaculture settings and mangrove environments of Kerala serve as reservoirs for V. cholerae. The non-O1/non-O139 environmental isolates of V. cholerae with incomplete ‘virulence casette’ are to be dealt with caution as they constitute a major reservoir of diverse virulence genes in the marine environment and play a crucial role in pathogenicity and horizontal gene transfer. The genes coding cholera toxin are borne on, and can be infectiously transmitted by CTXΦ, a filamentous lysogenic vibriophages. Temperate phages can provide crucial virulence and fitness factors affecting cell metabolism, bacterial adhesion, colonization, immunity, antibiotic resistance and serum resistance. The present study was an attempt to screen the marine environments like aquafarms and mangroves of coastal areas of Alappuzha and Cochin, Kerala for the presence of lysogenic V. cholerae, to study their pathogenicity and also gene transfer potential. Phenotypic and molecular methods were used for identification of isolates as V. cholerae. The thirty one isolates which were Gram negative, oxidase positive, fermentative, with or without gas production on MOF media and which showed yellow coloured colonies on TCBS (Thiosulfate Citrate Bile salt Sucrose) agar were segregated as vibrios. Twenty two environmental V. cholerae strains of both O1 and non- O1/non-O139 serogroups on induction with mitomycin C showed the presence of lysogenic phages. They produced characteristic turbid plaques in double agar overlay assay using the indicator strain V. cholerae El Tor MAK 757. PCR based molecular typing with primers targeting specific conserved sequences in the bacterial genome, demonstrated genetic diversity among these lysogen containing non-O1 V. cholerae . Polymerase chain reaction was also employed as a rapid screening method to verify the presence of 9 virulence genes namely, ctxA, ctxB, ace, hlyA, toxR, zot,tcpA, ninT and nanH, using gene specific primers. The presence of tcpA gene in ALPVC3 was alarming, as it indicates the possibility of an epidemic by accepting the cholera. Differential induction studies used ΦALPVC3, ΦALPVC11, ΦALPVC12 and ΦEKM14, underlining the possibility of prophage induction in natural ecosystems, due to abiotic factors like antibiotics, pollutants, temperature and UV. The efficiency of induction of prophages varied considerably in response to the different induction agents. The growth curve of lysogenic V. cholerae used in the study drastically varied in the presence of strong prophage inducers like antibiotics and UV. Bacterial cell lysis was directly proportional to increase in phage number due to induction. Morphological characterization of vibriophages by Transmission Electron Microscopy revealed hexagonal heads for all the four phages. Vibriophage ΦALPVC3 exhibited isometric and contractile tails characteristic of family Myoviridae, while phages ΦALPVC11 and ΦALPVC12 demonstrated the typical hexagonal head and non-contractile tail of family Siphoviridae. ΦEKM14, the podophage was distinguished by short non-contractile tail and icosahedral head. This work demonstrated that environmental parameters can influence the viability and cell adsorption rates of V. cholerae phages. Adsorption studies showed 100% adsorption of ΦALPVC3 ΦALPVC11, ΦALPVC12 and ΦEKM14 after 25, 30, 40 and 35 minutes respectively. Exposure to high temperatures ranging from 50ºC to 100ºC drastically reduced phage viability. The optimum concentration of NaCl required for survival of vibriophages except ΦEKM14 was 0.5 M and that for ΦEKM14 was 1M NaCl. Survival of phage particles was maximum at pH 7-8. V. cholerae is assumed to have existed long before their human host and so the pathogenic clones may have evolved from aquatic forms which later colonized the human intestine by progressive acquisition of genes. This is supported by the fact that the vast majority of V. cholerae strains are still part of the natural aquatic environment. CTXΦ has played a critical role in the evolution of the pathogenicity of V. cholerae as it can transmit the ctxAB gene. The unusual transformation of V. cholerae strains associated with epidemics and the emergence of V. cholera O139 demonstrates the evolutionary success of the organism in attaining greater fitness. Genetic changes in pathogenic V. cholerae constitute a natural process for developing immunity within an endemically infected population. The alternative hosts and lysogenic environmental V. cholerae strains may potentially act as cofactors in promoting cholera phage ‘‘blooms’’ within aquatic environments, thereby influencing transmission of phage sensitive, pathogenic V. cholerae strains by aquatic vehicles. Differential induction of the phages is a clear indication of the impact of environmental pollution and global changes on phage induction. The development of molecular biology techniques offered an accessible gateway for investigating the molecular events leading to genetic diversity in the marine environment. Using nucleic acids as targets, the methods of fingerprinting like ERIC PCR and BOX PCR, revealed that the marine environment harbours potentially pathogenic group of bacteria with genetic diversity. The distribution of virulence associated genes in the environmental isolates of V. cholerae provides tangible material for further investigation. Nucleotide and protein sequence analysis alongwith protein structure prediction aids in better understanding of the variation inalleles of same gene in different ecological niche and its impact on the protein structure for attaining greater fitness of pathogens. The evidences of the co-evolution of virulence genes in toxigenic V. cholerae O1 from different lineages of environmental non-O1 strains is alarming. Transduction studies would indicate that the phenomenon of acquisition of these virulence genes by lateral gene transfer, although rare, is not quite uncommon amongst non-O1/non-O139 V. cholerae and it has a key role in diversification. All these considerations justify the need for an integrated approach towards the development of an effective surveillance system to monitor evolution of V. cholerae strains with epidemic potential. Results presented in this study, if considered together with the mechanism proposed as above, would strongly suggest that the bacteriophage also intervenes as a variable in shaping the cholera bacterium, which cannot be ignored and hinting at imminent future epidemics.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Computational Biology is the research are that contributes to the analysis of biological data through the development of algorithms which will address significant research problems.The data from molecular biology includes DNA,RNA ,Protein and Gene expression data.Gene Expression Data provides the expression level of genes under different conditions.Gene expression is the process of transcribing the DNA sequence of a gene into mRNA sequences which in turn are later translated into proteins.The number of copies of mRNA produced is called the expression level of a gene.Gene expression data is organized in the form of a matrix. Rows in the matrix represent genes and columns in the matrix represent experimental conditions.Experimental conditions can be different tissue types or time points.Entries in the gene expression matrix are real values.Through the analysis of gene expression data it is possible to determine the behavioral patterns of genes such as similarity of their behavior,nature of their interaction,their respective contribution to the same pathways and so on. Similar expression patterns are exhibited by the genes participating in the same biological process.These patterns have immense relevance and application in bioinformatics and clinical research.Theses patterns are used in the medical domain for aid in more accurate diagnosis,prognosis,treatment planning.drug discovery and protein network analysis.To identify various patterns from gene expression data,data mining techniques are essential.Clustering is an important data mining technique for the analysis of gene expression data.To overcome the problems associated with clustering,biclustering is introduced.Biclustering refers to simultaneous clustering of both rows and columns of a data matrix. Clustering is a global whereas biclustering is a local model.Discovering local expression patterns is essential for identfying many genetic pathways that are not apparent otherwise.It is therefore necessary to move beyond the clustering paradigm towards developing approaches which are capable of discovering local patterns in gene expression data.A biclusters is a submatrix of the gene expression data matrix.The rows and columns in the submatrix need not be contiguous as in the gene expression data matrix.Biclusters are not disjoint.Computation of biclusters is costly because one will have to consider all the combinations of columans and rows in order to find out all the biclusters.The search space for the biclustering problem is 2 m+n where m and n are the number of genes and conditions respectively.Usually m+n is more than 3000.The biclustering problem is NP-hard.Biclustering is a powerful analytical tool for the biologist.The research reported in this thesis addresses the problem of biclustering.Ten algorithms are developed for the identification of coherent biclusters from gene expression data.All these algorithms are making use of a measure called mean squared residue to search for biclusters.The objective here is to identify the biclusters of maximum size with the mean squared residue lower than a given threshold. All these algorithms begin the search from tightly coregulated submatrices called the seeds.These seeds are generated by K-Means clustering algorithm.The algorithms developed can be classified as constraint based,greedy and metaheuristic.Constarint based algorithms uses one or more of the various constaints namely the MSR threshold and the MSR difference threshold.The greedy approach makes a locally optimal choice at each stage with the objective of finding the global optimum.In metaheuristic approaches particle Swarm Optimization(PSO) and variants of Greedy Randomized Adaptive Search Procedure(GRASP) are used for the identification of biclusters.These algorithms are implemented on the Yeast and Lymphoma datasets.Biologically relevant and statistically significant biclusters are identified by all these algorithms which are validated by Gene Ontology database.All these algorithms are compared with some other biclustering algorithms.Algorithms developed in this work overcome some of the problems associated with the already existing algorithms.With the help of some of the algorithms which are developed in this work biclusters with very high row variance,which is higher than the row variance of any other algorithm using mean squared residue, are identified from both Yeast and Lymphoma data sets.Such biclusters which make significant change in the expression level are highly relevant biologically.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

DNA sequence representation methods are used to denote a gene structure effectively and help in similarities/dissimilarities analysis of coding sequences. Many different kinds of representations have been proposed in the literature. They can be broadly classified into Numerical, Graphical, Geometrical and Hybrid representation methods. DNA structure and function analysis are made easy with graphical and geometrical representation methods since it gives visual representation of a DNA structure. In numerical method, numerical values are assigned to a sequence and digital signal processing methods are used to analyze the sequence. Hybrid approaches are also reported in the literature to analyze DNA sequences. This paper reviews the latest developments in DNA Sequence representation methods. We also present a taxonomy of various methods. A comparison of these methods where ever possible is also done

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Lignocellulosic biomass is probably the best alternative resource for biofuel production and it is composed mainly of cellulose, hemicelluloses and lignin. Cellulose is the most abundant among the three and conversion of cellulose to glucose is catalyzed by the enzyme cellulase. Cellulases are groups of enzymes act synergistically upon cellulose to produce glucose and comprise of endoglucanase, cellobiohydrolase and β-glucosidase. β -glucosidase assumes great importance due to the fact that it is the rate limiting enzyme. Endoglucanases (EG) produces nicks in the cellulose polymer exposing reducing and non reducing ends, cellobiohydrolases (CBH) acts upon the reducing or non reducing ends to liberate cellobiose units, and β - glucosidases (BGL) cleaves the cellobiose to liberate glucose completing the hydrolysis. . β -glucosidases undergo feedback inhibition by their own product- β glucose, and cellobiose which is their substrate. Few filamentous fungi produce glucose tolerant β - glucosidases which can overcome this inhibition by tolerating the product concentration to a particular threshold. The present study had targeted a filamentous fungus producing glucose tolerant β - glucosidase which was identified by morphological as well as molecular method. The fungus showed 99% similarity to Aspergillus unguis strain which comes under the Aspergillus nidulans group where most of the glucose tolerant β -glucosidase belongs. The culture was designated the strain number NII 08123 and was deposited in the NII culture collection at CSIR-NIIST. β -glucosidase multiplicity is a common occurrence in fungal world and in A.unguis this was demonstrated using zymogram analysis. A total 5 extracellular isoforms were detected in fungus and the expression levels of these five isoforms varied based on the carbon source available in the medium. Three of these 5 isoforms were expressed in higher levels as identified by the increased fluorescence (due to larger amounts of MUG breakdown by enzyme action) and was speculated to contribute significantly to the total _- β glucosidase activity. These isoforms were named as BGL 1, BGL3 and BGL 5. Among the three, BGL5 was demonstrated to be the glucose tolerant β -glucosidase and this was a low molecular weight protein. Major fraction was a high molecular weight protein but with lesser tolerance to glucose. BGL 3 was between the two in both activity and glucose tolerance.121 Glucose tolerant .β -glucosidase was purified and characterized and kinetic analysis showed that the glucose inhibition constant (Ki) of the protein is 800mM and Km and Vmax of the enzyme was found to be 4.854 mM and 2.946 mol min-1mg protein-1respectively. The optimumtemperature was 60°C and pH 6.0. The molecular weight of the purified protein was ~10kDa in both SDS as well as Native PAGE indicating that the glucose tolerant BGL is a monomeric protein.The major β -glucosidase, BGL1 had a pH and temperature optima of 5.0 and 60 °C respectively. The apparent molecular weight of the Native protein is 240kDa. The Vmax and Km was 78.8 mol min-1mg protein-1 and 0.326mM respectively. Degenerate primers were designed for glycosyl hydrolase families 1, 3 and 5 and the BGL genes were amplified from genomic DNA of Aspergillus unguis. The sequence analyses performed on the amplicons results confirmed the presence of all the three genes. Amplicon with a size of ~500bp was sequenced and which matched to a GH1 –BGL from Aspergillus oryzae. GH3 degenerate primers producing amplicons were sequenced and the sequences matched to β - glucosidase of GH3 family from Aspergillus nidulans and Aspergillus acculateus. GH5 degenerate primers also gave amplification and sequencing results indicated the presence of GH5 family BGL gene in the Aspergillus unguis genomic DNA.From the partial gene sequencing results, specific as well as degenerate primers were designed for TAIL PCR. Sequencing results of the 1.0 Kb amplicon matched Aspergillus nidulans β -glucosidase gene which belongs to the GH1 family. The sequence mainly covered the N-Terminal region of the matching peptide. All the three BGL proteins ie. BGL1, BGL3 and BGL5 were purified by chromatography an electro elution from Native PAGE gels and were subjected to MALDI-TOF mass spectrometric analysis. The results showed that BGL1 peptide mass matched to . β -glucosidase-I of Aspergillus flavus which is a 92kDa protein with 69% protein coverage. The glucose tolerant β -glucosidase BGL5 mass matched to the catalytic C-terminal domain of β -glucosidase-F from Emericella nidulans, but the protein coverage was very low compared to the size of the Emericella nidulans protein. While comparing the size of BGL5 from Aspergillus unguis, the protein sequence coverage is more than 80%. BGL F is a glycosyl hydrolase family 3 protein.The properties of BGL5 seem to be very unique, in that it is a GH3 β -glucosidase with a very low molecular weight of ~10kDa and at the same time having catalytic activity and glucose 122 tolerance which is as yet un-described in GH β -glucosidases. The occurrence of a fully functional 10kDA protein with glucose tolerant BGL activity has tremendous implications both from the points of understanding the structure function relationships as well as for applications of BGL enzymes. BGL-3 showed similarity to BGL1 of Aspergillus aculateus which was another GH3 β -glucosidase. It may be noted that though PCR could detect GH1, GH3 and GH5 β-glucosidases in the fungus, the major isoforms BGL1 BGL3 and BGL5 were all GH3 family enzymes. This would imply that β-glucosidases belonging to other families may also co-exist in the fungus and the other minor isoforms detected in zymograms may account for them. In biomass hydrolysis, GT-BGL containing BGL enzyme was supplemented to cellulase and the performances of blends were compared with a cocktail where commercial β- glucosidase was supplemented to the biomass hydrolyzing enzyme preparation. The cocktail supplemented with A unguis BGL preparation yielded 555mg/g sugar in 12h compared to the commercial enzyme preparation which gave only 333mg/g in the same period and the maximum sugar yield of 858 mg/g was attained in 36h by the cocktail containing A. unguis BGL. While the commercial enzyme achieved almost similar sugar yield in 24h, there was rapid drop in sugar concentration after that, indicating probably the conversion of glucose back to di-or oligosaccharides by the transglycosylation activity of the BGl in that preparation. Compared this, the A.unguis enzyme containing preparation supported peak yields for longer duration (upto 48h) which is important for biomass conversion to other products since the hydrolysate has to undergo certain unit operations before it goes into the next stage ie – fermentation in any bioprocesses for production of either fuels or chemicals.. Most importantly the Aspergillus unguis BGL preparation yields approximately 1.6 fold increase in the sugar release compared to the commercial BGL within 12h of time interval and 2.25 fold increase in the sugar release compared to the control ie. Cellulase without BGL supplementation. The current study therefore leads to the identification of a potent new isolate producing glucose tolerant β - glucosidase. The organism identified as Aspergillus unguis comes under the Aspergillus nidulans group where most of the GT-BGL producers belong and the detailed studies showed that the glucose tolerant β -glucosidase was a very low molecular weight protein which probably belongs to the glycosyl hydrolase family 3. Inhibition kinetic studies helped to understand the Ki and it is the second highest among the nidulans group of Aspergilli. This has promoted us for a detailed study regarding the mechanism of glucose tolerance. The proteomic 123 analyses clearly indicate the presence of GH3 catalytic domain in the protein. Since the size of the protein is very low and still its active and showed glucose tolerance it is speculated that this could be an entirely new protein or the modification of the existing β -glucosidase with only the catalytic domain present in it. Hydrolysis experiments also qualify this BGL, a suitable candidate for the enzyme cocktail development for biomass hydrolysis

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The genus Vibrioof the family Vibrionaceae are Gram negative, oxidasepositive, rod- or curved- rodshaped facultative anaerobes, widespread in marine and estuarine environments. Vibrio species are opportunistic human pathogens responsible for diarrhoeal disease, gastroenteritis, septicaemia and wound infections and are also pathogens of aquatic organisms, causing infections to crustaceans, bivalves and fishes. In the present study, marine environmental samples like seafood and water and sediment samples from aquafarms and mangroves were screened for the presence of Vibrio species. Of the134 isolates obtained from the various samples, 45 were segregated to the genus Vibrio on the basis of phenotypic characterization.like Gram staining, oxidase test, MoF test and salinity tolerance. Partial 16S rDNA sequence analysis was utilized for species level identification of the isolates and the strains were identified as V. cholerae(N=21), V. vulnificus(N=18), V. parahaemolyticus(N=3), V. alginolyticus (N=2) and V. azureus (N=1). The genetic relatedness and variations among the 45 Vibrio isolates were elucidated based on 16S rDNA sequences. Phenotypic characterization of the isolates was based on their response to 12 biochemical tests namely Voges-Proskauers’s (VP test), arginine dihydrolase , tolerance to 3% NaCl test, ONPG test that detects β-galactosidase activity, and tests for utilization of citrate, ornithine, mannitol, arabinose, sucrose, glucose, salicin and cellobiose. The isolates exhibited diverse biochemical patterns, some specific for the species and others indicative of their environmental source.Antibiogram for the isolates was determined subsequent to testing their susceptibility to 12 antibiotics by the disc diffusion method. Varying degrees of resistance to gentamycin (2.22%), ampicillin(62.22%), nalidixic acid (4.44%), vancomycin (86.66), cefixime (17.77%), rifampicin (20%), tetracycline (42.22%) and chloramphenicol (2.22%) was exhibited. All the isolates were susceptible to streptomycin, co-trimoxazole, trimethoprim and azithromycin. Isolates from all the three marine environments exhibited multiple antibiotic resistance, with high MAR index value. The molecular typing methods such as ERIC PCR and BOX PCR revealed intraspecies relatedness and genetic heterogeneity within the environmental isolatesof V. cholerae and V. vulnificus. The 21 strains of V. choleraewere serogroupedas non O1/ non O139 by screening for the presence O1rfb and O139 rfb marker genes by PCR. The virulence/virulence associated genes namely ctxA, ctxB, ace, VPI, hlyA, ompU, rtxA, toxR, zot, nagst, tcpA, nin and nanwere screened in V. cholerae and V. vulnificusstrains.The V. vulnificusstrains were also screened for three species specific genes viz., cps, vvhand viu. In V. cholerae strains, the virulence associated genes like VPI, hlyA, rtxA, ompU and toxR were confirmed by PCR. All the isolates, except for strain BTOS6, harbored at least one or a combination of the tested genes and V. choleraestrain BTPR5 isolated from prawn hosted the highest number of virulence associated genes. Among the V. vulnificusstrains, only 3 virulence genes, VPI, toxR and cps, were confirmed out of the 16 tested and only 7 of the isolates had these genes in one or more combinations. Strain BTPS6 from aquafarm and strain BTVE4 from mangrove samples yielded positive amplification for the three genes. The toxRgene from 9 strains of V. choleraeand 3 strains of V. vulnificus were cloned and sequenced for phylogenetic analysis based on nucleotide and the amino acid sequences. Multiple sequence alignment of the nucleotide sequences and amino acid sequences of the environmental strains of V. choleraerevealed that the toxRgene in the environmental strains are 100% homologous to themselves and to the V. choleraetoxR gene sequence available in the Genbank database. The 3 strains of V. vulnificus displayed high nucleotide and amino acid sequence similarity among themselves and to the sequences of V. cholerae and V. harveyi obtained from the GenBank database, but exhibited only 72% homology to the sequences of its close relative V. vulnificus. Structure prediction of the ToxR protein of Vibrio cholerae strain BTMA5 was by PHYRE2 software. The deduced amino acid sequence showed maximum resemblance with the structure of DNA-binding domain of response regulator2 from Escherichia coli k-12 Template based homology modelling in PHYRE2 successfully modelled the predicted protein and its secondary structure based on protein data bank (PDB) template c3zq7A. The pathogenicity studies were performed using the nematode Caenorhabditiselegansas a model system. The assessment of pathogenicity of environmental strain of V. choleraewas conducted with E. coli strain OP50 as the food source in control plates, environmental V. cholerae strain BTOS6, negative for all tested virulence genes, to check for the suitability of Vibrio sp. as a food source for the nematode;V. cholerae Co 366 ElTor, a clinical pathogenic strain and V. cholerae strain BTPR5 from seafood (Prawn) and positive for the tested virulence genes like VPI, hlyA, ompU,rtxA and toxR. It was found that V. cholerae strain BTOS6 could serve as a food source in place of E. coli strain OP50 but behavioral aberrations like sluggish movement and lawn avoidance and morphological abnormalities like pharyngeal and intestinal distensions and bagging were exhibited by the worms fed on V. cholerae Co 366 ElTor strain and environmental BTPR5 indicating their pathogenicity to the nematode. Assessment of pathogenicity of the environmental strains of V. vulnificus was performed with V. vulnificus strain BTPS6 which tested positive for 3 virulence genes, namely, cps, toxRand VPI, and V. vulnificus strain BTMM7 that did not possess any of the tested virulence genes. A reduction was observed in the life span of worms fed on environmental strain of V. vulnificusBTMM7 rather than on the ordinary laboratory food source, E. coli OP50. Behavioral abnormalities like sluggish movement, lawn avoidance and bagging were also observed in the worms fed with strain BTPS6, but the pharynx and the intestine were intact. The presence of multi drug resistant environmental Vibrio strainsthat constitute a major reservoir of diverse virulence genes are to be dealt with caution as they play a decisive role in pathogenicity and horizontal gene transfer in the marine environments.