928 resultados para Secondary Structure Prediction
Resumo:
Dans un premier temps, nous avons modélisé la structure d’une famille d’ARN avec une grammaire de graphes afin d’identifier les séquences qui en font partie. Plusieurs autres méthodes de modélisation ont été développées, telles que des grammaires stochastiques hors-contexte, des modèles de covariance, des profils de structures secondaires et des réseaux de contraintes. Ces méthodes de modélisation se basent sur la structure secondaire classique comparativement à nos grammaires de graphes qui se basent sur les motifs cycliques de nucléotides. Pour exemplifier notre modèle, nous avons utilisé la boucle E du ribosome qui contient le motif Sarcin-Ricin qui a été largement étudié depuis sa découverte par cristallographie aux rayons X au début des années 90. Nous avons construit une grammaire de graphes pour la structure du motif Sarcin-Ricin et avons dérivé toutes les séquences qui peuvent s’y replier. La pertinence biologique de ces séquences a été confirmée par une comparaison des séquences d’un alignement de plus de 800 séquences ribosomiques bactériennes. Cette comparaison a soulevée des alignements alternatifs pour quelques unes des séquences que nous avons supportés par des prédictions de structures secondaires et tertiaires. Les motifs cycliques de nucléotides ont été observés par les membres de notre laboratoire dans l'ARN dont la structure tertiaire a été résolue expérimentalement. Une étude des séquences et des structures tertiaires de chaque cycle composant la structure du Sarcin-Ricin a révélé que l'espace des séquences dépend grandement des interactions entre tous les nucléotides à proximité dans l’espace tridimensionnel, c’est-à-dire pas uniquement entre deux paires de bases adjacentes. Le nombre de séquences générées par la grammaire de graphes est plus petit que ceux des méthodes basées sur la structure secondaire classique. Cela suggère l’importance du contexte pour la relation entre la séquence et la structure, d’où l’utilisation d’une grammaire de graphes contextuelle plus expressive que les grammaires hors-contexte. Les grammaires de graphes que nous avons développées ne tiennent compte que de la structure tertiaire et négligent les interactions de groupes chimiques spécifiques avec des éléments extra-moléculaires, comme d’autres macromolécules ou ligands. Dans un deuxième temps et pour tenir compte de ces interactions, nous avons développé un modèle qui tient compte de la position des groupes chimiques à la surface des structures tertiaires. L’hypothèse étant que les groupes chimiques à des positions conservées dans des séquences prédéterminées actives, qui sont déplacés dans des séquences inactives pour une fonction précise, ont de plus grandes chances d’être impliqués dans des interactions avec des facteurs. En poursuivant avec l’exemple de la boucle E, nous avons cherché les groupes de cette boucle qui pourraient être impliqués dans des interactions avec des facteurs d'élongation. Une fois les groupes identifiés, on peut prédire par modélisation tridimensionnelle les séquences qui positionnent correctement ces groupes dans leurs structures tertiaires. Il existe quelques modèles pour adresser ce problème, telles que des descripteurs de molécules, des matrices d’adjacences de nucléotides et ceux basé sur la thermodynamique. Cependant, tous ces modèles utilisent une représentation trop simplifiée de la structure d’ARN, ce qui limite leur applicabilité. Nous avons appliqué notre modèle sur les structures tertiaires d’un ensemble de variants d’une séquence d’une instance du Sarcin-Ricin d’un ribosome bactérien. L’équipe de Wool à l’université de Chicago a déjà étudié cette instance expérimentalement en testant la viabilité de 12 variants. Ils ont déterminé 4 variants viables et 8 létaux. Nous avons utilisé cet ensemble de 12 séquences pour l’entraînement de notre modèle et nous avons déterminé un ensemble de propriétés essentielles à leur fonction biologique. Pour chaque variant de l’ensemble d’entraînement nous avons construit des modèles de structures tertiaires. Nous avons ensuite mesuré les charges partielles des atomes exposés sur la surface et encodé cette information dans des vecteurs. Nous avons utilisé l’analyse des composantes principales pour transformer les vecteurs en un ensemble de variables non corrélées, qu’on appelle les composantes principales. En utilisant la distance Euclidienne pondérée et l’algorithme du plus proche voisin, nous avons appliqué la technique du « Leave-One-Out Cross-Validation » pour choisir les meilleurs paramètres pour prédire l’activité d’une nouvelle séquence en la faisant correspondre à ces composantes principales. Finalement, nous avons confirmé le pouvoir prédictif du modèle à l’aide d’un nouvel ensemble de 8 variants dont la viabilité à été vérifiée expérimentalement dans notre laboratoire. En conclusion, les grammaires de graphes permettent de modéliser la relation entre la séquence et la structure d’un élément structural d’ARN, comme la boucle E contenant le motif Sarcin-Ricin du ribosome. Les applications vont de la correction à l’aide à l'alignement de séquences jusqu’au design de séquences ayant une structure prédéterminée. Nous avons également développé un modèle pour tenir compte des interactions spécifiques liées à une fonction biologique donnée, soit avec des facteurs environnants. Notre modèle est basé sur la conservation de l'exposition des groupes chimiques qui sont impliqués dans ces interactions. Ce modèle nous a permis de prédire l’activité biologique d’un ensemble de variants de la boucle E du ribosome qui se lie à des facteurs d'élongation.
Resumo:
Les ARN non codants (ARNnc) sont des transcrits d'ARN qui ne sont pas traduits en protéines et qui pourtant ont des fonctions clés et variées dans la cellule telles que la régulation des gènes, la transcription et la traduction. Parmi les nombreuses catégories d'ARNnc qui ont été découvertes, on trouve des ARN bien connus tels que les ARN ribosomiques (ARNr), les ARN de transfert (ARNt), les snoARN et les microARN (miARN). Les fonctions des ARNnc sont étroitement liées à leurs structures d’où l’importance de développer des outils de prédiction de structure et des méthodes de recherche de nouveaux ARNnc. Les progrès technologiques ont mis à la disposition des chercheurs des informations abondantes sur les séquences d'ARN. Ces informations sont accessibles dans des bases de données telles que Rfam, qui fournit des alignements et des informations structurelles sur de nombreuses familles d'ARNnc. Dans ce travail, nous avons récupéré toutes les séquences des structures secondaires annotées dans Rfam, telles que les boucles en épingle à cheveux, les boucles internes, les renflements « bulge », etc. dans toutes les familles d'ARNnc. Une base de données locale, RNAstem, a été créée pour faciliter la manipulation et la compilation des données sur les motifs de structure secondaire. Nous avons analysé toutes les boucles terminales et internes ainsi que les « bulges » et nous avons calculé un score d’abondance qui nous a permis d’étudier la fréquence de ces motifs. Tout en minimisant le biais de la surreprésentation de certaines classes d’ARN telles que l’ARN ribosomal, l’analyse des scores a permis de caractériser les motifs rares pour chacune des catégories d’ARN en plus de confirmer des motifs communs comme les boucles de type GNRA ou UNCG. Nous avons identifié des motifs abondants qui n’ont pas été étudiés auparavant tels que la « tetraloop » UUUU. En analysant le contenu de ces motifs en nucléotides, nous avons remarqué que ces régions simples brins contiennent beaucoup plus de nucléotides A et U. Enfin, nous avons exploré la possibilité d’utiliser ces scores pour la conception d’un filtre qui permettrait d’accélérer la recherche de nouveaux ARN non-codants. Nous avons développé un système de scores, RNAscore, qui permet d’évaluer un ARN en se basant sur son contenu en motifs et nous avons testé son applicabilité avec différents types de contrôles.
Characterization and Pathogenicity of Vibrio cholerae and Vibrio vulnificus from Marine environments
Resumo:
The genus Vibrioof the family Vibrionaceae are Gram negative, oxidasepositive, rod- or curved- rodshaped facultative anaerobes, widespread in marine and estuarine environments. Vibrio species are opportunistic human pathogens responsible for diarrhoeal disease, gastroenteritis, septicaemia and wound infections and are also pathogens of aquatic organisms, causing infections to crustaceans, bivalves and fishes. In the present study, marine environmental samples like seafood and water and sediment samples from aquafarms and mangroves were screened for the presence of Vibrio species. Of the134 isolates obtained from the various samples, 45 were segregated to the genus Vibrio on the basis of phenotypic characterization.like Gram staining, oxidase test, MoF test and salinity tolerance. Partial 16S rDNA sequence analysis was utilized for species level identification of the isolates and the strains were identified as V. cholerae(N=21), V. vulnificus(N=18), V. parahaemolyticus(N=3), V. alginolyticus (N=2) and V. azureus (N=1). The genetic relatedness and variations among the 45 Vibrio isolates were elucidated based on 16S rDNA sequences. Phenotypic characterization of the isolates was based on their response to 12 biochemical tests namely Voges-Proskauers’s (VP test), arginine dihydrolase , tolerance to 3% NaCl test, ONPG test that detects β-galactosidase activity, and tests for utilization of citrate, ornithine, mannitol, arabinose, sucrose, glucose, salicin and cellobiose. The isolates exhibited diverse biochemical patterns, some specific for the species and others indicative of their environmental source.Antibiogram for the isolates was determined subsequent to testing their susceptibility to 12 antibiotics by the disc diffusion method. Varying degrees of resistance to gentamycin (2.22%), ampicillin(62.22%), nalidixic acid (4.44%), vancomycin (86.66), cefixime (17.77%), rifampicin (20%), tetracycline (42.22%) and chloramphenicol (2.22%) was exhibited. All the isolates were susceptible to streptomycin, co-trimoxazole, trimethoprim and azithromycin. Isolates from all the three marine environments exhibited multiple antibiotic resistance, with high MAR index value. The molecular typing methods such as ERIC PCR and BOX PCR revealed intraspecies relatedness and genetic heterogeneity within the environmental isolatesof V. cholerae and V. vulnificus. The 21 strains of V. choleraewere serogroupedas non O1/ non O139 by screening for the presence O1rfb and O139 rfb marker genes by PCR. The virulence/virulence associated genes namely ctxA, ctxB, ace, VPI, hlyA, ompU, rtxA, toxR, zot, nagst, tcpA, nin and nanwere screened in V. cholerae and V. vulnificusstrains.The V. vulnificusstrains were also screened for three species specific genes viz., cps, vvhand viu. In V. cholerae strains, the virulence associated genes like VPI, hlyA, rtxA, ompU and toxR were confirmed by PCR. All the isolates, except for strain BTOS6, harbored at least one or a combination of the tested genes and V. choleraestrain BTPR5 isolated from prawn hosted the highest number of virulence associated genes. Among the V. vulnificusstrains, only 3 virulence genes, VPI, toxR and cps, were confirmed out of the 16 tested and only 7 of the isolates had these genes in one or more combinations. Strain BTPS6 from aquafarm and strain BTVE4 from mangrove samples yielded positive amplification for the three genes. The toxRgene from 9 strains of V. choleraeand 3 strains of V. vulnificus were cloned and sequenced for phylogenetic analysis based on nucleotide and the amino acid sequences. Multiple sequence alignment of the nucleotide sequences and amino acid sequences of the environmental strains of V. choleraerevealed that the toxRgene in the environmental strains are 100% homologous to themselves and to the V. choleraetoxR gene sequence available in the Genbank database. The 3 strains of V. vulnificus displayed high nucleotide and amino acid sequence similarity among themselves and to the sequences of V. cholerae and V. harveyi obtained from the GenBank database, but exhibited only 72% homology to the sequences of its close relative V. vulnificus. Structure prediction of the ToxR protein of Vibrio cholerae strain BTMA5 was by PHYRE2 software. The deduced amino acid sequence showed maximum resemblance with the structure of DNA-binding domain of response regulator2 from Escherichia coli k-12 Template based homology modelling in PHYRE2 successfully modelled the predicted protein and its secondary structure based on protein data bank (PDB) template c3zq7A. The pathogenicity studies were performed using the nematode Caenorhabditiselegansas a model system. The assessment of pathogenicity of environmental strain of V. choleraewas conducted with E. coli strain OP50 as the food source in control plates, environmental V. cholerae strain BTOS6, negative for all tested virulence genes, to check for the suitability of Vibrio sp. as a food source for the nematode;V. cholerae Co 366 ElTor, a clinical pathogenic strain and V. cholerae strain BTPR5 from seafood (Prawn) and positive for the tested virulence genes like VPI, hlyA, ompU,rtxA and toxR. It was found that V. cholerae strain BTOS6 could serve as a food source in place of E. coli strain OP50 but behavioral aberrations like sluggish movement and lawn avoidance and morphological abnormalities like pharyngeal and intestinal distensions and bagging were exhibited by the worms fed on V. cholerae Co 366 ElTor strain and environmental BTPR5 indicating their pathogenicity to the nematode. Assessment of pathogenicity of the environmental strains of V. vulnificus was performed with V. vulnificus strain BTPS6 which tested positive for 3 virulence genes, namely, cps, toxRand VPI, and V. vulnificus strain BTMM7 that did not possess any of the tested virulence genes. A reduction was observed in the life span of worms fed on environmental strain of V. vulnificusBTMM7 rather than on the ordinary laboratory food source, E. coli OP50. Behavioral abnormalities like sluggish movement, lawn avoidance and bagging were also observed in the worms fed with strain BTPS6, but the pharynx and the intestine were intact. The presence of multi drug resistant environmental Vibrio strainsthat constitute a major reservoir of diverse virulence genes are to be dealt with caution as they play a decisive role in pathogenicity and horizontal gene transfer in the marine environments.
Resumo:
MOTIVATION: The accurate prediction of the quality of 3D models is a key component of successful protein tertiary structure prediction methods. Currently, clustering or consensus based Model Quality Assessment Programs (MQAPs) are the most accurate methods for predicting 3D model quality; however they are often CPU intensive as they carry out multiple structural alignments in order to compare numerous models. In this study, we describe ModFOLDclustQ - a novel MQAP that compares 3D models of proteins without the need for CPU intensive structural alignments by utilising the Q measure for model comparisons. The ModFOLDclustQ method is benchmarked against the top established methods in terms of both accuracy and speed. In addition, the ModFOLDclustQ scores are combined with those from our older ModFOLDclust method to form a new method, ModFOLDclust2, that aims to provide increased prediction accuracy with negligible computational overhead. RESULTS: The ModFOLDclustQ method is competitive with leading clustering based MQAPs for the prediction of global model quality, yet it is up to 150 times faster than the previous version of the ModFOLDclust method at comparing models of small proteins (<60 residues) and over 5 times faster at comparing models of large proteins (>800 residues). Furthermore, a significant improvement in accuracy can be gained over the previous clustering based MQAPs by combining the scores from ModFOLDclustQ and ModFOLDclust to form the new ModFOLDclust2 method, with little impact on the overall time taken for each prediction. AVAILABILITY: The ModFOLDclustQ and ModFOLDclust2 methods are available to download from: http://www.reading.ac.uk/bioinf/downloads/ CONTACT: l.j.mcguffin@reading.ac.uk.
Resumo:
The self-assembly and hydrogelation properties of two Fmoc-tripeptides [Fmoc = N-(fluorenyl-9-methoxycarbonyl)] are investigated, in borate buffer and other basic solutions. A remarkable difference in self-assembly properties is observed comparing Fmoc-VLK(Boc) with Fmoc-K(Boc)LV, both containing K protected by N(epsilon)-tert-butyloxycarbonate (Boc). In borate buffer, the former peptide forms highly anisotropic fibrils which show local alignment, and the hydrogels show flow-aligning properties. In contrast, Fmoc-K(Boc)LV forms highly branched fibrils that produce isotropic hydrogels with a much higher modulus (G' > 10(4) Pa), and lower concentration for hydrogel formation. The distinct self-assembled structures are ascribed to conformational differences, as revealed by secondary structure probes (CD, FTIR, Raman spectroscopy) and X-ray diffraction. Fmoc-VLK(Boc) forms well-defined beta-sheets with a cross-beta X-ray diffraction pattern, whereas Fmoc-KLV(Boc) forms unoriented assemblies with multiple stacked sheets. Interchange of the K and V residues when inverting the tripeptide sequence thus leads to substantial differences in self-assembled structures, suggesting a promising approach to control hydrogel properties.
Resumo:
An automatic method for recognizing natively disordered regions from amino acid sequence is described and benchmarked against predictors that were assessed at the latest critical assessment of techniques for protein structure prediction (CASP) experiment. The method attains a Wilcoxon score of 90.0, which represents a statistically significant improvement on the methods evaluated on the same targets at CASP. The classifier, DISOPRED2, was used to estimate the frequency of native disorder in several representative genomes from the three kingdoms of life. Putative, long (>30 residue) disordered segments are found to occur in 2.0% of archaean, 4.2% of eubacterial and 33.0% of eukaryotic proteins. The function of proteins with long predicted regions of disorder was investigated using the gene ontology annotations supplied with the Saccharomyces genome database. The analysis of the yeast proteome suggests that proteins containing disorder are often located in the cell nucleus and are involved in the regulation of transcription and cell signalling. The results also indicate that native disorder is associated with the molecular functions of kinase activity and nucleic acid binding.
Resumo:
The FunFOLD2 server is a new independent server that integrates our novel protein–ligand binding site and quality assessment protocols for the prediction of protein function (FN) from sequence via structure. Our guiding principles were, first, to provide a simple unified resource to make our function prediction software easily accessible to all via a simple web interface and, second, to produce integrated output for predictions that can be easily interpreted. The server provides a clean web interface so that results can be viewed on a single page and interpreted by non-experts at a glance. The output for the prediction is an image of the top predicted tertiary structure annotated to indicate putative ligand-binding site residues. The results page also includes a list of the most likely binding site residues and the types of predicted ligands and their frequencies in similar structures. The protein–ligand interactions can also be interactively visualized in 3D using the Jmol plug-in. The raw machine readable data are provided for developers, which comply with the Critical Assessment of Techniques for Protein Structure Prediction data standards for FN predictions. The FunFOLD2 webserver is freely available to all at the following web site: http://www.reading.ac.uk/bioinf/FunFOLD/FunFOLD_form_2_0.html.
Resumo:
Elucidating the biological and biochemical roles of proteins, and subsequently determining their interacting partners, can be difficult and time consuming using in vitro and/or in vivo methods, and consequently the majority of newly sequenced proteins will have unknown structures and functions. However, in silico methods for predicting protein–ligand binding sites and protein biochemical functions offer an alternative practical solution. The characterisation of protein–ligand binding sites is essential for investigating new functional roles, which can impact the major biological research spheres of health, food, and energy security. In this review we discuss the role in silico methods play in 3D modelling of protein–ligand binding sites, along with their role in predicting biochemical functionality. In addition, we describe in detail some of the key alternative in silico prediction approaches that are available, as well as discussing the Critical Assessment of Techniques for Protein Structure Prediction (CASP) and the Continuous Automated Model EvaluatiOn (CAMEO) projects, and their impact on developments in the field. Furthermore, we discuss the importance of protein function prediction methods for tackling 21st century problems.
Resumo:
The present findings suggest that Anopheles (Kerteszia) homunculus may comprise more than one species. The rDNA ITS2 sequence data corroborate the presence of An. homunculus l.s. in Mata Atlantica, southern Brazil, and suggest that specimens from Trinidad may belong to an unnamed morphologically similar species. There is a need for additional studies to establish the geographical distribution of An. homunculus l.s. in continental South America and in Trinidad, especially in southern Mata Atlantica, Brazil.
Resumo:
Calyptommatus and Nothobachia genera of gymnophthalmid lizards are restricted to sandy open habitats on Sao Francisco River margins, northeastern Brazil. Phylogenetic relationships and geographic distribution of the four recognized species of Calyptommatus were analyzed from partial mitochondrial cyt b, 12S, and 16S rRNA genes sequencing, taking allopatric populations of the monotypic Nothobachia ablephara as the outgroup. In Calyptommatus a basal split separated C. sinebrachiatus, a species restricted to the eastern bank of the river, from the three other species. In this clade, C. confusionibus, found on western margin, was recovered as the sister group of the two other species, C. leiolepis and C. nicterus, from opposite margins. According to approximate date estimations, C. sinebrachiatus would have separated from the other congeneric species by 4.4-6.5 my, and C. nicterus, also from eastern bank, would be diverging by 1.8-2.6 my from C. leiolepis, the sister species on the opposite margin. C. confusionibus and C. leiolepis, both from western sandy areas, would be differentiating by 2.8-5.0 my. Divergence times of about 3.0-4.0 my were estimated for allopatric populations of Nothobachia restricted to western margin. Significant differences in 16S rRNA secondary structure relatively to other vertebrates are reported. Distinct evolutionary patterns are proposed for different taxa in those sandy areas, probably related to historical changes in the course of Sao Francisco River. (C) 2010 Elsevier Inc. All rights reserved.
Resumo:
Bothropasin is a 48 kDa hemorrhagic PIII snake venom metalloprotease (SVMP) isolated from Bothrops jararaca, containing disintegrin/cysteine-rich adhesive domains. Here we present the crystal structure of bothropasin complexed with the inhibitor POL647. The catalytic domain consists of a scaffold of two subdomains organized similarly to those described for other SVMPs, including the zinc and calcium-binding sites. The free cysteine residue Cys(189) is located within a hydrophobic core and it is not available for disulfide bonding or other interactions. There is no identifiable secondary structure for the disintegrin domain, but instead it is composed mostly of loops stabilized by seven disulfide bonds and by two calcium ions. The ECD region is in a loop and is structurally related to the RGD region of RGD disintegrins, which are derived from I`ll SVMPs. The ECD motif is stabilized by the Cys(117)_Cys(310) disulfide bond (between the disintegrin and cysteine-rich domains) and by one calcium ion. The side chain of Glu(276) of the ECD motif is exposed to solvent and free to make interactions. In bothropasin, the HVR (hyper-variable region) described for other Pill SVMPs in the cysteine-rich domain, presents a well-conserved sequence with respect to several other Pill members from different species. We propose that this subset be referred to as PIII-HCR (highly conserved region) SVMPs. The differences in the disintegrin-like, cysteine-rich or disintegrin-like cysteine-rich domains may be involved in selecting target binding, which in turn could generate substrate diversity or specificity for the catalytic domain. (C) 2008 Elsevier Ltd. All rights reserved.
Resumo:
Xylella fastidiosa is an important pathogen bacterium transmitted by xylem-feedings leafhoppers that colonizes the xylem of plants and causes diseases on several important crops including citrus variegated chlorosis (CVC) in orange and lime trees. Glutathione-S-transferases (GST) form a group of multifunctional isoenzymes that catalyzes both glutathione (GSH)-dependent conjugation and reduction reactions involved in the cellular detoxification of xenobiotic and endobiotic compounds. GSTs are the major detoxification enzymes found in the intracellular space and mainly in the cytosol from prokaryotes to mammals, and may be involved in the regulation of stress-activated signals by suppressing apoptosis signal-regulating kinase 1. In this study, we describe the cloning of the glutathione-S-transferase from X. fastidiosa into pET-28a(+) vector, its expression in Escherichia coli, purification and initial structural characterization. The purification of recombinant xfGST (rxfGST) to near homogeneity was achieved using affinity chromatography and size-exclusion chromatography (SEC). SEC demonstrated that rxfGST is a homodimer in solution. The secondary and tertiary structures of recombinant protein were analyzed by circular dichroism and fluorescence spectroscopy, respectively. The enzyme was assayed for activity and the results taken together indicated that rxfGST is a stable molecule, correctly folded, and highly active. Several members of the GST family have been extensively studied. However, xfGST is part of a less-studied subfamily which yet has not been structurally and biochemically characterized. In addition, these studies should provide a useful basis for future studies and biotechnological approaches of rxfGST. (C) 2008 Elsevier Inc. All rights reserved.
Resumo:
This study aimed at investigating the structural properties and mechanisms of the antifungal action of CpOsm, a purified osmotin from Calotropis procera latex. Fluorescence and CD assays revealed that the CpOsm structure is highly stable, regardless of pH levels. Accordingly, CpOsm inhibited the spore germination of Fusarium solani in all pH ranges tested. The content of the secondary structure of CpOsm was estimated as follows: alpha-helix (20%), beta-sheet (33%), turned (19%) and unordered (28%). RMSD 1%. CpOsm was stable at up to 75 degrees C, and thermal denaturation (T(m)) was calculated to be 77.8 degrees C. This osmotin interacted with the negatively charged large unilamellar vesicles (LUVs) of 1-palmitoyl-2-oleoyl-sn-glycero-3-phospho-rac-1-glycerol (POPG), inducing vesicle permeabilization by the leakage of calcein. CpOsm induced the membrane permeabilization of spores and hyphae from Fusarium solani, allowing for propidium iodide uptake. These results show that CpOsm is a stable protein, and its antifungal activity involves membrane permeabilization, as property reported earlier for other osmotins and thaumatin-like proteins. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
1,3-beta-Glucan depolymerizing enzymes have considerable biotechnological applications including biofuel production, feedstock-chemicals and pharmaceuticals. Here we describe a comprehensive functional characterization and low-resolution structure of a hyperthermophilic laminarinase from Thermotoga petrophila (TpLam). We determine TpLam enzymatic mode of operation, which specifically cleaves internal beta-1,3-glucosidic bonds. The enzyme most frequently attacks the bond between the 3rd and 4th residue from the non-reducing end, producing glucose, laminaribiose and laminaritriose as major products. Far-UV circular dichroism demonstrates that TpLam is formed mainly by beta structural elements, and the secondary structure is maintained after incubation at 90 degrees C. The structure resolved by small angle X-ray scattering, reveals a multi-domain structural architecture of a V-shape envelope with a catalytic domain flanked by two carbohydrate-binding modules. Crown Copyright (C) 2011 Published by Elsevier Inc. All rights reserved.