971 resultados para Protein Structure


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Les séquences protéiques naturelles sont le résultat net de l’interaction entre les mécanismes de mutation, de sélection naturelle et de dérive stochastique au cours des temps évolutifs. Les modèles probabilistes d’évolution moléculaire qui tiennent compte de ces différents facteurs ont été substantiellement améliorés au cours des dernières années. En particulier, ont été proposés des modèles incorporant explicitement la structure des protéines et les interdépendances entre sites, ainsi que les outils statistiques pour évaluer la performance de ces modèles. Toutefois, en dépit des avancées significatives dans cette direction, seules des représentations très simplifiées de la structure protéique ont été utilisées jusqu’à présent. Dans ce contexte, le sujet général de cette thèse est la modélisation de la structure tridimensionnelle des protéines, en tenant compte des limitations pratiques imposées par l’utilisation de méthodes phylogénétiques très gourmandes en temps de calcul. Dans un premier temps, une méthode statistique générale est présentée, visant à optimiser les paramètres d’un potentiel statistique (qui est une pseudo-énergie mesurant la compatibilité séquence-structure). La forme fonctionnelle du potentiel est par la suite raffinée, en augmentant le niveau de détails dans la description structurale sans alourdir les coûts computationnels. Plusieurs éléments structuraux sont explorés : interactions entre pairs de résidus, accessibilité au solvant, conformation de la chaîne principale et flexibilité. Les potentiels sont ensuite inclus dans un modèle d’évolution et leur performance est évaluée en termes d’ajustement statistique à des données réelles, et contrastée avec des modèles d’évolution standards. Finalement, le nouveau modèle structurellement contraint ainsi obtenu est utilisé pour mieux comprendre les relations entre niveau d’expression des gènes et sélection et conservation de leur séquence protéique.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Le cancer épithélial de l’ovaire (CEO) est le cancer gynécologique le plus létal. Plus de 70% des patientes diagnostiquées avec une tumeur de stade avancé rechutent suite aux traitements chimiothérapeutiques de première ligne, la survie à cinq ans étant ainsi très faible. Afin de mieux comprendre l’évolution de la maladie, nous avons recherché de nouveaux gènes, responsables de l’initiation et de la progression du CEO. Précédemment, des lignées cellulaires ont été dérivées à partir de la tumeur primaire et récurrente et/ou d’ascites de trois patientes. Le séquençage de l’ARN de ces lignées par la technologie de séquençage de nouvelle génération (TSNG) nous a permis d’identifier des mutations ponctuelles qui pourraient nous indiquer des gènes dérégulés dans le CEO. La TSNG est un bon outil qui permet d’identifier et de cribler à grande échelle des mutations. Nous avons sélectionné PLEC1, SCRIB, NCOR2, SEMA6C, IKBKB, GLCE et ITGAE comme gènes candidats présentant des mutations dans nos lignées et ayant une relation fonctionnelle avérée avec le cancer. Étant donné que la TSNG est une technique à taux de fiabilité limité, nous avons validé ces mutations par séquençage Sanger. Ensuite, nous avons étudié l’effet de ces mutations sur la structure protéique et l’expression de PLEC1, de SCRIB et de SEMA6C. Seules certaines mutations dans les gènes PLEC1, SCRIB et SEMA6C ont pu être confirmées. PLEC1 et SCRIB sont deux protéines d’échafaudage dont la mutation, rapportée dans plusieurs cancers, pourrait induire des changements de leurs conformations et affecter leurs interactions et leurs fonctions. Les conséquences de ces mutations sur la tumorigenèse de l’ovaire devront être étudiées.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The resurgence of the enteric pathogen Vibrio cholerae, the causative organism of epidemic cholera, remains a major health problem in many developing countries like India. The southern Indian state of Kerala is endemic to cholera. The outbreaks of cholera follow a seasonal pattern in regions of endemicity. Marine aquaculture settings and mangrove environments of Kerala serve as reservoirs for V. cholerae. The non-O1/non-O139 environmental isolates of V. cholerae with incomplete ‘virulence casette’ are to be dealt with caution as they constitute a major reservoir of diverse virulence genes in the marine environment and play a crucial role in pathogenicity and horizontal gene transfer. The genes coding cholera toxin are borne on, and can be infectiously transmitted by CTXΦ, a filamentous lysogenic vibriophages. Temperate phages can provide crucial virulence and fitness factors affecting cell metabolism, bacterial adhesion, colonization, immunity, antibiotic resistance and serum resistance. The present study was an attempt to screen the marine environments like aquafarms and mangroves of coastal areas of Alappuzha and Cochin, Kerala for the presence of lysogenic V. cholerae, to study their pathogenicity and also gene transfer potential. Phenotypic and molecular methods were used for identification of isolates as V. cholerae. The thirty one isolates which were Gram negative, oxidase positive, fermentative, with or without gas production on MOF media and which showed yellow coloured colonies on TCBS (Thiosulfate Citrate Bile salt Sucrose) agar were segregated as vibrios. Twenty two environmental V. cholerae strains of both O1 and non- O1/non-O139 serogroups on induction with mitomycin C showed the presence of lysogenic phages. They produced characteristic turbid plaques in double agar overlay assay using the indicator strain V. cholerae El Tor MAK 757. PCR based molecular typing with primers targeting specific conserved sequences in the bacterial genome, demonstrated genetic diversity among these lysogen containing non-O1 V. cholerae . Polymerase chain reaction was also employed as a rapid screening method to verify the presence of 9 virulence genes namely, ctxA, ctxB, ace, hlyA, toxR, zot,tcpA, ninT and nanH, using gene specific primers. The presence of tcpA gene in ALPVC3 was alarming, as it indicates the possibility of an epidemic by accepting the cholera. Differential induction studies used ΦALPVC3, ΦALPVC11, ΦALPVC12 and ΦEKM14, underlining the possibility of prophage induction in natural ecosystems, due to abiotic factors like antibiotics, pollutants, temperature and UV. The efficiency of induction of prophages varied considerably in response to the different induction agents. The growth curve of lysogenic V. cholerae used in the study drastically varied in the presence of strong prophage inducers like antibiotics and UV. Bacterial cell lysis was directly proportional to increase in phage number due to induction. Morphological characterization of vibriophages by Transmission Electron Microscopy revealed hexagonal heads for all the four phages. Vibriophage ΦALPVC3 exhibited isometric and contractile tails characteristic of family Myoviridae, while phages ΦALPVC11 and ΦALPVC12 demonstrated the typical hexagonal head and non-contractile tail of family Siphoviridae. ΦEKM14, the podophage was distinguished by short non-contractile tail and icosahedral head. This work demonstrated that environmental parameters can influence the viability and cell adsorption rates of V. cholerae phages. Adsorption studies showed 100% adsorption of ΦALPVC3 ΦALPVC11, ΦALPVC12 and ΦEKM14 after 25, 30, 40 and 35 minutes respectively. Exposure to high temperatures ranging from 50ºC to 100ºC drastically reduced phage viability. The optimum concentration of NaCl required for survival of vibriophages except ΦEKM14 was 0.5 M and that for ΦEKM14 was 1M NaCl. Survival of phage particles was maximum at pH 7-8. V. cholerae is assumed to have existed long before their human host and so the pathogenic clones may have evolved from aquatic forms which later colonized the human intestine by progressive acquisition of genes. This is supported by the fact that the vast majority of V. cholerae strains are still part of the natural aquatic environment. CTXΦ has played a critical role in the evolution of the pathogenicity of V. cholerae as it can transmit the ctxAB gene. The unusual transformation of V. cholerae strains associated with epidemics and the emergence of V. cholera O139 demonstrates the evolutionary success of the organism in attaining greater fitness. Genetic changes in pathogenic V. cholerae constitute a natural process for developing immunity within an endemically infected population. The alternative hosts and lysogenic environmental V. cholerae strains may potentially act as cofactors in promoting cholera phage ‘‘blooms’’ within aquatic environments, thereby influencing transmission of phage sensitive, pathogenic V. cholerae strains by aquatic vehicles. Differential induction of the phages is a clear indication of the impact of environmental pollution and global changes on phage induction. The development of molecular biology techniques offered an accessible gateway for investigating the molecular events leading to genetic diversity in the marine environment. Using nucleic acids as targets, the methods of fingerprinting like ERIC PCR and BOX PCR, revealed that the marine environment harbours potentially pathogenic group of bacteria with genetic diversity. The distribution of virulence associated genes in the environmental isolates of V. cholerae provides tangible material for further investigation. Nucleotide and protein sequence analysis alongwith protein structure prediction aids in better understanding of the variation inalleles of same gene in different ecological niche and its impact on the protein structure for attaining greater fitness of pathogens. The evidences of the co-evolution of virulence genes in toxigenic V. cholerae O1 from different lineages of environmental non-O1 strains is alarming. Transduction studies would indicate that the phenomenon of acquisition of these virulence genes by lateral gene transfer, although rare, is not quite uncommon amongst non-O1/non-O139 V. cholerae and it has a key role in diversification. All these considerations justify the need for an integrated approach towards the development of an effective surveillance system to monitor evolution of V. cholerae strains with epidemic potential. Results presented in this study, if considered together with the mechanism proposed as above, would strongly suggest that the bacteriophage also intervenes as a variable in shaping the cholera bacterium, which cannot be ignored and hinting at imminent future epidemics.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

La butirilcolinesterasa humana (BChE; EC 3.1.1.8) es una enzima polimórfica sintetizada en el hígado y en el tejido adiposo, ampliamente distribuida en el organismo y encargada de hidrolizar algunos ésteres de colina como la procaína, ésteres alifáticos como el ácido acetilsalicílico, fármacos como la metilprednisolona, el mivacurium y la succinilcolina y drogas de uso y/o abuso como la heroína y la cocaína. Es codificada por el gen BCHE (OMIM 177400), habiéndose identificado más de 100 variantes, algunas no estudiadas plenamente, además de la forma más frecuente, llamada usual o silvestre. Diferentes polimorfismos del gen BCHE se han relacionado con la síntesis de enzimas con niveles variados de actividad catalítica. Las bases moleculares de algunas de esas variantes genéticas han sido reportadas, entre las que se encuentra las variantes Atípica (A), fluoruro-resistente del tipo 1 y 2 (F-1 y F-2), silente (S), Kalow (K), James (J) y Hammersmith (H). En este estudio, en un grupo de pacientes se aplicó el instrumento validado Lifetime Severity Index for Cocaine Use Disorder (LSI-C) para evaluar la gravedad del consumo de “cocaína” a lo largo de la vida. Además, se determinaron Polimorfismos de Nucleótido Simple (SNPs) en el gen BCHE conocidos como responsables de reacciones adversas en pacientes consumidores de “cocaína” mediante secuenciación del gen y se predijo el efecto delos SNPs sobre la función y la estructura de la proteína, mediante el uso de herramientas bio-informáticas. El instrumento LSI-C ofreció resultados en cuatro dimensiones: consumo a lo largo de la vida, consumo reciente, dependencia psicológica e intento de abandono del consumo. Los estudios de análisis molecular permitieron observar dos SNPs codificantes (cSNPs) no sinónimos en el 27.3% de la muestra, c.293A>G (p.Asp98Gly) y c.1699G>A (p.Ala567Thr), localizados en los exones 2 y 4, que corresponden, desde el punto de vista funcional, a la variante Atípica (A) [dbSNP: rs1799807] y a la variante Kalow (K) [dbSNP: rs1803274] de la enzima BChE, respectivamente. Los estudios de predicción In silico establecieron para el SNP p.Asp98Gly un carácter patogénico, mientras que para el SNP p.Ala567Thr, mostraron un comportamiento neutro. El análisis de los resultados permite proponer la existencia de una relación entre polimorfismos o variantes genéticas responsables de una baja actividad catalítica y/o baja concentración plasmática de la enzima BChE y algunas de las reacciones adversas ocurridas en pacientes consumidores de cocaína.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Objetivos: Determinar la prevalencia y los factores asociados con el desarrollo de hipotiroidismo autoinmune (HA) en una cohorte de pacientes con lupus eritematoso sistémico (LES), y analizar la información actual en cuanto a la prevalencia e impacto de la enfermedad tiroidea autoinmune y la autoinmunidad tiroidea en pacientes con LES. Métodos: Este fue un estudio realizado en dos pasos. Primero, un total de 376 pacientes con LES fueron evaluados sistemáticamente por la presencia de: 1) HA confirmado, 2) positividad para anticuerpos tiroperoxidasa/tiroglobulina (TPOAb/TgAb) sin hipotiroidismo, 3) hipotiroidismo no autoinmune, y 4) pacientes con LES sin hipotiroidismo ni positividad para TPOAb/TgAb. Se construyeron modelos multivariados y árboles de regresión y clasificación para analizar los datos. Segundo, la información actual fue evaluada a través de una revisión sistemática de la literatura (RLS). Se siguieron las guías PRISMA para la búsqueda en las bases de datos PubMed, Scopus, SciELO y Librería Virtual en Salud. Resultados: En nuestra cohorte, la prevalencia de HA confirmado fue de 12% (Grupo 1). Sin embargo, la frecuencia de positividad para TPOAb y TgAb fue de 21% y 10%, respectivamente (Grupo 2). Los pacientes con LES sin HA, hipotiroidismo no autoinmune ni positividad para TPOAb/TgAb constituyeron el 40% de la corhorte. Los pacientes con HA confirmada fueron estadísticamente significativo de mayor edad y tuvieron un inicio tardío de la enfermedad. El tabaquismo (ORA 6.93, IC 95% 1.98-28.54, p= 0.004), la presencia de Síndrome de Sjögren (SS) (ORA 23.2, IC 95% 1.89-359.53, p= 0.015) y la positividad para anticuerpos anti-péptido cíclico citrulinado (anti-CCP) (ORA 10.35, IC 95% 1.04-121.26, p= 0.047) se asociaron con la coexistencia de LES-HA, ajustado por género y duración de la enfermedad. El tabaquismo y el SS fueron confirmados como factores predictivos para LES-HA (AUC del modelo CART = 0.72). En la RSL, la prevalencia de ETA en LES varío entre 1% al 60%. Los factores asociados con esta poliautoinmunidad fueron el género femenino, edad avanzada, tabaquismo, positividad para algunos anticuerpos, SS y el compromiso articular y cutáneo. Conclusiones: La ETA es frecuente en pacientes con LES, y no afecta la severidad del LES. Los factores de riesgo identificados ayudarán a los clínicos en la búsqueda de ETA. Nuestros resultados deben estimular políticas para la suspensión del tabaquismo en pacientes con LES.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Most newly sequenced proteins are likely to adopt a similar structure to one which has already been experimentally determined. For this reason, the most successful approaches to protein structure prediction have been template-based methods. Such prediction methods attempt to identify and model the folds of unknown structures by aligning the target sequences to a set of representative template structures within a fold library. In this chapter, I discuss the development of template-based approaches to fold prediction, from the traditional techniques to the recent state-of-the-art methods. I also discuss the recent development of structural annotation databases, which contain models built by aligning the sequences from entire proteomes against known structures. Finally, I run through a practical step-by-step guide for aligning target sequences to known structures and contemplate the future direction of template-based structure prediction.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Statistical approaches have been applied to examine amino acid pairing preferences within parallel beta-sheets. The main chain hydrogen bonding pattern in parallel beta-sheets means that, for each residue pair, only one of the residues is involved in main chain hydrogen bonding with the strand containing the partner residue. We call this the hydrogen bonded (HB) residue and the partner residue the non-hydrogen bonded (nHB) residue, and differentiate between the favorability of a pair and that of its reverse pair, e.g. Asn(HB)-Thr(nHB)versus Thr(HB)-Asn(nHB). Significantly (p < or = 0.000001) favoured pairings were rationalised using stereochemical arguments. For instance, Asn(HB)-Thr(nHB) and Arg(HB)-Thr(nHB) were favoured pairs, where the residues adopted favoured chi1 rotamer positions that allowed side-chain interactions to occur. In contrast, Thr(HB)-Asn(nHB) and Thr(HB)-Arg(nHB) were not significantly favoured, and could only form side-chain interactions if the residues involved adopted less favourable chi1 conformations. The favourability of hydrophobic pairs e.g. Ile(HB)-Ile(nHB), Val(HB)-Val(nHB) and Leu(HB)-Ile(nHB) was explained by the residues adopting their most preferred chi1 and chi2 conformations, which enabled them to form nested arrangements. Cysteine-cysteine pairs are significantly favoured, although these do not form intrasheet disulphide bridges. Interactions between positively and negatively charged residues were asymmetrically preferred: those with the negatively charged residue at the HB position were more favoured. This trend was accounted for by the presence of general electrostatic interactions, which, based on analysis of distances between charged atoms, were likely to be stronger when the negatively charged residue is the HB partner. The Arg(HB)-Asp(nHB) interaction was an exception to this trend and its favorability was rationalised by the formation of specific side-chain interactions. This research provides rules that could be applied to protein structure prediction, comparative modelling and protein engineering and design. The methods used to analyse the pairing preferences are automated and detailed results are available (http://www.rubic.rdg.ac.uk/betapairprefsparallel/).

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Statistical approaches have been applied to examine amino acid pairing preferences within parallel beta-sheets. The main chain hydrogen bonding pattern in parallel beta-sheets means that, for each residue pair, only one of the residues is involved in main chain hydrogen bonding with the strand containing the partner residue. We call this the hydrogen bonded (HB) residue and the partner residue the non-hydrogen bonded (nHB) residue, and differentiate between the favourability of a pair and that of its reverse pair, e.g. Asn(HB)-Thr(nHB) versus Thr(HB)-Asn(nHB). Significantly (p <= 0.000001) favoured pairings were rationalised using stereochemical arguments. For instance, Asn(HB)-Thr(nHB) and Arg(HB)-Thr(nHB) were favoured pairs, where the residues adopted favoured chi(1) rotamer positions that allowed side-chain interactions to occur. In contrast, Thr(HB)-Asn(nHB) and Thr(HB)-Arg(nHB) were not significantly favoured, and could only form side-chain interactions if the residues involved adopted less favourable chi(1) conformations. The favourability of hydrophobic pairs e.g. Ile(HB)-Ile(nHB), Val(HB)-Val(nHB) and Leu(HB)-Ile(nHB) was explained by the residues adopting their most preferred chi(1) and chi(2) conformations, which enabled them to form nested arrangements. Cysteine-cysteine pairs are significantly favoured, although these do not form intrasheet disulphide bridges. Interactions between positively and negatively charged residues were asymmetrically preferred: those with the negatively charged residue at the HB position were more favoured. This trend was accounted for by the presence of general electrostatic interactions, which, based on analysis of distances between charged atoms, were likely to be stronger when the negatively charged residue is the HB partner. The Arg(HB)-Asp(nHB) interaction was an exception to this trend and its favourability was rationalised by the formation of specific side-chain interactions. This research provides rules that could be applied to protein structure prediction, comparative modelling and protein engineering and design. The methods used to analyse the pairing preferences are automated and detailed results are available (http:// www.rubic.rdg.ac.uk/betapairprefsparallel/). (c) 2005 Elsevier Ltd. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

To further elucidate the role of proteases capable of cleaving N-terminal proopiomelanocortin (N-POMC)-derived peptides, we have cloned two cDNAs encoding isoforms of the airway trypsin-like protease (AT) from mouse (MAT) and rat ( RAT), respectively. The open reading frames comprise 417 amino acids (aa) and 279 aa. The mouse AT gene was located at chromosome 5E1 and contains 10 exons. The longer isoform, which we designated MAT1 and RAT1, has a simple type II transmembrane protein structure, consisting of a short cytoplasmic domain, a transmembrane domain, a SEA (63-kDa sea urchin sperm protein, enteropeptidase, agrin) module, and a serine protease domain. The human homolog of MAT1 and RAT1 is the human AT ( HAT). The shorter isoform, designated MAT2 and RAT2, which contains an alternative N terminus, was formerly described in the rat as adrenal secretory serine protease (AsP) and has been shown to be involved in the processing of N-POMC-derived peptides. In contrast to the long isoform, neither MAT2 and RAT2 ( AsP) contain a transmembrane domain nor a SEA domain but an N-terminal signal peptide to direct the enzyme to the secretory pathway. The C terminus, covering the catalytic triad, is identical in both isoforms. Immunohistochemically, MAT/RAT was predominantly expressed in tissues of the upper gastrointestinal and the respiratory tract - but also in the adrenal gland. Moreover, isoform-specific RT-PCR and quantitative PCR analysis revealed a complex expression pattern of the two isoforms with differences between mice and rats. These findings indicate a multifunctional role of these proteases beyond adrenal proliferation.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The Maillard reaction causes changes to protein structure and occurs in foods mainly during thermal treatment. Melanoidins, the final products of the Maillard reaction, may enter the gastrointestinal tract, which is populated by different species of bacteria. In this study, melanoidins were prepared from gluten and glucose. Their effect on the growth of faecal bacteria was determined in culture with genotype and phenotype probes to identify the different species involved. Analysis of peptic and tryptic digests showed that low molecular mass products are formed from the degradation of melanoidins. Results showed a change in the growth of bacteria. This in vitro study demonstrated that melanoidins, prepared from gluten and glucose, affect the growth of the gut microflora.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Accumulation of advanced glycation end-products (AGEs) on proteins is associated with the development of diabetic complications. Although the overall extent of modification of protein by AGEs is limited, localization of these modifications at a few critical sites might have a significant effect on protein structure and function. In the present study, we describe the sites of modification of RNase by glyoxal under physiological conditions. Arg(39) and Arg(85), which are closest to the active site of the enzyme, were identified as the primary sites of formation of the glyoxal-derived dihydroxyimidazolidine and hydroimidazolone adducts. Lower amounts of modification were detected at Arg(10), while Arg(33) appeared to be unmodified. We conclude that dihydroxyimidazolidine adducts are the primary products of modification of protein by glyoxal, that Arg(39) and Arg(85) are the primary sites of modification of RNase by glyoxal, and that modification of arginine residues during Maillard reactions of proteins is a highly selective process.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The completion of the Human Genome Project has revealed a multitude of potential avenues for the identification of therapeutic targets. Extensive sequence information enables the identification of novel genes but does not facilitate a thorough understanding of how changes in gene expression control the molecular mechanisms underlying the development and regulation of a cell or the progression of disease. Proteomics encompasses the study of proteins expressed by a population of cells, and evaluates changes in protein expression, post-translational modifications, protein interactions, protein structure and splice variants, all of which are imperative for a complete understanding of protein function within the cell. From the outset, proteomics has been used to compare the protein profiles of cells in healthy and diseased states and as such can be used to identify proteins associated with disease development and progression. These candidate proteins might provide novel targets for new therapeutic agents or aid the development of assays for disease biomarkers. This review provides an overview of the current proteomic techniques available and focuses on their application in the search for novel therapeutic targets for the treatment of disease.

Relevância:

60.00% 60.00%

Publicador:

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A number of new and newly improved methods for predicting protein structure developed by the Jones–University College London group were used to make predictions for the CASP6 experiment. Structures were predicted with a combination of fold recognition methods (mGenTHREADER, nFOLD, and THREADER) and a substantially enhanced version of FRAGFOLD, our fragment assembly method. Attempts at automatic domain parsing were made using DomPred and DomSSEA, which are based on a secondary structure parsing algorithm and additionally for DomPred, a simple local sequence alignment scoring function. Disorder prediction was carried out using a new SVM-based version of DISOPRED. Attempts were also made at domain docking and “microdomain” folding in order to build complete chain models for some targets.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The accurate prediction of the biochemical function of a protein is becoming increasingly important, given the unprecedented growth of both structural and sequence databanks. Consequently, computational methods are required to analyse such data in an automated manner to ensure genomes are annotated accurately. Protein structure prediction methods, for example, are capable of generating approximate structural models on a genome-wide scale. However, the detection of functionally important regions in such crude models, as well as structural genomics targets, remains an extremely important problem. The method described in the current study, MetSite, represents a fully automatic approach for the detection of metal-binding residue clusters applicable to protein models of moderate quality. The method involves using sequence profile information in combination with approximate structural data. Several neural network classifiers are shown to be able to distinguish metal sites from non-sites with a mean accuracy of 94.5%. The method was demonstrated to identify metal-binding sites correctly in LiveBench targets where no obvious metal-binding sequence motifs were detectable using InterPro. Accurate detection of metal sites was shown to be feasible for low-resolution predicted structures generated using mGenTHREADER where no side-chain information was available. High-scoring predictions were observed for a recently solved hypothetical protein from Haemophilus influenzae, indicating a putative metal-binding site.