895 resultados para sequence data mining
Resumo:
Human immunodeficiency virus type 1 (HIV-1) variants resistant to protease (PR) and reverse transcriptase (RT) inhibitors may display impaired infectivity and replication capacity. The individual contributions of mutated HIV-1 PR and RT to infectivity, replication, RT activity, and protein maturation (herein referred to as "fitness") in recombinant viruses were investigated by separately cloning PR, RT, and PR-RT cassettes from drug-resistant mutant viral isolates into the wild-type NL4-3 background. Both mutant PR and RT contributed to measurable deficits in fitness of viral constructs. In peripheral blood mononuclear cells, replication rates (means +/- standard deviations) of RT recombinants were 72.5% +/- 27.3% and replication rates of PR recombinants were 60.5% +/- 33.6% of the rates of NL4-3. PR mutant deficits were enhanced in CEM T cells, with relative replication rates of PR recombinants decreasing to 15.8% +/- 23.5% of NL4-3 replication rates. Cloning of the cognate RT improved fitness of some PR mutant clones. For a multidrug-resistant virus transmitted through sexual contact, RT constructs displayed a marked infectivity and replication deficit and diminished packaging of Pol proteins (RT content in virions diminished by 56.3% +/- 10.7%, and integrase content diminished by 23.3% +/- 18.4%), a novel mechanism for a decreased-fitness phenotype. Despite the identified impairment of recombinant clones, fitness of two of the three drug-resistant isolates was comparable to that of wild-type, susceptible viruses, suggestive of extensive compensation by genomic regions away from PR and RT. Only limited reversion of mutated positions to wild-type amino acids was observed for the native isolates over 100 viral replication cycles in the absence of drug selective pressure. These data underscore the complex relationship between PR and RT adaptive changes and viral evolution in antiretroviral drug-resistant HIV-1.
Resumo:
Résumé de la thèse L'évolution des systèmes policiers donne une place prépondérante à l'information et au renseignement. Cette transformation implique de développer et de maintenir un ensemble de processus permanent d'analyse de la criminalité, en particulier pour traiter des événements répétitifs ou graves. Dans une organisation aux ressources limitées, le temps consacré au recueil des données, à leur codification et intégration, diminue le temps disponible pour l'analyse et la diffusion de renseignements. Les phases de collecte et d'intégration restent néanmoins indispensables, l'analyse n'étant pas possible sur des données volumineuses n'ayant aucune structure. Jusqu'à présent, ces problématiques d'analyse ont été abordées par des approches essentiellement spécialisées (calculs de hot-sports, data mining, ...) ou dirigées par un seul axe (par exemple, les sciences comportementales). Cette recherche s'inscrit sous un angle différent, une démarche interdisciplinaire a été adoptée. L'augmentation continuelle de la quantité de données à analyser tend à diminuer la capacité d'analyse des informations à disposition. Un bon découpage (classification) des problèmes rencontrés permet de délimiter les analyses sur des données pertinentes. Ces classes sont essentielles pour structurer la mémoire du système d'analyse. Les statistiques policières de la criminalité devraient déjà avoir répondu à ces questions de découpage de la délinquance (classification juridique). Cette décomposition a été comparée aux besoins d'un système de suivi permanent dans la criminalité. La recherche confirme que nos efforts pour comprendre la nature et la répartition du crime se butent à un obstacle, à savoir que la définition juridique des formes de criminalité n'est pas adaptée à son analyse, à son étude. Depuis près de vingt ans, les corps de police de Suisse romande utilisent et développent un système de classification basé sur l'expérience policière (découpage par phénomène). Cette recherche propose d'interpréter ce système dans le cadre des approches situationnelles (approche théorique) et de le confronter aux données « statistiques » disponibles pour vérifier sa capacité à distinguer les formes de criminalité. La recherche se limite aux cambriolages d'habitations, un délit répétitif fréquent. La théorie des opportunités soutien qu'il faut réunir dans le temps et dans l'espace au minimum les trois facteurs suivants : un délinquant potentiel, une cible intéressante et l'absence de gardien capable de prévenir ou d'empêcher le passage à l'acte. Ainsi, le délit n'est possible que dans certaines circonstances, c'est-à-dire dans un contexte bien précis. Identifier ces contextes permet catégoriser la criminalité. Chaque cas est unique, mais un groupe de cas montre des similitudes. Par exemple, certaines conditions avec certains environnements attirent certains types de cambrioleurs. Deux hypothèses ont été testées. La première est que les cambriolages d'habitations ne se répartissent pas uniformément dans les classes formées par des « paramètres situationnels » ; la deuxième que des niches apparaissent en recoupant les différents paramètres et qu'elles correspondent à la classification mise en place par la coordination judiciaire vaudoise et le CICOP. La base de données vaudoise des cambriolages enregistrés entre 1997 et 2006 par la police a été utilisée (25'369 cas). Des situations spécifiques ont été mises en évidence, elles correspondent aux classes définies empiriquement. Dans une deuxième phase, le lien entre une situation spécifique et d'activité d'un auteur au sein d'une même situation a été vérifié. Les observations réalisées dans cette recherche indiquent que les auteurs de cambriolages sont actifs dans des niches. Plusieurs auteurs sériels ont commis des délits qui ne sont pas dans leur niche, mais le nombre de ces infractions est faible par rapport au nombre de cas commis dans la niche. Un système de classification qui correspond à des réalités criminelles permet de décomposer les événements et de mettre en place un système d'alerte et de suivi « intelligent ». Une nouvelle série dans un phénomène sera détectée par une augmentation du nombre de cas de ce phénomène, en particulier dans une région et à une période donnée. Cette nouvelle série, mélangée parmi l'ensemble des délits, ne serait pas forcément détectable, en particulier si elle se déplace. Finalement, la coopération entre les structures de renseignement criminel opérationnel en Suisse romande a été améliorée par le développement d'une plateforme d'information commune et le système de classification y a été entièrement intégré.
Resumo:
High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits. isb-sib.ch).
Resumo:
STAT transcription factors are expressed in many cell types and bind to similar sequences. However, different STAT gene knock-outs show very distinct phenotypes. To determine whether differences between the binding specificities of STAT proteins account for these effects, we compared the sequences bound by STAT1, STAT5A, STAT5B, and STAT6. One sequence set was selected from random oligonucleotides by recombinant STAT1, STAT5A, or STAT6. For another set including many weak binding sites, we quantified the relative affinities to STAT1, STAT5A, STAT5B, and STAT6. We compared the results to the binding sites in natural STAT target genes identified by others. The experiments confirmed the similar specificity of different STAT proteins. Detailed analysis indicated that STAT5A specificity is more similar to that of STAT6 than that of STAT1, as expected from the evolutionary relationships. The preference of STAT6 for sites in which the half-palindromes (TTC) are separated by four nucleotides (N(4)) was confirmed, but analysis of weak binding sites showed that STAT6 binds fairly well to N(3) sites. As previously reported, STAT1 and STAT5 prefer N(3) sites; however, STAT5A, but not STAT1, weakly binds N(4) sites. None of the STATs bound to half-palindromes. There were no specificity differences between STAT5A and STAT5B.
Resumo:
New anti-cancer agents are being developed that specifically recognise tumour cells. Recognition is dependent upon the enhanced expression of antigenic determinants on the surface of tumour cells. The tumour exposure and the extracellular accessibility of the mucin MUC-1 make this marker a suitable target for tumour diagnosis and therapy. We isolated and characterised six human scFv antibody fragments that bound to the MUC-1 core protein, by selecting a large naive human phage display library directly on a MUC-1-expressing breast carcinoma cell line. Their binding characteristics have been studied by ELISA, FACS and indirect immunofluorescence. The human scFv antibody fragments were specific for the tandem repeat region of MUC-1 and their binding is inhibited by soluble antigen. Four human scFv antibody fragments (M2, M3, M8, M12) recognised the hydrophilic PDTRP region of the MUC-1 core protein, which is thought to be an immunodominant region. The human scFv antibody fragments were stable in human serum at 37 degrees C and retained their binding specificity. For imaging or targeting to tumours over-expressing MUC-1, it might be feasible to use these human scFv, or multivalent derivatives, as vehicles to deliver anti-cancer agents.
Resumo:
Ligands of the tumor necrosis factor superfamily (TNFSF) (4-1BBL, APRIL, BAFF, CD27L, CD30L, CD40L, EDA1, EDA2, FasL, GITRL, LIGHT, lymphotoxin alpha, lymphotoxin alphabeta, OX40L, RANKL, TL1A, TNF, TWEAK, and TRAIL) bind members of the TNF receptor superfamily (TNFRSF). A comprehensive survey of ligand-receptor interactions was performed using a flow cytometry-based assay. All ligands engaged between one and five receptors, whereas most receptors only bound one to three ligands. The receptors DR6, RELT, TROY, NGFR, and mouse TNFRH3 did not interact with any of the known TNFSF ligands, suggesting that they either bind other types of ligands, function in a ligand-independent manner, or bind ligands that remain to be identified. The study revealed that ligand-receptor pairs are either cross-reactive between human and mouse (e.g. Tweak/Fn14, RANK/RANKL), strictly species-specific (GITR/GITRL), or partially species-specific (e.g. OX40/OX40L, CD40/CD40L). Interestingly, the receptor binding patterns of lymphotoxin alpha and alphabeta are redundant in the human but not in the mouse system. Ligand oligomerization allowed detection of weak interactions, such as that of human TNF with mouse TNFR2. In addition, mouse APRIL exists as two different splice variants differing by a single amino acid. Although human APRIL does not interact with BAFF-R, the shorter variant of mouse APRIL exhibits weak but detectable binding to mouse BAFF-R.
Resumo:
The ribonucleotide reductase gene tandem bnrdE/bnrdF in SPbeta-related prophages of different Bacillus spp. isolates presents different configurations of intervening sequences, comprising one to three of six non-homologous splicing elements. Insertion sites of group I introns and intein DNA are clustered in three relatively short segments encoding functionally important domains of the ribonucleotide reductase. Comparison of the bnrdE homologs reveals mutual exclusion of a group I intron and an intein coding sequence flanking the codon that specifies a conserved cysteine. In vivo splicing was demonstrated for all introns. However, for two of them a part of the mRNA precursor molecules remains unspliced. Intergenic bnrdE-bnrdF regions are unexpectedly long, comprising between 238 and 541 nt. The longest encodes a putative polypeptide related to HNH homing endonucleases.
Resumo:
Human Fas ligand (L) (CD95L) and tumor necrosis factor (TNF)-alpha undergo metalloproteinase-mediated proteolytic processing in their extracellular domains resulting in the release of soluble trimeric ligands (soluble [s]FasL, sTNF-alpha) which, in the case of sFasL, is thought to be implicated in diseases such as hepatitis and AIDS. Here we show that the processing of sFasL occurs between Ser126 and Leu127. The apoptotic-inducing capacity of naturally processed sFasL was reduced by >1,000-fold compared with membrane-bound FasL, and injection of high doses of recombinant sFasL in mice did not induce liver failure. However, soluble FasL retained its capacity to interact with Fas, and restoration of its cytotoxic activity was achieved both in vitro and in vivo with the addition of cross-linking antibodies. Similarly, the marginal apoptotic activity of recombinant soluble TNF-related apoptosis-inducing ligand (sTRAIL), another member of the TNF ligand family, was greatly increased upon cross-linking. These results indicate that the mere trimerization of the Fas and TRAIL receptors may not be sufficient to trigger death signals. Thus, the observation that sFasL is less cytotoxic than membrane-bound FasL may explain why in certain types of cancer, systemic tissue damage is not detected, even though the levels of circulating sFasL are high.
Resumo:
Glucagon-like peptide 1 (GLP-1) is a hormone derived from the preproglucagon molecule and is secreted by intestinal L cells. It is the most potent stimulator of glucose-induced insulin secretion and also suppresses in vivo acid secretion by gastric glands. A cDNA for the GLP-1 receptor was isolated by transient expression of a rat pancreatic islet cDNA library into COS cells; this was followed by binding of radiolabeled GLP-1 and screening by photographic emulsion autoradiography. The receptor transfected into COS cells binds GLP-1 with high affinity and is coupled to activation of adenylate cyclase. The receptor binds specifically GLP-1 and does not bind peptides of related structure and similar function, such as glucagon, gastric inhibitory peptide, vasoactive intestinal peptide, or secretin. The receptor is 463 amino acids long and contains seven transmembrane domains. Sequence homology is found only with the receptors for secretin, calcitonin, and parathyroid hormone, which form a newly characterized family of G-coupled receptors.
Resumo:
Over the past decades, several sensitive post-electrophoretic stains have been developed for an identification of proteins in general, or for a specific detection of post-translational modifications such as phosphorylation, glycosylation or oxidation. Yet, for a visualization and quantification of protein differences, the differential two-dimensional gel electrophoresis, termed DIGE, has become the method of choice for a detection of differences in two sets of proteomes. The goal of this review is to evaluate the use of the most common non-covalent and covalent staining techniques in 2D electrophoresis gels, in order to obtain maximal information per electrophoresis gel and for an identification of potential biomarkers. We will also discuss the use of detergents during covalent labeling, the identification of oxidative modifications and review influence of detergents on finger prints analysis and MS/MS identification in relation to 2D electrophoresis.
Resumo:
Neuronal development is the result of a multitude of neural migrations, which require extensive cell-cell communication. These processes are modulated by extracellular matrix components, such as heparan sulfate (HS) polysaccharides. HS is molecularly complex as a result of nonrandom modifications of the sugar moieties, including sulfations in specific positions. We report here mutations in HS 6-O-sulfotransferase 1 (HS6ST1) in families with idiopathic hypogonadotropic hypogonadism (IHH). IHH manifests as incomplete or absent puberty and infertility as a result of defects in gonadotropin-releasing hormone neuron development or function. IHH-associated HS6ST1 mutations display reduced activity in vitro and in vivo, suggesting that HS6ST1 and the complex modifications of extracellular sugars are critical for normal development in humans. Genetic experiments in Caenorhabditis elegans reveal that HS cell-specifically regulates neural branching in vivo in concert with other IHH-associated genes, including kal-1, the FGF receptor, and FGF. These findings are consistent with a model in which KAL1 can act as a modulatory coligand with FGF to activate the FGF receptor in an HS-dependent manner.
Resumo:
The siderophore pyochelin of Pseudomonas aeruginosa is derived from one molecule of salicylate and two molecules of cysteine. Two cotranscribed genes, pchEF, encoding peptide synthetases have been identified and characterized. pchE was required for the conversion of salicylate to dihydroaeruginoate (Dha), the condensation product of salicylate and one cysteine residue and pchF was essential for the synthesis of pyochelin from Dha. The deduced PchE (156 kDa) and PchF (197 kDa) proteins had adenylation, thiolation and condensation/cyclization motifs arranged as modules which are typical of those peptide synthetases forming thiazoline rings. The pchEF genes were coregulated with the pchDCBA operon, which provides enzymes for the synthesis (PchBA) and activation (PchD) of salicylate as well as a putative thioesterase (PchC). Expression of a translational pchE'-'lacZ fusion was strictly dependent on the PchR regulator and was induced by extracellular pyochelin, the end product of the pathway. Iron replete conditions led to Fur (ferric uptake regulator)-dependent repression of the pchE'-'lacZ fusion. A translational pchD'-'lacZ fusion was also positively regulated by PchR and pyochelin and repressed by Fur and iron. Thus, autoinduction by pyochelin (or ferric pyochelin) and repression by iron ensure a sensitive control of the pyochelin pathway in P. aeruginosa.
Resumo:
Somatic copy number aberrations (CNA) represent a mutation type encountered in the majority of cancer genomes. Here, we present the 2014 edition of arrayMap (http://www.arraymap.org), a publicly accessible collection of pre-processed oncogenomic array data sets and CNA profiles, representing a vast range of human malignancies. Since the initial release, we have enhanced this resource both in content and especially with regard to data mining support. The 2014 release of arrayMap contains more than 64,000 genomic array data sets, representing about 250 tumor diagnoses. Data sets included in arrayMap have been assembled from public repositories as well as additional resources, and integrated by applying custom processing pipelines. Online tools have been upgraded for a more flexible array data visualization, including options for processing user provided, non-public data sets. Data integration has been improved by mapping to multiple editions of the human reference genome, with the majority of the data now being available for the UCSC hg18 as well as GRCh37 versions. The large amount of tumor CNA data in arrayMap can be freely downloaded by users to promote data mining projects, and to explore special events such as chromothripsis-like genome patterns.
Resumo:
We present a novel steered molecular dynamics scheme to induce the dissociation of large protein-protein complexes. We apply this scheme to study the interaction of a T cell receptor (TCR) with a major histocompatibility complex (MHC) presenting a peptide (p). Two TCR-pMHC complexes are considered, which only differ by the mutation of a single amino acid on the peptide; one is a strong agonist that produces T cell activation in vivo, while the other is an antagonist. We investigate the interaction mechanism from a large number of unbinding trajectories by analyzing van der Waals and electrostatic interactions and by computing energy changes in proteins and solvent. In addition, dissociation potentials of mean force are calculated with the Jarzynski identity, using an averaging method developed for our steering scheme. We analyze the convergence of the Jarzynski exponential average, which is hampered by the large amount of dissipative work involved and the complexity of the system. The resulting dissociation free energies largely underestimate experimental values, but the simulations are able to clearly differentiate between wild-type and mutated TCR-pMHC and give insights into the dissociation mechanism.