30 resultados para Distant supervision
em Indian Institute of Science - Bangalore - Índia
Resumo:
Distant repeats between a pair of protein sequences can be exploited to study the various aspects of proteins such as structure-function relationship, disorders due to protein malfunction, evolutionary analysis, etc. An in-depth analysis of the distant repeats would facilitate to establish a stable evolutionary relation of the repeats with respect to their three-dimensional structure. To this effect, an algorithm has been devised to identify the distant repeats in a pair of protein sequences by essentially using the scores of PAM (Percent Accepted Mutation) matrices. The proposed algorithm will be of much use to researchers involved in the comparative study of various organisms based on the amino-acid repeats in protein sequences. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
Lack of supervision in clustering algorithms often leads to clusters that are not useful or interesting to human reviewers. We investigate if supervision can be automatically transferred for clustering a target task, by providing a relevant supervised partitioning of a dataset from a different source task. The target clustering is made more meaningful for the human user by trading-off intrinsic clustering goodness on the target task for alignment with relevant supervised partitions in the source task, wherever possible. We propose a cross-guided clustering algorithm that builds on traditional k-means by aligning the target clusters with source partitions. The alignment process makes use of a cross-task similarity measure that discovers hidden relationships across tasks. When the source and target tasks correspond to different domains with potentially different vocabularies, we propose a projection approach using pivot vocabularies for the cross-domain similarity measure. Using multiple real-world and synthetic datasets, we show that our approach improves clustering accuracy significantly over traditional k-means and state-of-the-art semi-supervised clustering baselines, over a wide range of data characteristics and parameter settings.
Resumo:
Background: In the post-genomic era where sequences are being determined at a rapid rate, we are highly reliant on computational methods for their tentative biochemical characterization. The Pfam database currently contains 3,786 families corresponding to ``Domains of Unknown Function'' (DUF) or ``Uncharacterized Protein Family'' (UPF), of which 3,087 families have no reported three-dimensional structure, constituting almost one-fourth of the known protein families in search for both structure and function. Results: We applied a `computational structural genomics' approach using five state-of-the-art remote similarity detection methods to detect the relationship between uncharacterized DUFs and domain families of known structures. The association with a structural domain family could serve as a start point in elucidating the function of a DUF. Amongst these five methods, searches in SCOP-NrichD database have been applied for the first time. Predictions were classified into high, medium and low-confidence based on the consensus of results from various approaches and also annotated with enzyme and Gene ontology terms. 614 uncharacterized DUFs could be associated with a known structural domain, of which high confidence predictions, involving at least four methods, were made for 54 families. These structure-function relationships for the 614 DUF families can be accessed on-line at http://proline.biochem.iisc.ernet.in/RHD_DUFS/. For potential enzymes in this set, we assessed their compatibility with the associated fold and performed detailed structural and functional annotation by examining alignments and extent of conservation of functional residues. Detailed discussion is provided for interesting assignments for DUF3050, DUF1636, DUF1572, DUF2092 and DUF659. Conclusions: This study provides insights into the structure and potential function for nearly 20 % of the DUFs. Use of different computational approaches enables us to reliably recognize distant relationships, especially when they converge to a common assignment because the methods are often complementary. We observe that while pointers to the structural domain can offer the right clues to the function of a protein, recognition of its precise functional role is still `non-trivial' with many DUF domains conserving only some of the critical residues. It is not clear whether these are functional vestiges or instances involving alternate substrates and interacting partners. Reviewers: This article was reviewed by Drs Eugene Koonin, Frank Eisenhaber and Srikrishna Subramanian.
Resumo:
The hydromagnetic spinup or spindown of an incompressible, rotating, electrically conducting fluid over an infinite insulated disk with an applied magnetic field is studied when the impulsive motion is imparted either to the fluid or to the disk. The nonlinear partial differential equations governing the flow are solved numerically using an implicit finite-difference scheme. It is found that the spinup (or spindown) time due to impulsive motion of the disk is much shorter than the spinup (or spindown) time due to the impulsive motion of the distant fluid. The spinup (or spindown) time for the hydromagnetic case is comparatively smaller than the corresponding nonmagnetic case. Spindown is not merely a mirror reflection of spinup. Physics of Fluids is copyrighted by The American Institute of Physics.
Resumo:
The product of the bglG gene of Escherichia coli was among the first bacterial antiterminators to be identified and characterized. Since the elucidation ten years ago of its role in the regulation of the bgl operon of E. coli,a large number of homologies have been discovered in both Gram-positive and Gram-negative bacteria. Often the homologues of BglG in other organisms are also involved in regulating β-glucoside utilization. Surprisingly, in many cases, they mediate antitermination to regulate a variety of other catabolic functions. Because of the high degree of conservation of the cis-acting regulatory elements, antiterminators from one organism can function in another. Generally the antiterminator protein itself is negatively regulated by phosphorylation by a component of the phosphotransferase system. This family of proteins thus represents a highly evolved regulatory system that is conserved across evolutionarily distant genuses.
Resumo:
A measure of stability of a given epitope is an important parameter in the exploration of the utility of a desired MAb. It defines the conditions necessary for using MAbs as an investigative tool in several research methodologies and therapeutic protocols. Despite these obvious interests the lack of simple and rapid assay systems for quantitating MAb-Ag interactions has largely hampered these studies. A single step MAb-Solid Phase Radioimmunoassay (SS-SPRIA), is described which eliminates errors that may arise with multistep sandwich assays. SS-SPRIA has been used to demonstrate the differential stability of the assembled epitopes on gonadotropins. Differential stability towards specific reagents can be exploited to identify aminoacid residues at the epitopic site. Inactivation of an epitopic region is indicative of the presence of the group modified, provided conformational relaxations are not induced due to modifications at distant sites. Here we provide evidence to validate these conclusions.
Resumo:
The crystal structure of copper ammonium oxalate dihydrate (space group P1̃) has been derived from a refinement of the two-dimensional (hk0) and (0kl) x-ray data using the atomic coordinateis of the isomorphous salt CuK 2(C2O4)2.2H2O as the starting point of the analysis. In contrast to the chromium complexes of oxalic acid the C-C bonds in both the two nonequivalent oxalate ions in the unit cell are single bonds (1.58 and 1.61 Å) consistent with the conclusion of Jeffrey and Parry that the carboxyl groups of the oxalate ion are separated by a pure a bond with little or no π conjugation across the molecule. Both the oxalate ions are slightly nonplanar. The copper ions occupy the special positions (0, 0, 0) and 0, 1/2, 0) and their coordination is of the distorted octahedral type with four nearest oxygen neighbors ( ≃ 2 Å) at the corners of a square and two more distant atoms along the octahedral bond direction. The environment of the NH4+ ions consists of eight nearest oxygen atoms at a mean distance of 3 Å.
Resumo:
Overexpression of the epidermal growth factor receptor family genes, which include ErbB-1, 2, 3 and 4, has been implicated in a number of cancers. We have studied the extent of ErbB-2 overexpression among Indian women with sporadic breast cancer. Methods: Immmunohistochemistry and genomic polymerase chain reaction (PCR) were used to study the ErbB2 overexpression. ErbB2 status was correlated with other clinico-pathological parameters, including patient survival. Results: ErbB-2 overexpression was detected in 43.2% (159/368) of the cases by immunohistochemistry. For a sub-set of patients (n = 55) for whom total DNA was available, ErbB-2 gene amplification was detected in 25.5% (14/55) of the cases by genomic PCR. While the ErbB2 overexpression was significantly higher in patients with lymphnode (χ2 = 12.06, P≤ 0.001), larger tumor size (χ2 = 8.22, P = 0.042) and ductal carcinoma (χ2 = 15.42, P ≤ 0.001), it was lower in patients with disease-free survival (χ2 = 22.13, P ≤ 0.001). Survival analysis on a sub-set of patients for whom survival data were available (n = 179) revealed that ErbB-2 status (χ2 =25.94, P ≤ 0.001), lymphnode status (χ2 = 12.68, P ≤ 0.001), distant metastasis (χ2 = 19.49, P ≤ 0.001) and stage of the disease (χ2 = 28.04, P ≤0.001) were markers of poor prognosis. Conclusions: ErbB-2 overexpression was significantly greater compared with the Western literature, but comparable to other Indian studies. Significant correlation was found between ErbB-2 status and lymphnode status, tumor size and ductal carcinoma. ErbB-2 status, lymph node status, distant metastasis and stage of the disease were found to be prognostic indicators.
Resumo:
Owing to high evolutionary divergence, it is not always possible to identify distantly related protein domains by sequence search techniques. Intermediate sequences possess sequence features of more than one protein and facilitate detection of remotely related proteins. We have demonstrated recently the employment of Cascade PSI-BLAST where we perform PSI-BLAST for many 'generations', initiating searches from new homologues as well. Such a rigorous propagation through generations of PSI-BLAST employs effectively the role of intermediates in detecting distant similarities between proteins. This approach has been tested on a large number of folds and its performance in detecting superfamily level relationships is similar to 35% better than simple PSI-BLAST searches. We present a web server for this search method that permits users to perform Cascade PSI-BLAST searches against the Pfam, SCOP and SwissProt databases. The URL for this server is http://crick.mbu.iisc.ernet.in/similar to CASCADE/CascadeBlast.html.
Resumo:
Diabetes is a chronic disease requiring continuous medical supervision and patient education to prevent acute secondary complications. In this study, we have harnessed the inherent property of insulin to aggregate into an oligomeric intermediate on the pathway to amyloid formation, to generate a form that exhibits controlled and sustained release for extended periods. Administration of a single dose of the insulin oligomer, defined here as the supramolecular insulin assembly II (SIA-II), to experimental animals rendered diabetic by streptozotocin or alloxan, released the hormone capable of maintaining physiologic glucose levels for > 120 days for bovine and > 140 days for recombinant human insulin without fasting hypoglycemia. Moreover, the novel SIA-II described here not only improved the glycemic control, but also reduced the extent of secondary diabetic complications.
Resumo:
The energy input to giant molecular clouds is recalculated, using the proper linearized equations of motion, including the Coriolis force and allowing for changes in the guiding center. Perturbation theory yields a result in the limit of distant encounters and small initial epicyclic amplitudes. Direct integration of the motion equations allows the strong encounter regime to be studied. The present perturbation theory result differs by a factor of order unity from that of Jog and Ostriker (1988). The result of present numerical integrations for the 2D (planar) velocity dispersion is presented. The accretion rate for a molecular cloud in the Galactic disk is calculated.
Resumo:
In a typical sensor network scenario a goal is to monitor a spatio-temporal process through a number of inexpensive sensing nodes, the key parameter being the fidelity at which the process has to be estimated at distant locations. We study such a scenario in which multiple encoders transmit their correlated data at finite rates to a distant, common decoder over a discrete time multiple access channel under various side information assumptions. In particular, we derive an achievable rate region for this communication problem.
Resumo:
We have consider ed the transient motion of art electrically conducting viscous compressible fluid which is in contact with an insulated infinite disk. The initial motion is considered to be due to the uniform rotation of the disk in an otherwise stationary fluid or due to the uniform rigid rotation of the fluid over a stationary disk. Different cases of transient motion due to finite impulse imparted either to the disk or to the distant fluid have been investigated. Effects of the imposed axial magnetic field and the disk temperature on the transient flow are included. The nonlinear partial differential equations governing the motion are solved numerically using an implicit finite-difference scheme along with the Newton's linearisation technique.
Resumo:
Photoluminescence (PL) of high quality GaN epitaxial layer grown on beta-Si3N4/Si (1 1 1) substrate using nitridation-annealing-nitridation method by plasma-assisted molecular beam epitaxy (PA-MBE) was investigated in the range of 5-300 K. Crystallinity of GaN epilayers was evaluated by high resolution X-ray diffraction (HRXRD) and surface morphology by Atomic Force Microscopy (AFM) and high resolution scanning electron microscopy (HRSEM). The temperature-dependent photoluminescence spectra showed an anomalous behaviour with an `S-like' shape of free exciton (FX) emission peaks. Distant shallow donor-acceptor pair (DAP) line peak at approximately 3.285 eV was also observed at 5 K, followed by LO replica sidebands separated by 91 meV. The activation energy of the free exciton for GaN epilayers was also evaluated to be similar to 27.8 +/- 0.7 meV from the temperature-dependent PL studies. Low carrier concentrations were observed similar to 4.5 +/- 2 x 10(17) Cm-3 by measurements and it indicates the silicon nitride layer, which not only acts as a growth buffer layer, but also effectively prevents Si diffusion from the substrate to GaN epilayers. The absence of yellow band emission at around 2.2 eV signifies the high quality of film. The tensile stress in GaN film calculated by the thermal stress model agrees very well with that derived from Raman spectroscopy. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
NSP3, an acidic nonstructural protein, encoded by gene 7 has been implicated as the key player in the assembly of the 11 viral plus-strand RNAs into the early replication intermediates during rotavirus morphogenesis. To date, the sequence or NSP3 from only three animal rotaviruses (SA11, SA114F, and bovine UK) has been determined and that from a human strain has not been reported. To determine the genetic diversity among gene 7 alleles from group A rotaviruses, the nucleotide sequence of the NSP3 gene from 13 strains belonging to nine different G serotypes, from both humans and animals, has been determined. Based on the amino acid sequence identity as well as phylogenetic analysis, NSP3 from group A rotaviruses falls into three evolutionarily related groups, i.e., the SA11 group, the Wa group, and the S2 group. The SA 11/SA114F gene appears to have a distant ancestral origin from that of the others and codes for a polypeptide of 315 amino acids (aa) in length. NSP3 from all other group A rotaviruses is only 313 aa in length because of a 2-amino-acid deletion near the carboxy-terminus, While the SA114F gene has the longest 3' untranslated region (UTR) of 132 nucleotides, that from other strains suffered deletions of varying lengths at two positions downstream of the translational termination codon. In spite of the divergence of the nucleotide (nt) sequence in the protein coding region, a stretch of about 80 nt in the 3' UTR is highly conserved in the NSP3 gene from all the strains. This conserved sequence in the 3' UTR might play an important role in the regulation of expression of the NSP3 gene. (C) 1995 Academic Press, Inc.