973 resultados para similarity


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background: In the post-genomic era where sequences are being determined at a rapid rate, we are highly reliant on computational methods for their tentative biochemical characterization. The Pfam database currently contains 3,786 families corresponding to ``Domains of Unknown Function'' (DUF) or ``Uncharacterized Protein Family'' (UPF), of which 3,087 families have no reported three-dimensional structure, constituting almost one-fourth of the known protein families in search for both structure and function. Results: We applied a `computational structural genomics' approach using five state-of-the-art remote similarity detection methods to detect the relationship between uncharacterized DUFs and domain families of known structures. The association with a structural domain family could serve as a start point in elucidating the function of a DUF. Amongst these five methods, searches in SCOP-NrichD database have been applied for the first time. Predictions were classified into high, medium and low-confidence based on the consensus of results from various approaches and also annotated with enzyme and Gene ontology terms. 614 uncharacterized DUFs could be associated with a known structural domain, of which high confidence predictions, involving at least four methods, were made for 54 families. These structure-function relationships for the 614 DUF families can be accessed on-line at http://proline.biochem.iisc.ernet.in/RHD_DUFS/. For potential enzymes in this set, we assessed their compatibility with the associated fold and performed detailed structural and functional annotation by examining alignments and extent of conservation of functional residues. Detailed discussion is provided for interesting assignments for DUF3050, DUF1636, DUF1572, DUF2092 and DUF659. Conclusions: This study provides insights into the structure and potential function for nearly 20 % of the DUFs. Use of different computational approaches enables us to reliably recognize distant relationships, especially when they converge to a common assignment because the methods are often complementary. We observe that while pointers to the structural domain can offer the right clues to the function of a protein, recognition of its precise functional role is still `non-trivial' with many DUF domains conserving only some of the critical residues. It is not clear whether these are functional vestiges or instances involving alternate substrates and interacting partners. Reviewers: This article was reviewed by Drs Eugene Koonin, Frank Eisenhaber and Srikrishna Subramanian.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

An energy approach within the framework of thermodynamics is used to model the fatigue process in plain concrete. Fatigue crack growth is an irreversible process associated with an irreversible entropy gain. A closed-form expression for entropy generated during fatigue in terms of energy dissipated is derived using principles of dimensional analysis and self-similarity. An increase in compliance is considered as a measure of damage accumulated during fatigue. The entropy at final fatigue failure is shown to be independent of loading and geometry and is proposed as a material property. A relationship between energy dissipated and number of cycles of fatigue loading is obtained. (C) 2015 American Society of Civil Engineers.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Clustering techniques which can handle incomplete data have become increasingly important due to varied applications in marketing research, medical diagnosis and survey data analysis. Existing techniques cope up with missing values either by using data modification/imputation or by partial distance computation, often unreliable depending on the number of features available. In this paper, we propose a novel approach for clustering data with missing values, which performs the task by Symmetric Non-Negative Matrix Factorization (SNMF) of a complete pair-wise similarity matrix, computed from the given incomplete data. To accomplish this, we define a novel similarity measure based on Average Overlap similarity metric which can effectively handle missing values without modification of data. Further, the similarity measure is more reliable than partial distances and inherently possesses the properties required to perform SNMF. The experimental evaluation on real world datasets demonstrates that the proposed approach is efficient, scalable and shows significantly better performance compared to the existing techniques.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper, we propose a technique for video object segmentation using patch seams across frames. Typically, seams, which are connected paths of low energy, are utilised for retargeting, where the primary aim is to reduce the image size while preserving the salient image contents. Here, we adapt the formulation of seams for temporal label propagation. The energy function associated with the proposed video seams provides temporal linking of patches across frames, to accurately segment the object. The proposed energy function takes into account the similarity of patches along the seam, temporal consistency of motion and spatial coherency of seams. Label propagation is achieved with high fidelity in the critical boundary regions, utilising the proposed patch seams. To achieve this without additional overheads, we curtail the error propagation by formulating boundary regions as rough-sets. The proposed approach out-perform state-of-the-art supervised and unsupervised algorithms, on benchmark datasets.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Motivated by multi-distribution divergences, which originate in information theory, we propose a notion of `multipoint' kernels, and study their applications. We study a class of kernels based on Jensen type divergences and show that these can be extended to measure similarity among multiple points. We study tensor flattening methods and develop a multi-point (kernel) spectral clustering (MSC) method. We further emphasize on a special case of the proposed kernels, which is a multi-point extension of the linear (dot-product) kernel and show the existence of cubic time tensor flattening algorithm in this case. Finally, we illustrate the usefulness of our contributions using standard data sets and image segmentation tasks.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Image inpainting is the process of filling the unwanted region in an image marked by the user. It is used for restoring old paintings and photographs, removal of red eyes from pictures, etc. In this paper, we propose an efficient inpainting algorithm which takes care of false edge propagation. We use the classical exemplar based technique to find out the priority term for each patch. To ensure that the edge content of the nearest neighbor patch found by minimizing L-2 distance between patches, we impose an additional constraint that the entropy of the patches be similar. Entropy of the patch acts as a good measure of edge content. Additionally, we fill the image by considering overlapping patches to ensure smoothness in the output. We use structural similarity index as the measure of similarity between ground truth and inpainted image. The results of the proposed approach on a number of examples on real and synthetic images show the effectiveness of our algorithm in removing objects and thin scratches or text written on image. It is also shown that the proposed approach is robust to the shape of the manually selected target. Our results compare favorably to those obtained by existing techniques

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A lectin from phloem exudates of Luffa acutangula (ridge gourd) was purified on chitin affinity chromatography and characterized for its amino acid sequence and to study the role of tryptophan in its activity. The purified lectin was subjected to various proteolytic digestions, and the resulting peptides were analyzed by liquid chromatography coupled electrospray ionization ion trap mass spectrometer. The peptide precursor ions were fragmented by collision-induced dissociation or electron transfer dissociation experiments, and a manual interpretation of MS/MS was performed to deduce amino acid sequence. This gave rise to almost complete sequence coverage of the lectin which showed high-sequence similarity with deduced sequences of phloem lectins present in the database. Chemical modification of lysine, tyrosine, histidine, arginine, aspartic acid, and glutamic acid residues did not inhibit the hemagglutinating activity. However, the modification of tryptophan residues using N-bromosuccinimide showed the loss of hemagglutinating activity. Additionally, the mapping of tryptophan residues was performed to determine the extent and number of residues modified, which revealed that six residues per molecule were oxidized suggesting their accessibility. The retention of the lectin activity was seen when the modifications were performed in the presence of chitooligosaccharides due to protection of a tryptophan residue (W-102) in the protein. These studies taken together have led to the identification of a particular tryptophan residue (W-102) in the activity of the lectin. (c) 2015 IUBMB Life, 67(12):943-953, 2015

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper, we try to establish the equivalence or similarity in the thermal and physiochemical changes in precursor droplets (cerium nitrate) in convective and radiative fields. The radiative field is created through careful heating of the droplet using a monochromatic light source (CO2 laser). The equivalence is also established for different modes of convection like droplet injected into a high-speed flow and droplet experiencing a convective flow due to acoustic streaming (levitated) only. The thermophysical changes are studied in an aqueous cerium nitrate droplet, and the dissociation of cerium nitrate to ceria is modeled using modified Kramers' reaction rate formulation. It is observed that vaporization, species accumulation, and chemical characteristics obtained in a convectively heated droplet are retained in a radiatively heated droplet by careful adjustment of the laser intensity. The timescales and ceria yield match reasonably well for both the cases. It is also noted that similar conclusions are drawn in both levitated droplet and a nonlevitated droplet.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Eleven coupled model intercomparison project 3 based global climate models are evaluated for the case study of Upper Malaprabha catchment, India for precipitation rate. Correlation coefficient, normalised root mean square deviation, and skill score are considered as performance indicators for evaluation in fuzzy environment and assumed to have equal impact on the global climate models. Fuzzy technique for order preference by similarity to an ideal solution is used to rank global climate models. Top three positions are occupied by MIROC3, GFDL2.1 and GISS with relative closeness of 0.7867, 0.7070, and 0.7068. IPSL-CM4, NCAR-PCMI occupied the tenth and eleventh positions with relative closeness of 0.4959 and 0.4562.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We develop a new dictionary learning algorithm called the l(1)-K-svp, by minimizing the l(1) distortion on the data term. The proposed formulation corresponds to maximum a posteriori estimation assuming a Laplacian prior on the coefficient matrix and additive noise, and is, in general, robust to non-Gaussian noise. The l(1) distortion is minimized by employing the iteratively reweighted least-squares algorithm. The dictionary atoms and the corresponding sparse coefficients are simultaneously estimated in the dictionary update step. Experimental results show that l(1)-K-SVD results in noise-robustness, faster convergence, and higher atom recovery rate than the method of optimal directions, K-SVD, and the robust dictionary learning algorithm (RDL), in Gaussian as well as non-Gaussian noise. For a fixed value of sparsity, number of dictionary atoms, and data dimension, l(1)-K-SVD outperforms K-SVD and RDL on small training sets. We also consider the generalized l(p), 0 < p < 1, data metric to tackle heavy-tailed/impulsive noise. In an image denoising application, l(1)-K-SVD was found to result in higher peak signal-to-noise ratio (PSNR) over K-SVD for Laplacian noise. The structural similarity index increases by 0.1 for low input PSNR, which is significant and demonstrates the efficacy of the proposed method. (C) 2015 Elsevier B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this letter, we submit our comment on the following recently published papers by Kalidas Das: (1) ``Influence of chemical reaction and viscous dissipation on MHD mixed convection flow,'' Journal of Mechanical Science and Technology 28 (5) (2014) 1881-1885; and (2) ``Cu-water nanofluid flow and heat transfer over a shrinking sheet,'' Journal of Mechanical Science and Technology 28 (12) (2014) 5089-5094. The authors attempt to present the similarity solutions in both papers. We comment that the similarity transformations considered in Refs. 1, 2] are incorrect. Thus, the results presented by Kalidas Das lead to invalid conclusions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Antifolates are competitive inhibitors of dihydrofolate reductase ( DHFR), a conserved enzyme that is central to metabolism and widely targeted in pathogenic diseases, cancer and autoimmune disorders. Although most clinically used antifolates are known to be target specific, some display a fair degree of cross-reactivity with DHFRs from other species. A method that enables identification of determinants of affinity and specificity in target DHFRs from different species and provides guidelines for the design of antifolates is currently lacking. To address this, we first captured the potential druggable space of a DHFR in a substructure called the `supersite' and classified supersites of DHFRs from 56 species into 16 `site-types' based on pairwise structural similarity. Analysis of supersites across these site-types revealed that DHFRs exhibit varying extents of dissimilarity at structurally equivalent positions in and around the binding site. We were able to explain the pattern of affinities towards chemically diverse antifolates exhibited by DHFRs of different site-types based on these structural differences. We then generated an antifolate-DHFR network by mapping known high-affinity antifolates to their respective supersites and used this to identify antifolates that can be repurposed based on similarity between supersites or antifolates. Thus, we identified 177 human-specific and 458 pathogen-specific antifolates, a large number of which are supported by available experimental data. Thus, in the light of the clinical importance of DHFR, we present a novel approach to identifying differences in the druggable space of DHFRs that can be utilized for rational design of antifolates.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We propose a completely automatic approach for recognizing low resolution face images captured in uncontrolled environment. The approach uses multidimensional scaling to learn a common transformation matrix for the entire face which simultaneously transforms the facial features of the low resolution and the high resolution training images such that the distance between them approximates the distance had both the images been captured under the same controlled imaging conditions. Stereo matching cost is used to obtain the similarity of two images in the transformed space. Though this gives very good recognition performance, the time taken for computing the stereo matching cost is significant. To overcome this limitation, we propose a reference-based approach in which each face image is represented by its stereo matching cost from a few reference images. Experimental evaluation on the real world challenging databases and comparison with the state-of-the-art super-resolution, classifier based and cross modal synthesis techniques show the effectiveness of the proposed algorithm.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Direct numerical simulation is carried out for a spatially evolving supersonic turbulent boundary layer at free-stream Mach number 6. To overcome numerical instability, the seventh-order WENO scheme is used for the convection terms of Navier-Stokes equations, and fine mesh is adopted to minimize numerical dissipation. Compressibilty effects on the near-wall turbulent kinetic energy budget are studied. The cross-stream extended self-similarity and scaling exponents including the near-wall region are studied. In high Mach number flows, the coherence vortex structures are arranged to be smoother and streamwised, and the hair-pin vortices are less likely to occur.