5 resultados para Objective function values

em Duke University


Relevância:

80.00% 80.00%

Publicador:

Resumo:

We study the problem of supervised linear dimensionality reduction, taking an information-theoretic viewpoint. The linear projection matrix is designed by maximizing the mutual information between the projected signal and the class label. By harnessing a recent theoretical result on the gradient of mutual information, the above optimization problem can be solved directly using gradient descent, without requiring simplification of the objective function. Theoretical analysis and empirical comparison are made between the proposed method and two closely related methods, and comparisons are also made with a method in which Rényi entropy is used to define the mutual information (in this case the gradient may be computed simply, under a special parameter setting). Relative to these alternative approaches, the proposed method achieves promising results on real datasets. Copyright 2012 by the author(s)/owner(s).

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Transcriptional regulation has been studied intensively in recent decades. One important aspect of this regulation is the interaction between regulatory proteins, such as transcription factors (TF) and nucleosomes, and the genome. Different high-throughput techniques have been invented to map these interactions genome-wide, including ChIP-based methods (ChIP-chip, ChIP-seq, etc.), nuclease digestion methods (DNase-seq, MNase-seq, etc.), and others. However, a single experimental technique often only provides partial and noisy information about the whole picture of protein-DNA interactions. Therefore, the overarching goal of this dissertation is to provide computational developments for jointly modeling different experimental datasets to achieve a holistic inference on the protein-DNA interaction landscape.

We first present a computational framework that can incorporate the protein binding information in MNase-seq data into a thermodynamic model of protein-DNA interaction. We use a correlation-based objective function to model the MNase-seq data and a Markov chain Monte Carlo method to maximize the function. Our results show that the inferred protein-DNA interaction landscape is concordant with the MNase-seq data and provides a mechanistic explanation for the experimentally collected MNase-seq fragments. Our framework is flexible and can easily incorporate other data sources. To demonstrate this flexibility, we use prior distributions to integrate experimentally measured protein concentrations.

We also study the ability of DNase-seq data to position nucleosomes. Traditionally, DNase-seq has only been widely used to identify DNase hypersensitive sites, which tend to be open chromatin regulatory regions devoid of nucleosomes. We reveal for the first time that DNase-seq datasets also contain substantial information about nucleosome translational positioning, and that existing DNase-seq data can be used to infer nucleosome positions with high accuracy. We develop a Bayes-factor-based nucleosome scoring method to position nucleosomes using DNase-seq data. Our approach utilizes several effective strategies to extract nucleosome positioning signals from the noisy DNase-seq data, including jointly modeling data points across the nucleosome body and explicitly modeling the quadratic and oscillatory DNase I digestion pattern on nucleosomes. We show that our DNase-seq-based nucleosome map is highly consistent with previous high-resolution maps. We also show that the oscillatory DNase I digestion pattern is useful in revealing the nucleosome rotational context around TF binding sites.

Finally, we present a state-space model (SSM) for jointly modeling different kinds of genomic data to provide an accurate view of the protein-DNA interaction landscape. We also provide an efficient expectation-maximization algorithm to learn model parameters from data. We first show in simulation studies that the SSM can effectively recover underlying true protein binding configurations. We then apply the SSM to model real genomic data (both DNase-seq and MNase-seq data). Through incrementally increasing the types of genomic data in the SSM, we show that different data types can contribute complementary information for the inference of protein binding landscape and that the most accurate inference comes from modeling all available datasets.

This dissertation provides a foundation for future research by taking a step toward the genome-wide inference of protein-DNA interaction landscape through data integration.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Dynamics of biomolecules over various spatial and time scales are essential for biological functions such as molecular recognition, catalysis and signaling. However, reconstruction of biomolecular dynamics from experimental observables requires the determination of a conformational probability distribution. Unfortunately, these distributions cannot be fully constrained by the limited information from experiments, making the problem an ill-posed one in the terminology of Hadamard. The ill-posed nature of the problem comes from the fact that it has no unique solution. Multiple or even an infinite number of solutions may exist. To avoid the ill-posed nature, the problem needs to be regularized by making assumptions, which inevitably introduce biases into the result.

Here, I present two continuous probability density function approaches to solve an important inverse problem called the RDC trigonometric moment problem. By focusing on interdomain orientations we reduced the problem to determination of a distribution on the 3D rotational space from residual dipolar couplings (RDCs). We derived an analytical equation that relates alignment tensors of adjacent domains, which serves as the foundation of the two methods. In the first approach, the ill-posed nature of the problem was avoided by introducing a continuous distribution model, which enjoys a smoothness assumption. To find the optimal solution for the distribution, we also designed an efficient branch-and-bound algorithm that exploits the mathematical structure of the analytical solutions. The algorithm is guaranteed to find the distribution that best satisfies the analytical relationship. We observed good performance of the method when tested under various levels of experimental noise and when applied to two protein systems. The second approach avoids the use of any model by employing maximum entropy principles. This 'model-free' approach delivers the least biased result which presents our state of knowledge. In this approach, the solution is an exponential function of Lagrange multipliers. To determine the multipliers, a convex objective function is constructed. Consequently, the maximum entropy solution can be found easily by gradient descent methods. Both algorithms can be applied to biomolecular RDC data in general, including data from RNA and DNA molecules.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJECTIVE: A study was undertaken to determine whether better cognitive functioning at midlife among more physically fit individuals reflects neuroprotection, by which fitness protects against age-related cognitive decline, or neuroselection, by which children with higher cognitive functioning select more active lifestyles. METHODS: Children in the Dunedin Longitudinal Study (N = 1,037) completed the Wechsler Intelligence Scales and the Trail Making, Rey Delayed Recall, and Grooved Pegboard tasks as children and again at midlife (age = 38 years). Adult cardiorespiratory fitness was assessed using a submaximal exercise test to estimate maximum oxygen consumption adjusted for body weight in milliliters/minute/kilogram. We tested whether more fit individuals had better cognitive functioning than their less fit counterparts (which could be consistent with neuroprotection), and whether better childhood cognitive functioning predisposed to better adult cardiorespiratory fitness (neuroselection). Finally, we examined possible mechanisms of neuroselection. RESULTS: Participants with better cardiorespiratory fitness had higher cognitive test scores at midlife. However, fitness-associated advantages in cognitive functioning were already present in childhood. After accounting for childhood baseline performance on the same cognitive tests, there was no association between cardiorespiratory fitness and midlife cognitive functioning. Socioeconomic and health advantages in childhood and healthier lifestyles during young adulthood explained most of the association between childhood cognitive functioning and adult cardiorespiratory fitness. INTERPRETATION: We found no evidence for a neuroprotective effect of cardiorespiratory fitness as of midlife. Instead, children with better cognitive functioning are selecting healthier lives. Fitness interventions may enhance cognitive functioning. However, observational and experimental studies testing neuroprotective effects of physical fitness should consider confounding by neuroselection.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

As a psychological principle, the golden rule represents an ethic of universal empathic concern. It is, surprisingly, present in the sacred texts of virtually all religions, and in philosophical works across eras and continents. Building on the literature demonstrating a positive impact of prosocial behavior on well-being, the present study investigates the psychological function of universal empathic concern in Indian Hindus, Christians, Muslims and Sikhs.

I develop a measure of the centrality of the golden rule-based ethic, within an individual’s understanding of his or her religion, that is applicable to all theistic religions. I then explore the consistency of its relationships with psychological well-being and other variables across religious groups.

Results indicate that this construct, named Moral Concern Religious Focus, can be reliably measured in disparate religious groups, and consistently predicts well-being across them. With measures of Intrinsic, Extrinsic and Quest religious orientations in the model, only Moral Concern and religiosity predict well-being. Moral Concern alone mediates the relationship between religiosity and well-being, and explains more variance in well-being than religiosity alone. The relationship between Moral Concern and well-being is mediated by increased preference for prosocial values, more satisfying interpersonal relationships, and greater meaning in life. In addition, across religious groups Moral Concern is associated with better self-reported physical and mental health, and more compassionate attitudes toward oneself and others.

Two additional types of religious focus are identified: Personal Gain, representing the motive to use religion to improve one’s life, and Relationship with God. Personal Gain is found to predict reduced preference for prosocial values, less meaning in life, and lower quality of relationships. It is associated with greater interference of pain and physical or mental health problems with daily activities, and lower self-compassion. Relationship with God is found to be associated primarily with religious variables and greater meaning in life.

I conclude that individual differences in the centrality of the golden rule and its associated ethic of universal empathic concern may play an important role in explaining the variability in associations between religion, prosocial behavior and well-being noted in the literature.