6 resultados para Algorithmic Solution

em Helda - Digital Repository of University of Helsinki


Relevância:

20.00% 20.00%

Publicador:

Resumo:

NMR spectroscopy enables the study of biomolecules from peptides and carbohydrates to proteins at atomic resolution. The technique uniquely allows for structure determination of molecules in solution-state. It also gives insights into dynamics and intermolecular interactions important for determining biological function. Detailed molecular information is entangled in the nuclear spin states. The information can be extracted by pulse sequences designed to measure the desired molecular parameters. Advancement of pulse sequence methodology therefore plays a key role in the development of biomolecular NMR spectroscopy. A range of novel pulse sequences for solution-state NMR spectroscopy are presented in this thesis. The pulse sequences are described in relation to the molecular information they provide. The pulse sequence experiments represent several advances in NMR spectroscopy with particular emphasis on applications for proteins. Some of the novel methods are focusing on methyl-containing amino acids which are pivotal for structure determination. Methyl-specific assignment schemes are introduced for increasing the size range of 13C,15N labeled proteins amenable to structure determination without resolving to more elaborate labeling schemes. Furthermore, cost-effective means are presented for monitoring amide and methyl correlations simultaneously. Residual dipolar couplings can be applied for structure refinement as well as for studying dynamics. Accurate methods for measuring residual dipolar couplings in small proteins are devised along with special techniques applicable when proteins require high pH or high temperature solvent conditions. Finally, a new technique is demonstrated to diminish strong-coupling induced artifacts in HMBC, a routine experiment for establishing long-range correlations in unlabeled molecules. The presented experiments facilitate structural studies of biomolecules by NMR spectroscopy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Analyzing statistical dependencies is a fundamental problem in all empirical science. Dependencies help us understand causes and effects, create new scientific theories, and invent cures to problems. Nowadays, large amounts of data is available, but efficient computational tools for analyzing the data are missing. In this research, we develop efficient algorithms for a commonly occurring search problem - searching for the statistically most significant dependency rules in binary data. We consider dependency rules of the form X->A or X->not A, where X is a set of positive-valued attributes and A is a single attribute. Such rules describe which factors either increase or decrease the probability of the consequent A. A classical example are genetic and environmental factors, which can either cause or prevent a disease. The emphasis in this research is that the discovered dependencies should be genuine - i.e. they should also hold in future data. This is an important distinction from the traditional association rules, which - in spite of their name and a similar appearance to dependency rules - do not necessarily represent statistical dependencies at all or represent only spurious connections, which occur by chance. Therefore, the principal objective is to search for the rules with statistical significance measures. Another important objective is to search for only non-redundant rules, which express the real causes of dependence, without any occasional extra factors. The extra factors do not add any new information on the dependence, but can only blur it and make it less accurate in future data. The problem is computationally very demanding, because the number of all possible rules increases exponentially with the number of attributes. In addition, neither the statistical dependency nor the statistical significance are monotonic properties, which means that the traditional pruning techniques do not work. As a solution, we first derive the mathematical basis for pruning the search space with any well-behaving statistical significance measures. The mathematical theory is complemented by a new algorithmic invention, which enables an efficient search without any heuristic restrictions. The resulting algorithm can be used to search for both positive and negative dependencies with any commonly used statistical measures, like Fisher's exact test, the chi-squared measure, mutual information, and z scores. According to our experiments, the algorithm is well-scalable, especially with Fisher's exact test. It can easily handle even the densest data sets with 10000-20000 attributes. Still, the results are globally optimal, which is a remarkable improvement over the existing solutions. In practice, this means that the user does not have to worry whether the dependencies hold in future data or if the data still contains better, but undiscovered dependencies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Cereal arabinoxylans, guar galactomannans, and dextrans produced by lactic acid bacteria(LAB) are a structurally diverse group of branched polysaccharides with nutritional and industrial functions. In this thesis, the effect of the chemical structure on the dilute solution properties of these polysaccharides was investigated using size-exclusion chromatography(SEC) and asymmetric flow field-flow fractionation (AsFlFFF) with multiple-detection. The chemical structures of arabinoxylans were determined, whereas galactomannan and dextran structures were studied in previous investigations. Characterization of arabinoxylans revealed differences in the chemical structures of cereal arabinoxylans. Although arabinoxylans from wheat, rye, and barley fiber contained similar amounts of arabinose side units, the substitution pattern of arabinoxylans from different cereals varied. Arabinoxylans from barley husks and commercial low-viscosity wheat arabinoxylan contained a lower number of arabinose side units. Structurally different dextrans were obtained from different LAB. The structural effects on the solution properties could be studied in detail by modifying pure wheat and rye arabinoxylans and guar galactomannan with specific enzymes. The solution characterization of arabinoxylans, enzymatically modified galactomannans, and dextrans revealed the presence of aggregates in aqueous polysaccharide solutions. In the case of arabinoxylans and dextrans, the comparison of molar mass data from aqueous and organic SEC analyses was essential in confirming aggregation, which could not be observed only from the peak or molar mass distribution shapes obtained with aqueous SEC. The AsFlFFF analyses gave further evidence of aggregation. Comparison of molar mass and intrinsic viscosity data of unmodified and partially debranched guar galactomannan, on the other hand, revealed the aggregation of native galactomannan. The arabinoxylan and galactomannan samples with low or enzymatically extensively decreased side unit content behaved similarly in aqueous solution: lower molar mass samples stayed in solution but formed large aggregates, whereas the water solubility of the higher-molar-mass samples decreased significantly. Due to the restricted solubility of galactomannans in organic solvents, only aqueous galactomannan solutions were studied. The SEC and AsFlFFF results differed for the wheat arabinoxylan and dextran samples. Column matrix effects and possible differences in the separation parameters are discussed, and a problem related to the non-established relationship between the separation parameters of the two separation techniques is highlighted. This thesis shows that complementary approaches in the solution characterization of chemically heterogeneous polysaccharides are needed to comprehensively investigate macromolecular behavior in solution. These results may also be valuable when characterizing other branched polysaccharides.