967 resultados para Crossed Classification Models


30.00% 30.00%



The term Congenital Nystagmus (Early Onset Nystagmus or Infantile Nystagmus Syndrome) refers to a pathology characterised by an involuntary movement of the eyes, which often seriously reduces a subject’s vision. Congenital Nystagmus (CN) is a specific kind of nystagmus within the wider classification of infantile nystagmus, which can be best recognized and classified by means of a combination of clinical investigations and motility analysis; in some cases, eye movement recording and analysis are indispensable for diagnosis. However, interpretation of eye movement recordings still lacks of complete reliability; hence new analysis techniques and precise identification of concise parameters directly related to visual acuity are necessary to further support physicians’ decisions. To this aim, an index computed from eye movement recordings and related to the visual acuity of a subject is proposed in this thesis. This estimator is based on two parameters: the time spent by a subject effectively viewing a target (foveation time - Tf) and the standard deviation of eye position (SDp). Moreover, since previous studies have shown that visual acuity largely depends on SDp, a data collection pilot study was also conducted with the purpose of specifically identifying eventual slow rhythmic component in the eye position and to characterise in more detail the SDp. The results are presented in this thesis. In addition, some oculomotor system models are reviewed and a new approach to those models, i.e. the recovery of periodic orbits of the oculomotor system in patients with CN, is tested on real patients data. In conclusion, the results obtained within this research consent to completely and reliably characterise the slow rhythmic component sometimes present in eye position recordings of CN subjects and to better classify the different kinds of CN waveforms. Those findings can successfully support the clinicians in therapy planning and treatment outcome evaluation.


30.00% 30.00%



Die TGFbeta/BMP Signaltransduktionskaskade ist wichtig für viele Entwicklungsprozesse fast aller embryonaler sowie extraembryonaler Gewebe und sie ist ebenso essentiell bei der Aufrechterhaltung der Homöostase im adulten Organismus. In vielen Mausmodellen und Zellkulturversuchen wurde gezeigt, dass Liganden dieses Signalweges in verschiedene Stadien der Knorpel- und Knochenentwicklung involviert sind. BMPs sind beispielsweise maßgeblich an der frühen Kondensation und Bildung des Knorpels und später an Proliferation und Hypertrophie der Chondrozyten beteiligt. BMPs können ektopisch Knochenbildung auslösen und das Expressionsmuster der Liganden und spezifischen Rezeptoren in der Wachstumsfuge lässt auf eine wichtige Rolle der BMPs in der Wachstumsfuge schließen. Der gezielte knock out der BMP-Rezeptoren Bmpr1a und Bmpr1b in proliferierenden Chondrozyten führt zur Ausbildung einer generellen Chondrodysplasie. Smad1, Smad5 und Smad8 sind die Mediatoren der BMP-Signalkaskade. Im Rahmen der vorliegenden Arbeit sollte die Rolle und Funktion der Smad1- und Smad5-Proteine in der Wachstumsfuge untersucht werden. Hierzu wurden konditionale Smad1-knock out-Mäuse mit einer transgenen Mauslinie gekreuzt, die die Cre-Rekombinase spezifisch in proliferierenden Chondrozyten exprimiert. Diese Mäuse wurden mit und ohne heterozygotem Smad5-Hintergrund charakterisiert. Bei einem knock out von Smad1 allein konnte ein leichte Verkürzung der Wachstumsfuge beobachtet werden, wobei prähypertrophe und hypertrophe Zone gleichermaßen betroffen waren. Dieser Phänotyp war verstärkt in Mäusen mit zusätzlichem heterozygotem Smad5-Hintergrund. Eine Verringerung der Proliferationsrate konnte zusammen mit einer verminderten Ihh-Expression nachgewiesen werden. Zusätzlich konnte anhand von Röntgenaufnahmen eine Dysorganisation der nasalen Region und ein fehlendes nasales Septum beobachtet werden. Produktion und Mineralisation der extrazellulären Matrix waren nicht beeinträchtigt. Um die Rolle der BMP- und TGFbeta-Signalkaskaden während der endochondralen Ossifikation zu vergleichen, wurden transgene Mäuse generiert, in denen die TGFbeta-Signalkaskade spezifisch in proliferierenden Chondrozyten gestört war. Zwei Mauslinien, die ähnliche Phänotypen zeigten, wurden untersucht. Esl1 ist ein TGFbeta-bindendes Protein, von dem man annimmt, dass es die TGFbeta-Signalkaskade inhibieren kann. Esl1-knock out-Mäuse sind kleiner als Wildtypmäuse und die Überexpression von Esl1 in proliferierenden Chondrozyten führt zu einer Verlängerung der Wachstumsfuge und einer verstärkten Proliferationsrate. Knorpelmarker, wie Col2a1 und Sox9 sind in diesen Mäusen herunterreguliert, während Col10a1 und Ihh als Marker für die hypertrophe und prähypertrophe Zone herunterreguliert waren. Dies führt zu der Annahme, dass mehr Zellen in die terminale Differenzierung eintreten. Bei transgenen Mäusen, in denen ein dominant-negativer (dn) TGFbeta-Rezeptor in proliferierenden Chondrozyten überexprimiert wurde, konnte eine verlängerte prähypertrophe Zone, eine erhöhte Ihh-Expression, sowie eine verstärkte Proliferationsrate beobachtet werden. Zusätzlich konnte in homozygoten Tieren ein craniofacialer Phänotyp beschrieben werden, der zu Problemen bei der Nahrungsaufnahme und damit zu einer starken Wachstumsbeeinträchtigung führte. Die BMP- und TGFbeta-Signalkaskaden haben möglicherweise antagonistische Effekte in der Wachstumsfuge. Während der Ausfall von BMP in proliferierenden Chondrozyten aufgrund einer gesunkenen Proliferationsrate zu einer Verkürzung der Wachstumsfuge führte, kann man in Mäusen mit einer Störung der TGFbeta-Signalkaskade eine verstärkte Proliferation in einer daher verlängerten Wachstumsfuge beobachten. Ein weiteres Ziel dieser Arbeit war die Generation einer transgenen Mauslinie, die die Cre-Rekombinase spezifisch in hypertrophen Chondrozyten exprimiert. Promoterstudien mit transgenen Mäusen weisen darauf hin, dass ein putatives AP1-Element, etwa 4 kb vor dem ersten Exon des Col10a1 gelegen, wichtig für die spezifische Expression in hypertrophen Chondrozyten ist. Ein Konstrukt, dass vier Kopien dieses Elements und den basalen Promoter enthält, wurde benutzt, um die Cre-Rekombinase spezifisch zu exprimieren. Diese Mauslinie befindet sich in der Testphase und erste Daten deuten auf eine spezifische Expression der Cre-Rekombinase in hypertrophen Chondrozyten hin.


30.00% 30.00%



Tracking activities during daily life and assessing movement parameters is essential for complementing the information gathered in confined environments such as clinical and physical activity laboratories for the assessment of mobility. Inertial measurement units (IMUs) are used as to monitor the motion of human movement for prolonged periods of time and without space limitations. The focus in this study was to provide a robust, low-cost and an unobtrusive solution for evaluating human motion using a single IMU. First part of the study focused on monitoring and classification of the daily life activities. A simple method that analyses the variations in signal was developed to distinguish two types of activity intervals: active and inactive. Neural classifier was used to classify active intervals; the angle with respect to gravity was used to classify inactive intervals. Second part of the study focused on extraction of gait parameters using a single inertial measurement unit (IMU) attached to the pelvis. Two complementary methods were proposed for gait parameters estimation. First method was a wavelet based method developed for the estimation of gait events. Second method was developed for estimating step and stride length during level walking using the estimations of the previous method. A special integration algorithm was extended to operate on each gait cycle using a specially designed Kalman filter. The developed methods were also applied on various scenarios. Activity monitoring method was used in a PRIN’07 project to assess the mobility levels of individuals living in a urban area. The same method was applied on volleyball players to analyze the fitness levels of them by monitoring their daily life activities. The methods proposed in these studies provided a simple, unobtrusive and low-cost solution for monitoring and assessing activities outside of controlled environments.


30.00% 30.00%



The diagnosis, grading and classification of tumours has benefited considerably from the development of DCE-MRI which is now essential to the adequate clinical management of many tumour types due to its capability in detecting active angiogenesis. Several strategies have been proposed for DCE-MRI evaluation. Visual inspection of contrast agent concentration curves vs time is a very simple yet operator dependent procedure, therefore more objective approaches have been developed in order to facilitate comparison between studies. In so called model free approaches, descriptive or heuristic information extracted from time series raw data have been used for tissue classification. The main issue concerning these schemes is that they have not a direct interpretation in terms of physiological properties of the tissues. On the other hand, model based investigations typically involve compartmental tracer kinetic modelling and pixel-by-pixel estimation of kinetic parameters via non-linear regression applied on region of interests opportunely selected by the physician. This approach has the advantage to provide parameters directly related to the pathophysiological properties of the tissue such as vessel permeability, local regional blood flow, extraction fraction, concentration gradient between plasma and extravascular-extracellular space. Anyway, nonlinear modelling is computational demanding and the accuracy of the estimates can be affected by the signal-to-noise ratio and by the initial solutions. The principal aim of this thesis is investigate the use of semi-quantitative and quantitative parameters for segmentation and classification of breast lesion. The objectives can be subdivided as follow: describe the principal techniques to evaluate time intensity curve in DCE-MRI with focus on kinetic model proposed in literature; to evaluate the influence in parametrization choice for a classic bi-compartmental kinetic models; to evaluate the performance of a method for simultaneous tracer kinetic modelling and pixel classification; to evaluate performance of machine learning techniques training for segmentation and classification of breast lesion.


30.00% 30.00%



The Thermodynamic Bethe Ansatz analysis is carried out for the extended-CP^N class of integrable 2-dimensional Non-Linear Sigma Models related to the low energy limit of the AdS_4xCP^3 type IIA superstring theory. The principal aim of this program is to obtain further non-perturbative consistency check to the S-matrix proposed to describe the scattering processes between the fundamental excitations of the theory by analyzing the structure of the Renormalization Group flow. As a noteworthy byproduct we eventually obtain a novel class of TBA models which fits in the known classification but with several important differences. The TBA framework allows the evaluation of some exact quantities related to the conformal UV limit of the model: effective central charge, conformal dimension of the perturbing operator and field content of the underlying CFT. The knowledge of this physical quantities has led to the possibility of conjecturing a perturbed CFT realization of the integrable models in terms of coset Kac-Moody CFT. The set of numerical tools and programs developed ad hoc to solve the problem at hand is also discussed in some detail with references to the code.


30.00% 30.00%



Information is nowadays a key resource: machine learning and data mining techniques have been developed to extract high-level information from great amounts of data. As most data comes in form of unstructured text in natural languages, research on text mining is currently very active and dealing with practical problems. Among these, text categorization deals with the automatic organization of large quantities of documents in priorly defined taxonomies of topic categories, possibly arranged in large hierarchies. In commonly proposed machine learning approaches, classifiers are automatically trained from pre-labeled documents: they can perform very accurate classification, but often require a consistent training set and notable computational effort. Methods for cross-domain text categorization have been proposed, allowing to leverage a set of labeled documents of one domain to classify those of another one. Most methods use advanced statistical techniques, usually involving tuning of parameters. A first contribution presented here is a method based on nearest centroid classification, where profiles of categories are generated from the known domain and then iteratively adapted to the unknown one. Despite being conceptually simple and having easily tuned parameters, this method achieves state-of-the-art accuracy in most benchmark datasets with fast running times. A second, deeper contribution involves the design of a domain-independent model to distinguish the degree and type of relatedness between arbitrary documents and topics, inferred from the different types of semantic relationships between respective representative words, identified by specific search algorithms. The application of this model is tested on both flat and hierarchical text categorization, where it potentially allows the efficient addition of new categories during classification. Results show that classification accuracy still requires improvements, but models generated from one domain are shown to be effectively able to be reused in a different one.


30.00% 30.00%



Membrane interactions of porphyrinic photosensitizers (PSs) are known to play a crucial role for PS efficiency in photodynamic therapy (PDT). In the current paper, the interactions between 15 different porphyrinic PSs with various hydrophilic/lipophilic properties and phospholipid bilayers were probed by NMR spectroscopy. Unilamellar vesicles consisting of dioleoyl-phosphatidyl-choline (DOPC) were used as membrane models. PS-membrane interactions were deduced from analysis of the main DOPC (1)H-NMR resonances (choline and lipid chain signals). Initial membrane adsorption of the PSs was indicated by induced changes to the DOPC choline signal, i.e. a split into inner and outer choline peaks. Based on this parameter, the PSs could be classified into two groups, Type-A PSs causing a split and the Type-B PSs causing no split. A further classification into two subgroups each, A1, A2 and B1, B2 was based on the observed time-dependent changes of the main DOPC NMR signals following initial PS adsorption. Four different time-correlated patterns were found indicating different levels and rates of PS penetration into the hydrophobic membrane interior. The type of interaction was mainly affected by the amphiphilicity and the overall lipophilicity of the applied PS structures. In conclusion, the NMR data provided valuable structural and dynamic insights into the PS-membrane interactions which allow deriving the structural constraints for high membrane affinity and high membrane penetration of a given PS. (C) 2011 Elsevier B.V. All rights reserved.


30.00% 30.00%



The advances in computational biology have made simultaneous monitoring of thousands of features possible. The high throughput technologies not only bring about a much richer information context in which to study various aspects of gene functions but they also present challenge of analyzing data with large number of covariates and few samples. As an integral part of machine learning, classification of samples into two or more categories is almost always of interest to scientists. In this paper, we address the question of classification in this setting by extending partial least squares (PLS), a popular dimension reduction tool in chemometrics, in the context of generalized linear regression based on a previous approach, Iteratively ReWeighted Partial Least Squares, i.e. IRWPLS (Marx, 1996). We compare our results with two-stage PLS (Nguyen and Rocke, 2002A; Nguyen and Rocke, 2002B) and other classifiers. We show that by phrasing the problem in a generalized linear model setting and by applying bias correction to the likelihood to avoid (quasi)separation, we often get lower classification error rates.


30.00% 30.00%



Suppose that we are interested in establishing simple, but reliable rules for predicting future t-year survivors via censored regression models. In this article, we present inference procedures for evaluating such binary classification rules based on various prediction precision measures quantified by the overall misclassification rate, sensitivity and specificity, and positive and negative predictive values. Specifically, under various working models we derive consistent estimators for the above measures via substitution and cross validation estimation procedures. Furthermore, we provide large sample approximations to the distributions of these nonsmooth estimators without assuming that the working model is correctly specified. Confidence intervals, for example, for the difference of the precision measures between two competing rules can then be constructed. All the proposals are illustrated with two real examples and their finite sample properties are evaluated via a simulation study.


30.00% 30.00%



BACKGROUND Low-grade gliomas (LGGs) are rare brain neoplasms, with survival spanning up to a few decades. Thus, accurate evaluations on how biomarkers impact survival among patients with LGG require long-term studies on samples prospectively collected over a long period. METHODS The 210 adult LGGs collected in our databank were screened for IDH1 and IDH2 mutations (IDHmut), MGMT gene promoter methylation (MGMTmet), 1p/19q loss of heterozygosity (1p19qloh), and nuclear TP53 immunopositivity (TP53pos). Multivariate survival analyses with multiple imputation of missing data were performed using either histopathology or molecular markers. Both models were compared using Akaike's information criterion (AIC). The molecular model was reduced by stepwise model selection to filter out the most critical predictors. A third model was generated to assess for various marker combinations. RESULTS Molecular parameters were better survival predictors than histology (ΔAIC = 12.5, P< .001). Forty-five percent of studied patients died. MGMTmet was positively associated with IDHmut (P< .001). In the molecular model with marker combinations, IDHmut/MGMTmet combined status had a favorable impact on overall survival, compared with IDHwt (hazard ratio [HR] = 0.33, P< .01), and even more so the triple combination, IDHmut/MGMTmet/1p19qloh (HR = 0.18, P< .001). Furthermore, IDHmut/MGMTmet/TP53pos triple combination was a significant risk factor for malignant transformation (HR = 2.75, P< .05). CONCLUSION By integrating networks of activated molecular glioma pathways, the model based on genotype better predicts prognosis than histology and, therefore, provides a more reliable tool for standardizing future treatment strategies.


30.00% 30.00%



Random Forests™ is reported to be one of the most accurate classification algorithms in complex data analysis. It shows excellent performance even when most predictors are noisy and the number of variables is much larger than the number of observations. In this thesis Random Forests was applied to a large-scale lung cancer case-control study. A novel way of automatically selecting prognostic factors was proposed. Also, synthetic positive control was used to validate Random Forests method. Throughout this study we showed that Random Forests can deal with large number of weak input variables without overfitting. It can account for non-additive interactions between these input variables. Random Forests can also be used for variable selection without being adversely affected by collinearities. ^ Random Forests can deal with the large-scale data sets without rigorous data preprocessing. It has robust variable importance ranking measure. Proposed is a novel variable selection method in context of Random Forests that uses the data noise level as the cut-off value to determine the subset of the important predictors. This new approach enhanced the ability of the Random Forests algorithm to automatically identify important predictors for complex data. The cut-off value can also be adjusted based on the results of the synthetic positive control experiments. ^ When the data set had high variables to observations ratio, Random Forests complemented the established logistic regression. This study suggested that Random Forests is recommended for such high dimensionality data. One can use Random Forests to select the important variables and then use logistic regression or Random Forests itself to estimate the effect size of the predictors and to classify new observations. ^ We also found that the mean decrease of accuracy is a more reliable variable ranking measurement than mean decrease of Gini. ^


30.00% 30.00%



It is well accepted that tumorigenesis is a multi-step procedure involving aberrant functioning of genes regulating cell proliferation, differentiation, apoptosis, genome stability, angiogenesis and motility. To obtain a full understanding of tumorigenesis, it is necessary to collect information on all aspects of cell activity. Recent advances in high throughput technologies allow biologists to generate massive amounts of data, more than might have been imagined decades ago. These advances have made it possible to launch comprehensive projects such as (TCGA) and (ICGC) which systematically characterize the molecular fingerprints of cancer cells using gene expression, methylation, copy number, microRNA and SNP microarrays as well as next generation sequencing assays interrogating somatic mutation, insertion, deletion, translocation and structural rearrangements. Given the massive amount of data, a major challenge is to integrate information from multiple sources and formulate testable hypotheses. This thesis focuses on developing methodologies for integrative analyses of genomic assays profiled on the same set of samples. We have developed several novel methods for integrative biomarker identification and cancer classification. We introduce a regression-based approach to identify biomarkers predictive to therapy response or survival by integrating multiple assays including gene expression, methylation and copy number data through penalized regression. To identify key cancer-specific genes accounting for multiple mechanisms of regulation, we have developed the integIRTy software that provides robust and reliable inferences about gene alteration by automatically adjusting for sample heterogeneity as well as technical artifacts using Item Response Theory. To cope with the increasing need for accurate cancer diagnosis and individualized therapy, we have developed a robust and powerful algorithm called SIBER to systematically identify bimodally expressed genes using next generation RNAseq data. We have shown that prediction models built from these bimodal genes have the same accuracy as models built from all genes. Further, prediction models with dichotomized gene expression measurements based on their bimodal shapes still perform well. The effectiveness of outcome prediction using discretized signals paves the road for more accurate and interpretable cancer classification by integrating signals from multiple sources.


30.00% 30.00%



The geometries of a catchment constitute the basis for distributed physically based numerical modeling of different geoscientific disciplines. In this paper results from ground-penetrating radar (GPR) measurements, in terms of a 3D model of total sediment thickness and active layer thickness in a periglacial catchment in western Greenland, is presented. Using the topography, thickness and distribution of sediments is calculated. Vegetation classification and GPR measurements are used to scale active layer thickness from local measurements to catchment scale models. Annual maximum active layer thickness varies from 0.3 m in wetlands to 2.0 m in barren areas and areas of exposed bedrock. Maximum sediment thickness is estimated to be 12.3 m in the major valleys of the catchment. A method to correlate surface vegetation with active layer thickness is also presented. By using relatively simple methods, such as probing and vegetation classification, it is possible to upscale local point measurements to catchment scale models, in areas where the upper subsurface is relatively homogenous. The resulting spatial model of active layer thickness can be used in combination with the sediment model as a geometrical input to further studies of subsurface mass-transport and hydrological flow paths in the periglacial catchment through numerical modelling.


30.00% 30.00%



We present a novel approach using both sustained vowels and connected speech, to detect obstructive sleep apnea (OSA) cases within a homogeneous group of speakers. The proposed scheme is based on state-of-the-art GMM-based classifiers, and acknowledges specifically the way in which acoustic models are trained on standard databases, as well as the complexity of the resulting models and their adaptation to specific data. Our experimental database contains a suitable number of utterances and sustained speech from healthy (i.e control) and OSA Spanish speakers. Finally, a 25.1% relative reduction in classification error is achieved when fusing continuous and sustained speech classifiers. Index Terms: obstructive sleep apnea (OSA), gaussian mixture models (GMMs), background model (BM), classifier fusion.