856 resultados para Data Driven Clustering
Resumo:
ABSTRACT: BACKGROUND: Cardiovascular magnetic resonance (CMR) has favorable characteristics for diagnostic evaluation and risk stratification of patients with known or suspected CAD. CMR utilization in CAD detection is growing fast. However, data on its cost-effectiveness are scarce. The goal of this study is to compare the costs of two strategies for detection of significant coronary artery stenoses in patients with suspected coronary artery disease (CAD): 1) Performing CMR first to assess myocardial ischemia and/or infarct scar before referring positive patients (defined as presence of ischemia and/or infarct scar to coronary angiography (CXA) versus 2) a hypothetical CXA performed in all patients as a single test to detect CAD. METHODS: A subgroup of the European CMR pilot registry was used including 2,717 consecutive patients who underwent stress-CMR. From these patients, 21% were positive for CAD (ischemia and/or infarct scar), 73% negative, and 6% uncertain and underwent additional testing. The diagnostic costs were evaluated using invoicing costs of each test performed. Costs analysis was performed from a health care payer perspective in German, United Kingdom, Swiss, and United States health care settings. RESULTS: In the public sectors of the German, United Kingdom, and Swiss health care systems, cost savings from the CMR-driven strategy were 50%, 25% and 23%, respectively, versus outpatient CXA. If CXA was carried out as an inpatient procedure, cost savings were 46%, 50% and 48%, respectively. In the United States context, cost savings were 51% when compared with inpatient CXA, but higher for CMR by 8% versus outpatient CXA. CONCLUSION: This analysis suggests that from an economic perspective, the use of CMR should be encouraged as a management option for patients with suspected CAD.
Resumo:
PSIP1 (PC4 and SFRS1 interacting protein 1) encodes two splice variants: lens epithelium-derived growth factor or p75 (LEDGF/p75) and p52. PSIP1 gene products were shown to be involved in transcriptional regulation, affecting a plethora of cellular processes, including cell proliferation, cell survival, and stress response. Furthermore, LEDGF/p75 has implications for various diseases and infections, including autoimmunity, leukemia, embryo development, psoriasis, and human immunodeficiency virus integration. Here, we reported the first characterization of the PSIP1 promoter. Using 5' RNA ligase-mediated rapid amplification of cDNA ends, we identified novel transcription start sites in different cell types. Using a luciferase reporter system, we identified regulatory elements controlling the expression of LEDGF/p75 and p52. These include (i) minimal promoters (-112/+59 and +609/+781) that drive the basal expression of LEDGF/p75 and of the shorter splice variant p52, respectively; (ii) a sequence (+319/+397) that may control the ratio of LEDGF/p75 expression to p52 expression; and (iii) a strong enhancer (-320/-207) implicated in the modulation of LEDGF/p75 transcriptional activity. Computational, biochemical, and genetic approaches enabled us to identify the transcription factor Sp1 as a key modulator of the PSIP1 promoter, controlling LEDGF/p75 transcription through two binding sites at -72/-64 and -46/-36. Overall, our results provide initial data concerning LEDGF/p75 promoter regulation, giving new insights to further understand its biological function and opening the door for new therapeutic strategies in which LEDGF/p75 is involved.
Resumo:
This study proposes a theoretical model describing the electrostatically driven step of the alpha 1 b-adrenergic receptor (AR)-G protein recognition. The comparative analysis of the structural-dynamics features of functionally different receptor forms, i.e., the wild type (ground state) and its constitutively active mutants D142A and A293E, was instrumental to gain insight on the receptor-G protein electrostatic and steric complementarity. Rigid body docking simulations between the different forms of the alpha 1 b-AR and the heterotrimeric G alpha q, G alpha s, G alpha i1, and G alpha t suggest that the cytosolic crevice shared by the active receptor and including the second and the third intracellular loops as well as the cytosolic extension of helices 5 and 6, represents the receptor surface with docking complementarity with the G protein. On the other hand, the G protein solvent-exposed portions that recognize the intracellular loops of the activated receptors are the N-terminal portion of alpha 3, alpha G, the alpha G/alpha 4 loop, alpha 4, the alpha 4/beta 6 loop, alpha 5, and the C-terminus. Docking simulations suggest that the two constitutively active mutants D142A and A293E recognize different G proteins with similar selectivity orders, i.e., G alpha q approximately equal to G alpha s > G alpha i > G alpha t. The theoretical models herein proposed might provide useful suggestions for new experiments aiming at exploring the receptor-G protein interface.
Resumo:
The hypothalamic damage induced by neonatal treatment with monosodium l-glutamate (MSG) induces several metabolic abnormalities, resulting in a rat hyperleptinemic-hyperadipose phenotype. This study was conducted to explore the impact of the neonatal MSG treatment, in the adult (120 days old) female rat on: (a) the in vivo and in vitro mineralocorticoid responses to ACTH and angiotensin II (AII); (b) the effect of leptin on ACTH- and AII-stimulated mineralocorticoid secretions by isolated corticoadrenal cells; and (c) abdominal adiposity characteristics. Our data indicate that, compared with age-matched controls, MSG rats displayed: (1) enhanced and reduced mineralocorticoid responses to ACTH and AII treatments, respectively, effects observed in both in vivo and in vitro conditions; (2) adrenal refractoriness to the inhibitory effect of exogenous leptin on ACTH-stimulated aldosterone output by isolated adrenocortical cells; and (3) distorted omental adiposity morphology and function. This study supports that the adult hyperleptinemic MSG female rat is characterized by enhanced ACTH-driven mineralocorticoid function, impaired adrenal leptin sensitivity, and disrupted abdominal adiposity function. MSG rats could counteract undesirable effects of glucocorticoid excess, by developing a reduced AII-driven mineralocorticoid function. Thus, chronic hyperleptinemia could play a protective role against ACTH-mediated allostatic loads in the adrenal leptin resistant, MSG female rat phenotype.
Resumo:
In an earlier investigation (Burger et al., 2000) five sediment cores near the RodriguesTriple Junction in the Indian Ocean were studied applying classical statistical methods(fuzzy c-means clustering, linear mixing model, principal component analysis) for theextraction of endmembers and evaluating the spatial and temporal variation ofgeochemical signals. Three main factors of sedimentation were expected by the marinegeologists: a volcano-genetic, a hydro-hydrothermal and an ultra-basic factor. Thedisplay of fuzzy membership values and/or factor scores versus depth providedconsistent results for two factors only; the ultra-basic component could not beidentified. The reason for this may be that only traditional statistical methods wereapplied, i.e. the untransformed components were used and the cosine-theta coefficient assimilarity measure.During the last decade considerable progress in compositional data analysis was madeand many case studies were published using new tools for exploratory analysis of thesedata. Therefore it makes sense to check if the application of suitable data transformations,reduction of the D-part simplex to two or three factors and visualinterpretation of the factor scores would lead to a revision of earlier results and toanswers to open questions . In this paper we follow the lines of a paper of R. Tolosana-Delgado et al. (2005) starting with a problem-oriented interpretation of the biplotscattergram, extracting compositional factors, ilr-transformation of the components andvisualization of the factor scores in a spatial context: The compositional factors will beplotted versus depth (time) of the core samples in order to facilitate the identification ofthe expected sources of the sedimentary process.Kew words: compositional data analysis, biplot, deep sea sediments
Resumo:
This article presents recent WMR (wheeled mobile robot) navigation experiences using local perception knowledge provided by monocular and odometer systems. A local narrow perception horizon is used to plan safety trajectories towards the objective. Therefore, monocular data are proposed as a way to obtain real time local information by building two dimensional occupancy grids through a time integration of the frames. The path planning is accomplished by using attraction potential fields, while the trajectory tracking is performed by using model predictive control techniques. The results are faced to indoor situations by using the lab available platform consisting in a differential driven mobile robot
Resumo:
Estudi, disseny i implementació de diferents tècniques d’agrupament defibres (clustering) per tal d’integrar a la plataforma DTIWeb diferentsalgorismes de clustering i tècniques de visualització de clústers de fibres de forma quefaciliti la interpretació de dades de DTI als especialistes
Resumo:
Despite their limited proliferation capacity, regulatory T cells (T(regs)) constitute a population maintained over the entire lifetime of a human organism. The means by which T(regs) sustain a stable pool in vivo are controversial. Using a mathematical model, we address this issue by evaluating several biological scenarios of the origins and the proliferation capacity of two subsets of T(regs): precursor CD4(+)CD25(+)CD45RO(-) and mature CD4(+)CD25(+)CD45RO(+) cells. The lifelong dynamics of T(regs) are described by a set of ordinary differential equations, driven by a stochastic process representing the major immune reactions involving these cells. The model dynamics are validated using data from human donors of different ages. Analysis of the data led to the identification of two properties of the dynamics: (1) the equilibrium in the CD4(+)CD25(+)FoxP3(+)T(regs) population is maintained over both precursor and mature T(regs) pools together, and (2) the ratio between precursor and mature T(regs) is inverted in the early years of adulthood. Then, using the model, we identified three biologically relevant scenarios that have the above properties: (1) the unique source of mature T(regs) is the antigen-driven differentiation of precursors that acquire the mature profile in the periphery and the proliferation of T(regs) is essential for the development and the maintenance of the pool; there exist other sources of mature T(regs), such as (2) a homeostatic density-dependent regulation or (3) thymus- or effector-derived T(regs), and in both cases, antigen-induced proliferation is not necessary for the development of a stable pool of T(regs). This is the first time that a mathematical model built to describe the in vivo dynamics of regulatory T cells is validated using human data. The application of this model provides an invaluable tool in estimating the amount of regulatory T cells as a function of time in the blood of patients that received a solid organ transplant or are suffering from an autoimmune disease.
Resumo:
HEMOLIA (a project under European community’s 7th framework programme) is a new generation Anti-Money Laundering (AML) intelligent multi-agent alert and investigation system which in addition to the traditional financial data makes extensive use of modern society’s huge telecom data source, thereby opening up a new dimension of capabilities to all Money Laundering fighters (FIUs, LEAs) and Financial Institutes (Banks, Insurance Companies, etc.). This Master-Thesis project is done at AIA, one of the partners for the HEMOLIA project in Barcelona. The objective of this thesis is to find the clusters in a network drawn by using the financial data. An extensive literature survey has been carried out and several standard algorithms related to networks have been studied and implemented. The clustering problem is a NP-hard problem and several algorithms like K-Means and Hierarchical clustering are being implemented for studying several problems relating to sociology, evolution, anthropology etc. However, these algorithms have certain drawbacks which make them very difficult to implement. The thesis suggests (a) a possible improvement to the K-Means algorithm, (b) a novel approach to the clustering problem using the Genetic Algorithms and (c) a new algorithm for finding the cluster of a node using the Genetic Algorithm.
Resumo:
OBJECTIVE: This study assessed clustering of multiple risk behaviors (i.e., low leisure-time physical activity, low fruits/vegetables intake, and high alcohol consumption) with level of cigarette consumption. METHODS: Data from the 2002 Swiss Health Survey, a population-based cross-sectional telephone survey assessing health and self-reported risk behaviors, were used. 18,005 subjects (8052 men and 9953 women) aged 25 years old or more participated. RESULTS: Smokers more frequently had low leisure time physical activity, low fruits/vegetables intake, and high alcohol consumption than non- and ex-smokers. Frequency of each risk behavior increased steadily with cigarette consumption. Clustering of risk behaviors increased with cigarette consumption in both men and women. For men, the odds ratios of multiple (> or =2) risk behaviors other than smoking, adjusted for age, nationality, and educational level, were 1.14 (95% confidence interval: 0.97, 1.33) for ex-smokers, 1.24 (0.93, 1.64) for light smokers (1-9 cigarettes/day), 1.72 (1.36, 2.17) for moderate smokers (10-19 cigarettes/day), and 3.07 (2.59, 3.64) for heavy smokers (> or =20 cigarettes/day) versus non-smokers. Similar odds ratios were found for women for corresponding groups, i.e., 1.01 (0.86, 1.19), 1.26 (1.00, 1.58), 1.62 (1.33, 1.98), and 2.75 (2.30, 3.29). CONCLUSIONS: Counseling and intervention with smokers should take into account the strong clustering of risk behaviors with level of cigarette consumption.
Resumo:
INTRODUCTION Finding therapeutic alternatives to carbapenems in infections caused by extended-spectrum β-lactamase-producing Escherichia coli (ESBL-EC) is imperative. Although fosfomycin was discovered more than 40 years ago, it was not investigated in accordance with current standards and so is not used in clinical practice except in desperate situations. It is one of the so-called neglected antibiotics of high potential interest for the future. METHODS AND ANALYSIS The main objective of this project is to demonstrate the clinical non-inferiority of intravenous fosfomycin with regard to meropenem for treating bacteraemic urinary tract infections (UTI) caused by ESBL-EC. This is a 'real practice' multicentre, open-label, phase III randomised controlled trial, designed to compare the clinical and microbiological efficacy, and safety of intravenous fosfomycin (4 g/6 h) and meropenem (1 g/8 h) as targeted therapy for this infection; a change to oral therapy is permitted after 5 days in both arms, in accordance with predetermined options. The study design follows the latest recommendations for designing trials investigating new options for multidrug-resistant bacteria. Secondary objectives include the study of fosfomycin concentrations in plasma and the impact of both drugs on intestinal colonisation by multidrug-resistant Gram-negative bacilli. ETHICS AND DISSEMINATION Ethical approval was obtained from the Andalusian Coordinating Institutional Review Board (IRB) for Biomedical Research (Referral Ethics Committee), which obtained approval from the local ethics committees at all participating sites in Spain (22 sites). Data will be presented at international conferences and published in peer-reviewed journals. DISCUSSION This project is proposed as an initial step in the investigation of an orphan antimicrobial of low cost with high potential as a therapeutic alternative in common infections such as UTI in selected patients. These results may have a major impact on the use of antibiotics and the development of new projects with this drug, whether as monotherapy or combination therapy. TRIAL REGISTRATION NUMBER NCT02142751. EudraCT no: 2013-002922-21. Protocol V.1.1 dated 14 March 2014.
Resumo:
Globalization involves several facility location problems that need to be handled at large scale. Location Allocation (LA) is a combinatorial problem in which the distance among points in the data space matter. Precisely, taking advantage of the distance property of the domain we exploit the capability of clustering techniques to partition the data space in order to convert an initial large LA problem into several simpler LA problems. Particularly, our motivation problem involves a huge geographical area that can be partitioned under overall conditions. We present different types of clustering techniques and then we perform a cluster analysis over our dataset in order to partition it. After that, we solve the LA problem applying simulated annealing algorithm to the clustered and non-clustered data in order to work out how profitable is the clustering and which of the presented methods is the most suitable
Resumo:
BACKGROUND: It is unknown why patients with extensive ulcerative colitis (UC) have a higher risk of colorectal cancer compared with patients with left-sided UC. This study characterizes the inflammatory processes in left-sided UC, pancolitis, and UC-associated dysplasia at the transcriptional level to identify potential biomarkers and transcripts of importance for the carcinogenic behavior of chronic inflammation. METHODS: The Affymetrix GeneChip Human Genome U133 Plus 2.0 was applied on colonic biopsies from UC patients with left-sided UC, pancolitis, dysplasia, and controls. Reverse transcription polymerase chain reaction and immunohistochemistry were performed for validating selected transcripts in the initial cohort and in 2 independent cohorts of patients with UC. Microarray data were analyzed by principal component analysis, and reverse transcription polymerase chain reaction and immunohistochemistry data by the Wilcoxon's rank-sum test. RESULTS: The principal component analysis results revealed separate clusters for left-sided UC, pancolitis, dysplasia, and controls. Close clustering of dysplastic and pancolitic samples indicated similarities in gene expression. Indeed, 101 and 656 parallel upregulated and downregulated transcripts, respectively, were identified in specimens from dysplasia and pancolitis. Validation of selected transcripts hereof identified insulin receptor alpha (INSRA) and MAP kinase interacting serine/threonine kinase 2 (MKNK2) with an enhanced expression in dysplasia compared with left-sided UC and controls, whereas laminin γ2 (LAMC2) was found with a lower expression in dysplasia compared with the remaining 3 groups. CONCLUSIONS: This study demonstrates pancolitis and left-sided UC as distinct inflammatory processes at the transcriptional level, and identifies INSRA, MKNK2, and LAMC2 as potential critical transcripts in the inflammation-driven preneoplastic process of UC.
Resumo:
BACKGROUND: Solexa/Illumina short-read ultra-high throughput DNA sequencing technology produces millions of short tags (up to 36 bases) by parallel sequencing-by-synthesis of DNA colonies. The processing and statistical analysis of such high-throughput data poses new challenges; currently a fair proportion of the tags are routinely discarded due to an inability to match them to a reference sequence, thereby reducing the effective throughput of the technology. RESULTS: We propose a novel base calling algorithm using model-based clustering and probability theory to identify ambiguous bases and code them with IUPAC symbols. We also select optimal sub-tags using a score based on information content to remove uncertain bases towards the ends of the reads. CONCLUSION: We show that the method improves genome coverage and number of usable tags as compared with Solexa's data processing pipeline by an average of 15%. An R package is provided which allows fast and accurate base calling of Solexa's fluorescence intensity files and the production of informative diagnostic plots.
Resumo:
Background: To enhance our understanding of complex biological systems like diseases we need to put all of the available data into context and use this to detect relations, pattern and rules which allow predictive hypotheses to be defined. Life science has become a data rich science with information about the behaviour of millions of entities like genes, chemical compounds, diseases, cell types and organs, which are organised in many different databases and/or spread throughout the literature. Existing knowledge such as genotype - phenotype relations or signal transduction pathways must be semantically integrated and dynamically organised into structured networks that are connected with clinical and experimental data. Different approaches to this challenge exist but so far none has proven entirely satisfactory. Results: To address this challenge we previously developed a generic knowledge management framework, BioXM™, which allows the dynamic, graphic generation of domain specific knowledge representation models based on specific objects and their relations supporting annotations and ontologies. Here we demonstrate the utility of BioXM for knowledge management in systems biology as part of the EU FP6 BioBridge project on translational approaches to chronic diseases. From clinical and experimental data, text-mining results and public databases we generate a chronic obstructive pulmonary disease (COPD) knowledge base and demonstrate its use by mining specific molecular networks together with integrated clinical and experimental data. Conclusions: We generate the first semantically integrated COPD specific public knowledge base and find that for the integration of clinical and experimental data with pre-existing knowledge the configuration based set-up enabled by BioXM reduced implementation time and effort for the knowledge base compared to similar systems implemented as classical software development projects. The knowledgebase enables the retrieval of sub-networks including protein-protein interaction, pathway, gene - disease and gene - compound data which are used for subsequent data analysis, modelling and simulation. Pre-structured queries and reports enhance usability; establishing their use in everyday clinical settings requires further simplification with a browser based interface which is currently under development.