936 resultados para Cluster Analysis of Variables


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The number of sequences generated by genome projects has increased exponentially, but gene characterization has not followed at the same rate. Sequencing and analysis of full-length cDNAs is an important step in gene characterization that has been used nowadays by several research groups. In this work, we have selected Schistosoma mansoni clones for full-length sequencing, using an algorithm that investigates the presence of the initial methionine in the parasite sequence based on the positions of alignment start between two sequences. BLAST searches to produce such alignments have been performed using parasite expressed sequence tags produced by Minas Gerais Genome Network against sequences from the database Eukaryotic Cluster of Orthologous Groups (KOG). This procedure has allowed the selection of clones representing 398 proteins which have not been deposited as S. mansoni complete CDS in any public database. Dedicated sequencing of 96 of such clones with reads from both 5' and 3' ends has been performed. These reads have been assembled using PHRAP, resulting in the production of 33 full-length sequences that represent novel S. mansoni proteins. These results shall contribute to construct a more complete view of the biology of this important parasite.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

AIMS: To investigate empirically the hypothesized relationship between counsellor motivational interviewing (MI) skills and patient change talk (CT) by analysing the articulation between counsellor behaviours and patient language during brief motivational interventions (BMI) addressing at-risk alcohol consumption. DESIGN: Sequential analysis of psycholinguistic codes obtained by two independent raters using the Motivational Interviewing Skill Code (MISC), version 2.0. SETTING: Secondary analysis of data from a randomized controlled trial evaluating the effectiveness of BMI in an emergency department. PARTICIPANTS: A total of 97 patients tape-recorded when receiving BMI. MEASUREMENTS: MISC variables were categorized into three counsellor behaviours (MI-consistent, MI-inconsistent and 'other') and three kinds of patient language (CT, counter-CT (CCT) and utterances not linked with the alcohol topic). Observed transition frequencies, conditional probabilities and significance levels based on odds ratios were computed using sequential analysis software. FINDINGS: MI-consistent behaviours were the only counsellor behaviours that were significantly more likely to be followed by patient CT. Those behaviours were significantly more likely to be followed by patient change exploration (CT and CCT) while MI-inconsistent behaviours and 'other' counsellor behaviours were significantly more likely to be followed by utterances not linked with the alcohol topic and significantly less likely to be followed by CT. MI-consistent behaviours were more likely after change exploration, whereas 'other' counsellor behaviours were more likely only after utterances not linked with the alcohol topic. CONCLUSIONS: Findings lend support to the hypothesized relationship between MI-consistent behaviours and CT, highlight the importance of patient influence on counsellor behaviour and emphasize the usefulness of MI techniques and spirit during brief interventions targeting change enhancement.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

At CoDaWork'03 we presented work on the analysis of archaeological glass composi-tional data. Such data typically consist of geochemical compositions involving 10-12variables and approximates completely compositional data if the main component, sil-ica, is included. We suggested that what has been termed `crude' principal componentanalysis (PCA) of standardized data often identi ed interpretable pattern in the datamore readily than analyses based on log-ratio transformed data (LRA). The funda-mental problem is that, in LRA, minor oxides with high relative variation, that maynot be structure carrying, can dominate an analysis and obscure pattern associatedwith variables present at higher absolute levels. We investigate this further using sub-compositional data relating to archaeological glasses found on Israeli sites. A simplemodel for glass-making is that it is based on a `recipe' consisting of two `ingredients',sand and a source of soda. Our analysis focuses on the sub-composition of componentsassociated with the sand source. A `crude' PCA of standardized data shows two clearcompositional groups that can be interpreted in terms of di erent recipes being used atdi erent periods, reected in absolute di erences in the composition. LRA analysis canbe undertaken either by normalizing the data or de ning a `residual'. In either case,after some `tuning', these groups are recovered. The results from the normalized LRAare di erently interpreted as showing that the source of sand used to make the glassdi ered. These results are complementary. One relates to the recipe used. The otherrelates to the composition (and presumed sources) of one of the ingredients. It seemsto be axiomatic in some expositions of LRA that statistical analysis of compositionaldata should focus on relative variation via the use of ratios. Our analysis suggests thatabsolute di erences can also be informative

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A joint distribution of two discrete random variables with finite support can be displayed as a two way table of probabilities adding to one. Assume that this table hasn rows and m columns and all probabilities are non-null. This kind of table can beseen as an element in the simplex of n · m parts. In this context, the marginals areidentified as compositional amalgams, conditionals (rows or columns) as subcompositions. Also, simplicial perturbation appears as Bayes theorem. However, the Euclideanelements of the Aitchison geometry of the simplex can also be translated into the tableof probabilities: subspaces, orthogonal projections, distances.Two important questions are addressed: a) given a table of probabilities, which isthe nearest independent table to the initial one? b) which is the largest orthogonalprojection of a row onto a column? or, equivalently, which is the information in arow explained by a column, thus explaining the interaction? To answer these questionsthree orthogonal decompositions are presented: (1) by columns and a row-wise geometric marginal, (2) by rows and a columnwise geometric marginal, (3) by independenttwo-way tables and fully dependent tables representing row-column interaction. Animportant result is that the nearest independent table is the product of the two (rowand column)-wise geometric marginal tables. A corollary is that, in an independenttable, the geometric marginals conform with the traditional (arithmetic) marginals.These decompositions can be compared with standard log-linear models.Key words: balance, compositional data, simplex, Aitchison geometry, composition,orthonormal basis, arithmetic and geometric marginals, amalgam, dependence measure,contingency table

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sphingomonas wittichii RW1 is a bacterium isolated for its ability to degrade the xenobiotic compounds dibenzodioxin and dibenzofuran (DBF). A number of genes involved in DBF degradation have been previously characterized, such as the dxn cluster, dbfB, and the electron transfer components fdx1, fdx3, and redA2. Here we use a combination of whole genome transcriptome analysis and transposon library screening to characterize RW1 catabolic and other genes implicated in the reaction to or degradation of DBF. To detect differentially expressed genes upon exposure to DBF, we applied three different growth exposure experiments, using either short DBF exposures to actively growing cells or growing them with DBF as sole carbon and energy source. Genome-wide gene expression was examined using a custom-made microarray. In addition, proportional abundance determination of transposon insertions in RW1 libraries grown on salicylate or DBF by ultra-high throughput sequencing was used to infer genes whose interruption caused a fitness loss for growth on DBF. Expression patterns showed that batch and chemostat growth conditions, and short or long exposure of cells to DBF produced very different responses. Numerous other uncharacterized catabolic gene clusters putatively involved in aromatic compound metabolism increased expression in response to DBF. In addition, only very few transposon insertions completely abolished growth on DBF. Some of those (e.g., in dxnA1) were expected, whereas others (in a gene cluster for phenylacetate degradation) were not. Both transcriptomic data and transposon screening suggest operation of multiple redundant and parallel aromatic pathways, depending on DBF exposure. In addition, increased expression of other non-catabolic genes suggests that during initial exposure, S. wittichii RW1 perceives DBF as a stressor, whereas after longer exposure, the compound is recognized as a carbon source and metabolized using several pathways in parallel.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Acute myeloid leukemia arising from chronic myelomonocytic leukemia is currently classified as acute myeloid leukemia with myelodysplasia-related changes, a high-risk subtype. However, the specific features of these cases have not been well described. We studied 38 patients with chronic myelomonocytic leukemia who progressed to acute myeloid leukemia. We compared the clinicopathologic and genetic features of these cases with 180 patients with de novo acute myeloid leukemia and 34 patients with acute myeloid leukemia following myelodysplastic syndromes. We also examined features associated with progression from chronic myelomonocytic leukemia to acute myeloid leukemia by comparing the progressed chronic myelomonocytic leukemia cases with a cohort of chronic myelomonocytic leukemia cases that did not transform to acute myeloid leukemia. Higher white blood cell count, marrow cellularity, karyotype risk score, and Revised International Prognostic Scoring System score were associated with more rapid progression from chronic myelomonocytic leukemia to acute myeloid leukemia. Patients with acute myeloid leukemia ex chronic myelomonocytic leukemia were older (P<0.01) and less likely to receive aggressive treatment (P=0.02) than de novo acute myeloid leukemia patients. Most cases showed monocytic differentiation and fell into the intermediate acute myeloid leukemia karyotype risk group; 55% had normal karyotype and 17% had NPM1 mutation. Median overall survival was 6 months, which was inferior to de novo acute myeloid leukemia (17 months, P=0.002) but similar to post myelodysplastic syndrome acute myeloid leukemia. On multivariate analysis of all acute myeloid leukemia patients, only age and karyotype were independent prognostic variables for overall survival. Our findings indicate that acute myeloid leukemia following chronic myelomonocytic leukemia displays aggressive behavior and support placement of these cases within the category of acute myeloid leukemia with myelodysplasia-related changes. The poor prognosis of these patients may be related to an older population and lack of favorable-prognosis karyotypes that characterize many de novo acute myeloid leukemia cases.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Previously published scientific papers have reported a negative correlation between drinking water hardness and cardiovascular mortality. Some ecologic and case-control studies suggest the protective effect of calcium and magnesium concentration in drinking water. In this article we present an analysis of this protective relationship in 538 municipalities of Comunidad Valenciana (Spain) from 1991-1998. We used the Spanish version of the Rapid Inquiry Facility (RIF) developed under the European Environment and Health Information System (EUROHEIS) research project. The strategy of analysis used in our study conforms to the exploratory nature of the RIF that is used as a tool to obtain quick and flexible insight into epidemiologic surveillance problems. This article describes the use of the RIF to explore possible associations between disease indicators and environmental factors. We used exposure analysis to assess the effect of both protective factors--calcium and magnesium--on mortality from cerebrovascular (ICD-9 430-438) and ischemic heart (ICD-9 410-414) diseases. This study provides statistical evidence of the relationship between mortality from cardiovascular diseases and hardness of drinking water. This relationship is stronger in cerebrovascular disease than in ischemic heart disease, is more pronounced for women than for men, and is more apparent with magnesium than with calcium concentration levels. Nevertheless, the protective nature of these two factors is not clearly established. Our results suggest the possibility of protectiveness but cannot be claimed as conclusive. The weak effects of these covariates make it difficult to separate them from the influence of socioeconomic and environmental factors. We have also performed disease mapping of standardized mortality ratios to detect clusters of municipalities with high risk. Further standardization by levels of calcium and magnesium in drinking water shows changes in the maps when we remove the effect of these covariates.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Hydrogeological research usually includes some statistical studies devised to elucidate mean background state, characterise relationships among different hydrochemical parameters, and show the influence of human activities. These goals are achieved either by means of a statistical approach or by mixing modelsbetween end-members. Compositional data analysis has proved to be effective with the first approach, but there is no commonly accepted solution to the end-member problem in a compositional framework.We present here a possible solution based on factor analysis of compositions illustrated with a case study.We find two factors on the compositional bi-plot fitting two non-centered orthogonal axes to the most representative variables. Each one of these axes defines a subcomposition, grouping those variables thatlay nearest to it. With each subcomposition a log-contrast is computed and rewritten as an equilibrium equation. These two factors can be interpreted as the isometric log-ratio coordinates (ilr) of three hiddencomponents, that can be plotted in a ternary diagram. These hidden components might be interpreted as end-members.We have analysed 14 molarities in 31 sampling stations all along the Llobregat River and its tributaries, with a monthly measure during two years. We have obtained a bi-plot with a 57% of explained totalvariance, from which we have extracted two factors: factor G, reflecting geological background enhanced by potash mining; and factor A, essentially controlled by urban and/or farming wastewater. Graphicalrepresentation of these two factors allows us to identify three extreme samples, corresponding to pristine waters, potash mining influence and urban sewage influence. To confirm this, we have available analysisof diffused and widespread point sources identified in the area: springs, potash mining lixiviates, sewage, and fertilisers. Each one of these sources shows a clear link with one of the extreme samples, exceptfertilisers due to the heterogeneity of their composition.This approach is a useful tool to distinguish end-members, and characterise them, an issue generally difficult to solve. It is worth note that the end-member composition cannot be fully estimated but only characterised through log-ratio relationships among components. Moreover, the influence of each endmember in a given sample must be evaluated in relative terms of the other samples. These limitations areintrinsic to the relative nature of compositional data

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this project, we have investigated new ways of modelling and analysis of human vasculature from Medical images. The research was divided in two main areas: cerebral vasculature analysis and coronary arteries modeling. Regarding cerebral vasculature analysis, we have studed cerebral aneurysms, internal carotid and the Circle of Willis (CoW). Aneurysms are abnormal vessel enlargements that can rupture causing important cerebral damages or death. The understanding of this pathology, together with its virtual treatment, and image diagnosis and prognosis, includes identification and detailed measurement of the aneurysms. In this context, we have proposed two automatic aneurysm isolation method, to separate the abnormal part of the vessel from the healthy part, to homogenize and speed-up the processing pipeline usually employed to study this pathology, [Cardenes2011TMI, arrabide2011MedPhys]. The results obtained from both methods have been also compared and validatied in [Cardenes2012MBEC]. A second important task here the analysis of the internal carotid [Bogunovic2011Media] and the automatic labelling of the CoW, Bogunovic2011MICCAI, Bogunovic2012TMI]. The second area of research covers the study of coronary arteries, specially coronary bifurcations because there is where the formation of atherosclerotic plaque is more common, and where the intervention is more challenging. Therefore, we proposed a novel modelling method from Computed Tomography Angiography (CTA) images, combined with Conventional Coronary Angiography (CCA), to obtain realistic vascular models of coronary bifurcations, presented in [Cardenes2011MICCAI], and fully validated including phantom experiments in [Cardene2013MedPhys]. The realistic models obtained from this method are being used to simulate stenting procedures, and to investigate the hemodynamic variables in coronary bifurcations in the works submitted in [Morlachi2012, Chiastra2012]. Additionally, another preliminary work has been done to reconstruct the coronary tree from rotational angiography, and published in [Cardenes2012ISBI].

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper we examine some of the economic forces that underlie economic growth at the county level. In an effort to describe a much more comprehensive regional economic growth model, we address a variety of different growth hypotheses by introducing a large number of growth related variables. When formulating our hypotheses and specifying our growth model we make liberal use of GIS (geographical information systems) mapping software to “paint” a picture of where growth spots exist. Our empirical estimation indicates that amenities, state and local tax burdens, population, amount of primary agriculture activity, and demographics have important impacts on economic growth.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The transcription factor NFAT5/TonEBP regulates the response of mammalian cells to hypertonicity. However, little is known about the physiopathologic tonicity thresholds that trigger its transcriptional activity in primary cells. Wilkins et al. recently developed a transgenic mouse carrying a luciferase reporter (9xNFAT-Luc) driven by a cluster of NFAT sites, that was activated by calcineurin-dependent NFATc proteins. Since the NFAT site of this reporter was very similar to an optimal NFAT5 site, we tested whether this reporter could detect the activation of NFAT5 in transgenic cells.Results: The 9xNFAT-Luc reporter was activated by hypertonicity in an NFAT5-dependent manner in different types of non-transformed transgenic cells: lymphocytes, macrophages and fibroblasts. Activation of this reporter by the phorbol ester PMA plus ionomycin was independent of NFAT5 and mediated by NFATc proteins. Transcriptional activation of NFAT5 in T lymphocytes was detected at hypertonic conditions of 360–380 mOsm/kg (isotonic conditions being 300 mOsm/kg) and strongly induced at 400 mOsm/kg. Such levels have been recorded in plasma in patients with osmoregulatory disorders and in mice deficient in aquaporins and vasopressin receptor. The hypertonicity threshold required to activate NFAT5 was higher in bone marrow-derived macrophages (430 mOsm/kg) and embryonic fibroblasts (480 mOsm/kg). Activation of the 9xNFAT-Luc reporter by hypertonicity in lymphocytes was insensitive to the ERK inhibitor PD98059, partially inhibited by the PI3-kinase inhibitor wortmannin (0.5 μM) and the PKA inhibitor H89, and substantially downregulated by p38 inhibitors (SB203580 and SB202190) and by inhibition of PI3-kinase-related kinases with 25 μM LY294002. Sensitivity of the reporter to FK506 varied among cell types and was greater in primary T cells than in fibroblasts and macrophages.Conclusion: Our results indicate that NFAT5 is a sensitive responder to pathologic increases in extracellular tonicity in T lymphocytes. Activation of NFAT5 by hypertonicity in lymphocytes was mediated by a combination of signaling pathways that differed from those required in other cell types. We propose that the 9xNFAT-Luc transgenic mouse model might be useful to study the physiopathological regulation of both NFAT5 and NFATc factors in primary cells.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Peach fruit undergoes a rapid softening process that involves a number of metabolic changes. Storing fruit at low temperatures has been widely used to extend its postharvest life. However, this leads to undesired changes, such as mealiness and browning, which affect the quality of the fruit. In this study, a 2-D DIGE approach was designed to screen for differentially accumulated proteins in peach fruit during normal softening as well as under conditions that led to fruit chilling injury. Results:The analysis allowed us to identify 43 spots -representing about 18% of the total number analyzed- that show statistically significant changes. Thirty-nine of the proteins could be identified by mass spectrometry. Some of the proteins that changed during postharvest had been related to peach fruit ripening and cold stress in the past. However, we identified other proteins that had not been linked to these processes. A graphical display of the relationship between the differentially accumulated proteins was obtained using pairwise average-linkage cluster analysis and principal component analysis. Proteins such as endopolygalacturonase, catalase, NADP-dependent isocitrate dehydrogenase, pectin methylesterase and dehydrins were found to be very important for distinguishing between healthy and chill injured fruit. A categorization of the differentially accumulated proteins was performed using Gene Ontology annotation. The results showed that the 'response to stress', 'cellular homeostasis', 'metabolism of carbohydrates' and 'amino acid metabolism' biological processes were affected the most during the postharvest. Conclusions: Using a comparative proteomic approach with 2-D DIGE allowed us to identify proteins that showed stage-specific changes in their accumulation pattern. Several proteins that are related to response to stress, cellular homeostasis, cellular component organization and carbohydrate metabolism were detected as being differentially accumulated. Finally, a significant proportion of the proteins identified had not been associated with softening, cold storage or chilling injury-altered fruit before; thus, comparative proteomics has proven to be a valuable tool for understanding fruit softening and postharvest.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Splenic marginal zone lymphoma (SMZL) is an indolent B-cell lymphoproliferative disorder characterised by 7q32 deletion, but the target genes of this deletion remain unknown. In order to elucidate the genetic target of this deletion, we performed an integrative analysis of the genetic, epigenetic, transcriptomic and miRNomic data. High resolution array comparative genomic hybridization of 56 cases of SMZL delineated a minimally deleted region (2.8 Mb) at 7q32, but showed no evidence of any cryptic homozygous deletion or recurrent breakpoint in this region. Integrated transcriptomic analysis confirmed significant under-expression of a number of genes in this region in cases of SMZL with deletion, several of which showed hypermethylation. In addition, a cluster of 8 miRNA in this region showed under-expression in cases with the deletion, and three (miR-182/96/183) were also significantly under-expressed (P<0.05) in SMZL relative to other lymphomas. Genomic sequencing of these miRNA and IRF5, a strong candidate gene, did not show any evidence of somatic mutation in SMZL. These observations provide valuable guidance for further characterisation of 7q deletion.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

When continuous data are coded to categorical variables, two types of coding are possible: crisp coding in the form of indicator, or dummy, variables with values either 0 or 1; or fuzzy coding where each observation is transformed to a set of "degrees of membership" between 0 and 1, using co-called membership functions. It is well known that the correspondence analysis of crisp coded data, namely multiple correspondence analysis, yields principal inertias (eigenvalues) that considerably underestimate the quality of the solution in a low-dimensional space. Since the crisp data only code the categories to which each individual case belongs, an alternative measure of fit is simply to count how well these categories are predicted by the solution. Another approach is to consider multiple correspondence analysis equivalently as the analysis of the Burt matrix (i.e., the matrix of all two-way cross-tabulations of the categorical variables), and then perform a joint correspondence analysis to fit just the off-diagonal tables of the Burt matrix - the measure of fit is then computed as the quality of explaining these tables only. The correspondence analysis of fuzzy coded data, called "fuzzy multiple correspondence analysis", suffers from the same problem, albeit attenuated. Again, one can count how many correct predictions are made of the categories which have highest degree of membership. But here one can also defuzzify the results of the analysis to obtain estimated values of the original data, and then calculate a measure of fit in the familiar percentage form, thanks to the resultant orthogonal decomposition of variance. Furthermore, if one thinks of fuzzy multiple correspondence analysis as explaining the two-way associations between variables, a fuzzy Burt matrix can be computed and the same strategy as in the crisp case can be applied to analyse the off-diagonal part of this matrix. In this paper these alternative measures of fit are defined and applied to a data set of continuous meteorological variables, which are coded crisply and fuzzily into three categories. Measuring the fit is further discussed when the data set consists of a mixture of discrete and continuous variables.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The general objective of the study was to empirically test a reciprocal model of job satisfaction and life satisfaction while controlling for some social demographic variables. 827 employees working in 34 car dealerships in Northern Quebec (56% responses rate) were surveyed. The multiple item questionnaires were analysed using correlation analysis, chi square and ANOVAs. Results show interesting patterns emerging for the relationships between job and life satisfaction of which 49.2% of all individuals have spillover, 43.5% compensation, and 7.3% segmentation type of relationships. Results, nonetheless, are far richer and the model becomes much more refined when social demographic indicators are taken into account. Globally, social demographic variables demonstrate some effects on each satisfaction individually but also on the interrelation (nature of the relations) between life and work satisfaction.