93 resultados para speaker clustering
Resumo:
Atherogenic dyslipidemia, manifest by low HDL-cholesterol and high TG levels, is an important component of ATP-III defined metabolic syndrome. Here, we dissected the phenotypic and genetic architecture of these traits by assessing their relationships with other metabolically relevant measures, including plasma adipo-cytokines, highly sensitive C-reactive protein (hsCRP) and LDL particle size, in a large family data set (n=2800) and in an independent set of dyslipidemic cases (n=716) and normolipidemic controls (n=1073). We explored the relationships among these phenotypes using variable clustering and then estimated their genetic heritabilities and cross-trait correlations. In families, four clusters explained 61% of the total variance, with one adiposity-related cluster (including hsCRP), one BP-related cluster, and two lipid-related clusters (HDL-C, TG, adiponectin and LDL particle size; apoB and non-HDL-C). A similar structure was observed in dyslipidemic cases and normolipidemic controls. The genetic correlations in the families largely paralleled the phenotype clustering results, suggesting that common genes having pleiotropic effects contributed to the correlations observed. In summary, our analyses support a model of metabolic syndrome with two major components, body fat and lipids, each with two subcomponents, and quantifies their degree of overlap with each other and with metabolic-syndrome related measures (adipokines, LDL particle size and hsCRP).
Resumo:
BACKGROUND: It is unknown why patients with extensive ulcerative colitis (UC) have a higher risk of colorectal cancer compared with patients with left-sided UC. This study characterizes the inflammatory processes in left-sided UC, pancolitis, and UC-associated dysplasia at the transcriptional level to identify potential biomarkers and transcripts of importance for the carcinogenic behavior of chronic inflammation. METHODS: The Affymetrix GeneChip Human Genome U133 Plus 2.0 was applied on colonic biopsies from UC patients with left-sided UC, pancolitis, dysplasia, and controls. Reverse transcription polymerase chain reaction and immunohistochemistry were performed for validating selected transcripts in the initial cohort and in 2 independent cohorts of patients with UC. Microarray data were analyzed by principal component analysis, and reverse transcription polymerase chain reaction and immunohistochemistry data by the Wilcoxon's rank-sum test. RESULTS: The principal component analysis results revealed separate clusters for left-sided UC, pancolitis, dysplasia, and controls. Close clustering of dysplastic and pancolitic samples indicated similarities in gene expression. Indeed, 101 and 656 parallel upregulated and downregulated transcripts, respectively, were identified in specimens from dysplasia and pancolitis. Validation of selected transcripts hereof identified insulin receptor alpha (INSRA) and MAP kinase interacting serine/threonine kinase 2 (MKNK2) with an enhanced expression in dysplasia compared with left-sided UC and controls, whereas laminin γ2 (LAMC2) was found with a lower expression in dysplasia compared with the remaining 3 groups. CONCLUSIONS: This study demonstrates pancolitis and left-sided UC as distinct inflammatory processes at the transcriptional level, and identifies INSRA, MKNK2, and LAMC2 as potential critical transcripts in the inflammation-driven preneoplastic process of UC.
Resumo:
We consider electroencephalograms (EEGs) of healthy individuals and compare the properties of the brain functional networks found through two methods: unpartialized and partialized cross-correlations. The networks obtained by partial correlations are fundamentally different from those constructed through unpartial correlations in terms of graph metrics. In particular, they have completely different connection efficiency, clustering coefficient, assortativity, degree variability, and synchronization properties. Unpartial correlations are simple to compute and they can be easily applied to large-scale systems, yet they cannot prevent the prediction of non-direct edges. In contrast, partial correlations, which are often expensive to compute, reduce predicting such edges. We suggest combining these alternative methods in order to have complementary information on brain functional networks.
Resumo:
BACKGROUND: Jeune asphyxiating thoracic dystrophy (JATD) is a rare, often lethal, recessively inherited chondrodysplasia characterised by shortened ribs and long bones, sometimes accompanied by polydactyly, and renal, liver and retinal disease. Mutations in intraflagellar transport (IFT) genes cause JATD, including the IFT dynein-2 motor subunit gene DYNC2H1. Genetic heterogeneity and the large DYNC2H1 gene size have hindered JATD genetic diagnosis. AIMS AND METHODS: To determine the contribution to JATD we screened DYNC2H1 in 71 JATD patients JATD patients combining SNP mapping, Sanger sequencing and exome sequencing. RESULTS AND CONCLUSIONS: We detected 34 DYNC2H1 mutations in 29/71 (41%) patients from 19/57 families (33%), showing it as a major cause of JATD especially in Northern European patients. This included 13 early protein termination mutations (nonsense/frameshift, deletion, splice site) but no patients carried these in combination, suggesting the human phenotype is at least partly hypomorphic. In addition, 21 missense mutations were distributed across DYNC2H1 and these showed some clustering to functional domains, especially the ATP motor domain. DYNC2H1 patients largely lacked significant extra-skeletal involvement, demonstrating an important genotype-phenotype correlation in JATD. Significant variability exists in the course and severity of the thoracic phenotype, both between affected siblings with identical DYNC2H1 alleles and among individuals with different alleles, which suggests the DYNC2H1 phenotype might be subject to modifier alleles, non-genetic or epigenetic factors. Assessment of fibroblasts from patients showed accumulation of anterograde IFT proteins in the ciliary tips, confirming defects similar to patients with other retrograde IFT machinery mutations, which may be of undervalued potential for diagnostic purposes.
Resumo:
BACKGROUND: Solexa/Illumina short-read ultra-high throughput DNA sequencing technology produces millions of short tags (up to 36 bases) by parallel sequencing-by-synthesis of DNA colonies. The processing and statistical analysis of such high-throughput data poses new challenges; currently a fair proportion of the tags are routinely discarded due to an inability to match them to a reference sequence, thereby reducing the effective throughput of the technology. RESULTS: We propose a novel base calling algorithm using model-based clustering and probability theory to identify ambiguous bases and code them with IUPAC symbols. We also select optimal sub-tags using a score based on information content to remove uncertain bases towards the ends of the reads. CONCLUSION: We show that the method improves genome coverage and number of usable tags as compared with Solexa's data processing pipeline by an average of 15%. An R package is provided which allows fast and accurate base calling of Solexa's fluorescence intensity files and the production of informative diagnostic plots.
Resumo:
Overweight and obesity are associated with arterial hypertension. Given the large increase in the obesity prevalence worldwide, the number of obese patients with hypertension is likely to increase substantially in the near future. Overweight and obese patients are exposed to an important metabolic and cardiovascular risk. The understanding of the mechanisms linking obesity to hypertension is important for specific prevention and therapy in this population. There is some evidence that obesity is associated with an increased aldosterone level. To date, 2 mechanisms may explain the interaction of fat tissue with the renin-angiotensin-aldosterone system, and therefore explain, in part, obesity-related hypertension. First, human adipose tissue produces several components of the renin-angiotensin-aldosterone system, mainly adipose tissue-derived angiotensinogen. Second, increased fatty acid production in the obese patient, especially nonesterified fatty acids, might stimulate aldosterone production, independent of renin. A better understanding of these mechanisms might have implications for the management of hypertension in overweight and obese patients. Because aldosterone also is associated with blood glucose and blood lipids, selective aldosterone blockade may represent a particularly attractive therapeutic strategy in obese patients with a clustering of cardiovascular risk factors.
Resumo:
Although Leontopodium alpinum is considered to be threatened in many countries, only limited scientific information about its autecology is available. In this study, we aim to define the most important ecological factors which influence the distribution of L. alpinum in the Swiss Alps. These were assessed at the national scale using species distribution models based on topoclimatic predictors and at the community scale using exhaustive plant inventories. The latter were analysed using hierarchical clustering and principal component analysis, and the results were interpreted using ecological indicator values. L. alpinum was found almost exclusively on base-rich bedrocks (limestone and ultramaphic rocks). The species distribution models showed that the available moisture (dry regions, mostly in the Inner Alps), elevation (mostly above 2000 m.a.s.l.) and slope (mostly >30°) were the most important predictors. The relevés showed that L. alpinum is present in a wide range of plant communities, all subalpine-alpine open grasslands, with a low grass cover. As a light-demanding and short species, L. alpinum requires light at ground level; hence, it can only grow in open, nutrient-poor grasslands. These conditions are met in dry conditions (dry, summer-warm climate, rocky and draining soil, south-facing aspect and/or steep slope), at high elevations, on oligotrophic soils and/or on windy ridges. Base-rich soils appear to also be essential, although it is still unclear if this corresponds to physiological or ecological (lower competition) requirements.
Resumo:
Background/Aims. Recently, peripheral blood mononuclear cell transcriptome analysis has identified genes that are upregulated in relapsing minimal-change nephrotic syndrome (MCNS). In order to investigate protein expression in peripheral blood mononuclear cells (PBMC) from relapsing MCNS patients, we performed proteomic comparisons of PBMC from patients with MCNS in relapse and controls. METHODS: PBMC from a total of 20 patients were analysed. PBMC were taken from five patients with relapsing MCNS, four in remission, five patients with other glomerular diseases and six controls. Two dimensional electrophoresis was performed and proteome patterns were compared. RESULTS: Automatic heuristic clustering analysis allowed us to pool correctly the gels from the MCNS patients in the relapse and in the control groups. Using hierarchical population matching, nine spots were found to be increased in PBMC from MCNS patients in relapse. Four spots were identified by mass spectrometry. Three of the four proteins identified (L-plastin, alpha-tropomyosin and annexin III) were cytoskeletal-associated proteins. Using western blot and immunochemistry, L-plastin and alpha-tropomyosin 3 concentrations were found to be enhanced in PBMC from MCNS patients in relapse. Conclusions. These data indicate that a specific proteomic profile characterizes PBMC from MCNS patients in relapse. Proteins involved in PBMC cytoskeletal rearrangement are increased in relapsing MCNS. We hypothesize that T-cell cytoskeletal rearrangement may play a role in the pathogenesis of MCNS by altering the expression of cell surface receptors and by modifying the interaction of these cells with glomerular cells.
Resumo:
BACKGROUND & AIMS: Regulation of gene expression in the follicle-associated epithelium (FAE) over Peyer's patches is largely unknown. CCL20, a chemokine that recruits immature dendritic cells, is one of the few FAE-specific markers described so far. Lymphotoxin beta (LTalpha1beta2) expressed on the membrane of immune cells triggers CCL20 expression in enterocytes. In this study, we measured expression profiles of LTalpha1beta2-treated intestinal epithelial cells and selected CCL20 -coregulated genes to identify new FAE markers. METHODS: Genomic profiles of T84 and Caco-2 cell lines treated with either LTalpha1beta2, flagellin, or tumor necrosis factor alpha were measured using the Affymetrix GeneChip U133A. Clustering analysis was used to select CCL20 -coregulated genes, and laser dissection microscopy and real-time polymerase chain reaction on human biopsy specimens was used to assess the expression of the selected markers. RESULTS: Applying a 2-way analysis of variance, we identified regulated genes upon the different treatments. A subset of genes involved in inflammation and related to the nuclear factor kappaB pathway was coregulated with CCL20 . Among these genes, the antiapoptotic factor TNFAIP3 was highly expressed in the FAE. CCL23 , which was not coregulated in vitro with CCL20 , was also specifically expressed in the FAE. CONCLUSIONS: We have identified 2 novel human FAE specifically expressed genes. Most of the CCL20 -coregulated genes did not show FAE-specific expression, suggesting that other signaling pathways are critical to modulate FAE-specific gene expression.
Resumo:
In this study we investigated the effect of medial temporal lobe epilepsy (MTLE) on the global characteristics of brain connectivity estimated by topological measures. We used DSI (Diffusion Spectrum Imaging) to construct a connectivity matrix where the nodes represents the anatomical ROIs and the edges are the connections between any pair of ROIs weighted by the mean GFA/FA values. A significant difference was found between the patient group vs control group in characteristic path length, clustering coefficient and small-worldness. This suggests that the MTLE network is less efficient compared to the network of the control group.
Resumo:
The scenario considered here is one where brain connectivity is represented as a network and an experimenter wishes to assess the evidence for an experimental effect at each of the typically thousands of connections comprising the network. To do this, a univariate model is independently fitted to each connection. It would be unwise to declare significance based on an uncorrected threshold of α=0.05, since the expected number of false positives for a network comprising N=90 nodes and N(N-1)/2=4005 connections would be 200. Control of Type I errors over all connections is therefore necessary. The network-based statistic (NBS) and spatial pairwise clustering (SPC) are two distinct methods that have been used to control family-wise errors when assessing the evidence for an experimental effect with mass univariate testing. The basic principle of the NBS and SPC is the same as supra-threshold voxel clustering. Unlike voxel clustering, where the definition of a voxel cluster is unambiguous, 'clusters' formed among supra-threshold connections can be defined in different ways. The NBS defines clusters using the graph theoretical concept of connected components. SPC on the other hand uses a more stringent pairwise clustering concept. The purpose of this article is to compare the pros and cons of the NBS and SPC, provide some guidelines on their practical use and demonstrate their utility using a case study involving neuroimaging data.
Resumo:
The importance of competition between similar species in driving community assembly is much debated. Recently, phylogenetic patterns in species composition have been investigated to help resolve this question: phylogenetic clustering is taken to imply environmental filtering, and phylogenetic overdispersion to indicate limiting similarity between species. We used experimental plant communities with random species compositions and initially even abundance distributions to examine the development of phylogenetic pattern in species abundance distributions. Where composition was held constant by weeding, abundance distributions became overdispersed through time, but only in communities that contained distantly related clades, some with several species (i.e., a mix of closely and distantly related species). Phylogenetic pattern in composition therefore constrained the development of overdispersed abundance distributions, and this might indicate limiting similarity between close relatives and facilitation/complementarity between distant relatives. Comparing the phylogenetic patterns in these communities with those expected from the monoculture abundances of the constituent species revealed that interspecific competition caused the phylogenetic patterns. Opening experimental communities to colonization by all species in the species pool led to convergence in phylogenetic diversity. At convergence, communities were composed of several distantly related but species-rich clades and had overdispersed abundance distributions. This suggests that limiting similarity processes determine which species dominate a community but not which species occur in a community. Crucially, as our study was carried out in experimental communities, we could rule out local evolutionary or dispersal explanations for the patterns and identify ecological processes as the driving force, underlining the advantages of studying these processes in experimental communities. Our results show that phylogenetic relations between species provide a good guide to understanding community structure and add a new perspective to the evidence that niche complementarity is critical in driving community assembly.
Resumo:
Salmonid populations of many rivers are rapidly declining. One possible explanation is that habitat fragmentation increases genetic drift and reduces the populations' potential to adapt to changing environmental conditions. We measured the genetic and eco-morphological diversity of brown trout (Salmo trutta) in a Swiss stream system, using multivariate statistics and Bayesian clustering. We found large genetic and phenotypic variation within only 40 km of stream length. Eighty-eight percent of all pairwise F(ST) comparisons and 50% of the population comparisons in body shape were significant. High success rates of population assignment tests confirmed the distinctiveness of populations in both genotype and phenotype. Spatial analysis revealed that divergence increased with waterway distance, the number of weirs, and stretches of poor habitat between sampling locations, but effects of isolation-by-distance and habitat fragmentation could not be fully disentangled. Stocking intensity varied between streams but did not appear to erode genetic diversity within populations. A lack of association between phenotypic and genetic divergence points to a role of local adaptation or phenotypically plastic responses to habitat heterogeneity. Indeed, body shape could be largely explained by topographic stream slope, and variation in overall phenotype matched the flow regimes of the respective habitats.
Resumo:
This paper proposes a novel approach for the analysis of illicit tablets based on their visual characteristics. In particular, the paper concentrates on the problem of ecstasy pill seizure profiling and monitoring. The presented method extracts the visual information from pill images and builds a representation of it, i.e. it builds a pill profile based on the pill visual appearance. Different visual features are used to build different image similarity measures, which are the basis for a pill monitoring strategy based on both discriminative and clustering models. The discriminative model permits to infer whether two pills come from the same seizure, while the clustering models groups of pills that share similar visual characteristics. The resulting clustering structure allows to perform a visual identification of the relationships between different seizures. The proposed approach was evaluated using a data set of 621 Ecstasy pill pictures. The results demonstrate that this is a feasible and cost effective method for performing pill profiling and monitoring.
Resumo:
Previous microarray studies on breast cancer identified multiple tumour classes, of which the most prominent, named luminal and basal, differ in expression of the oestrogen receptor alpha gene (ER). We report here the identification of a group of breast tumours with increased androgen signalling and a 'molecular apocrine' gene expression profile. Tumour samples from 49 patients with large operable or locally advanced breast cancers were tested on Affymetrix U133A gene expression microarrays. Principal components analysis and hierarchical clustering split the tumours into three groups: basal, luminal and a group we call molecular apocrine. All of the molecular apocrine tumours have strong apocrine features on histological examination (P=0.0002). The molecular apocrine group is androgen receptor (AR) positive and contains all of the ER-negative tumours outside the basal group. Kolmogorov-Smirnov testing indicates that oestrogen signalling is most active in the luminal group, and androgen signalling is most active in the molecular apocrine group. ERBB2 amplification is commoner in the molecular apocrine than the other groups. Genes that best split the three groups were identified by Wilcoxon test. Correlation of the average expression profile of these genes in our data with the expression profile of individual tumours in four published breast cancer studies suggest that molecular apocrine tumours represent 8-14% of tumours in these studies. Our data show that it is possible with microarray data to divide mammary tumour cells into three groups based on steroid receptor activity: luminal (ER+ AR+), basal (ER- AR-) and molecular apocrine (ER- AR+).