971 resultados para Genetic clustering analysis


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Abstract Background Transcript enumeration methods such as SAGE, MPSS, and sequencing-by-synthesis EST "digital northern", are important high-throughput techniques for digital gene expression measurement. As other counting or voting processes, these measurements constitute compositional data exhibiting properties particular to the simplex space where the summation of the components is constrained. These properties are not present on regular Euclidean spaces, on which hybridization-based microarray data is often modeled. Therefore, pattern recognition methods commonly used for microarray data analysis may be non-informative for the data generated by transcript enumeration techniques since they ignore certain fundamental properties of this space. Results Here we present a software tool, Simcluster, designed to perform clustering analysis for data on the simplex space. We present Simcluster as a stand-alone command-line C package and as a user-friendly on-line tool. Both versions are available at: http://xerad.systemsbiology.net/simcluster. Conclusion Simcluster is designed in accordance with a well-established mathematical framework for compositional data analysis, which provides principled procedures for dealing with the simplex space, and is thus applicable in a number of contexts, including enumeration-based gene expression data.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

BACKGROUND: Recurrent airway obstruction (RAO) is a severe chronic respiratory disease affecting horses worldwide, though mostly in the Northern hemisphere. Environmental as well as genetic factors strongly influence the course and prognosis of the disease. Research has been focused on characterization of immunologic factors contributing to inflammatory responses, on genetic linkage analysis, and, more recently, on proteomic analysis of airway secretions from affected horses. The goal of this study was to investigate the interactions between eight candidate genes previously identified in a genetic linkage study and proteins expressed in bronchoalveolar lavage fluid (BALF) collected from healthy and RAO-affected horses. The analysis was carried out with Ingenuity Pathway Analysis(R) bioinformatics software. RESULTS: The gene with the greatest number of indirect interactions with the set of proteins identified is Interleukin 4 Receptor (IL-4R), whose protein has also been detected in BALF. Interleukin 21 receptor and chemokine (C-C motif) ligand 24 also showed a large number of interactions with the group of detected proteins. Protein products of other genes like that of SOCS5, revealed direct interactions with the IL-4R protein. The interacting proteins NOD2, RPS6KA5 and FOXP3 found in several pathways are reported regulators of the NFkappaB pathway. CONCLUSIONS: The pathways generated with IL-4R highlight possible important intracellular signaling cascades implicating, for instance, NFkappaB. Furthermore, the proposed interaction between SOCS5 and IL-4R could explain how different genes can lead to identical clinical RAO phenotypes, as observed in two Swiss Warmblood half sibling families because these proteins interact upstream of an important cascade where they may act as a functional unit.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Thirty microsatellite markers were analysed in 1426 goats from 45 traditional or rare breeds in 15 European and Middle Eastern countries. In all populations inbreeding was indicated by heterozygosity deficiency (mean FIS = 0.10). Genetic differentiation between breeds was moderate with a mean FST value of 0.07, but for most (c. 71%) northern and central European breeds, individuals could be assigned to their breeds with a success rate of more than 80%. Bayesian-based clustering analysis of allele frequencies and multivariate analysis revealed at least four discrete clusters: eastern Mediterranean (Middle East), central Mediterranean, western Mediterranean and central/northern Europe. About 41% of the genetic variability among the breeds could be explained by their geographical origin. A decrease in genetic diversity from the south-east to the north-west was accompanied by an increase in the level of differentiation at the breed level. These observations support the hypothesis that domestic livestock migrated from the Middle East towards western and northern Europe and indicate that breed formation was more systematic in north-central Europe than in the Middle East. We propose that breed differentiation and molecular diversity are independent criteria for conservation.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

BACKGROUND Follicular variant of papillary thyroid carcinoma (FVPTC) shares features of papillary (PTC) and follicular (FTC) thyroid carcinomas on a clinical, morphological, and genetic level. MicroRNA (miRNA) deregulation was extensively studied in PTCs and FTCs. However, very limited information is available for FVPTC. The aim of this study was to assess miRNA expression in FVPTC with the most comprehensive miRNA array panel and to correlate it with the clinicopathological data. METHODS Forty-four papillary thyroid carcinomas (17 FVPTC, 27 classic PTC) and eight normal thyroid tissue samples were analyzed for expression of 748 miRNAs using Human Microarray Assays on the ABI 7900 platform (Life Technologies, Carlsbad, CA). In addition, an independent set of 61 tumor and normal samples was studied for expression of novel miRNA markers detected in this study. RESULTS Overall, the miRNA expression profile demonstrated similar trends between FVPTC and classic PTC. Fourteen miRNAs were deregulated in FVPTC with a fold change of more than five (up/down), including miRNAs known to be upregulated in PTC (miR-146b-3p, -146-5p, -221, -222 and miR-222-5p) and novel miRNAs (miR-375, -551b, 181-2-3p, 99b-3p). However, the levels of miRNA expression were different between these tumor types and some miRNAs were uniquely dysregulated in FVPTC allowing separation of these tumors on the unsupervised hierarchical clustering analysis. Upregulation of novel miR-375 was confirmed in a large independent set of follicular cell derived neoplasms and benign nodules and demonstrated specific upregulation for PTC. Two miRNAs (miR-181a-2-3p, miR-99b-3p) were associated with an adverse outcome in FVPTC patients by a Kaplan-Meier (p < 0.05) and multivariate Cox regression analysis (p < 0.05). CONCLUSIONS Despite high similarity in miRNA expression between FVPTC and classic PTC, several miRNAs were uniquely expressed in each tumor type, supporting their histopathologic differences. Highly upregulated miRNA identified in this study (miR-375) can serve as a novel marker of papillary thyroid carcinoma, and miR-181a-2-3p and miR-99b-3p can predict relapse-free survival in patients with FVPTC thus potentially providing important diagnostic and predictive value.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Lyme disease Borrelia can infect humans and animals for months to years, despite the presence of an active host immune response. The vls antigenic variation system, which expresses the surface-exposed lipoprotein VlsE, plays a major role in B. burgdorferi immune evasion. Gene conversion between vls silent cassettes and the vlsE expression site occurs at high frequency during mammalian infection, resulting in sequence variation in the VlsE product. In this study, we examined vlsE sequence variation in B. burgdorferi B31 during mouse infection by analyzing 1,399 clones isolated from bladder, heart, joint, ear, and skin tissues of mice infected for 4 to 365 days. The median number of codon changes increased progressively in C3H/HeN mice from 4 to 28 days post infection, and no clones retained the parental vlsE sequence at 28 days. In contrast, the decrease in the number of clones with the parental vlsE sequence and the increase in the number of sequence changes occurred more gradually in severe combined immunodeficiency (SCID) mice. Clones containing a stop codon were isolated, indicating that continuous expression of full-length VlsE is not required for survival in vivo; also, these clones continued to undergo vlsE recombination. Analysis of clones with apparent single recombination events indicated that recombinations into vlsE are nonselective with regard to the silent cassette utilized, as well as the length and location of the recombination event. Sequence changes as small as one base pair were common. Fifteen percent of recovered vlsE variants contained "template-independent" sequence changes, which clustered in the variable regions of vlsE. We hypothesize that the increased frequency and complexity of vlsE sequence changes observed in clones recovered from immunocompetent mice (as compared with SCID mice) is due to rapid clearance of relatively invariant clones by variable region-specific anti-VlsE antibody responses.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

PURPOSE: The present study defines genomic loci underlying coordinate changes in gene expression following retinal injury. METHODS: A group of acute phase genes expressed in diverse nervous system tissues was defined by combining microarray results from injury studies from rat retina, brain, and spinal cord. Genomic loci regulating the brain expression of acute phase genes were identified using a panel of BXD recombinant inbred (RI) mouse strains. Candidate upstream regulators within a locus were defined using single nucleotide polymorphism databases and promoter motif databases. RESULTS: The acute phase response of rat retina, brain, and spinal cord was dominated by transcription factors. Three genomic loci control transcript expression of acute phase genes in brains of BXD RI mouse strains. One locus was identified on chromosome 12 and was highly correlated with the expression of classic acute phase genes. Within the locus we identified the inhibitor of DNA binding 2 (Id2) as a candidate upstream regulator. Id2 was upregulated as an acute phase transcript in injury models of rat retina, brain, and spinal cord. CONCLUSIONS: We defined a group of transcriptional changes associated with the retinal acute injury response. Using genetic linkage analysis of natural transcript variation, we identified regulatory loci and candidate regulators that control transcript levels of acute phase genes.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Prostate cancer (PC) is a significant economic and health burden in the U.S. and Europe but its causes are largely unknown. The most significant risk factors (after gender) are age and family history of the disease. A gene with high penetrance but low frequency on chromosome 1q, HPC 1, has been suggested to cause a proportion of the familial aggregation of PC but other more common genes, conferring less risk, are also thought to contribute to disease predisposition. We have pursued a strategy to study both types of genetic risk in PC. To identify high penetrance genes, affected men from thirteen families have been genotyped for genetic linkage analysis at six microsatellite markers spanning 45 cM of 1q24-25. Both LOD score and non-parametric statistics provide no significant support for HPC1 in this genomic region, although 3 of the families did combine to produce a LOD score of 0.9. These families will be included in a genome wide search for other PC predisposition genes as part of a multinational collaboration.^ For study of common genetic factors in PC development, leukocyte DNA samples from an unselected series of 55 patients and 67 controls have been examined for genetic differences in two other candidate genes, the androgen receptor gene, hAR, at Xq11-12, and the vitamin D receptor gene, hVDR, at 12q12-14. hAR was typed for two trinucleotide repeat length polymorphisms, (CAG)$\rm\sb{n}$ and (GGC)$\rm\sb{n},$ encoding polyglutamine and polyglycine tracts, respectively, which have been implicated in PC susceptibility. These data, combined with similarly processed patients and controls from the U.K. show no consistent association of allele length with PC risk. A novel finding, however, has been a significant association between the number of GGC repeats and the length of time between diagnosis and relapse in stage T1-T4 Caucasian patients irrespective of therapy and age of the patient. Of 49 patients who relapsed out of 108 entering the study, those with 16 or fewer GGC repeats had an average relapse-free-period of 101 (+/$-$7.7) months while for those with more than 16 repeats the period averaged 48 (+/$-$2.9) months, a difference of 2.1 fold or 4.4 years.^ The second gene, hVDR, was genotyped at two polymorphisms, a synonymous C/T substitution in exon 9 identified by differential TaqI enzymatic digestion and a variable length polyA tract in the 3$\sp\prime$ UTR. Although these polymorphisms are in strong linkage disequilibrium only the polyA region showed a possible association with PC risk. Men homozygous for alleles with fewer than 18 A's had an increased risk (OR = 3.0, p = 0.0578) compared to controls. This result is opposite to the findings of others and may either indicate off-setting random errors which together balance out to no significant overall effect or reflect more complex genetic and/or environmental associations.^ Overall, this research suggests that single gene familial predisposition may be less prominent in PC than in other cancers and that the characteristics of PC pathology may be useful in identifying the effects of common genetic factors. ^

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We combine multi-wavelength data in the AEGIS-XD and C-COSMOS surveys to measure the typical dark matter halo mass of X-ray selected active galactic nuclei (AGN) [L_X(2–10 keV) > 10^42 erg s^− 1] in comparison with far-infrared selected star-forming galaxies detected in the Herschel/PEP survey (PACS Evolutionary Probe; L_IR > 10^11 L_⊙) and quiescent systems at z ≈ 1. We develop a novel method to measure the clustering of extragalactic populations that uses photometric redshift probability distribution functions in addition to any spectroscopy. This is advantageous in that all sources in the sample are used in the clustering analysis, not just the subset with secure spectroscopy. The method works best for large samples. The loss of accuracy because of the lack of spectroscopy is balanced by increasing the number of sources used to measure the clustering. We find that X-ray AGN, far-infrared selected star-forming galaxies and passive systems in the redshift interval 0.6 < z < 1.4 are found in haloes of similar mass, log M_DMH/(M_⊙ h^−1) ≈ 13.0. We argue that this is because the galaxies in all three samples (AGN, star-forming, passive) have similar stellar mass distributions, approximated by the J-band luminosity. Therefore, all galaxies that can potentially host X-ray AGN, because they have stellar masses in the appropriate range, live in dark matter haloes of log M_DMH/(M_⊙ h^−1) ≈ 13.0 independent of their star formation rates. This suggests that the stellar mass of X-ray AGN hosts is driving the observed clustering properties of this population. We also speculate that trends between AGN properties (e.g. luminosity, level of obscuration) and large-scale environment may be related to differences in the stellar mass of the host galaxies.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Both long-term environmental changes such as those driven by the glacial cycles and more recent anthropogenic impacts have had major effects on the past demography in wild organisms. Within species, these changes are reflected in the amount and distribution of neutral genetic variation. In this thesis, mitochondrial and microsatellite DNA was analysed to investigate how environmental and anthropogenic factors have affected genetic diversity and structure in four ecologically different animal species. Paper I describes the post-glacial recolonisation history of the speckled-wood butterfly (Pararge aegeria) in Northern Europe. A decrease in genetic diversity with latitude and a marked population structure were uncovered, consistent with a hypothesis of repeated founder events during the postglacial recolonisation. Moreover, Approximate Bayesian Computation analyses indicate that the univoltine populations in Scandinavia and Finland originate from recolonisations along two routes, one on each side of the Baltic. Paper II aimed to investigate how past sea-level rises affected the population history of the convict surgeonfish (Acanthurus triostegus) in the Indo-Pacific. Assessment of the species’ demographic history suggested a population expansion that occurred approximately at the end of the last glaciation. Moreover, the results demonstrated an overall lack of phylogeographic structure, probably due to the high dispersal rates associated with the species’ pelagic larval stage. Populations at the species’ eastern range margin were significantly differentiated from other populations, which likely is a consequence of their geographic isolation. In Paper III, we assessed the effect of human impact on the genetic variation of European moose (Alces alces) in Sweden. Genetic analyses revealed a spatial structure with two genetic clusters, one in northern and one in southern Sweden, which were separated by a narrow transition zone. Moreover, demographic inference suggested a recent population bottleneck. The inferred timing of this bottleneck coincided with a known reduction in population size in the 19th and early 20th century due to high hunting pressure. In Paper IV, we examined the effect of an indirect but well-described human impact, via environmental toxic chemicals (PCBs), on the genetic variation of Eurasian otters (Lutra lutra) in Sweden. Genetic clustering assignment revealed differentiation between otters in northern and southern Sweden, but also in the Stockholm region. ABC analyses indicated a decrease in effective population size in both northern and southern Sweden. Moreover, comparative analyses of historical and contemporary samples demonstrated a more severe decline in genetic diversity in southern Sweden compared to northern Sweden, in agreement with the levels of PCBs found.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Background. Whether current criteria used to define nicotine dependence are informative for genetic research is an important empirical question. The authors used items of the DSM-IV and of the Heaviness of Smoking Index to characterize the nicotine dependence phenotype and to identify salient symptoms in a genetically informative community sample of Australian young adult female and mate twins. Method. Phenotypic and genetic factor analyses were performed on nine dependence symptoms (the seven DSM-IV substance dependence criteria and the two Heaviness of Smoking Index (HSI) items derived from the Fagerstrom Tolerance Questionnaire, time to first cigarette in the morning and number of cigarettes smoked per day). Phenotypic and genetic analyses were restricted to ever smokers. Results. Phenotypic nicotine dependence symptom covariation was best captured by two factors with a similar pattern of factor loadings for women and men. In genetic factor analysis item covariation was best captured by two genetic but one shared environmental factor for both women and men; however, item factor loadings differed by gender. All nicotine dependence symptoms were substantially heritable, except for the DSM-IV criterion of 'giving up or reducing important activities in order to smoke', which was weakly familial. Conclusions. The salient behavioral indices of nicotine dependence are similar for women and men. DSM-IV criteria of tolerance, withdrawal, and experiencing difficulty quitting and HSI items time to first cigarette in the morning and number of cigarettes smoked per day may represent the most highly heritable symptoms of nicotine dependence for both women and men.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Seven years of multi-environment yield trials of navy bean (Phaseolus vulgaris L.) grown in Queensland were examined. As is common with plant breeding evaluation trials, test entries and locations varied between years. Grain yield data were analysed for each year using cluster and ordination analyses (pattern analyses). These methods facilitate descriptions of genotype performance across environments and the discrimination among genotypes provided by the environments. The observed trends for genotypic yield performance across environments were partly consistent with agronomic and disease reactions at specific environments and also partly explainable by breeding and selection history. In some cases, similarities in discrimination among environments were related to geographic proximity, in others management practices, and in others similarities occurred between geographically widely separated environments which differed in management practices. One location was identified as having atypical line discrimination. The analysis indicated that the number of test locations was below requirements for adequate representation of line x environment interaction. The pattern analyses methods used were an effective aid in describing the patterns in data for each year and illustrated the variations in adaptive patterns from year to year. The study has implications for assessing the number and location of test sites for plant breeding multi-environment trials, and for the understanding of genetic traits contributing to line x environment interactions.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

2000 Mathematics Subject Classification: 62H30

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Our sleep timing preference, or chronotype, is a manifestation of our internal biological clock. Variation in chronotype has been linked to sleep disorders, cognitive and physical performance, and chronic disease. Here we perform a genome-wide association study of self-reported chronotype within the UK Biobank cohort (n=100,420). We identify 12 new genetic loci that implicate known components of the circadian clock machinery and point to previously unstudied genetic variants and candidate genes that might modulate core circadian rhythms or light-sensing pathways. Pathway analyses highlight central nervous and ocular systems and fear-response-related processes. Genetic correlation analysis suggests chronotype shares underlying genetic pathways with schizophrenia, educational attainment and possibly BMI. Further, Mendelian randomization suggests that evening chronotype relates to higher educational attainment. These results not only expand our knowledge of the circadian system in humans but also expose the influence of circadian characteristics over human health and life-history variables such as educational attainment.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Osteoarthritis (OA) is the most common form of arthritis with a high socioeconomic burden, with an incompletely understood etiology. Evidence suggests a role for the transforming growth factor beta (TGF-ß) signalling pathway and epigenomics in OA. The aim of this thesis was to understand the involvement of the TGF-ß pathway in OA and to determine the DNA methylation patterns of OA-affected cartilage as compared to the OA-free cartilage. First, I found that a common SNP in the BMP2 gene, a ligand in the Bone morphogenetic protein (BMP) subunit of TGF-ß pathway, was associated with OA in the Newfoundland population. I also showed a genetic association between SMAD3 - a signal transducer in the TGF-ß subunit of the TGF-ß signalling pathway - and the total radiographic burden of OA. I further demonstrated that SMAD3 is over-expressed in OA cartilage, suggesting an over activation of the TGF-ß signalling in OA. Next, I examined the connection of these genes in the regulation of matrix metallopeptidase 13 (MMP13) - an enzyme known to destroy extracellular matrix in OA cartilage - in the context of the TGF-ß signalling. The analyses showed that TGF-ß, MMP13, and SMAD3 were overexpressed in OA cartilage, whereas the expression of BMP2 was significantly reduced. The expression of TGF-ß was positively correlated with that of SMAD3 and MMP13, suggesting that TGF-ß signalling is involved in up-regulation of MMP13. This regulation, however, appears not to be controlled by SMAD3 signals, possibly due to the involvement of collateral signalling, and to be suppressed by BMP regulation in healthy cartilage, whose levels were reduced in end-stage OA. In a genome-wide DNA methylation analysis, I reported CpG sites differentially methylated in OA and showed that the cartilage methylome has a potential to distinguish between OA-affected and non-OA cartilage. Functional clustering analysis of the genes harbouring differentially methylated loci revealed that they are enriched in the skeletal system morphogenesis pathway, which could be a potential candidate for further OA studies. Overall, the findings from the present thesis provide evidence that the TGF-ß signalling pathway is associated with the development of OA, and epigenomics might be involved as a potential mechanism in OA.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

With the development of information technology, the theory and methodology of complex network has been introduced to the language research, which transforms the system of language in a complex networks composed of nodes and edges for the quantitative analysis about the language structure. The development of dependency grammar provides theoretical support for the construction of a treebank corpus, making possible a statistic analysis of complex networks. This paper introduces the theory and methodology of the complex network and builds dependency syntactic networks based on the treebank of speeches from the EEE-4 oral test. According to the analysis of the overall characteristics of the networks, including the number of edges, the number of the nodes, the average degree, the average path length, the network centrality and the degree distribution, it aims to find in the networks potential difference and similarity between various grades of speaking performance. Through clustering analysis, this research intends to prove the network parameters’ discriminating feature and provide potential reference for scoring speaking performance.