14 resultados para Linkage

em Duke University


Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a novel unsupervised approach for linking records across arbitrarily many files, while simultaneously detecting duplicate records within files. Our key innovation is to represent the pattern of links between records as a {\em bipartite} graph, in which records are directly linked to latent true individuals, and only indirectly linked to other records. This flexible new representation of the linkage structure naturally allows us to estimate the attributes of the unique observable people in the population, calculate $k$-way posterior probabilities of matches across records, and propagate the uncertainty of record linkage into later analyses. Our linkage structure lends itself to an efficient, linear-time, hybrid Markov chain Monte Carlo algorithm, which overcomes many obstacles encountered by previously proposed methods of record linkage, despite the high dimensional parameter space. We assess our results on real and simulated data.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Multiple functions of the beta2-adrenergic receptor (ADRB2) and angiotensin-converting enzyme (ACE) genes warrant studies of their associations with aging-related phenotypes. We focus on multimarker analyses and analyses of the effects of compound genotypes of two polymorphisms in the ADRB2 gene, rs1042713 and rs1042714, and 11 polymorphisms of the ACE gene, on the risk of such an aging-associated phenotype as myocardial infarction (MI). We used the data from a genotyped sample of the Framingham Heart Study Offspring (FHSO) cohort (n = 1500) followed for about 36 years with six examinations. The ADRB2 rs1042714 (C-->G) polymorphism and two moderately correlated (r(2) = 0.77) ACE polymorphisms, rs4363 (A-->G) and rs12449782 (A-->G), were significantly associated with risks of MI in this aging cohort in multimarker models. Predominantly linked ACE genotypes exhibited opposite effects on MI risks, e.g., the AA (rs12449782) genotype had a detrimental effect, whereas the predominantly linked AA (rs4363) genotype exhibited a protective effect. This trade-off occurs as a result of the opposite effects of rare compound genotypes of the ACE polymorphisms with a single dose of the AG heterozygote. This genetic trade-off is further augmented by the selective modulating effect of the rs1042714 ADRB2 polymorphism. The associations were not altered by adjustment for common MI risk factors. The results suggest that effects of single specific genetic variants of the ADRB2 and ACE genes on MI can be readily altered by gene-gene or/and gene-environmental interactions, especially in large heterogeneous samples. Multimarker genetic analyses should benefit studies of complex aging-associated phenotypes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Plants exhibit different developmental strategies than animals; these are characterized by a tight linkage between environmental conditions and development. As plants have neither specialized sensory organs nor a nervous system, intercellular regulators are essential for their development. Recently, major advances have been made in understanding how intercellular regulation is achieved in plants on a molecular level. Plants use a variety of molecules for intercellular regulation: hormones are used as systemic signals that are interpreted at the individual-cell level; receptor peptide-ligand systems regulate local homeostasis; moving transcriptional regulators act in a switch-like manner over small and large distances. Together, these mechanisms coherently coordinate developmental decisions with resource allocation and growth.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND: Ganglioside biosynthesis occurs through a multi-enzymatic pathway which at the lactosylceramide step is branched into several biosynthetic series. Lc3 synthase utilizes a variety of galactose-terminated glycolipids as acceptors by establishing a glycosidic bond in the beta-1,3-linkage to GlcNaAc to extend the lacto- and neolacto-series gangliosides. In order to examine the lacto-series ganglioside functions in mice, we used gene knockout technology to generate Lc3 synthase gene B3gnt5-deficient mice by two different strategies and compared the phenotypes of the two null mouse groups with each other and with their wild-type counterparts. RESULTS: B3gnt5 gene knockout mutant mice appeared normal in the embryonic stage and, if they survived delivery, remained normal during early life. However, about 9% developed early-stage growth retardation, 11% died postnatally in less than 2 months, and adults tended to die in 5-15 months, demonstrating splenomegaly and notably enlarged lymph nodes. Without lacto-neolacto series gangliosides, both homozygous and heterozygous mice gradually displayed fur loss or obesity, and breeding mice demonstrated reproductive defects. Furthermore, B3gnt5 gene knockout disrupted the functional integrity of B cells, as manifested by a decrease in B-cell numbers in the spleen, germinal center disappearance, and less efficiency to proliferate in hybridoma fusion. CONCLUSIONS: These novel results demonstrate unequivocally that lacto-neolacto series gangliosides are essential to multiple physiological functions, especially the control of reproductive output, and spleen B-cell abnormality. We also report the generation of anti-IgG response against the lacto-series gangliosides 3'-isoLM1 and 3',6'-isoLD1.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Tumor microenvironmental stresses, such as hypoxia and lactic acidosis, play important roles in tumor progression. Although gene signatures reflecting the influence of these stresses are powerful approaches to link expression with phenotypes, they do not fully reflect the complexity of human cancers. Here, we describe the use of latent factor models to further dissect the stress gene signatures in a breast cancer expression dataset. The genes in these latent factors are coordinately expressed in tumors and depict distinct, interacting components of the biological processes. The genes in several latent factors are highly enriched in chromosomal locations. When these factors are analyzed in independent datasets with gene expression and array CGH data, the expression values of these factors are highly correlated with copy number alterations (CNAs) of the corresponding BAC clones in both the cell lines and tumors. Therefore, variation in the expression of these pathway-associated factors is at least partially caused by variation in gene dosage and CNAs among breast cancers. We have also found the expression of two latent factors without any chromosomal enrichment is highly associated with 12q CNA, likely an instance of "trans"-variations in which CNA leads to the variations in gene expression outside of the CNA region. In addition, we have found that factor 26 (1q CNA) is negatively correlated with HIF-1alpha protein and hypoxia pathways in breast tumors and cell lines. This agrees with, and for the first time links, known good prognosis associated with both a low hypoxia signature and the presence of CNA in this region. Taken together, these results suggest the possibility that tumor segmental aneuploidy makes significant contributions to variation in the lactic acidosis/hypoxia gene signatures in human cancers and demonstrate that latent factor analysis is a powerful means to uncover such a linkage.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We present a novel strategy that uses high-throughput methods of isolating and mapping C. elegans mutants susceptible to pathogen infection. We show that C. elegans mutants that exhibit an enhanced pathogen accumulation (epa) phenotype can be rapidly identified and isolated using a sorting system that allows automation of the analysis, sorting, and dispensing of C. elegans by measuring fluorescent bacteria inside the animals. Furthermore, we validate the use of Amplifluor as a new single nucleotide polymorphism (SNP) mapping technique in C. elegans. We show that a set of 9 SNPs allows the linkage of C. elegans mutants to a 5-8 megabase sub-chromosomal region.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND: Microsporidia are obligate intracellular, eukaryotic pathogens that infect a wide range of animals from nematodes to humans, and in some cases, protists. The preponderance of evidence as to the origin of the microsporidia reveals a close relationship with the fungi, either within the kingdom or as a sister group to it. Recent phylogenetic studies and gene order analysis suggest that microsporidia share a particularly close evolutionary relationship with the zygomycetes. METHODOLOGY/PRINCIPAL FINDINGS: Here we expanded this analysis and also examined a putative sex-locus for variability between microsporidian populations. Whole genome inspection reveals a unique syntenic gene pair (RPS9-RPL21) present in the vast majority of fungi and the microsporidians but not in other eukaryotic lineages. Two other unique gene fusions (glutamyl-prolyl tRNA synthetase and ubiquitin-ribosomal subunit S30) that are present in metazoans, choanoflagellates, and filasterean opisthokonts are unfused in the fungi and microsporidians. One locus previously found to be conserved in many microsporidian genomes is similar to the sex locus of zygomycetes in gene order and architecture. Both sex-related and sex loci harbor TPT, HMG, and RNA helicase genes forming a syntenic gene cluster. We sequenced and analyzed the sex-related locus in 11 different Encephalitozoon cuniculi isolates and the sibling species E. intestinalis (3 isolates) and E. hellem (1 isolate). There was no evidence for an idiomorphic sex-related locus in this Encephalitozoon species sample. According to sequence-based phylogenetic analyses, the TPT and RNA helicase genes flanking the HMG genes are paralogous rather than orthologous between zygomycetes and microsporidians. CONCLUSION/SIGNIFICANCE: The unique genomic hallmarks between microsporidia and fungi are independent of sequence based phylogenetic comparisons and further contribute to define the borders of the fungal kingdom and support the classification of microsporidia as unusual derived fungi. And the sex/sex-related loci appear to have been subject to frequent gene conversion and translocations in microsporidia and zygomycetes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The Bateson-Dobzhansky-Muller model posits that hybrid incompatibilities result from genetic changes that accumulate during population divergence. Indeed, much effort in recent years has been devoted to identifying genes associated with hybrid incompatibilities, often with limited success, suggesting that hybrid sterility and inviability are frequently caused by complex interactions between multiple loci and not by single or a small number of gene pairs. Our previous study showed that the nature of epistasis between sterility-conferring QTL in the Drosophila persimilis-D. pseudoobscura bogotana species pair is highly specific. Here, we further dissect one of the three QTL underlying hybrid male sterility between these species and provide evidence for multiple factors within this QTL. This result indicates that the number of loci thought to contribute to hybrid dysfunction may have been underestimated, and we discuss how linkage and complex epistasis may be characteristic of the genetics of hybrid incompatibilities. We further pinpoint the location of one locus that confers hybrid male sterility when homozygous, dubbed "mule-like", to roughly 250 kilobases.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The Haemophilus influenzae HMW1 adhesin is a high-molecular weight protein that is secreted by the bacterial two-partner secretion pathway and mediates adherence to respiratory epithelium, an essential early step in the pathogenesis of H. influenzae disease. In recent work, we discovered that HMW1 is a glycoprotein and undergoes N-linked glycosylation at multiple asparagine residues with simple hexose units rather than N-acetylated hexose units, revealing an unusual N-glycosidic linkage and suggesting a new glycosyltransferase activity. Glycosylation protects HMW1 against premature degradation during the process of secretion and facilitates HMW1 tethering to the bacterial surface, a prerequisite for HMW1-mediated adherence. In the current study, we establish that the enzyme responsible for glycosylation of HMW1 is a protein called HMW1C, which is encoded by the hmw1 gene cluster and shares homology with a group of bacterial proteins that are generally associated with two-partner secretion systems. In addition, we demonstrate that HMW1C is capable of transferring glucose and galactose to HMW1 and is also able to generate hexose-hexose bonds. Our results define a new family of bacterial glycosyltransferases.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Complex diseases will have multiple functional sites, and it will be invaluable to understand the cross-locus interaction in terms of linkage disequilibrium (LD) between those sites (epistasis) in addition to the haplotype-LD effects. We investigated the statistical properties of a class of matrix-based statistics to assess this epistasis. These statistical methods include two LD contrast tests (Zaykin et al., 2006) and partial least squares regression (Wang et al., 2008). To estimate Type 1 error rates and power, we simulated multiple two-variant disease models using the SIMLA software package. SIMLA allows for the joint action of up to two disease genes in the simulated data with all possible multiplicative interaction effects between them. Our goal was to detect an interaction between multiple disease-causing variants by means of their linkage disequilibrium (LD) patterns with other markers. We measured the effects of marginal disease effect size, haplotype LD, disease prevalence and minor allele frequency have on cross-locus interaction (epistasis). In the setting of strong allele effects and strong interaction, the correlation between the two disease genes was weak (r=0.2). In a complex system with multiple correlations (both marginal and interaction), it was difficult to determine the source of a significant result. Despite these complications, the partial least squares and modified LD contrast methods maintained adequate power to detect the epistatic effects; however, for many of the analyses we often could not separate interaction from a strong marginal effect. While we did not exhaust the entire parameter space of possible models, we do provide guidance on the effects that population parameters have on cross-locus interaction.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND: We have previously shown that a functional polymorphism of the UGT2B15 gene (rs1902023) was associated with increased risk of prostate cancer (PC). Novel functional polymorphisms of the UGT2B17 and UGT2B15 genes have been recently characterized by in vitro assays but have not been evaluated in epidemiologic studies. METHODS: Fifteen functional SNPs of the UGT2B17 and UGT2B15 genes, including cis-acting UGT2B gene SNPs, were genotyped in African American and Caucasian men (233 PC cases and 342 controls). Regression models were used to analyze the association between SNPs and PC risk. RESULTS: After adjusting for race, age and BMI, we found that six UGT2B15 SNPs (rs4148269, rs3100, rs9994887, rs13112099, rs7686914 and rs7696472) were associated with an increased risk of PC in log-additive models (p < 0.05). A SNP cis-acting on UGT2B17 and UGT2B15 expression (rs17147338) was also associated with increased risk of prostate cancer (OR = 1.65, 95% CI = 1.00-2.70); while a stronger association among men with high Gleason sum was observed for SNPs rs4148269 and rs3100. CONCLUSIONS: Although small sample size limits inference, we report novel associations between UGT2B15 and UGT2B17 variants and PC risk. These associations with PC risk in men with high Gleason sum, more frequently found in African American men, support the relevance of genetic differences in the androgen metabolism pathway, which could explain, in part, the high incidence of PC among African American men. Larger studies are required.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND: The wealth of phenotypic descriptions documented in the published articles, monographs, and dissertations of phylogenetic systematics is traditionally reported in a free-text format, and it is therefore largely inaccessible for linkage to biological databases for genetics, development, and phenotypes, and difficult to manage for large-scale integrative work. The Phenoscape project aims to represent these complex and detailed descriptions with rich and formal semantics that are amenable to computation and integration with phenotype data from other fields of biology. This entails reconceptualizing the traditional free-text characters into the computable Entity-Quality (EQ) formalism using ontologies. METHODOLOGY/PRINCIPAL FINDINGS: We used ontologies and the EQ formalism to curate a collection of 47 phylogenetic studies on ostariophysan fishes (including catfishes, characins, minnows, knifefishes) and their relatives with the goal of integrating these complex phenotype descriptions with information from an existing model organism database (zebrafish, http://zfin.org). We developed a curation workflow for the collection of character, taxonomic and specimen data from these publications. A total of 4,617 phenotypic characters (10,512 states) for 3,449 taxa, primarily species, were curated into EQ formalism (for a total of 12,861 EQ statements) using anatomical and taxonomic terms from teleost-specific ontologies (Teleost Anatomy Ontology and Teleost Taxonomy Ontology) in combination with terms from a quality ontology (Phenotype and Trait Ontology). Standards and guidelines for consistently and accurately representing phenotypes were developed in response to the challenges that were evident from two annotation experiments and from feedback from curators. CONCLUSIONS/SIGNIFICANCE: The challenges we encountered and many of the curation standards and methods for improving consistency that we developed are generally applicable to any effort to represent phenotypes using ontologies. This is because an ontological representation of the detailed variations in phenotype, whether between mutant or wildtype, among individual humans, or across the diversity of species, requires a process by which a precise combination of terms from domain ontologies are selected and organized according to logical relations. The efficiencies that we have developed in this process will be useful for any attempt to annotate complex phenotypic descriptions using ontologies. We also discuss some ramifications of EQ representation for the domain of systematics.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Associating genetic variation with quantitative measures of gene regulation offers a way to bridge the gap between genotype and complex phenotypes. In order to identify quantitative trait loci (QTLs) that influence the binding of a transcription factor in humans, we measured binding of the multifunctional transcription and chromatin factor CTCF in 51 HapMap cell lines. We identified thousands of QTLs in which genotype differences were associated with differences in CTCF binding strength, hundreds of them confirmed by directly observable allele-specific binding bias. The majority of QTLs were either within 1 kb of the CTCF binding motif, or in linkage disequilibrium with a variant within 1 kb of the motif. On the X chromosome we observed three classes of binding sites: a minority class bound only to the active copy of the X chromosome, the majority class bound to both the active and inactive X, and a small set of female-specific CTCF sites associated with two non-coding RNA genes. In sum, our data reveal extensive genetic effects on CTCF binding, both direct and indirect, and identify a diversity of patterns of CTCF binding on the X chromosome.