686 resultados para haplotype
Genome-Wide Analyses Suggest Mechanisms Involving Early B-Cell Development in Canine IgA Deficiency.
Resumo:
Immunoglobulin A deficiency (IgAD) is the most common primary immune deficiency disorder in both humans and dogs, characterized by recurrent mucosal tract infections and a predisposition for allergic and other immune mediated diseases. In several dog breeds, low IgA levels have been observed at a high frequency and with a clinical resemblance to human IgAD. In this study, we used genome-wide association studies (GWAS) to identify genomic regions associated with low IgA levels in dogs as a comparative model for human IgAD. We used a novel percentile groups-approach to establish breed-specific cut-offs and to perform analyses in a close to continuous manner. GWAS performed in four breeds prone to low IgA levels (German shepherd, Golden retriever, Labrador retriever and Shar-Pei) identified 35 genomic loci suggestively associated (p <0.0005) to IgA levels. In German shepherd, three genomic regions (candidate genes include KIRREL3 and SERPINA9) were genome-wide significantly associated (p <0.0002) with IgA levels. A ~20kb long haplotype on CFA28, significantly associated (p = 0.0005) to IgA levels in Shar-Pei, was positioned within the first intron of the gene SLIT1. Both KIRREL3 and SLIT1 are highly expressed in the central nervous system and in bone marrow and are potentially important during B-cell development. SERPINA9 expression is restricted to B-cells and peaks at the time-point when B-cells proliferate into antibody-producing plasma cells. The suggestively associated regions were enriched for genes in Gene Ontology gene sets involving inflammation and early immune cell development.
Resumo:
Leopard Complex spotting occurs in several breeds of horses and is caused by an incompletely dominant allele (LP). Homozygosity for LP is also associated with congenital stationary night blindness (CSNB) in Appaloosa horses. Previously, LP was mapped to a 6 cm region on ECA1 containing the candidate gene TRPM1 (Transient Receptor Potential Cation Channel, Subfamily M, Member 1) and decreased expression of this gene, measured by qRT-PCR, was identified as the likely cause of both spotting and ocular phenotypes. This study describes investigations for a mutation causing or associated with the Leopard Complex and CSNB phenotype in horses. Re-sequencing of the gene and associated splice sites within the 105 624 bp genomic region of TRPM1 led to the discovery of 18 SNPs. Most of the SNPs did not have a predictive value for the presence of LP. However, one SNP (ECA1:108,249,293 C>T) found within intron 11 had a strong (P < 0.0005), but not complete, association with LP and CSNB and thus is a good marker but unlikely to be causative. To further localize the association, 70 SNPs spanning over two Mb including the TRPM1 gene were genotyped in 192 horses from three different breeds segregating for LP. A single 173 kb haplotype associated with LP and CSNB (ECA1: 108,197,355- 108,370,150) was identified. Illumina sequencing of 300 kb surrounding this haplotype revealed 57 SNP variants. Based on their localization within expressed sequences or regions of high sequence conservation across mammals, six of these SNPs were considered to be the most likely candidate mutations. While the precise function of TRPM1 remains to be elucidated, this work solidifies its functional role in both pigmentation and night vision. Further, this work has identified several potential regulatory elements of the TRPM1 gene that should be investigated further in this and other species.
Resumo:
Hypothyroidism is a complex clinical condition found in both humans and dogs, thought to be caused by a combination of genetic and environmental factors. In this study we present a multi-breed analysis of predisposing genetic risk factors for hypothyroidism in dogs using three high-risk breeds-the Gordon Setter, Hovawart and the Rhodesian Ridgeback. Using a genome-wide association approach and meta-analysis, we identified a major hypothyroidism risk locus shared by these breeds on chromosome 12 (p = 2.1x10-11). Further characterisation of the candidate region revealed a shared ~167 kb risk haplotype (4,915,018-5,081,823 bp), tagged by two SNPs in almost complete linkage disequilibrium. This breed-shared risk haplotype includes three genes (LHFPL5, SRPK1 and SLC26A8) and does not extend to the dog leukocyte antigen (DLA) class II gene cluster located in the vicinity. These three genes have not been identified as candidate genes for hypothyroid disease previously, but have functions that could potentially contribute to the development of the disease. Our results implicate the potential involvement of novel genes and pathways for the development of canine hypothyroidism, raising new possibilities for screening, breeding programmes and treatments in dogs. This study may also contribute to our understanding of the genetic etiology of human hypothyroid disease, which is one of the most common endocrine disorders in humans.
Resumo:
Phylogenetic reconstruction of the evolutionary history of closely related organisms may be difficult because of the presence of unsorted lineages and of a relatively high proportion of heterozygous sites that are usually not handled well by phylogenetic programs. Genomic data may provide enough fixed polymorphisms to resolve phylogenetic trees, but the diploid nature of sequence data remains analytically challenging. Here, we performed a phylogenomic reconstruction of the evolutionary history of the common vole (Microtus arvalis) with a focus on the influence of heterozygosity on the estimation of intraspecific divergence times. We used genome-wide sequence information from 15 voles distributed across the European range. We provide a novel approach to integrate heterozygous information in existing phylogenetic programs by repeated random haplotype sampling from sequences with multiple unphased heterozygous sites. We evaluated the impact of the use of full, partial, or no heterozygous information for tree reconstructions on divergence time estimates. All results consistently showed four deep and strongly supported evolutionary lineages in the vole data. These lineages undergoing divergence processes split only at the end or after the last glacial maximum based on calibration with radiocarbon-dated paleontological material. However, the incorporation of information from heterozygous sites had a significant impact on absolute and relative branch length estimations. Ignoring heterozygous information led to an overestimation of divergence times between the evolutionary lineages of M. arvalis. We conclude that the exclusion of heterozygous sites from evolutionary analyses may cause biased and misleading divergence time estimates in closely related taxa.
Resumo:
The identification of quantitative trait loci (QTL) such as height and their underlying causative variants is still challenging and often requires large sample sizes. In humans hundreds of loci with small effects control the heritable portion of height variability. In domestic animals, typically only a few loci with comparatively large effects explain a major fraction of the heritability. We investigated height at withers in Shetland ponies and mapped a QTL to ECA 6 by genome-wide association (GWAS) using a small cohort of only 48 animals and the Illumina equine SNP70 BeadChip. Fine-mapping revealed a shared haplotype block of 793 kb in small Shetland ponies. The HMGA2 gene, known to be associated with height in horses and many other species, was located in the associated haplotype. After closing a gap in the equine reference genome we identified a non-synonymous variant in the first exon of HMGA2 in small Shetland ponies. The variant was predicted to affect the functionally important first AT-hook DNA binding domain of the HMGA2 protein (c.83G>A; p.G28E). We assessed the functional impact and found impaired DNA binding of a peptide with the mutant sequence in an electrophoretic mobility shift assay. This suggests that the HMGA2 variant also affects DNA binding in vivo and thus leads to reduced growth and a smaller stature in Shetland ponies. The identified HMGA2 variant also segregates in several other pony breeds but was not found in regular-sized horse breeds. We therefore conclude that we identified a quantitative trait nucleotide for height in horses.
Resumo:
INTRODUCTION Known genetic variants with reference to preeclampsia only explain a proportion of the heritable contribution to the development of this condition. The association between preeclampsia and the risk of cardiovascular disease later in life has encouraged the study of genetic variants important in thrombosis and vascular inflammation also in relation to preeclampsia. The von Willebrand factor-cleaving protease, ADAMTS13, plays an important role in micro vascular thrombosis, and partial deficiencies of this enzyme have been observed in association with cardiovascular disease and preeclampsia. However, it remains unknown whether decreased ADAMTS13 levels represent a cause or an effect of the event in placental and cardiovascular disease. METHODS We studied the distribution of three functional genetic variants of ADAMTS13, c.1852C>G (rs28647808), c.4143_4144dupA (rs387906343), and c.3178C>T (rs142572218) in women with preeclampsia and their controls in a nested case-control study from the second Nord-Trøndelag Health Study (HUNT2). We also studied the association between ADAMTS13 activity and preeclampsia, in serum samples procured unrelated in time of the preeclamptic pregnancy. RESULTS No differences were observed in genotype, allele or haplotype frequencies of the different ADAMTS13 variants when comparing cases and controls, and no association to preeclampsia was found with lower levels of ADAMTS13 activity. CONCLUSION Our findings indicate that ADAMTS13 variants and ADAMTS13 activity do not contribute to an increased risk of preeclampsia in the general population.
Resumo:
Retinitis pigmentosa (RP) is a genetically heterogeneous group of retinal degenerations that affects over one million people worldwide. To date, 11 autosomal dominant, 13 autosomal recessive, and 5 X-linked forms of retinitis pigmentosa have been identified through linkage analysis, but the disease-causing genes and mutations have been found for only half of these loci. My research uses a positional candidate cloning approach to identify the gene and mutations responsible for one type of autosomal dominant retinitis pigmentosa, RP10. The premise is that identifying the genes and mutations responsible for disease will provide insight into disease mechanisms and provide treatment options. Previous research mapped the RP10 locus to a 5cM region on chromosome 7q31 between markers D7S686 and D7S530. Linkage and fine-point haplotype analysis was used to reduce and refine the RP10 disease interval to a 4cM region located between D7S2471 and a new marker located 45,000bp telomeric of D7S461. In order to identify genes located in the RP10 interval, an extensive EST map was created of this region. Five EST clusters from this map were analyzed to determine if mutations in these genes cause the RP10 form of retinitis pigmentosa. The genomic structure of a known metabotrophic glutamate receptor, GRMS8, was determined first. DNA sequencing of GRM8 in RP10 family members did not identify any disease-causing mutations. Four other EST clusters (A170, A173, A189, and A258) were characterized and determined to be part of the same gene, UBNL1 (ubinuclein-like 1). The full-length mRNA sequence and genomic structure of UBNL1 was determined and then screened in patients. No disease-causing mutations were identified in any of the RP10 family members tested. Recent data made available with the release of the public and Celera genome assemblies indicates that UBNL1 is outside of the RP10 disease region. Despite this complication, characterization of UBNL1 is still important in the understanding of normal visual processes and it is possible that mutations in UBNL1 could cause other forms of retinopathy. The EST map and list of RP10 candidates will continue to aid others in the search for the RP10 gene and mutations. ^
Resumo:
With hundreds of single nucleotide polymorphisms (SNPs) in a candidate gene and millions of SNPs across the genome, selecting an informative subset of SNPs to maximize the ability to detect genotype-phenotype association is of great interest and importance. In addition, with a large number of SNPs, analytic methods are needed that allow investigators to control the false positive rate resulting from large numbers of SNP genotype-phenotype analyses. This dissertation uses simulated data to explore methods for selecting SNPs for genotype-phenotype association studies. I examined the pattern of linkage disequilibrium (LD) across a candidate gene region and used this pattern to aid in localizing a disease-influencing mutation. The results indicate that the r2 measure of linkage disequilibrium is preferred over the common D′ measure for use in genotype-phenotype association studies. Using step-wise linear regression, the best predictor of the quantitative trait was not usually the single functional mutation. Rather it was a SNP that was in high linkage disequilibrium with the functional mutation. Next, I compared three strategies for selecting SNPs for application to phenotype association studies: based on measures of linkage disequilibrium, based on a measure of haplotype diversity, and random selection. The results demonstrate that SNPs selected based on maximum haplotype diversity are more informative and yield higher power than randomly selected SNPs or SNPs selected based on low pair-wise LD. The data also indicate that for genes with small contribution to the phenotype, it is more prudent for investigators to increase their sample size than to continuously increase the number of SNPs in order to improve statistical power. When typing large numbers of SNPs, researchers are faced with the challenge of utilizing an appropriate statistical method that controls the type I error rate while maintaining adequate power. We show that an empirical genotype based multi-locus global test that uses permutation testing to investigate the null distribution of the maximum test statistic maintains a desired overall type I error rate while not overly sacrificing statistical power. The results also show that when the penetrance model is simple the multi-locus global test does as well or better than the haplotype analysis. However, for more complex models, haplotype analyses offer advantages. The results of this dissertation will be of utility to human geneticists designing large-scale multi-locus genotype-phenotype association studies. ^
Resumo:
Apolipoprotein E (ApoE) plays a major role in the metabolism of high density and low density lipoproteins (HDL and LDL). Its common protein isoforms (E2, E3, E4) are risk factors for coronary artery disease (CAD) and explain between 16 to 23% of the inter-individual variation in plasma apoE levels. Linkage analysis has been completed for plasma apoE levels in the GENOA study (Genetic Epidemiology Network of Atherosclerosis). After stratification of the population by lipoprotein levels and body mass index (BMI) to create more homogeneity with regard to biological context for apoE levels, Hispanic families showed significant linkage on chromosome 17q for two strata (LOD=2.93 at 104 cM for a low cholesterol group, LOD=3.04 at 111 cM for a low cholesterol, high HDLC group). Replication of 17q linkage was observed for apoB and apoE levels in the unstratified Hispanic and African-American populations, and for apoE levels in African-American families. Replication of this 17q linkage in different populations and strata provides strong support for the presence of gene(s) in this region with significant roles in the determination of inter-individual variation in plasma apoE levels. Through a positional and functional candidate gene approach, ten genes were identified in the 17q linked region, and 62 polymorphisms in these genes were genotyped in the GENOA families. Association analysis was performed with FBAT, GEE, and variance-component based tests followed by conditional linkage analysis. Association studies with partial coverage of TagSNPs in the gene coding for apolipoprotein H (APOH) were performed, and significant results were found for 2 SNPs (APOH_20951 and APOH_05407) in the Hispanic low cholesterol strata accounting for 3.49% of the inter-individual variation in plasma apoE levels. Among the other candidate genes, we identified a haplotype block in the ACE1 gene that contains two major haplotypes associated with apoE levels as well as total cholesterol, apoB and LDLC levels in the unstratified Hispanic population. Identifying genes responsible for the remaining 60% of inter-individual variation in plasma apoE level, will yield new insights into the understanding of genetic interactions involved in the lipid metabolism, and a more precise understanding of the risk factors leading to CAD. ^
Resumo:
Hypertension is usually defined as having values of systolic blood pressure ≥140 mmHg, diastolic blood pressure ≥90 mmHg. Hypertension is one of the main adverse effects of glucocorticoid on the cardiovascular system. Glucocorticoids are essential hormones, secreted from adrenal glands in circadian fashion. Glucocorticoid's effect on blood pressure is conveyed by the glucocorticoid receptor (NR3C1), an omnipresent nuclear transcription factor. Although polymorphisms in this gene have long been implicated to be a causal factor for cardiovascular diseases such as hypertension, no study has yet thoroughly interrogated the gene's polymorphisms for their effect on blood pressure levels. Therefore, I have first resequenced ∼30 kb of the gene, encompassing all exons, promoter regions, 5'/3' UTRs as well as at least 1.5 kb of the gene's flanking regions from 114 chromosome 5 monosomic cell lines, comprised of three major American ethnic groups—European American, African American and Mexican American. I observed 115 polymorphisms and 14 common molecularly phased haplotypes. A subset of markers was chosen for genotyping study populations of GENOA (Genetic Epidemiology Network of Atherosclerosis; 1022 non-Hispanic whites, 1228 African Americans and 954 Mexican Americans). Since these study populations include sibships, the family-based association test was performed on 4 blood pressure-related quantitative variables—pulse, systolic blood pressure, diastolic blood pressure and mean arterial pressure. Using these analyses, multiple correlated SNPs are significantly protective against high systolic blood pressure in non-Hispanic whites, which includes rsb198, a SNP formerly associated with beneficial body compositions. Haplotype association analysis also supports this finding and all p-values remained significant after permutation tests. I therefore conclude that multiple correlated SNPs on the gene may confer protection against high blood pressure in non-Hispanic whites. ^
Resumo:
To identify genetic susceptibility loci for severe diabetic retinopathy, 286 Mexican-Americans with type 2 diabetes from Starr County, Texas completed detailed physical and ophthalmologic examinations including fundus photography for diabetic retinopathy grading. 103 individuals with moderate-to-severe non-proliferative diabetic retinopathy or proliferative diabetic retinopathy were defined as cases for this study. DNA samples extracted from study subjects were genotyped using the Affymetrix GeneChip® Human Mapping 100K Set, which includes 116,204 single nucleotide polymorphisms (SNPs) across the whole genome. Single-marker allelic tests and 2- to 8-SNP sliding-window Haplotype Trend Regression implemented in HelixTreeTM were first performed with these direct genotypes to identify genes/regions contributing to the risk of severe diabetic retinopathy. An additional 1,885,781 HapMap Phase II SNPs were imputed from the direct genotypes to expand the genomic coverage for a more detailed exploration of genetic susceptibility to diabetic retinopathy. The average estimated allelic dosage and imputed genotypes with the highest posterior probabilities were subsequently analyzed for associations using logistic regression and Fisher's Exact allelic tests, respectively. To move beyond these SNP-based approaches, 104,572 directly genotyped and 333,375 well-imputed SNPs were used to construct genetic distance matrices based on 262 retinopathy candidate genes and their 112 related biological pathways. Multivariate distance matrix regression was then used to test hypotheses with genes and pathways as the units of inference in the context of susceptibility to diabetic retinopathy. This study provides a framework for genome-wide association analyses, and implicated several genes involved in the regulation of oxidative stress, inflammatory processes, histidine metabolism, and pancreatic cancer pathways associated with severe diabetic retinopathy. Many of these loci have not previously been implicated in either diabetic retinopathy or diabetes. In summary, CDC73, IL12RB2, and SULF1 had the best evidence as candidates to influence diabetic retinopathy, possibly through novel biological mechanisms related to VEGF-mediated signaling pathway or inflammatory processes. While this study uncovered some genes for diabetic retinopathy, a comprehensive picture of the genetic architecture of diabetic retinopathy has not yet been achieved. Once fully understood, the genetics and biology of diabetic retinopathy will contribute to better strategies for diagnosis, treatment and prevention of this disease.^
Resumo:
Clubfoot is a common, complex birth defect affecting 4,000 newborns in the United States and 135,000 world-wide each year. The clubfoot deformity is characterized by inward and rigid downward displacement of one or both feet, along with persistent calf muscle hypoplasia. Despite strong evidence for a genetic liability, there is a limited understanding of the genetic and environmental factors contributing to the etiology of clubfoot. The studies described in this dissertation were performed to identify variants and/or genes associated with clubfoot. Genome-wide linkage scan performed on ten multiplex clubfoot families identified seven new chromosomal regions that provide new areas to search for clubfoot genes. Troponin C (TNNC2) the strongest candidate gene, located in 20q12-q13.11, is involved in muscle contraction. Exon sequencing of TNNC2 did not identify any novel coding variants. Interrogation of fifteen muscle contraction genes found strong associations with SNPs located in potential regulatory regions of TPM1 (rs4075583 and rs3805965), TPM2 (rs2025126 and rs2145925) and TNNC2 (rs383112 and rs437122). In previous studies, a strong association was found with rs3801776 located in the basal promoter of HOXA9, a gene also involved in muscle development and patterning. Altogether, this data suggests that SNPs located in potential regulatory regions of genes involved in muscle development and function could alter transcription factor binding leading to changes in gene expression. Functional analysis of 3801776/HOXA9, rs2025126/TPM2 and rs2145925/TPM2 showed altered protein binding, which significantly influenced promoter activity. Although the ancestral allele (G) of rs4075583/TPM1 creates a DNA-protein complex, it did not affect TPM1 promoter activity. However and importantly, in the context of a haplotype, rs4075583/G significantly decreased TPM1 promoter activity. These results suggest dysregulation of multiple skeletal muscle genes, TPM1, TPM2, TNNC2 and HOXA9, working in concert may contribute to clubfoot. However, specific allelic combinations involving these four regulatory SNPs did not confer a significantly higher risk for clubfoot. Other combinations of these variants are being evaluated. Moreover, these variants may interact with yet to be discovered variants in other genes to confer a higher clubfoot risk. Collectively, we show novel evidence for the role of skeletal muscle genes in clubfoot indicating that there are multiple genetic factors contributing to this complex birth defect.
Resumo:
Despite the key importance of altered oceanic mantle as a repository and carrier of light elements (B, Li, and Be) to depth, its inventory of these elements has hardly been explored and quantified. In order to constrain the systematics and budget of these elements we have studied samples of highly serpentinized (>50%) spinel harzburgite drilled at the Mid-Atlantic Ridge (Fifteen-Twenty Fracture zone, ODP Leg 209, Sites 1272A and 1274A). In-situ analysis by secondary ion mass spectrometry reveals that the B, Li and Be contents of mantle minerals (olivine, orthopyroxene, and clinopyroxene) remain unchanged during serpentinization. B and Li abundances largely correspond to those of unaltered mantle minerals whereas Be is close to the detection limit. The Li contents of clinopyroxene are slightly higher (0.44-2.8 µg/g) compared to unaltered mantle clinopyroxene, and olivine and clinopyroxene show an inverse Li partitioning compared to literature data. These findings along with textural observations and major element composition obtained from microprobe analysis suggest reaction of the peridotites with a mafic silicate melt before serpentinization. Serpentine minerals are enriched in B (most values between 10 and 100 µg/g), depleted in Li (most values below 1 µg/g) compared to the primary phases, with considerable variation within and between samples. Be is at the detection limit. Analysis of whole rock samples by prompt gamma activation shows that serpentinization tends to increase B (10.4-65.0 µg/g), H2O and Cl contents and to lower Li contents (0.07-3.37 µg/g) of peridotites, implying that-contrary to alteration of oceanic crust-B is fractionated from Li and that the B and Li inventory should depend essentially on rock-water ratios. Based on our results and on literature data, we calculate the inventory of B and Li contained in the oceanic lithosphere, and its partitioning between crust and mantle as a function of plate characteristics. We model four cases, an ODP Leg 209-type lithosphere with almost no igneous crust, and a Semail-type lithosphere with a thick igneous crust, both at 1 and 75 Ma, respectively. The results show that the Li contents of the oceanic lithosphere are highly variable (17-307 kg in a column of 1 m * 1 m * thickness of the lithosphere (kg/col)). They are controlled by the primary mantle phases and by altered crust, whereas the B contents (25-904 kg/col) depend entirely on serpentinization. In all cases, large quantities of B reside in the uppermost part of the plate and could hence be easily liberated during slab dehydration. The most prominent input of Li into subduction zones is to be expected from Semail-type lithosphere because most of the Li is stored at shallow levels in the plate. Subducting an ODP Leg 209-type lithosphere would mean only very little Li contribution from the slab. Serpentinized mantle thus plays an important role in B recycling in subduction zones, but it is of lesser importance for Li.
Population genetic and dispersal modeling data for Bathymodiolus mussels from the Mid-Atlantic Ridge
Resumo:
The zip folder comprises a text file and a gzipped tar archive. 1) The text file contains individual genotype data for 90 SNPs, 9 microsatellites and the mitochondrial ND4 gene that were determined in deep-sea hydrothermal vent mussels from the Mid-Atlantic Ridge (genus Bathymodiolus). Mussel specimens are grouped according to the population (pop)/location from which they have been sampled (first column). The remaining columns contain the respective allele/haplotype codes for the different genetic loci (names in the header line). The data file is in CONVERT format and can be directly transformed into different input files for population genetic statistics. 2) The tar archive contains NetCDF files with larval dispersal probabilities for simulated annual larval releases between 1998 and 2007. For each simulated vent location (Menez Gwen, Lucky Strike, Rainbow, Vent 1-10) two NetCDF files are given, one for an assumed pelagic larval duration of 1 year and the other one for an assumed pelagic larval duration of 6 months (6m).