977 resultados para genetic mapping
Resumo:
This dissertation has three separate parts: the first part deals with the general pedigree association testing incorporating continuous covariates; the second part deals with the association tests under population stratification using the conditional likelihood tests; the third part deals with the genome-wide association studies based on the real rheumatoid arthritis (RA) disease data sets from Genetic Analysis Workshop 16 (GAW16) problem 1. Many statistical tests are developed to test the linkage and association using either case-control status or phenotype covariates for family data structure, separately. Those univariate analyses might not use all the information coming from the family members in practical studies. On the other hand, the human complex disease do not have a clear inheritance pattern, there might exist the gene interactions or act independently. In part I, the new proposed approach MPDT is focused on how to use both the case control information as well as the phenotype covariates. This approach can be applied to detect multiple marker effects. Based on the two existing popular statistics in family studies for case-control and quantitative traits respectively, the new approach could be used in the simple family structure data set as well as general pedigree structure. The combined statistics are calculated using the two statistics; A permutation procedure is applied for assessing the p-value with adjustment from the Bonferroni for the multiple markers. We use simulation studies to evaluate the type I error rates and the powers of the proposed approach. Our results show that the combined test using both case-control information and phenotype covariates not only has the correct type I error rates but also is more powerful than the other existing methods. For multiple marker interactions, our proposed method is also very powerful. Selective genotyping is an economical strategy in detecting and mapping quantitative trait loci in the genetic dissection of complex disease. When the samples arise from different ethnic groups or an admixture population, all the existing selective genotyping methods may result in spurious association due to different ancestry distributions. The problem can be more serious when the sample size is large, a general requirement to obtain sufficient power to detect modest genetic effects for most complex traits. In part II, I describe a useful strategy in selective genotyping while population stratification is present. Our procedure used a principal component based approach to eliminate any effect of population stratification. The paper evaluates the performance of our procedure using both simulated data from an early study data sets and also the HapMap data sets in a variety of population admixture models generated from empirical data. There are one binary trait and two continuous traits in the rheumatoid arthritis dataset of Problem 1 in the Genetic Analysis Workshop 16 (GAW16): RA status, AntiCCP and IgM. To allow multiple traits, we suggest a set of SNP-level F statistics by the concept of multiple-correlation to measure the genetic association between multiple trait values and SNP-specific genotypic scores and obtain their null distributions. Hereby, we perform 6 genome-wide association analyses using the novel one- and two-stage approaches which are based on single, double and triple traits. Incorporating all these 6 analyses, we successfully validate the SNPs which have been identified to be responsible for rheumatoid arthritis in the literature and detect more disease susceptibility SNPs for follow-up studies in the future. Except for chromosome 13 and 18, each of the others is found to harbour susceptible genetic regions for rheumatoid arthritis or related diseases, i.e., lupus erythematosus. This topic is discussed in part III.
Resumo:
The development of a completely annotated sheep genome sequence is a key need for understanding the phylogenetic relationships and genetic diversity among the many different sheep breeds worldwide and for identifying genes controlling economically and physiologically important traits. The ovine genome sequence assembly will be crucial for developing optimized breeding programs based on highly productive, healthy sheep phenotypes that are adapted to modern breeding and production conditions. Scientists and breeders around the globe have been contributing to this goal by generating genomic and cDNA libraries, performing genome-wide and trait-associated analyses of polymorphism, expression analysis, genome sequencing, and by developing virtual and physical comparative maps. The International Sheep Genomics Consortium (ISGC), an informal network of sheep genomics researchers, is playing a major role in coordinating many of these activities. In addition to serving as an essential tool for monitoring chromosome abnormalities in specific sheep populations, ovine molecular cytogenetics provides physical anchors which link and order genome regions, such as sequence contigs, genes and polymorphic DNA markers to ovine chromosomes. Likewise, molecular cytogenetics can contribute to the process of defining evolutionary breakpoints between related species. The selective expansion of the sheep cytogenetic map, using loci to connect maps and identify chromosome bands, can substantially contribute to improving the quality of the annotated sheep genome sequence and will also accelerate its assembly. Furthermore, identifying major morphological chromosome anomalies and micro-rearrangements, such as gene duplications or deletions, that might occur between different sheep breeds and other Ovis species will also be important to understand the diversity of sheep chromosome structure and its implications for cross-breeding. To date, 566 loci have been assigned to specific chromosome regions in sheep and the new cytogenetic map is presented as part of this review. This review will also summarize the current cytogenomic status of the sheep genome, describe current activities in the sheep cytogenomics research sector, and will discuss the cytogenomics data in context with other major sheep genomics projects.
Resumo:
Electrophoretic variants at four additional enzyme loci--two esterases (Est-2, Est-3), retinal lactate dehydrogenase (LDH-1) and mannose phosphate isomerase (MPI)--among three species and four subspecies of fish of the genus Xiphophorus were observed. Electrophoretic patterns in F1 hybrid heterozygotes confirmed the monomeric structures of MPI and the esterase and the tetrametric structure of LDH in these fishes. Variant alleles of all four loci displayed normal Mendelian segregation in backcross and F2 hybrids. Recombination data from backcross hybrids mapped with Haldane's mapping function indicate the four loci to be linked as Est-2--0.43--Est3--0.26--LDH-1--0.19--MPI. Significant interference was detected and apparently concentrated in the Est-3 to MPI region. No significant sex-specific differences in recombination were observed. This group (designated linkage group II) was shown to assort independently from the three loci of linkage group I (adenosine deaminase, glucose-6-phosphate dehydrogenase, and 6-phosphogluconate dehydrogenase) and from glyceraldehyde-3-phosphate dehydrogenase and two isocitrate dehydrogenase loci. Evidence for conservation of the linkage group, at least in part, in other vertebrate species is presented.
Resumo:
OBJECTIVE: To identify systemic sclerosis (SSc) susceptibility loci via a genome-wide association study. METHODS: A genome-wide association study was performed in 137 patients with SSc and 564 controls from Korea using the Affymetrix Human SNP Array 5.0. After fine-mapping studies, the results were replicated in 1,107 SSc patients and 2,747 controls from a US Caucasian population. RESULTS: The single-nucleotide polymorphisms (SNPs) (rs3128930, rs7763822, rs7764491, rs3117230, and rs3128965) of HLA-DPB1 and DPB2 on chromosome 6 formed a distinctive peak with log P values for association with SSc susceptibility (P=8.16x10(-13)). Subtyping analysis of HLA-DPB1 showed that DPB1*1301 (P=7.61x10(-8)) and DPB1*0901 (P=2.55x10(-5)) were the subtypes most susceptible to SSc in Korean subjects. In US Caucasians, 2 pairs of SNPs, rs7763822/rs7764491 and rs3117230/rs3128965, showed strong association with SSc patients who had either circulating anti-DNA topoisomerase I (P=7.58x10(-17)/4.84x10(-16)) or anticentromere autoantibodies (P=1.12x10(-3)/3.2x10(-5)), respectively. CONCLUSION: The results of our genome-wide association study in Korean subjects indicate that the region of HLA-DPB1 and DPB2 contains the loci most susceptible to SSc in a Korean population. The confirmatory studies in US Caucasians indicate that specific SNPs of HLA-DPB1 and/or DPB2 are strongly associated with US Caucasian patients with SSc who are positive for anti-DNA topoisomerase I or anticentromere autoantibodies.
Resumo:
Human x rodent somatic cell hybrids have played an important role in human genetics research. They have been especially useful for assigning genes to chromosomes and isolating DNA markers from specific regions of the human genome.^ By employing a combination of somatic cell genetic, recombinant DNA, and cytogenetic techniques, human DNA excision repair gene ERCC4 was mapped regionally to human 16p13.13-13.2, even though the gene has not been cloned. Human x Chinese hamster ovary (CHO) cell hybrids selected for human ERCC4 activity and containing 16p13.1-p13.3 as the only human genetic material were identified. These hybrids were used to order DNA markers located in 16p13.1-p13.3. New DNA markers physically close to ERCC4 were isolated from such hybrids. Using amplified human DNA from the hybrids as probe in fluorescent in situ hybridization, the short arm breakpoint in the chromosome 16 inversion associated with acute myelomonocytic leukemia (AMML) was found to be physically close to the ERCC4 gene. The physical mapping and eventually, the cloning of the ERCC4 gene, will benefit the understanding of the DNA repair system and the study of other important biomedical problems such as tumorigenesis.^ To facilitate the cloning of ERCC4 gene and, in general, the cloning of genes from any defined regions of the human genome, a method was developed for the direct isolation of human transcribed genes ffom somatic cell hybrids. cDNA was prepared from human x rodent hybrid by using consensus 5$\sp\prime$ splice site sequences as primers. These primers were designed to select immature, unspliced messenger RNA (still retaining species specific repeat sequences) as templates. Screening of a derived cDNA library for human repeat sequences resulted in the isolation of human clones at the anticipated frequency with characteristics expected of exons of transcribed human genes. The usefulness of the splice site specific primers was analyzed and the cDNA synthesis conditions with these primers were optimized. The procedure was shown to be sensitive enough to clone weakly expressed genes. Studying the expression of the represented genes with the isolated clones was shown to be feasible. Such regional specific human gene fragments will be very valuable for many human genetic studies such as the search of inherited disease genes and the construction of a cDNA map of the human genome. ^
Resumo:
BACKGROUND & AIMS: Recently, genetic variations in MICA (lead single nucleotide polymorphism [SNP] rs2596542) were identified by a genome-wide association study (GWAS) to be associated with hepatitis C virus (HCV)-related hepatocellular carcinoma (HCC) in Japanese patients. In the present study, we sought to determine whether this SNP is predictive of HCC development in the Caucasian population as well. METHODS: An extended region around rs2596542 was genotyped in 1924 HCV-infected patients from the Swiss Hepatitis C Cohort Study (SCCS). Pair-wise correlation between key SNPs was calculated both in the Japanese and European populations (HapMap3: CEU and JPT). RESULTS: To our surprise, the minor allele A of rs2596542 in proximity of MICA appeared to have a protective impact on HCC development in Caucasians, which represents an inverse association as compared to the one observed in the Japanese population. Detailed fine-mapping analyses revealed a new SNP in HCP5 (rs2244546) upstream of MICA as strong predictor of HCV-related HCC in the SCCS (univariable p=0.027; multivariable p=0.0002, odds ratio=3.96, 95% confidence interval=1.90-8.27). This newly identified SNP had a similarly directed effect on HCC in both Caucasian and Japanese populations, suggesting that rs2244546 may better tag a putative true variant than the originally identified SNPs. CONCLUSIONS: Our data confirms the MICA/HCP5 region as susceptibility locus for HCV-related HCC and identifies rs2244546 in HCP5 as a novel tagging SNP. In addition, our data exemplify the need for conducting meta-analyses of cohorts of different ethnicities in order to fine map GWAS signals.
Resumo:
White markings and spotting patterns in animal species are thought to be a result of the domestication process. They often serve for the identification of individuals but sometimes are accompanied by complex pathological syndromes. In the Swiss Franches-Montagnes horse population, white markings increased vastly in size and occurrence during the past 30 years, although the breeding goal demands a horse with as little depigmented areas as possible. In order to improve selection and avoid more excessive depigmentation on the population level, we estimated population parameters and breeding values for white head and anterior and posterior leg markings. Heritabilities and genetic correlations for the traits were high (h(2) > 0.5). A strong positive correlation was found between the chestnut allele at the melanocortin-1-receptor gene locus and the extent of white markings. Segregation analysis revealed that our data fit best to a model including a polygenic effect and a biallelic locus with a dominant-recessive mode of inheritance. The recessive allele was found to be the white trait-increasing allele. Multilocus linkage disequilibrium analysis allowed the mapping of the putative major locus to a chromosomal region on ECA3q harboring the KIT gene.
Resumo:
Nitrate reductase in Escherichia coli is a membrane-bound anaerobic enzyme that is repressed by oxygen and induced by nitrate. The genetic organization of the structural genes for the two larger subunits of nitrate reductase ((alpha) and (beta)) was determined by immunoprecipitation analysis of the formation of these proteins in nitrate reductase-deficient mutants resulting from transposon Tn5 mutagenesis. The results suggested that the genes encoding the (alpha) and (beta) subunits (narG and H) were arranged in an operon with transcription in the direction promoter(--->)(alpha)(--->)(beta). Segments of the chromosome containing the Tn5 inserts from several of the mutants were cloned into plasmid pBR322 and the positions of the transposons determined by restriction mapping. The Tn5 insertion sites were localized on two contiguous EcoRI fragments spanning about 6.6 kilobases of DNA. The narI gene (proposed to encode the (gamma) subunit) was positioned immediately downstream from the (beta)-gene (narH) by Southern analysis of Tn10 insertions into the narI locus. A Tn10 insertion into the narK locus, proposed to encode a nitrate-sensitive repressor of other anaerobic enzymes, was located about 1.5 kilobases upstream from the narGHI operon promoter. The narL locus, proposed to encode a nitrate-sensitive positive regulator of the narGHI operon and known to be genetically linked to the other nar genes, was demonstrated to lie outside a 19.3-kilobase region of the chromosome which encompasses the other nar genes. The physical limit of the narGHI promoter was defined by studying the effect of Tn5 insertions into a hybrid plasmid containing the functional operon. The points of origin of the coding regions for the (alpha) and (beta) genes were deduced by alignment of the chromosomal map of Tn5 insertion sites with the sizes of (alpha) and (beta) subunit fragments produced by plasmids carrying these Tn5 inserts in the nar operon. The coding region for the (alpha) subunit (143,000 daltons) begins about 250 nucleotides downstream from the deduced limit of the promoter region and includes about 4.0 kilobases of DNA; the region encoding (beta) (60,000 daltons) lies immediately downstream from the (alpha)-gene and is approximately 1.6 kilobases in length. The adjacent region encoding the (gamma) subunit (19,000 daltons) is approximately 0.5 kilobase in length. ^
Resumo:
Renal cell carcinoma (RCC) is the most common malignant tumor of the kidney. Characterization of RCC tumors indicates that the most frequent genetic event associated with the initiation of tumor formation involves a loss of heterozygosity or cytogenetic aberration on the short arm of human chromosome 3. A tumor suppressor locus Nonpapillary Renal Carcinoma-1 (NRC-1, OMIM ID 604442) has been previously mapped to a 5–7 cM region on chromosome 3p12 and shown to induce rapid tumor cell death in vivo, as demonstrated by functional complementation experiments. ^ To identify the gene that accounts for the tumor suppressor activities of NRC-1, fine-scale physical mapping was conducted with a novel real-time quantitative PCR based method developed in this study. As a result, NRC-1 was mapped within a 4.6-Mb region defined by two unique sequences within UniGene clusters Hs.41407 and Hs.371835 (78,545Kb–83,172Kb in the NCBI build 31 physical map). The involvement of a putative tumor suppressor gene Robo1/Dutt1 was excluded as a candidate for NRC-1. Furthermore, a transcript map containing eleven candidate genes was established for the 4.6-Mb region. Analyses of gene expression patterns with real-time quantitative RT-PCR assays showed that one of the eleven candidate genes in the interval (TSGc28) is down-regulated in 15 out of 20 tumor samples compared with matched normal samples. Three exons of this gene have been identified by RACE experiments, although additional exon(s) seem to exist. Further gene characterization and functional studies are required to confirm the gene as a true tumor suppressor gene. ^ To study the cellular functions of NRC-1, gene expression profiles of three tumor suppressive microcell hybrids, each containing a functional copy of NRC-1, were compared with those of the corresponding parental tumor cell lines using 16K oligonucleotide microarrays. Differentially expressed genes were identified. Analyses based on the Gene Ontology showed that introduction of NRC-1 into tumor cell lines activates genes in multiple cellular pathways, including cell cycle, signal transduction, cytokines and stress response. NRC-1 is likely to induce cell growth arrest indirectly through WEE1. ^
Resumo:
Linkage and association studies are major analytical tools to search for susceptibility genes for complex diseases. With the availability of large collection of single nucleotide polymorphisms (SNPs) and the rapid progresses for high throughput genotyping technologies, together with the ambitious goals of the International HapMap Project, genetic markers covering the whole genome will be available for genome-wide linkage and association studies. In order not to inflate the type I error rate in performing genome-wide linkage and association studies, multiple adjustment for the significant level for each independent linkage and/or association test is required, and this has led to the suggestion of genome-wide significant cut-off as low as 5 × 10 −7. Almost no linkage and/or association study can meet such a stringent threshold by the standard statistical methods. Developing new statistics with high power is urgently needed to tackle this problem. This dissertation proposes and explores a class of novel test statistics that can be used in both population-based and family-based genetic data by employing a completely new strategy, which uses nonlinear transformation of the sample means to construct test statistics for linkage and association studies. Extensive simulation studies are used to illustrate the properties of the nonlinear test statistics. Power calculations are performed using both analytical and empirical methods. Finally, real data sets are analyzed with the nonlinear test statistics. Results show that the nonlinear test statistics have correct type I error rates, and most of the studied nonlinear test statistics have higher power than the standard chi-square test. This dissertation introduces a new idea to design novel test statistics with high power and might open new ways to mapping susceptibility genes for complex diseases. ^
Resumo:
To identify more mutations that can affect the early development of Myxococcus xanthus, the synthetic transposon TnT41 was designed and constructed. By virtue of its special features, it can greatly facilitate the processes of mutation screening/selection, mapping, cloning and DNA sequencing. In addition, it allows for the systematic discovery of genes in regulatory hierarchies using their target promoters. In this study, the minimal regulatory region of the early developmentally regulated gene 4521 was used as a reporter in the TnT41 mutagenesis. Both positive (P) mutations and negative (N) mutations were isolated based on their effects on 4521 expression.^ Four of these mutations, i.e. N1, N2, P52 and P54 were analyzed in detail. Mutations N1 and N2 are insertion mutations in a gene designated sasB. The sasB gene is also identified in this study by genetic and molecular analysis of five UV-generated 4521 suppressor mutations. The sasB gene encodes a protein without meaningful homology in the databases. The sasB gene negatively regulates 4521 expression possibly through the SasS-SasR two component system. A wild-type sasB gene is required for normal M. xanthus fruiting body formation and sporulation.^ Cloning and sequencing analysis of the P52 mutation led to the identification of an operon that encodes the M. xanthus high-affinity branched-chain amino acid transporter system. This liv operon consists of five genes designated livK, livH, livM, livC, and livF, respectively. The Liv proteins are highly similar to their counterparts from other bacteria in both amino acid sequences, functional motifs and predicted secondary structures. This system is required for development since liv null mutations cause abnormality in fruiting body formation and a 100-fold decrease in sporulation efficiency.^ Mutation P54 is a TnT41 insertion in the sscM gene of the ssc chemotaxis system, which has been independently identified by Dr. Shi's lab. The sscM gene encodes a MCP (methyl-accepting chemotaxis protein) homologue. The SscM protein is predicted to contain two transmembrane domains, a signaling domain and at least one putative methylation site. Null mutations of this gene abolish the aggregation of starving cells at a very early stage, though the sporulation levels of the mutant can reach 10% that of wild-type cells. ^
Resumo:
A mapping F2 population from the cross ‘Piel de Sapo’ × PI124112 was selectively genotyped to study the genetic control of morphological fruit traits by QTL (Quantitative Trait Loci) analysis. Ten QTL were identified, five for FL (Fruit Length), two for FD (Fruit Diameter) and three for FS (Fruit Shape). At least one robust QTL per character was found, flqs8.1 (LOD = 16.85, R2 = 34%), fdqs12.1 (LOD = 3.47, R2 = 11%) and fsqs8.1 (LOD = 14.85, R2 = 41%). flqs2.1 and fsqs2.1 cosegregate with gene a (andromonoecious), responsible for flower sex determination and with pleiotropic effects on FS. They display a positive additive effect (a) value, so the PI124112 allele causes an increase in FL and FS, producing more elongated fruits. Conversely, the negative a value for flqs8.1 and fsqs8.1 indicates a decrease in FL and FS, what results in rounder fruits, even if PI124112 produces very elongated melons. This is explained by a significant epistatic interaction between fsqs2.1 and fsqs8.1, where the effects of the alleles at locus a are attenuated by the additive PI124112 allele at fsqs8.1. Roundest fruits are produced by homozygous for PI124112 at fsqs8.1 that do not carry any dominant A allele at locus a (PiPiaa). A significant interaction between fsqs8.1 and fsqs12.1 was also detected, with the alleles at fsqs12.1 producing more elongated fruits. fsqs8.1 seems to be allelic to QTL discovered in other populations where the exotic alleles produce elongated fruits. This model has been validated in assays with backcross lines along 3 years and ultimately obtaining a fsqs8.1-NIL (Near Isogenic Line) in ‘Piel de Sapo’ background which yields round melons.
Resumo:
Linkage disequilibrium analysis can provide high resolution in the mapping of disease genes because it incorporates information on recombinations that have occurred during the entire period from the mutational event to the present. A circumstance particularly favorable for high-resolution mapping is when a single founding mutation segregates in an isolated population. We review here the population structure of Finland in which a small founder population some 100 generations ago has expanded into 5.1 million people today. Among the 30-odd autosomal recessive disorders that are more prevalent in Finland than elsewhere, several appear to have segregated for this entire period in the “panmictic” southern Finnish population. Linkage disequilibrium analysis has allowed precise mapping and determination of genetic distances at the 0.1-cM level in several of these disorders. Estimates of genetic distance have proven accurate, but previous calculations of the confidence intervals were too small because sampling variation was ignored. In the north and east of Finland the population can be viewed as having been “founded” only after 1500. Disease mutations that have undergone such a founding bottleneck only 20 or so generations ago exhibit linkage disequilibrium and haplotype sharing over long genetic distances (5–15 cM). These features have been successfully exploited in the mapping and cloning of many genes. We review the statistical issues of fine mapping by linkage disequilibrium and suggest that improved methodologies may be necessary to map diseases of complex etiology that may have arisen from multiple founding mutations.
Comparative mapping of Andropogoneae: Saccharum L. (sugarcane) and its relation to sorghum and maize
Resumo:
Comparative genetic maps of Papuan Saccharum officinarum L. (2n = 80) and S. robustum (2n = 80) were constructed by using single-dose DNA markers (SDMs). SDM-framework maps of S. officinarum and S. robustum were compared with genetic maps of sorghum and maize by way of anchor restriction fragment length polymorphism probes. The resulting comparisons showed striking colinearity between the sorghum and Saccharum genomes. There were no differences in marker order between S. officinarum and sorghum. Furthermore, there were no alterations in SDM order between S. officinarum and S. robustum. The S. officinarum and S. robustum maps also were compared with the map of the polysomic octoploid S. spontaneum ‘SES 208’ (2n = 64, x = 8), thus permitting relations to homology groups (“chromosomes”) of S. spontaneum to be studied. Investigation of transmission genetics in S. officinarum and S. robustum confirmed preliminary results that showed incomplete polysomy in these species. Because of incomplete polysomy, multiple-dose markers could not be mapped for lack of a genetic model for their segregation. To coalesce S. officinarum and S. robustum linkage groups into homology groups (composed of homologous pairing partners), they were compared with sorghum (2n = 20), which functioned as a synthetic diploid. Groupings suggested by comparative mapping were found to be highly concordant with groupings based on highly polymorphic restriction fragment length polymorphism probes detecting multiple SDMs. The resulting comparative maps serve as bridges to allow information from one Andropogoneae to be used by another, for breeding, ecology, evolution, and molecular biology.
Resumo:
The region of human chromosome 22q11 is prone to rearrangements. The resulting chromosomal abnormalities are involved in Velo-cardio-facial and DiGeorge syndromes (VCFS and DGS) (deletions), “cat eye” syndrome (duplications), and certain types of tumors (translocations). As a prelude to the development of mouse models for VCFS/DGS by generating targeted deletions in the mouse genome, we examined the organization of genes from human chromosome 22q11 in the mouse. Using genetic linkage analysis and detailed physical mapping, we show that genes from a relatively small region of human 22q11 are distributed on three mouse chromosomes (MMU6, MMU10, and MMU16). Furthermore, although the region corresponding to about 2.5 megabases of the VCFS/DGS critical region is located on mouse chromosome 16, the relative organization of the region is quite different from that in humans. Our results show that the instability of the 22q11 region is not restricted to humans but may have been present throughout evolution. The results also underscore the importance of detailed comparative mapping of genes in mice and humans as a prerequisite for the development of mouse models of human diseases involving chromosomal rearrangements.