979 resultados para Genomic Regions
Resumo:
A Deficiência Intelectual (DI) é uma condição definida como um funcionamento intelectual significativamente prejudicado, expresso juntamente com limitações em pelo menos duas áreas do comportamento adaptativo que se manifestam antes dos 18 anos de idade. A prevalência estimada da DI na população em geral é de 2-3% e um número expressivo de casos permanece sem um diagnóstico definitivo. Há um consenso geral de que a DI é mais comum em indivíduos do sexo masculino em relação aos do sexo feminino. Entre as explicações para este excesso está a concentração de genes específicos para a habilidade cognitiva no cromossomo X. MicroRNAs (miRNAs) são pequenas moléculas de RNA não codificador que modulam a expressão gênica pós-transcricional de RNAs mensageiros alvo. Recentemente, estudos têm demonstrado a importância essencial dos miRNAs para o desenvolvimento e funcionamento cerebrais e sabe-se que o cromossomo X tem uma alta densidade de genes de miRNAs. Neste contexto, os miRNAs são candidatos potenciais como fatores genéticos envolvidos na Deficiência Intelectual Ligada ao X (DILX). Neste estudo, foram analisadas as regiões genômicas de 17 genes de miRNAs expressos no cérebro localizados no cromossomo X, com o objetivo de investigar o possível envolvimento de variantes na sequência destes miRNAs na DILX. Para este fim, selecionamos amostras de DNA genômico (sangue periférico) de 135 indivíduos do sexo masculino portadores de DI sugestiva de DILX de um grupo de mais de 1.100 pacientes com DI encaminhados ao Serviço de Genética Humana da UERJ. O critério de inclusão para este estudo era de que os probandos apresentassem um ou mais parentes do sexo masculino afetados pela DI que fossem interligados por via materna. As amostras de DNA dos pacientes foram amplificadas utilizando a técnica de reação em cadeia da polimerase, seguida por purificação e sequenciamento direto pelo método de Sanger dos fragmentos amplificados. Para avaliar a conservação dos 17 miRNAs foi realizada uma análise filogenética in silico incluindo sequências dos miRNAs selecionados de humanos e de outras 8 espécies de primatas estreitamente relacionadas. Não foram encontradas alterações nas sequências nos genes de 17 miRNAs analisados, mesmo diante do padrão genético altamente heterogêneo da população brasileira. Adicionalmente, a análise filogenética destes miRNAs revelou uma alta conservação entre as espécies comparadas. Considerando o papel dos miRNAs como reguladores da expressão gênica, a ausência de alterações e a alta conservação entre primatas sugerem uma forte pressão seletiva sobre estas moléculas, reforçando a sua importância funcional para o organismo em geral. Apesar de não termos encontrado variantes de sequência nos miRNAs estudados, o envolvimento de miRNAs na DI não pode ser completamente descartado. Alterações fora da molécula de miRNA precursor, nos fatores de processamento, nos sítios alvo e variações no número de cópias de genes de miRNAs podem implicar em alteração na expressão dos miRNAs e, consequentemente, na funcionalidade do miRNA maduro. Sendo assim, uma análise sistemática da expressão de miRNAs em pacientes com DILX é urgentemente necessária, a fim de desvendar novos genes/mecanismos moleculares relacionados a esta condição.
Resumo:
青稞,是我国藏区居民对裸大麦的称谓,它不仅是藏民的主要食粮、燃料和牲畜饲料,而且也是啤酒、医药和保健品生产的原料;青稞不仅为藏区人民的健康和经济发展做出了很大的贡献,而且对人类健康和社会经济的可持续发展都有重要的意义。青藏高原是我国及世界上青稞分布和种植面积最大的地区,资源极其丰富。虽然从经典遗传直到分子标记对我国大麦遗传多样性都有研究,但研究手段、数量仍然不够深入,对我国大麦资源遗传多样性研究的信息非常有限,不能很好地满足大麦遗传研究和育种应用的需要,尤其是对西藏栽培大麦的遗传多样性的研究还只是刚刚开始,关于栽培青稞多态性的研究报道很少。本研究采用SSR标记和蛋白质电泳两类技术,从SSR标记位点、单体醇溶蛋白、B组醇溶蛋白和淀粉粒结合蛋白(SGP)等四个方面对我国青藏高原栽培青稞的遗传多样性进行了综合评价。 SSR标记具有基因组分布广泛、数量丰富、多态性高、容易检测、共显性、结果稳定可靠、实验重现性好、操作简单、经济、易于高通量分析等许多优点,被认为是用于遗传多样性、品种鉴定、物种的系统发育、亲缘关系及起源等研究的非常有效的分子标记。本研究采用SSR标记分析了64份青藏高原栽培青稞的遗传多样性,同时评估SSR标记在我国大麦育种和品种鉴定中的应用潜力。选择了30个已知作图位点SSR标记,其中25个标记与重要性状的控制位点连锁紧密。选择的30个SSR标记,5个未得到很好的扩增产物,3个无多态性。22个多态性SSR标记位点中,每位点检测出等位基因2~15个,共检测出等位基因132个,平均每位点6.0 个。各多态位点检测出基因型为2~11种,位点HVM33的基因型最多。各多态位点的多态信息指数为0.16~0.91, 平均为0.65。根据PIC值选择了13个SSR标记用于我国青藏高原栽培青稞基因型鉴定,这些标记的PIC值为0.6以上。结合PIC值和基因型差异,选择了8个多态信息含量高的SSR标记,构建了高效指纹图谱,此图谱能把64份材料完全区分。 贮藏蛋白电泳分析是研究相关编码蛋白基因多态性的非常有效的方法。大麦单体蛋白与小麦醇溶蛋白相对应,具有丰富的多态性,可用于大麦遗传多样性、品种鉴定和群体进化等研究。本研究通过A-PAGE电泳技术研究了84份青藏高原栽培青稞的单体醇溶蛋白多态性。大麦单体醇溶蛋白图谱与小麦醇溶蛋白电泳图谱类似,所分离的蛋白清晰地分为ω-,γ-,β-和α-四个部分。青藏高原栽培青稞单体醇溶蛋白具有丰富的多态性,84份青稞材料中存在43条不同的蛋白带,75种组合带谱;其中67种为单一材料所独有,另8种则分别包含了2-3份材料。每份材料中拥有醇溶蛋白带为6-16条,含有6-10条单体醇溶蛋白带材料较多。西藏和四川材料群体单体醇溶蛋白多态性不同,具有区域特异性。西藏材料中发现了40条不同蛋白带,3条特异带,46 种蛋白组合;四川材料中出现了40种不同蛋白带,26种条带组合, 3条特异带。基于单体蛋白多态性的聚类与材料的来源有一定的相关性。A-PAGE单体蛋白具有丰富的多态性,可作为遗传研究和品种鉴定的标记。 大麦醇溶蛋白(hordein)是大麦籽粒的主要贮藏蛋白,与大麦的营养品质和加工品质密切相关,而且具有丰富的多态性,广泛用于品种鉴定、种质筛选、遗传多样性和亲缘关系研究。B组醇溶蛋白是主要的醇溶蛋白组份,约占总醇溶蛋白的80%,而且具有丰富的多态性。本研究采用SDS-PAGE分析了72份青藏高原栽培青稞B组醇溶蛋白的遗传多样性。青藏高原栽培青稞B组醇溶蛋白具有丰富的多态性,72份青稞材料中存在15种蛋白带,30种组合带谱,其中15种为单一材料所独有,另15种则分别包含了2-10份材料。每份材料中B组醇溶蛋白条带数为4-8条,含5、6条的材料较常见。不同来源的群体材料间B组醇溶蛋白组成存在差异,西藏青稞含有26种蛋白组合带谱,其中有19种特异带谱;四川群体中共发现11种蛋白组合带型,其中有4种特有带谱。两群体中都存在稀有条带。聚类分析将材料分成三组,材料聚类与材料来源地没有明显的相关性。 淀粉粒蛋白(Starch granule proteins, SGPs)是一类与淀粉粒结合的微量蛋白,一些淀粉粒蛋白具有淀粉生化合成中主要的酶蛋白功能,其变异会影响淀粉含量和特性,从而影响淀粉的应用。关于我国大麦淀粉粒组成研究还未见报道。本实验首次开创了我国大麦淀粉粒结合蛋白的研究工作。采用SDS-PAGE电泳技术研究了青藏高原栽培青稞的SGP组成,并分析了不同SGP组合间淀粉含量的差异,初步探索了所分离的SGP蛋白与淀粉合成的关系。66份青稞材料中分离了10种主要的SGP,其表观分子量为40-100KD,低于60KD的SGP带有7条,共有16种组合带谱;各SGP蛋白和组合带谱出现的频率存在差异,青藏高原青稞的SGP组成存在多态性。西藏青稞和四川青稞的SGP组成有很大差异,SGP组成具有地域差异性,西藏青稞含有12种蛋白组合带谱,其中有9种特异带谱;四川群体中共发现7种蛋白组合带型,其中有4种特有带谱;两群体中仅有3种共同的蛋白组合带谱。SGP蛋白特性将66份青稞分为三组, 即Ⅰ、Ⅱ、Ⅲ,材料聚类与材料来源具有一定的相关性。不同组合带谱材料间淀粉含量差异显著性检验结果显示,不同带谱间材料的总淀粉含量、直链淀粉含量和支链淀粉含量有差异,带谱2(SGP1+3+7+9+10)和8(SGP1+2+4+6+8)的总淀粉含量及支链淀粉含量显著大于组合带谱3(SGP1+3+7+10)的总淀粉含量。组合带谱7(SGP1+2+6+8)的直链淀粉含量显著低于带谱11(SGP1+5+8)的直链淀粉。带谱SGP2、3、4、5、6、7、8、9、10可能参与淀粉合成,SGP9可能与高支链淀粉的合成相关。 SSR标记位点、单体醇溶蛋白、B组醇溶蛋白、淀粉结合蛋白等四个方面的研究结果表明青藏高原SSR标记多态性、单体醇溶蛋白多态性、B组醇溶蛋白多态性和SGP多态性都非常丰富,与青藏高原是栽培青稞的多样性分布中心的观点一致。 青藏高原栽培青稞的SSR标记、单体醇溶蛋白、B组醇溶蛋白和SGP多态性表现出很大差异。SSR标记覆盖了整个基因组,多态性非常高。单体蛋白、B组醇溶蛋白、SGP蛋白是育种中非常关注的性状,他们只是代表基因组中的某一区域或位点,多态性相对较低。但单体蛋白多态性很高,84份材料中检测出43条不同蛋白带,75种不同的组合带谱。SSR标记技术和单体蛋白技术都是遗传多样性研究的有力工具,但单体蛋白技术不仅多态性高,而且经济、操作简便,是种质鉴定的理想方法。 对不同标记的多态性材料数据进行聚类,聚类图能为我们提供各材料间的遗传相似信息,为材料选择提供参考。但材料聚类与材料来源的地理区域的相关性表现不一致。SSR聚类和B组醇溶蛋白聚类与材料的来源地无相关性,而单体醇溶蛋白和SGP聚类与材料来源地有一定相关性,即西藏群体和四川群体分别有集中类群,这可能是人为选择的附加效应。 不同来源的群体材料的遗传多样性不同,具有区域特异稀有基因,加强不同地区间资源的交换和配合使用,有利于增加群体遗传多样性和新品种培育。 青藏高原栽培青稞的麦芽浸提性状、淀粉性状、病虫及裸粒等重要农艺性状控制位点存在丰富的变异,遗传基础宽广,可能蕴藏着多种不同的等位基因,是研究重要性状遗传特性、基因资源挖掘和遗传育种的宝贵资源库。 Hulless barley, due to its favorable attributes such as high feed value, good human nutrition,rich dietary fiber and ease processing, attracts people,s attention . Hulless barley plays a very important role in Tibetan life, used as essential food crop, main animal feed and important fuel. In addition to tsampa (roasted barley flour), a main food for Tibetan, hulless barley is also made into cake, soup, porridge, recent naked barley liquor and cornmeal. Qinghai-Tibet Plateau is one of a few areas which plant naked barley widely in the world and also has a long growing history. Genetic diversity of the cultivated hulless barley in this region , however, has not been documented. The study of genetic diversity existing within this population is of particular interest in germplasm identification, preservation, and new cultivar development. This study analyzed the genetic diversity of the cultivated naked barley from Qinghai-Tibet plateau through the study of SSR marker loci and monomeric prolamins, B-horden and starch granule proteins. SSRs are present abundantly in genomes of higher organisms and have become a popular marker system in plant studies. SSRs offer a number of advantages, such as the high level of polymorphisms, locus specificity, co-dominance, reproducibility, ease of use through PCRand random distribution throughout the genome. In barley, several hundred SSRs have been developed and genetically mapped and can therefore be selected from specific genomic regions. The genetic diversity of 64 cultivated naked barley from Tibet and Sichuan was studied with 30 SSRs of known map location.Among the selected SSR markers, PCR products of 5 SSR markers were not obtained and 3 SSR marker loci were monomeric. A total of 132 alleles were identified at 22 polyomeric SSR loci. The number of alleles per locus ranged from 2 to 15, with an average of 6.0. The polymorphism information content values for the SSRs ranged from 0.08 to 0.94, with an average of 0.65. 13 SSR markers with the PIC value >0.6 have been selected for discrimination of Qinghai-Tibet naked barley genotypews. A finger Print map was developed through 7 SSR markers with the high PIC value. It could be used as an efficient tool for gene discovery and identification of gernplasm. Hordeins, the main storage proteins of the barley seed, are composed of momomeric and polymeric prolamins and divided into -A, B, C and D groups in order of decreasing electrophoretic mobility. Hordeins show high inter-genotypic variation and have been extensively used as markers for cultivar identification and analyzing the genetic diversity. This study analyzed the genetic diversity of B-hordein in 72 naked barley from Qinqhai-Tibet Plateau. Extensive diversity was observed. A total of 15 different bands and 30 distinct patterns were found. Jaccard's coefficient of similarity was calculated, and the accessions were divided into three main groups by cluster analysis using UPGMA. Differentiation among the populations from different collecting regions based on the polymorphism of B-hordein was investigated. Monomeric prolamins show high inter-genotypic variation and have been used as molecular markers for cultivar identification, analyzing the genetic diversity in collections and investigating the evolution processes and structure of populations However, the cultivated hulless accessions from Qinghai-Tibet Pateau in China have never been examined with respect to monomeric prolamins. This study analyzed the genetic diversity of monomeric prolamins (protein fraction corresponding to wheat gliadins) using the Acid -PAGE technique in eighty-four cultivated hulless barley from Qinqhai-Tibet Plateau in China. Extensive diversity was observed. A total of 43 different bands were found, of which 21 different bands were in the region of ω group, 8 in the region of γ, 8 in the region of β, and 6 in the region of α group. Among the 86 accessions, 75 distinct patterns were identified. The number of bands ranged from 6 to 16, depending on the variety. Jaccard’s coefficient of similarity was calculated, and the lines were grouped by cluster analysis using UPGMA. A dendrogram was obtained from the analysis of the groups and five main clusters were identified. No relationship between the distribution in the dendrogram and growth habits and origins of the cultivars could be detected. Starch is the major constituent of the cereal endosperm, comprising approximately 65% of the dry weight of the mature wheat grain. The starch formed in all organs of plants is packaged into starch granules, which vary widely between species and cultivars in size and shape. Wheat endosperm starch granules contain about corresponding to the main biosynthase of starch. This report firstly dealed with intraspecific variation of the major SGPs in cultivated naked barley from Qinghai-Tibet plateau. A total of 10 major SGPs were observed in the range of 40KD-100KD and 16 types of patterns were found. Based on the variation of SGPs, accessions studied were classified into 3 groups. A geographical cline of electrophoregram was observed. In addition, significance test of the difference of starch content among groups and types of patterns were done, and the results indicated those SGPs could be related to the content of starch. Diagram obtained through cluster analysis exhibited a structuration of diversity and genetic relationship among cultivated hulless accessions. In breeding program, parents with genetically distant relationship for hybridization will increase genetic diversity of progenies. In conclusion, cultivated naked barley from Qinghai-Tibet Plateau in China presents a high variability with respect to monomeric prolamins,SSR markers , B- hordeins and SGPs. The result of this study supports Qinghai-Tibet Plateau is the center of cultivated hulless barley and the cultivated naked barley is considered to be a gene pool with large diversity and could be applied to breeding for cereal.
Resumo:
Lactococcus lactis is used extensively world-wide for the production of fermented dairy products. Bacteriophages (phages) infecting L. lactis can result in slow or incomplete fermentations, or may even cause total fermentation failure. Therefore, bacteriophages disrupting L. lactis fermentation are of economic concern. This thesis employed a multifaceted approach to investigate various molecular aspects of phage-host interaction in L. lactis. The genome sequence of an Irish dairy starter strain, the prophage-cured L. lactis subsp. cremoris UC509.9, was studied. The 2,250,427 bp circular chromosome represents the smallest among its sequenced lactococcal equivalents. The genome displays clear genetic adaptation to the dairy niche in the form of extensive reductive evolution. Gene prediction identified 2066 protein-encoding genes, including 104 which showed significant homology to transposase-specifying genes. Over 9 % of the identified genes appear to be inactivated through stop codons or frame shift mutations. Many pseudogenes were found in genes that are assigned to carbohydrate and amino acid transport and metabolism orthologous groups, reflecting L. lactis UC509.9’s adaptation to the lactose and casein-rich dairy environment. Sequence analysis of the eight plasmids of L. lactis revealed extensive adaptation to the dairy environment. Key industrial phenotypes were mapped and novel lactococcal plasmid-associated genes highlighted. In addition to chromosomally-encoded bacteriophage resistance systems, six functional such systems were identified, including two abortive infection systems, AbiB and AbiD1, explaining the observed phage resistance of L. lactis UC509.9 Molecular analysis suggests that the constitutive expression of AbiB is not lethal to cells, suggesting the protein is expressed in an un/inactivated form. Analysis of 936 species phage sk1-escape mutants of AbiB revealed that all such mutants harbour mutations in orf6, which encodes the major capsid protein. Results suggest that the major capsid protein is required for activation of the AbiB system, although this requires furrther investigations. Temporal transcriptomes of L. lactis UC509.9 undergoing lytic infection with either one of two distinct bacteriophages, Tuc2009 and c2, was determined and compared to the transcriptome of uninfected UC509.9 cells. Whole genome microarrays performed at various time-points post-infection demonstrated a rather modest impact on host transcription. Alterations in the UC509.9 transcriptome during lytic infection appear phage-specific, with a relatively small number of differentially transcribed genes shared between infection with either Tuc2009 or c2. Transcriptional profiles of both bacteriophages during lytic infection was shown to generally correlate with previous studies and allowed the confirmation of previously predicted promoter sequences. Bioinformatic analysis of genomic regions encoding the presumed cell wall polysaccharide (CW PS) biosynthesis gene cluster of several strains of L. lactis was performed. Results demonstrate the presence of three dominant genetic types of this gene cluster, termed type A, B and C. These regions were used for the development of a multiplex PCR to identify CW PS genotype of various lactococcal strains. Analysis of 936 species phage receptor binding protein phylogeny (RBP) and CW PS genotype revealed an apparent correlation between RBP phylogeny and CW PS type, thereby providing a partial explanation for the observed narrow host range of 936 phages. Further analysis of the genetic locus encompassing the presumed CW PS biosynthesis operon of eight strains identified as belonging to the CW PS C (geno)type, revealed the presence of a variable region among the examined strains. The obtained comparative analysis allowed for the identification of five subgroups of the C type, named C1 to C5. We purified an acidic polysaccharide from the cell wall of L. lactis 3107 (C2 subtype) and confirmed that it is structurally different from the CW PS of the C1 subtype L. lactis MG1363. Combinations of genes from the variable region of C2 subtype were amplified from L. lactis 3107 and introduced into a mutant of the C1 subtype L. lactis NZ9000 (a direct derivative of MG1363) deficient in CW PS biosynthesis. The resulting recombinant mutant synthesized a CW PS with a composition characteristic for that of the C2 subtype L. lactis 3107 and not the wildtype C1 L. lactis NZ9000. The recombinant mutant exhibited a changed phage resistance/sensitivity profile consistent with that of L. lactis 3107, which unambiguously demonstrated that L. lactis 3107 CW PS is the host cell surface receptor of two bacteriophages belonging to the P335 species as well as phages that are member of the 936 species. The research presented in this thesis has significantly advanced our understanding of L. lactis bacteriophage-host interactions in several ways. Firstly, the examination of plasmidencoded bacteriophage resistance systems has allowed inferences to be made regarding the mode of action of AbiB, thereby providing a platform for further elucidation of the molecular trigger of this system. Secondly, the phage infection transcriptome data presented, in addition to previous work, has made L. lactis a model organism in terms of transcriptomic studies of bacteriophage-host interactions. And finally, the research described in this thesis has for the first time explicitly revealed the nature of a carbohydrate bacteriophage receptor in L. lactis, while also providing a logical explanation for the observed narrow host ranges exhibited by 936 and P335 phages. Future research in discerning the structures of other L. lactis CW PS, combined with the determination of the molecular interplay between receptor binding proteins of these phages and CW PS will allow an in depth understanding of the mechanism by which the most prevalent lactococcal phages identify and adsorb to their specific host.
Resumo:
The population structure of an organism reflects its evolutionary history and influences its evolutionary trajectory. It constrains the combination of genetic diversity and reveals patterns of past gene flow. Understanding it is a prerequisite for detecting genomic regions under selection, predicting the effect of population disturbances, or modeling gene flow. This paper examines the detailed global population structure of Arabidopsis thaliana. Using a set of 5,707 plants collected from around the globe and genotyped at 149 SNPs, we show that while A. thaliana as a species self-fertilizes 97% of the time, there is considerable variation among local groups. This level of outcrossing greatly limits observed heterozygosity but is sufficient to generate considerable local haplotypic diversity. We also find that in its native Eurasian range A. thaliana exhibits continuous isolation by distance at every geographic scale without natural breaks corresponding to classical notions of populations. By contrast, in North America, where it exists as an exotic species, A. thaliana exhibits little or no population structure at a continental scale but local isolation by distance that extends hundreds of km. This suggests a pattern for the development of isolation by distance that can establish itself shortly after an organism fills a new habitat range. It also raises questions about the general applicability of many standard population genetics models. Any model based on discrete clusters of interchangeable individuals will be an uneasy fit to organisms like A. thaliana which exhibit continuous isolation by distance on many scales.
Resumo:
The aim of the 5-year European Union (EU)-Integrated Project GEnetics of Healthy Aging (GEHA), constituted by 25 partners (24 from Europe plus the Beijing Genomics Institute from China), is to identify genes involved in healthy aging and longevity, which allow individuals to survive to advanced old age in good cognitive and physical function and in the absence of major age-related diseases. To achieve this aim a coherent, tightly integrated program of research that unites demographers, geriatricians, geneticists, genetic epidemiologists, molecular biologists, bioinfomaticians, and statisticians has been set up. The working plan is to: (a) collect DNA and information on the health status from an unprecedented number of long-lived 90+ sibpairs (n = 2650) and of younger ethnically matched controls (n = 2650) from 11 European countries; (b) perform a genome-wide linkage scannning in all the sibpairs (a total of 5300 individuals); this investigation will be followed by linkage disequilibrium mapping (LD mapping) of the candidate chromosomal regions; (c) study in cases (i.e., the 2650 probands of the sibpairs) and controls (2650 younger people), genomic regions (chromosome 4, D4S1564, chromosome 11, 11.p15.5) which were identified in previous studies as possible candidates to harbor longevity genes; (d) genotype all recruited subjects for apoE polymorphisms; and (e) genotype all recruited subjects for inherited as well as epigenetic variability of the mitochondrial DNA (mtDNA). The genetic analysis will be performed by 9 high-throughput platforms, within the framework of centralized databases for phenotypic, genetic, and mtDNA data. Additional advanced approaches (bioinformatics, advanced statistics, mathematical modeling, functional genomics and proteomics, molecular biology, molecular genetics) are envisaged to identify the gene variant(s) of interest. The experimental design will also allow (a) to identify gender-specific genes involved in healthy aging and longevity in women and men stratified for ethnic and geographic origin and apoE genotype; (b) to perform a longitudinal survival study to assess the impact of the identified genetic loci on 90+ people mortality; and (c) to develop mathematical and statistical models capable of combining genetic data with demographic characteristics, health status, socioeconomic factors, lifestyle habits.
Resumo:
A combination of linkage analyses and association studies are currently employed to promote the identification of genetic factors contributing to inherited renal disease. We have standardized and merged complex genetic data from disparate sources, creating unique chromosomal maps to enhance genetic epidemiological investigations. This database and novel renal maps effectively summarize genomic regions of suggested linkage, association, or chromosomal abnormalities implicated in renal disease. Chromosomal regions associated with potential intermediate clinical phenotypes have been integrated, adding support for particular genomic intervals. More than 500 reports from medical databases, published scientific literature, and the World Wide Web were interrogated for relevant renal-related information. Chromosomal regions highlighted for prioritized investigation of renal complications include 3q13-26, 6q22-27, 10p11-15, 16p11-13, and 18q22. Combined genetic and physical maps are effective tools to organize genetic data for complex diseases. These renal chromosome maps provide insights into renal phenotype-genotype relationships and act as a template for future genetic investigations into complex renal diseases. New data from individual researchers and/or future publications can be readily incorporated to this resource via a user-friendly web-form accessed from the website: www.qub.ac.uk/neph-res/CORGI/index.php.
Resumo:
Purpose. Keratoconus is a progressive disorder of the cornea that can lead to severe visual impairment or blindness. Although several genomic regions have been linked to rare familial forms of keratoconus, no genes have yet been definitively identified for common forms of the disease. Methods. Two genome-wide association scans were undertaken in parallel. The first used pooled DNA from an Australian cohort, followed by typing of top-ranked single-nucleotide polymorphisms (SNPs) in individual DNA samples. The second was conducted in individually genotyped patients, and controls from the USA. Tag SNPs around the hepatocyte growth factor (HGF) gene were typed in three additional replication cohorts. Serum levels of HGF protein in normal individuals were assessed with ELISA and correlated with genotype. Results. The only SNP observed to be associated in both the pooled discovery and primary replication cohort was rs1014091, located upstream of the HGF gene. The nearby SNP rs3735520 was found to be associated in the individually typed discovery cohort (P = 6.1 × 10 ). Genotyping of tag SNPs around HGF revealed association at rs3735520 and rs17501108/rs1014091 in four of the five cohorts. Meta-analysis of all five datasets together yielded suggestive P values for rs3735520 (P = 9.9 × 10 ) and rs17501108 (P = 9.9 × 10 ). In addition, SNP rs3735520 was found to be associated with serum HGF level in normal individuals (P = 0.036). Conclusions. Taken together, these results implicate genetic variation at the HGF locus with keratoconus susceptibility. © 2011 The Association for Research in Vision and Ophthalmology, Inc.
Resumo:
Background: Members of the genus Cronobacter are causes of rare but severe illness in neonates and preterm infants following the ingestion of contaminated infant formula. Seven species have been described and two of the species genomes were subsequently published. In this study, we performed comparative genomics on eight strains of Cronobacter, including six that we sequenced (representing six of the seven species) and two previously published, closed genomes.
Results: We identified and characterized the features associated with the core and pan genome of the genus Cronobacter in an attempt to understand the evolution of these bacteria and the genetic content of each species. We identified 84 genomic regions that are present in two or more Cronobacter genomes, along with 45 unique genomic regions. Many potentially horizontally transferred genes, such as lysogenic prophages, were also identified. Most notable among these were several type six secretion system gene clusters, transposons that carried tellurium, copper and/or silver resistance genes, and a novel integrative conjugative element.
Conclusions: Cronobacter have diverged into two clusters, one consisting of C. dublinensis and C. muytjensii (Cdub-Cmuy) and the other comprised of C. sakazakii, C. malonaticus, C. universalis, and C. turicensis, (Csak-Cmal-Cuni-Ctur) from the most recent common ancestral species. While several genetic determinants for plant-association and human virulence could be found in the core genome of Cronobacter, the four Cdub-Cmuy clade genomes contained several accessory genomic regions important for survival in a plant-associated environmental niche, while the Csak-Cmal-Cuni-Ctur clade genomes harbored numerous virulence-related genetic traits.
Resumo:
Schizophrenia is an idiopathic mental disorder with a heritable component and a substantial public health impact. We conducted a multi-stage genome-wide association study (GWAS) for schizophrenia beginning with a Swedish national sample (5,001 cases and 6,243 controls) followed by meta-analysis with previous schizophrenia GWAS (8,832 cases and 12,067 controls) and finally by replication of SNPs in 168 genomic regions in independent samples (7,413 cases, 19,762 controls and 581 parent-offspring trios). We identified 22 loci associated at genome-wide significance; 13 of these are new, and 1 was previously implicated in bipolar disorder. Examination of candidate genes at these loci suggests the involvement of neuronal calcium signaling. We estimate that 8,300 independent, mostly common SNPs (95% credible interval of 6,300-10,200 SNPs) contribute to risk for schizophrenia and that these collectively account for at least 32% of the variance in liability. Common genetic variation has an important role in the etiology of schizophrenia, and larger studies will allow more detailed understanding of this disorder.
Resumo:
Background: More accurate coronary heart disease (CHD) prediction, specifically in middle-aged men, is needed to reduce the burden of disease more effectively. We hypothesised that a multilocus genetic risk score could refine CHD prediction beyond classic risk scores and obtain more precise risk estimates using a prospective cohort design.
Methods: Using data from nine prospective European cohorts, including 26,221 men, we selected in a case-cohort setting 4,818 healthy men at baseline, and used Cox proportional hazards models to examine associations between CHD and risk scores based on genetic variants representing 13 genomic regions. Over follow-up (range: 5-18 years), 1,736 incident CHD events occurred. Genetic risk scores were validated in men with at least 10 years of follow-up (632 cases, 1361 non-cases). Genetic risk score 1 (GRS1) combined 11 SNPs and two haplotypes, with effect estimates from previous genome-wide association studies. GRS2 combined 11 SNPs plus 4 SNPs from the haplotypes with coefficients estimated from these prospective cohorts using 10-fold cross-validation. Scores were added to a model adjusted for classic risk factors comprising the Framingham risk score and 10-year risks were derived.
Results: Both scores improved net reclassification (NRI) over the Framingham score (7.5%, p = 0.017 for GRS1, 6.5%, p = 0.044 for GRS2) but GRS2 also improved discrimination (c-index improvement 1.11%, p = 0.048). Subgroup analysis on men aged 50-59 (436 cases, 603 non-cases) improved net reclassification for GRS1 (13.8%) and GRS2 (12.5%). Net reclassification improvement remained significant for both scores when family history of CHD was added to the baseline model for this male subgroup improving prediction of early onset CHD events.
Conclusions: Genetic risk scores add precision to risk estimates for CHD and improve prediction beyond classic risk factors, particularly for middle aged men.
Resumo:
The mineral concentrations in cereals are important for human health, especially for individuals who consume a cereal subsistence diet. A number of elements, such as zinc, are required within the diet, while some elements are toxic to humans, for example arsenic. In this study we carry out genome-wide association (GWA) mapping of grain concentrations of arsenic, copper, molybdenum and zinc in brown rice using an established rice diversity panel of,300 accessions and 36.9 k single nucleotide polymorphisms (SNPs). The study was performed across five environments: one field site in Bangladesh, one in China and two in the US, with one of the US sites repeated over two years. GWA mapping on the whole dataset and on separate subpopulations of rice revealed a large number of loci significantly associated with variation in grain arsenic, copper, molybdenum and zinc. Seventeen of these loci were detected in data obtained from grain cultivated in more than one field location, and six co-localise with previously identified quantitative trait loci. Additionally, a number of candidate genes for the uptake or transport of these elements were located near significantly associated SNPs (within 200 kb, the estimated global linkage disequilibrium previously employed in this rice panel). This analysis highlights a number of genomic regions and candidate genes for further analysis as well as the challenges faced when mapping environmentally-variable traits in a highly genetically structured diversity panel.
Resumo:
BACKGROUND: Klebsiella pneumoniae strains are pathogenic to animals and humans, in which they are both a frequent cause of nosocomial infections and a re-emerging cause of severe community-acquired infections. K. pneumoniae isolates of the capsular serotype K2 are among the most virulent. In order to identify novel putative virulence factors that may account for the severity of K2 infections, the genome sequence of the K2 reference strain Kp52.145 was determined and compared to two K1 and K2 strains of low virulence and to the reference strains MGH 78578 and NTUH-K2044.
RESULTS: In addition to diverse functions related to host colonization and virulence encoded in genomic regions common to the four strains, four genomic islands specific for Kp52.145 were identified. These regions encoded genes for the synthesis of colibactin toxin, a putative cytotoxin outer membrane protein, secretion systems, nucleases and eukaryotic-like proteins. In addition, an insertion within a type VI secretion system locus included sel1 domain containing proteins and a phospholipase D family protein (PLD1). The pld1 mutant was avirulent in a pneumonia model in mouse. The pld1 mRNA was expressed in vivo and the pld1 gene was associated with K. pneumoniae isolates from severe infections. Analysis of lipid composition of a defective E. coli strain complemented with pld1 suggests an involvement of PLD1 in cardiolipin metabolism.
CONCLUSIONS: Determination of the complete genome of the K2 reference strain identified several genomic islands comprising putative elements of pathogenicity. The role of PLD1 in pathogenesis was demonstrated for the first time and suggests that lipid metabolism is a novel virulence mechanism of K. pneumoniae.
Resumo:
BACKGROUND: Urothelial pathogenesis is a complex process driven by an underlying network of interconnected genes. The identification of novel genomic target regions and gene targets that drive urothelial carcinogenesis is crucial in order to improve our current limited understanding of urothelial cancer (UC) on the molecular level. The inference of genome-wide gene regulatory networks (GRN) from large-scale gene expression data provides a promising approach for a detailed investigation of the underlying network structure associated to urothelial carcinogenesis.
METHODS: In our study we inferred and compared three GRNs by the application of the BC3Net inference algorithm to large-scale transitional cell carcinoma gene expression data sets from Illumina RNAseq (179 samples), Illumina Bead arrays (165 samples) and Affymetrix Oligo microarrays (188 samples). We investigated the structural and functional properties of GRNs for the identification of molecular targets associated to urothelial cancer.
RESULTS: We found that the urothelial cancer (UC) GRNs show a significant enrichment of subnetworks that are associated with known cancer hallmarks including cell cycle, immune response, signaling, differentiation and translation. Interestingly, the most prominent subnetworks of co-located genes were found on chromosome regions 5q31.3 (RNAseq), 8q24.3 (Oligo) and 1q23.3 (Bead), which all represent known genomic regions frequently deregulated or aberated in urothelial cancer and other cancer types. Furthermore, the identified hub genes of the individual GRNs, e.g., HID1/DMC1 (tumor development), RNF17/TDRD4 (cancer antigen) and CYP4A11 (angiogenesis/ metastasis) are known cancer associated markers. The GRNs were highly dataset specific on the interaction level between individual genes, but showed large similarities on the biological function level represented by subnetworks. Remarkably, the RNAseq UC GRN showed twice the proportion of significant functional subnetworks. Based on our analysis of inferential and experimental networks the Bead UC GRN showed the lowest performance compared to the RNAseq and Oligo UC GRNs.
CONCLUSION: To our knowledge, this is the first study investigating genome-scale UC GRNs. RNAseq based gene expression data is the data platform of choice for a GRN inference. Our study offers new avenues for the identification of novel putative diagnostic targets for subsequent studies in bladder tumors.
Resumo:
Background: Interindividual epigenetic variation that occurs systemically must be established prior to gastrulation in the very early embryo and, because it is systemic, can be assessed in easily biopsiable tissues. We employ two independent genome-wide approaches to search for such variants.
Results: First, we screen for metastable epialleles by performing genomewide bisulfite sequencing in peripheral blood lymphocyte (PBL) and hair follicle DNA from two Caucasian adults. Second, we conduct a genomewide screen for genomic regions at which PBL DNA methylation is affected by season of conception in rural Gambia. Remarkably, both approaches identify the genomically imprinted VTRNA2-1 as a top environmentally responsive epiallele. We demonstrate systemic and stochastic interindividual variation in DNA methylation at the VTRNA2-1 differentially methylated region in healthy Caucasian and Asian adults and show, in rural Gambians, that periconceptional environment affects offspring VTRNA2-1 epigenotype, which is stable over at least 10 years. This unbiased screen also identifies over 100 additional candidate metastable epialleles, and shows that these are associated with cis genomic features including transposable elements.
Conclusions: The non-coding VTRNA2-1 transcript (also called nc886) is a putative tumor suppressor and modulator of innate immunity. Thus, these data indicating environmentally induced loss of imprinting at VTRNA2-1 constitute a plausible causal pathway linking early embryonic environment, epigenetic alteration, and human disease. More broadly, the list of candidate metastable epialleles provides a resource for future studies of epigenetic variation and human disease.
Resumo:
The glucocorticoid (GC) receptor (GR) and Kruppel-like factor Klf4 are transcription factors that play major roles in skin homeostasis. However, whether these transcription factors cooperate in binding genomic regulatory regions in epidermal keratinocytes was not known. Here, we show that in dexamethasone-treated keratinocytes GR and Klf4 are recruited to genomic regions containing adjacent GR and KLF binding motifs to control transcription of the anti-inflammatory genes Tsc22d3 and Zfp36. GR- and Klf4 loss of function experiments showed total GR but partial Klf4 requirement for full gene induction in response to dexamethasone. In wild type keratinocytes induced to differentiate, GR and Klf4 protein expression increased concomitant with Tsc22d3 and Zfp36 up-regulation. In contrast, GR-deficient cells failed to differentiate or fully induce Klf4, Tsc22d3 and Zfp36 correlating with increased expression of the epithelium-specific Trp63, a known transcriptional repressor of Klf4. The identified transcriptional cooperation between GR and Klf4 may determine cell-type specific regulation and have implications for developing therapies for skin diseases.