969 resultados para variants,
Resumo:
The recent advance in high-throughput sequencing and genotyping protocols allows rapid investigation of Mendelian and complex diseases on a scale not previously been possible. In my thesis research I took advantage of these modern techniques to study retinitis pigmentosa (RP), a rare inherited disease characterized by progressive loss of photoreceptors and leading to blindness; and hypertension, a common condition affecting 30% of the adult population. Firstly, I compared the performance of different next generation sequencing (NGS) platforms in the sequencing of the RP-linked gene PRPF31. The gene contained a mutation in an intronic repetitive element, which presented difficulties for both classic sequencing methods and NGS. We showed that all NGS platforms are powerful tools to identify rare and common DNA variants, also in case of more complex sequences. Moreover, we evaluated the features of different NGS platforms that are important in re-sequencing projects. The main focus of my thesis was then to investigate the involvement of pre-mRNA splicing factors in autosomal dominant RP (adRP). I screened 5 candidate genes in a large cohort of patients by using long-range PCR as enrichment step, followed by NGS. We tested two different approaches: in one, all target PCRs from all patients were pooled and sequenced as a single DNA library; in the other, PCRs from each patient were separated within the pool by DNA barcodes. The first solution was more cost-effective, while the second one allowed obtaining faster and more accurate results, but overall they both proved to be effective strategies for gene screenings in many samples. We could in fact identify novel missense mutations in the SNRNP200 gene, encoding an essential RNA helicase for splicing catalysis. Interestingly, one of these mutations showed incomplete penetrance in one family with adRP. Thus, we started to study the possible molecular causes underlying phenotypic differences between asymptomatic and affected members of this family. For the study of hypertension, I joined a European consortium to perform genome-wide association studies (GWAS). Thanks to the use of very informative genotyping arrays and of phenotipically well-characterized cohorts, we could identify a novel susceptibility locus for hypertension in the promoter region of the endothelial nitric oxide synthase gene (NOS3). Moreover, we have proven the direct causality of the associated SNP using three different methods: 1) targeted resequencing, 2) luciferase assay, and 3) population study. - Le récent progrès dans le Séquençage à haut Débit et les protocoles de génotypage a permis une plus vaste et rapide étude des maladies mendéliennes et multifactorielles à une échelle encore jamais atteinte. Durant ma thèse de recherche, j'ai utilisé ces nouvelles techniques de séquençage afin d'étudier la retinite pigmentale (RP), une maladie héréditaire rare caractérisée par une perte progressive des photorécepteurs de l'oeil qui entraine la cécité; et l'hypertension, une maladie commune touchant 30% de la population adulte. Tout d'abord, j'ai effectué une comparaison des performances de différentes plateformes de séquençage NGS (Next Generation Sequencing) lors du séquençage de PRPF31, un gène lié à RP. Ce gène contenait une mutation dans un élément répétable intronique, qui présentait des difficultés de séquençage avec la méthode classique et les NGS. Nous avons montré que les plateformes de NGS analysées sont des outils très puissants pour identifier des variations de l'ADN rares ou communes et aussi dans le cas de séquences complexes. De plus, nous avons exploré les caractéristiques des différentes plateformes NGS qui sont importantes dans les projets de re-séquençage. L'objectif principal de ma thèse a été ensuite d'examiner l'effet des facteurs d'épissage de pre-ARNm dans une forme autosomale dominante de RP (adRP). Un screening de 5 gènes candidats issus d'une large cohorte de patients a été effectué en utilisant la long-range PCR comme étape d'enrichissement, suivie par séquençage avec NGS. Nous avons testé deux approches différentes : dans la première, toutes les cibles PCRs de tous les patients ont été regroupées et séquencées comme une bibliothèque d'ADN unique; dans la seconde, les PCRs de chaque patient ont été séparées par code barres d'ADN. La première solution a été la plus économique, tandis que la seconde a permis d'obtenir des résultats plus rapides et précis. Dans l'ensemble, ces deux stratégies se sont démontrées efficaces pour le screening de gènes issus de divers échantillons. Nous avons pu identifier des nouvelles mutations faux-sens dans le gène SNRNP200, une hélicase ayant une fonction essentielle dans l'épissage. Il est intéressant de noter qu'une des ces mutations montre une pénétrance incomplète dans une famille atteinte d'adRP. Ainsi, nous avons commencé une étude sur les causes moléculaires entrainant des différences phénotypiques entre membres affectés et asymptomatiques de cette famille. Lors de l'étude de l'hypertension, j'ai rejoint un consortium européen pour réaliser une étude d'association Pangénomique ou genome-wide association study Grâce à l'utilisation de tableaux de génotypage très informatifs et de cohortes extrêmement bien caractérisées au niveau phénotypique, un nouveau locus lié à l'hypertension a été identifié dans la région promotrice du gène endothélial nitric oxide sinthase (NOS3). Par ailleurs, nous avons prouvé la cause directe du SNP associé au moyen de trois méthodes différentes: i) en reséquençant la cible avec NGS, ii) avec des essais à la luciférase et iii) une étude de population.
Resumo:
Rapport de synthèse :Les individus HIV-positifs constituent une population à risque pour les maladies cardiovasculaires telles que |'infarctus cardiaque ou cérébrale. Celles-ci découlent d'une formation accélérée d'athéroscIérose. Ces pathologies s'expliquent en grande partie par une dyslipidémie observée au sein de cette population et qui sont dues à des facteurs externes tels que : l'immunosuppression avancée, la virémie non-contrôlée, et les effets de la thérapie antirétrovirale. Récemment, des polymorphismes nucléotidiques simples (SNP) associés à la dyslipidémie ont été mis en évidence d'une manière globale par des Genome-Wide Association Studies (GWAS). Le but principal de cette étude est d'éva|uer et de valider |'effet cumulatif des SNP identifiés dans ces GWAS pour la dyslipidémie chez des patients HIV-positifs. De plus, |'identification des facteurs non-génétiques qui contribuent à la dyslipidémie démontrent |'importance des facteurs externes, tels que mentionnés ci- dessus, et en particulier à ceux de la thérapie antirétrovirale.Les participants de l'étude proviennent de trois groupes: 426 personnes sélectionnées pour une étude précédente, 222 personnes sélectionnées de façon arbitraire dans la "Cohorte HIV Suisse" et 103 personnes sélectionnées avec un "New-Onset Diabetes mellitus" identifiées lors d'études précédentes. Ces individus ont contribué à plus de 34'000 mesures de lipides sur une durée moyenne supérieure à 7 ans. Pour l'étude, 33 SNP identifiés dans des GWAS et 9 SNP identifiés dans d'autres études publiées dans la littérature non-couverte par des GWAS ont été repris. Le génotypage a été complété pour 745 (99.2%) des 751 participants. Pour les analyses statistiques, les thérapies antirétrovirales ont été divisées en trois groupes (favorisant peu, moyennement et fortement la dyslipidémie), et trois scores génétiques ont été créés (profil favorable, moyennement favorable, non favorable/favorisant la dyslipidémie). Dans un premier temps, l'effet sur la valeur des lipides d'un ou deux allèles variants a été analysé au moyen d'un modèle de régression pour chaque SNP en ajustant le modèle pour les variables non- génétiques. Dans un deuxième temps, les SNP ayant une valeur p >= à 0.2 ont été repris dans un model Multi-SNP, ce modèle est également ajusté pour les variables non-génétiques. Puisque cette étude se base sur des SNP précédemment identifiés, celle-ci évalue uniquement l'association établie entre chaque SNP et les critères qui ont été établis au préalable, tels que : Cholestérol totale, HDL Cholestérol, non-HDL Cholestérol ou Triglycérides. Les résultats trouvés lors de |'étude confirment les résultats de la littérature. Cette étude montre que les SNP associés à la dyslipidémie doivent être analysés dans le contexte d'une thérapie antirétrovirale en tenant compte de la démographie et en considérant les valeurs du HIV (CD4+, virémie). Ces SNP montrent une tendance à prédire une dyslipidémie prolongée chez l'individu. En effet, un patient avec une thérapie antirétrovirale favorisant la dyslipidémie et un patrimoine génétique non-favorable a un risque qui est 3-f0is plus important d'avoir un Non-HDL- Cholestérol élevé, 5-fois plus important d'avoir un HDL-Cholestérol abaissé, et 4 à 5-fois plus important d'avoir une hypertriglycéridémie qu'un patient qui suit une thérapie antirétrovirale favorisant peu la dyslipidémie qui a un patrimoine génétique favorable. Vu la corrélation entre les SNP et la thérapie antirétrovirale, les cliniciens devraient intégrer les informations génétiques afin de choisir une thérapie antirétrovirale en fonction du patrimoine génétique.
Resumo:
The basis set superposition error-free second-order MØller-Plesset perturbation theory of intermolecular interactions was studied. The difficulties of the counterpoise (CP) correction in open-shell systems were also discussed. The calculations were performed by a program which was used for testing the new variants of the theory. It was shown that the CP correction for the diabatic surfaces should be preferred to the adiabatic ones
Resumo:
Aims: The adaptive immune response against hepatitis C virus (HCV) is significantly shaped by the host's composition of HLA alleles. Thus, the HLA phenotype is a critical determinant of viral evolution during adaptive immune pressure. Potential associations of HLA class I alleles with polymorphisms of HCV immune escape variants are largely unknown. Methods: Direct sequence analysis of the genes encoding the HCV proteins E2, NS3 and NS5B in a cohort of 159 patients with chronic HCV genotype 1 infection who were treated with pegylated interferon-alfa 2b and ribavirin in a prospective controlled trial for 48 weeks was exhibited. HLA class I genotyping was performed by strand-specific reverse hybridization with the INNO-LiPA line probe assays for HLA-A and HLA-B and by strand-specific PCR-SSP. We analyzed each amino acid position of HCV proteins using an extension of Fisher's exact test for associations with HLA alleles. In addition, associations of specific HLA alleles with inflammatory activity, liver fibrosis, HCV RNA viral load and virologic treatment outcome were investigated. Results: Separate analyses of HCV subtype 1a and 1b isolates revealed substantially different patterns of HLA-restricted polymorphisms between subtypes. Only one polymorphism within NS5B (V2758x) was significantly associated with HLA B*15 in HCV genotype 1b infected patients (adjusted p=0,048). However, a number of HLA class I-restricted polymorphisms within novel putative HCV CD8+ T cell epitopes (genotype 1a: HLA-A*11 GTRTIASPK1086-1094 [NS3], HLA-B*07 WPAPQGARSL1111-1120 [NS3]; genotype 1b: HLA-A*24 HYAPRPCGI488-496 [E2], HLA-B*44 GENETDVLL530-538 [E2], HLA-B*15 RVFTEAMTRY2757-2766 [NS5B]) were observed with high predicted epitope binding scores assessed by the web-based software SYFPEITHI (>21). Most of the identified putative epitopes were overlapping with already otherwise published epitopes, indicating a high immunogenicity of the accordant HCV protein region. In addition, certain HLA class I alleles were associated with inflammatory activity, stage of liver fibrosis, and sustained virologic response to antiviral therapy. Conclusions: HLA class I restricted HCV sequence polymorphisms are rare. HCV polymorphisms identified within putative HCV CD8+ T cell epitopes in the present study differ in their genomic distribution between genotype 1a and 1b isolates, implying divergent adaptation to the host's immune pressure on the HCV subtype level.
Resumo:
Genetic polymorphisms near IL28B are associated with spontaneous and treatment-induced clearance of hepatitis C virus (HCV), two processes that require the appropriate activation of the host immune responses. Intrahepatic inflammation is believed to mirror such activation, but its relationship with IL28B polymorphisms has yet to be fully appreciated. We analyzed the association of IL28B polymorphisms with histological and follow-up features in 2335 chronically HCV-infected Caucasian patients. Assessable phenotypes before any antiviral treatment included necroinflammatory activity (n = 1,098), fibrosis (n = 1,527), fibrosis progression rate (n = 1,312), and hepatocellular carcinoma development (n = 1,915). Associations of alleles with the phenotypes were evaluated by univariate analysis and multivariate logistic regression, accounting for all relevant covariates. The rare G allele at IL28B marker rs8099917-previously shown to be at risk of treatment failure-was associated with lower activity (P = 0.04), lower fibrosis (P = 0.02) with a trend toward lower fibrosis progression rate (P = 0.06). When stratified according to HCV genotype, most significant associations were observed in patients infected with non-1 genotypes (P = 0.003 for activity, P = 0.001 for fibrosis, and P = 0.02 for fibrosis progression rate), where the odds ratio of having necroinflammation or rapid fibrosis progression for patients with IL28B genotypes TG or GG versus TT were 0.48 (95% confidence intervals 0.30-0.78) and 0.56 (0.35-0.92), respectively. IL28B polymorphisms were not predictive of the development of hepatocellular carcinoma. CONCLUSION: In chronic hepatitis C, IL28B variants associated with poor response to interferon therapy may predict slower fibrosis progression, especially in patients infected with non-1 HCV genotypes.
Resumo:
The extensive variability of individual human genomes contributes to phenotypic variability. Structural genomic variants, and copy number variants (CNVs) in particular, have recently been rediscovered as contributors to the genomic plasticity and evolution and as pathoetiologic elements for both monogenic and complex traits. Herein we review some of the consequences of CNVs in the context of human inherited diseases.
Resumo:
The white Barn Owl subspecies (Tyto alba alba) is found in southern Europe and the reddish-brown subspecies (T a. guttata) in northern and eastern Europe. In central Europe, the two subspecies interbreed producing a large range of phenotypic variants. Because of the different ratios of the subspecies in different geographic regions, we predict that genetic variation should be greater in Switzerland than in Hungary. We tested this hypothesis by measuring genetic variation with the RAPD method. As predicted, the genetic differentiation within a Swiss population of Barn Owls was significantly greater than the variation within a Hungarian population. This suggests that gene flow is greater in central Europe than at the eastern limit of the Barn Owl distribution in Hungary. In both countries genetic variation was more pronounced in females than in males. As in other birds, this is probably because female Barn Owls are less philopatric than males. The number of migrants between Hungary and Switzerland is ca. 1 individual per generation; if calculated separately for the sexes, then 0.525 for males and ca. I for females (Nm values). The difference in the number of migrants between genders again is likely a consequence of higher male philopatry. The sexual differentiation is greater in the Swiss population than in the Hungarian and the genetic substructuring of the populations of the species is substantial. The reason for the considerable population substructuring could be the nonmigratory behavior and socially monogamous pairing of the species, as well as the geographical barriers (Alps) between the populations examined.
Resumo:
HAMAP (High-quality Automated and Manual Annotation of Proteins-available at http://hamap.expasy.org/) is a system for the automatic classification and annotation of protein sequences. HAMAP provides annotation of the same quality and detail as UniProtKB/Swiss-Prot, using manually curated profiles for protein sequence family classification and expert curated rules for functional annotation of family members. HAMAP data and tools are made available through our website and as part of the UniRule pipeline of UniProt, providing annotation for millions of unreviewed sequences of UniProtKB/TrEMBL. Here we report on the growth of HAMAP and updates to the HAMAP system since our last report in the NAR Database Issue of 2013. We continue to augment HAMAP with new family profiles and annotation rules as new protein families are characterized and annotated in UniProtKB/Swiss-Prot; the latest version of HAMAP (as of 3 September 2014) contains 1983 family classification profiles and 1998 annotation rules (up from 1780 and 1720). We demonstrate how the complex logic of HAMAP rules allows for precise annotation of individual functional variants within large homologous protein families. We also describe improvements to our web-based tool HAMAP-Scan which simplify the classification and annotation of sequences, and the incorporation of an improved sequence-profile search algorithm.
Resumo:
OBJECTIVES: Co-morbidity between depression and anxiety disorders is common. In this study we define a quantitative measure of anxiety by summating four anxiety items from the SCAN interview in a large collection of major depression (MDD) cases to identify genes contributing to this complex phenotype. METHODS: A total of 1522 MDD cases dichotomised according to those with at least one anxiety item scored (n = 1080) and those without anxiety (n = 442) were analysed, and also compared to 1588 healthy controls at a genome-wide level, to identify genes that may contribute to anxiety in MDD. RESULTS: For the quantitative trait, suggestive evidence of association was detected for two SNPs, and for the dichotomous anxiety present/absent ratings for three SNPs at genome-wide level. In the genome-wide analysis of MDD cases with co-morbid anxiety and healthy controls, two SNPs attained P values of < 5 × 10⁻⁶. Analysing candidate genes, P values ≤ 0.0005 were found with three SNPs for the quantitative trait and three SNPs for the dichotomous trait. CONCLUSIONS: This study provides an initial genome-wide assessment of possible genetic contribution to anxiety in MDD. Although suggestive evidence of association was found for several SNPs, our findings suggest that there are no common variants strongly associated with anxious depression.
Resumo:
Background: Searching for associations between genetic variants and complex diseases has been a very active area of research for over two decades. More than 51,000 potential associations have been studied and published, a figure that keeps increasing, especially with the recent explosion of array-based Genome-Wide Association Studies. Even if the number of true associations described so far is high, many of the putative risk variants detected so far have failed to be consistently replicated and are widely considered false positives. Here, we focus on the world-wide patterns of replicability of published association studies.Results: We report three main findings. First, contrary to previous results, genes associated to complex diseases present lower degrees of genetic differentiation among human populations than average genome-wide levels. Second, also contrary to previous results, the differences in replicability of disease associated-loci between Europeans and East Asians are highly correlated with genetic differentiation between these populations. Finally, highly replicated genes present increased levels of high-frequency derived alleles in European and Asian populations when compared to African populations. Conclusions: Our findings highlight the heterogeneous nature of the genetic etiology of complex disease, confirm the importance of the recent evolutionary history of our species in current patterns of disease susceptibility and could cast doubts on the status as false positives of some associations that have failed to replicate across populations.
Resumo:
Background: Lynch syndrome (LS) is an autosomal dominant inherited cancer syndrome characterized by early onset cancers of the colorectum, endometrium and other tumours. A significant proportion of DNA variants in LS patients are unclassified. Reports on the pathogenicity of the c.1852_1853AA>GC (p.Lys618Ala) variant of the MLH1 gene are conflicting. In this study, we provide new evidence indicating that this variant has no significant implications for LS.Methods: The following approach was used to assess the clinical significance of the p.Lys618Ala variant: frequency in a control population, case-control comparison, co-occurrence of the p.Lys618Ala variant with a pathogenic mutation, co-segregation with the disease and microsatellite instability in tumours from carriers of the variant. We genotyped p.Lys618Ala in 1034 individuals (373 sporadic colorectal cancer [CRC] patients, 250 index subjects from families suspected of having LS [revised Bethesda guidelines] and 411 controls). Three well-characterized LS families that fulfilled the Amsterdam II Criteria and consisted of members with the p.Lys618Ala variant were included to assess co-occurrence and co-segregation. A subset of colorectal tumour DNA samples from 17 patients carrying the p.Lys618Ala variant was screened for microsatellite instability using five mononucleotide markers.Results: Twenty-seven individuals were heterozygous for the p.Lys618Ala variant; nine had sporadic CRC (2.41%), seven were suspected of having hereditary CRC (2.8%) and 11 were controls (2.68%). There were no significant associations in the case-control and case-case studies. The p.Lys618Ala variant was co-existent with pathogenic mutations in two unrelated LS families. In one family, the allele distribution of the pathogenic and unclassified variant was in trans, in the other family the pathogenic variant was detected in the MSH6 gene and only the deleterious variant co-segregated with the disease in both families. Only two positive cases of microsatellite instability (2/17, 11.8%) were detected in tumours from p.Lys618Ala carriers, indicating that this variant does not play a role in functional inactivation of MLH1 in CRC patients.Conclusions: The p.Lys618Ala variant should be considered a neutral variant for LS. These findings have implications for the clinical management of CRC probands and their relatives.
Resumo:
UEV proteins are enzymatically inactive variants of the E2 ubiquitin-conjugating enzymes that regulate noncanonical elongation of ubiquitin chains. In Saccharomyces cerevisiae, UEV is part of the RAD6-mediated error-free DNA repair pathway. In mammalian cells, UEV proteins can modulate c-FOS transcription and the G2-M transition of the cell cycle. Here we show that the UEV genes from phylogenetically distant organisms present a remarkable conservation in their exon–intron structure. We also show that the human UEV1 gene is fused with the previously unknown gene Kua. In Caenorhabditis elegans and Drosophila melanogaster, Kua and UEV are in separated loci, and are expressed as independent transcripts and proteins. In humans, Kua and UEV1 are adjacent genes, expressed either as separate transcripts encoding independent Kua and UEV1 proteins, or as a hybrid Kua–UEV transcript, encoding a two-domain protein. Kua proteins represent a novel class of conserved proteins with juxtamembrane histidine-rich motifs. Experiments with epitope-tagged proteins show that UEV1A is a nuclear protein, whereas both Kua and Kua–UEV localize to cytoplasmic structures, indicating that the Kua domain determines the cytoplasmic localization of Kua–UEV. Therefore, the addition of a Kua domain to UEV in the fused Kua–UEV protein confers new biological properties to this regulator of variant polyubiquitination.[Kua cDNAs isolated by RT-PCR and described in this paper have been deposited in the GenBank data library under accession nos. AF1155120 (H. sapiens) and AF152361 (D. melanogaster). Genomic clones containing UEV genes: S. cerevisiae, YGL087c (accession no. Z72609); S. pombe, c338 (accession no. AL023781); P. falciparum, MAL3P2 (accession no. AL034558); A. thaliana, F26F24 (accession no. AC005292); C. elegans, F39B2 (accession no. Z92834); D. melanogaster, AC014908; and H. sapiens, 1185N5 (accession no. AL034423). Accession numbers for Kua cDNAs in GenBank dbEST: M. musculus, AA7853; T. cruzi, AI612534. Other Kua-containing sequences: A. thaliana genomic clones F10M23 (accession no. AL035440), F19K23 (accession no. AC000375), and T20K9 (accession no. AC004786).
Resumo:
Eating disorders (EDs) are complex psychiatric diseases that include anorexia nervosa and bulimia nervosa, and have higher than 50% heritability. Previous studies have found association of BDNF and NTRK2 to ED, while animal models suggest that other neurotrophin genes might also be involved in eating behavior. We have performed a family-based association study with 151 TagSNPs covering 10 neurotrophin signaling genes: NGFB, BDNF, NTRK1, NGFR/p75, NTF4/5, NTRK2, NTF3, NTRK3, CNTF and CNTFR in 371 ED trios of Spanish, French and German origin. Besides several nominal associations, we found a strong significant association after correcting for multiple testing (P = 1.04 × 10−4) between ED and rs7180942, located in the NTRK3 gene, which followed an overdominant model of inheritance. Interestingly, HapMap unrelated individuals carrying the rs7180942 risk genotypes for ED showed higher levels of expression of NTRK3 in lymphoblastoid cell lines. Furthermore, higher expression of the orthologous murine Ntrk3 gene was also detected in the hypothalamus of the anx/anx mouse model of anorexia. Finally, variants in NGFB gene appear to modify the risk conferred by the NTRK3 rs7180942 risk genotypes (P = 4.0 × 10−5) showing a synergistic epistatic interaction. The reported data, in addition to the previous reported findings for BDNF and NTRK2, point neurotrophin signaling genes as key regulators of eating behavior and their altered cross-regulation as susceptibility factors for EDs.
Resumo:
Background: The GENCODE consortium was formed to identify and map all protein-coding genes within the ENCODE regions. This was achieved by a combination of initial manualannotation by the HAVANA team, experimental validation by the GENCODE consortium and a refinement of the annotation based on these experimental results.Results: The GENCODE gene features are divided into eight different categories of which onlythe first two (known and novel coding sequence) are confidently predicted to be protein-codinggenes. 5’ rapid amplification of cDNA ends (RACE) and RT-PCR were used to experimentallyverify the initial annotation. Of the 420 coding loci tested, 229 RACE products have beensequenced. They supported 5’ extensions of 30 loci and new splice variants in 50 loci. In addition,46 loci without evidence for a coding sequence were validated, consisting of 31 novel and 15putative transcripts. We assessed the comprehensiveness of the GENCODE annotation byattempting to validate all the predicted exon boundaries outside the GENCODE annotation. Outof 1,215 tested in a subset of the ENCODE regions, 14 novel exon pairs were validated, only twoof them in intergenic regions.Conclusions: In total, 487 loci, of which 434 are coding, have been annotated as part of theGENCODE reference set available from the UCSC browser. Comparison of GENCODEannotation with RefSeq and ENSEMBL show only 40% of GENCODE exons are contained withinthe two sets, which is a reflection of the high number of alternative splice forms with uniqueexons annotated. Over 50% of coding loci have been experimentally verified by 5’ RACE forEGASP and the GENCODE collaboration is continuing to refine its annotation of 1% humangenome with the aid of experimental validation.
Resumo:
MicroRNAs (miRNA) are recognized posttranscriptional gene repressors involved in the control of almost every biological process. Allelic variants in these regions may be an important source of phenotypic diversity and contribute to disease susceptibility. We analyzed the genomic organization of 325 human miRNAs (release 7.1, miRBase) to construct a panel of 768 single-nucleotide polymorphisms (SNPs) covering approximately 1 Mb of genomic DNA, including 131 isolated miRNAs (40%) and 194 miRNAs arranged in 48 miRNA clusters, as well as their 5-kb flanking regions. Of these miRNAs, 37% were inside known protein-coding genes, which were significantly associated with biological functions regarding neurological, psychological or nutritional disorders. SNP coverage analysis revealed a lower SNP density in miRNAs compared with the average of the genome, with only 24 SNPs located in the 325 miRNAs studied. Further genotyping of 340 unrelated Spanish individuals showed that more than half of the SNPs in miRNAs were either rare or monomorphic, in agreement with the reported selective constraint on human miRNAs. A comparison of the minor allele frequencies between Spanish and HapMap population samples confirmed the applicability of this SNP panel to the study of complex disorders among the Spanish population, and revealed two miRNA regions, hsa-mir-26a-2 in the CTDSP2 gene and hsa-mir-128-1 in the R3HDM1 gene, showing geographical allelic frequency variation among the four HapMap populations, probably because of differences in natural selection. The designed miRNA SNP panel could help to identify still hidden links between miRNAs and human disease.