928 resultados para Genome rearrangements


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Prolificacy is the most important trait influencing the reproductive efficiency of pig production systems. The low heritability and sex-limited expression of prolificacy have hindered to some extent the improvement of this trait through artificial selection. Moreover, the relative contributions of additive, dominant and epistatic QTL to the genetic variance of pig prolificacy remain to be defined. In this work, we have undertaken this issue by performing one-dimensional and bi-dimensional genome scans for number of piglets born alive (NBA) and total number of piglets born (TNB) in a three generation Iberian by Meishan F2 intercross. Results: The one-dimensional genome scan for NBA and TNB revealed the existence of two genome-wide highly significant QTL located on SSC13 (P < 0.001) and SSC17 (P < 0.01) with effects on both traits. This relative paucity of significant results contrasted very strongly with the wide array of highly significant epistatic QTL that emerged in the bi-dimensional genome-wide scan analysis. As much as 18 epistatic QTL were found for NBA (four at P < 0.01 and five at P < 0.05) and TNB (three at P < 0.01 and six at P < 0.05), respectively. These epistatic QTL were distributed in multiple genomic regions, which covered 13 of the 18 pig autosomes, and they had small individual effects that ranged between 3 to 4% of the phenotypic variance. Different patterns of interactions (a × a, a × d, d × a and d × d) were found amongst the epistatic QTL pairs identified in the current work.Conclusions: The complex inheritance of prolificacy traits in pigs has been evidenced by identifying multiple additive (SSC13 and SSC17), dominant and epistatic QTL in an Iberian × Meishan F2 intercross. Our results demonstrate that a significant fraction of the phenotypic variance of swine prolificacy traits can be attributed to first-order gene-by-gene interactions emphasizing that the phenotypic effects of alleles might be strongly modulated by the genetic background where they segregate.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising. We have used three recently introduced programs based on either phylogenetic–stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures (corresponding to ∼2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3′-UTRs. While we estimate a significant false discovery rate of ∼50%–70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz and EvoFold, and an additional 239 RNAz or EvoFold predictions are supported by the (more stringent) AlifoldZ algorithm. Five hundred seventy RNAz structure predictions fall into regions that show signs of selection pressure also on the sequence level (i.e., conserved elements). More than 700 predictions overlap with noncoding transcripts detected by oligonucleotide tiling arrays. One hundred seventy-five selected candidates were tested by RT-PCR in six tissues, and expression could be verified in 43 cases (24.6%).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The “one-gene, one-protein” rule, coined by Beadle and Tatum, has been fundamental to molecular biology. The rule implies that the genetic complexity of an organism depends essentially on its gene number. The discovery, however, that alternative gene splicing and transcription are widespread phenomena dramatically altered our understanding of the genetic complexity of higher eukaryotic organisms; in these, a limited number of genes may potentially encode a much larger number of proteins. Here we investigate yet another phenomenon that may contribute to generate additional protein diversity. Indeed, by relying on both computational and experimental analysis, we estimate that at least 4%–5% of the tandem gene pairs in the human genome can be eventually transcribed into a single RNA sequence encoding a putative chimeric protein. While the functional significance of most of these chimeric transcripts remains to be determined, we provide strong evidence that this phenomenon does not correspond to mere technical artifacts and that it is a common mechanism with the potential of generating hundreds of additional proteins in the human genome.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: We present the results of EGASP, a community experiment to assess the state-ofthe-art in genome annotation within the ENCODE regions, which span 1% of the human genomesequence. The experiment had two major goals: the assessment of the accuracy of computationalmethods to predict protein coding genes; and the overall assessment of the completeness of thecurrent human genome annotations as represented in the ENCODE regions. For thecomputational prediction assessment, eighteen groups contributed gene predictions. Weevaluated these submissions against each other based on a ‘reference set’ of annotationsgenerated as part of the GENCODE project. These annotations were not available to theprediction groups prior to the submission deadline, so that their predictions were blind and anexternal advisory committee could perform a fair assessment.Results: The best methods had at least one gene transcript correctly predicted for close to 70%of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into accountalternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotidelevel, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programsrelying on mRNA and protein sequences were the most accurate in reproducing the manuallycurated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could beverified.Conclusions: This is the first such experiment in human DNA, and we have followed thestandards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe theresults presented here contribute to the value of ongoing large-scale annotation projects and shouldguide further experimental methods when being scaled up to the entire human genome sequence.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Selenocysteine (Sec) is co-translationally inserted into selenoproteins in response to codon UGA with the help of the selenocysteine insertion sequence (SECIS) element. The number of selenoproteins in animals varies, with humans having 25 and mice having 24 selenoproteins. To date, however, only one selenoprotein, thioredoxin reductase, has been detected in Caenorhabditis elegans, and this enzyme contains only one Sec. Here, we characterize the selenoproteomes of C.elegans and Caenorhabditis briggsae with three independent algorithms, one searching for pairs of homologous nematode SECIS elements, another searching for Cys- or Sec-containing homologs of potential nematode selenoprotein genes and the third identifying Sec-containing homologs of annotated nematode proteins. These methods suggest that thioredoxin reductase is the only Sec-containing protein in the C.elegans and C.briggsae genomes. In contrast, we identified additional selenoproteins in other nematodes. Assuming that Sec insertion mechanisms are conserved between nematodes and other eukaryotes, the data suggest that nematode selenoproteomes were reduced during evolution, and that in an extreme reduction case Sec insertion systems probably decode only a single UGA codon in C.elegans and C.briggsae genomes. In addition, all detected genes had a rare form of SECIS element containing a guanosine in place of a conserved adenosine present in most other SECIS structures, suggesting that in organisms with small selenoproteomes SECIS elements may change rapidly.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Despite the continuous production of genome sequence for a number of organisms,reliable, comprehensive, and cost effective gene prediction remains problematic. This is particularlytrue for genomes for which there is not a large collection of known gene sequences, such as therecently published chicken genome. We used the chicken sequence to test comparative andhomology-based gene-finding methods followed by experimental validation as an effective genomeannotation method.Results: We performed experimental evaluation by RT-PCR of three different computational genefinders, Ensembl, SGP2 and TWINSCAN, applied to the chicken genome. A Venn diagram wascomputed and each component of it was evaluated. The results showed that de novo comparativemethods can identify up to about 700 chicken genes with no previous evidence of expression, andcan correctly extend about 40% of homology-based predictions at the 5' end.Conclusions: De novo comparative gene prediction followed by experimental verification iseffective at enhancing the annotation of the newly sequenced genomes provided by standardhomology-based methods.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: An excess of caffeine is cytotoxic to all eukaryotic cell types. We aim to study how cells become tolerant to atoxic dose of this drug, and the relationship between caffeine and oxidative stress pathways.Methodology/Principal Findings: We searched for Schizosaccharomyces pombe mutants with inhibited growth on caffeinecontainingplates. We screened a collection of 2,700 haploid mutant cells, of which 98 were sensitive to caffeine. The genes mutated in these sensitive clones were involved in a number of cellular roles including the H2O2-induced Pap1 and Sty1 stress pathways, the integrity and calcineurin pathways, cell morphology and chromatin remodeling. We have investigated the role of the oxidative stress pathways in sensing and promoting survival to caffeine. The Pap1 and the Sty1 pathways are both required for normal tolerance to caffeine, but only the Sty1 pathway is activated by the drug. Cells lacking Pap1 aresensitive to caffeine due to the decreased expression of the efflux pump Hba2. Indeed, ?hba2 cells are sensitive to caffeine, and constitutive activation of the Pap1 pathway enhances resistance to caffeine in an Hba2-dependent manner. Conclusions/Significance: With our caffeine-sensitive, genome-wide screen of an S. pombe deletion collection, we havedemonstrated the importance of some oxidative stress pathway components on wild-type tolerance to the drug.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Poor understanding of the spliceosomal mechanisms to select intronic 3' ends (3'ss) is a major obstacle to deciphering eukaryotic genomes. Here, we discern the rules for global 3'ss selection in yeast. We show that, in contrast to the uniformity of yeast splicing, the spliceosome uses all available 3'ss within a distance window from the intronic branch site (BS), and that in 70% of all possible 3'ss this is likely to be mediated by pre-mRNA structures. Our results reveal that one of these RNA folds acts as an RNA thermosensor, modulating alternative splicing in response to heat shock by controlling alternate 3'ss availability. Thus, our data point to a deeper role for the pre-mRNA in the control of its own fate, and to a simple mechanism for some alternative splicing.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Pneumocystis jirovecii is a fungus belonging to a basal lineage of the Ascomycotina, the Taphrinomycotina subphylum. It is a parasite specific to humans that dwells primarily in the lung and can cause severe pneumonia in individuals with debilitated immune system. Despite its clinical importance, many aspects of its biology remain poorly understood, at least in part because of the lack of a continuous in vitro cultivation system. The present thesis consists in the genome reconstruction and comparative genomics of P. jirovecii. It is made of three parts: (i) the de novo sequencing of P. jirovecii genome starting from a single broncho- alveolar lavage fluid of a single patient (ii) the de novo sequencing of the genome of the plant pathogen Taphrina deformans, a fungus closely related to P. jirovecii, and (iii) the genome scale comparison of P. jirovecii to other Taphrinomycotina members. Enrichment in P. jirovecii cells by immuno-precipitation, whole DNA random amplification, two complementary high throughput DNA sequencing methods, and in silico sorting and assembly of sequences were used for the de novo reconstruction of P. jirovecii genome from the microbiota of a single clinical specimen. An iterative ad hoc pipeline as well as numerical simulations was used to recover P. jirovecii sequences while purging out contaminants and assembly or amplification chimeras. This strategy produced a 8.1 Mb assembly, which encodes 3,898 genes. Homology searches, mapping on biochemical pathways atlases, and manual validations revealed that this genome lacks (i) most of the enzymes dedicated to the amino acids biosyntheses, and (ii) most virulence factors observed in other fungi, e.g. the glyoxylate shunt pathway and specific peptidases involved in the degradation of the host cell membrane. The same analyses applied to the available genomic sequences from Pneumocystis carinii the species infecting rats and Pneumocystis murina the species infecting mice revealed the same deficiencies. The genome sequencing of T. deformans yielded a 13 Mb assembly, which encodes 5,735 genes. T. deformans possesses enzymes involved plant cell wall degradation, secondary metabolism, the glyoxylate cycle, detoxification, sterol biosynthesis, as well as the biosyntheses of plant hormones such as abscisic acid or indole-3-acetic acid. T. deformans also harbors gene subsets that have counterparts in plant saprophytes or pathogens, which is consistent with its alternate saprophytic and pathogenic lifestyles. Mating genes were also identified. The homothallism of this fungus suggests a mating-type switching mechanism. Comparative analyses indicated that 81% of P. jirovecii genes are shared with eight other Taphrinomycotina members, including T. deformans, P. carinii and P. murina. These genes are mostly involved in housekeeping activities. The genes specific to the Pneumocystis genus represent 8%, and are involved in RNA metabolism and signaling. The signaling is known to be crucial for interaction of Pneumocystis spp with their environment. Eleven percent are unique to P. jirovecii and encode mostly proteins of unknown function. These genes in conjunction with other ones (e.g. the major surface glycoproteins) might govern the interaction of P. jirovecii with its human host cells, and potentially be responsible of the host specificity. P. jirovecii exhibits a reduced genome in size with a low GC content, and most probably scavenges vital compounds such as amino acids and cholesterol from human lungs. Consistently, its genome encodes a large set of transporters (ca. 22% of its genes), which may play a pivotal role in the acquisition of these compounds. All these features are generally observed in obligate parasite of various kingdoms (bacteria, protozoa, fungi). Moreover, epidemiological studies failed to evidence a free-living form of the fungus and Pneumocystis spp were shown to co-evolved with their hosts. Given also the lack of virulence factors, our observations strongly suggest that P. jirovecii is an obligate parasite specialized in the colonization of human lungs, and which causes disease only in individuals with compromised immune system. The same conclusion is most likely true for all other Pneumocystis spp in their respective mammalian host. - Pneumocystis jirovecii est un champignon appartenant à ine branche basale des Ascomycotina, le sous-embranchement des Taphrinomycotina. C'est un parasite spécifique aux humains qui réside principalement dans les poumons, et qui peut causer des pneumonies sévères chez des individus ayant un système immunitaire déficient. En dépit de son importance clinique, de nombreux aspects de sa biologie demeurent,largement méconnus, au moins en partie à cause de l'absence d'un système de culture in vitro continu. Cette thèse traite de la reconstruction du génome et de la génomique comparative de P. jirovecii. Elle comporte trois parties: (i) le séquençage de novo du génome de P. jirovecii à partir d'un lavage broncho-alvéolaire provenant d'un seul patient, (ii) le séquençage de novo du génome d'un champignon pathogène de plante Taphrina deformans qui est phylogénétiquement proche de P. jirovecii, et (iii) la comparaison du génome de P. jirovecii à celui d'autres membres du sous-embranchement des Taphrinomycotina. Un enrichissement en cellules de P. jirovecii par immuno-précipitation, une amplification aléatoire des molécules d'ADN, deux méthodes complémentaires de séquençage à haut débit, un tri in silico et un assemblage des séquences ont été utilisés pour reconstruire de novo le génome de P. jirovecii à partir du microbiote d'un seul échantillon clinique. Un pipeline spécifique ainsi que des simulations numériques ont été utilisés pour récupérer les séquences de P. jirovecii tout en éliminant les séquences contaminants et les chimères d'amplification ou d'assemblage. Cette stratégie a produit un assemblage de 8.1 Mb, qui contient 3898 gènes. Les recherches d'homologies, de cartographie des voies métaboliques et des validations manuelles ont révélé que ce génome est dépourvu (i) de la plupart des enzymes dédiées à la biosynthèse des acides aminés, et (ii) de la plupart des facteurs de virulence observés chez d'autres champignons, par exemple, le cycle du glyoxylate ainsi que des peptidases spécifiques impliquées dans la dégradation de la membrane de la cellule hôte. Les analyses appliquées aux données génomiques disponibles de Pneumocystis carinii, l'espèce infectant les rats, et de Pneumocystis murina, l'espèce infectant les souris, ont révélé les mêmes déficiences. Le séquençage du génome de T. deformans a généré un assemblage de 13.3 Mb qui contient 5735 gènes. T. deformans possède les gènes codant pour les enzymes impliquées dans la dégradation des parois cellulaires des plantes, le métabolisme secondaire, le cycle du glyoxylate, la détoxification, la biosynthèse des stérols ainsi que la biosynthèse d'hormones de plantes telles que l'acide abscissique ou l'acide indole 3-acétique. T. deformans possède également des sous-ensembles de gènes présents exclusivement chez des saprophytes ou des pathogènes de plantes, ce qui est consistent avec son mode de vie alternatif saprophyte et pathogène. Des gènes impliqués dans la conjugaison ont été identifiés. L'homothallisme de ce champignon suggère mécanisme de permutation du type conjuguant. Les analyses comparatives ont démontré que 81% des gènes de P. jirovecii sont présent chez les autres membres du sous-embranchement des Taphrinomycotina. Ces gènes sont essentiellement impliqués dans le métabolisme basai. Les gènes spécifiques au genre Pneumocystis représentent 8%, et sont impliqués dans le métabolisme de l'ARN et la signalisation. La signalisation est connue pour être cruciale pour l'interaction des espèces de Pneumocystis avec leur environnement. Les gènes propres à P. jirovecii représentent 11% et codent en majorité pour des protéines dont la fonction est inconnue. Ces gènes en conjonction avec d'autres (par exemple, les glycoprotéines de surface), pourraient être déterminants dans l'interaction de P. jirovecii avec les cellules de l'hôte humain, et être potentiellement responsable de la spécificité d'hôte. P. jirovecii possède un génome de taille réduite à faible pourcentage en GC et récupère très probablement des composés vitaux comme les acides aminés et le cholestérol à partir des poumons humains. De manière consistante, son génome code pour de nombreux transporteurs (22% de ses gènes), qui pourraient jouer un rôle essentiel dans l'acquisition de ces composés. Ces caractéristiques sont généralement observées chez les parasites obligatoires de plusieurs règnes (bactéries, protozoaires, champignons). De plus, les études épidémiologiques n'ont pas réussi à prouver l'existence d'ime forme vivant librement du champignon. Etant donné également l'absence de facteurs de virulence, nos observations suggèrent que P. jirovecii est un parasite obligatoire spécialisé dans la colonisation des poumons humains, ne causant une maladie que chez des individus ayant un système immunitaire compromis. La même conclusion est très probablement applicable à toutes les autres espèces de Pneumocystis dans leur hôte mammifère respectif.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Robertsonian (Rb) fusions received large theoretical support for their role in speciation, but empirical evidence is often lacking. Here, we address the role of Rb rearrangements on the genetic differentiation of the karyotypically diversified group of shrews, Sorex araneus. We compared genetic structure between 'rearranged' and 'common' chromosomes in pairwise comparisons of five karyotypic taxa of the group. Considering all possible comparisons, we found a significantly greater differentiation at rearranged chromosomes, supporting the role of chromosomal rearrangements in the general genetic diversification of this group. Intertaxa structure and distance were larger across rearranged chromosomes for most of the comparisons, although these differences were not significant. This last result could be explained by the large variance observed among microsatellite-based estimates. The differences observed among the pairs of taxa analysed support the role of both the hybrid karyotypic complexity and the level of evolutionary divergence.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We report the draft genome sequence of the red harvester ant, Pogonomyrmex barbatus. The genome was sequenced using 454 pyrosequencing, and the current assembly and annotation were completed in less than 1 y. Analyses of conserved gene groups (more than 1,200 manually annotated genes to date) suggest a high-quality assembly and annotation comparable to recently sequenced insect genomes using Sanger sequencing. The red harvester ant is a model for studying reproductive division of labor, phenotypic plasticity, and sociogenomics. Although the genome of P. barbatus is similar to other sequenced hymenopterans (Apis mellifera and Nasonia vitripennis) in GC content and compositional organization, and possesses a complete CpG methylation toolkit, its predicted genomic CpG content differs markedly from the other hymenopterans. Gene networks involved in generating key differences between the queen and worker castes (e.g., wings and ovaries) show signatures of increased methylation and suggest that ants and bees may have independently co-opted the same gene regulatory mechanisms for reproductive division of labor. Gene family expansions (e.g., 344 functional odorant receptors) and pseudogene accumulation in chemoreception and P450 genes compared with A. mellifera and N. vitripennis are consistent with major life-history changes during the adaptive radiation of Pogonomyrmex spp., perhaps in parallel with the development of the North American deserts.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The distribution of transposable elements (TEs) in a genome reflects a balance between insertion rate and selection against new insertions. Understanding the distribution of TEs therefore provides insights into the forces shaping the organization of genomes. Past research has shown that TEs tend to accumulate in genomic regions with low gene density and low recombination rate. However, little is known about the factors modulating insertion rates across the genome and their evolutionary significance. One candidate factor is gene expression, which has been suggested to increase local insertion rate by rendering DNA more accessible. We test this hypothesis by comparing the TE density around germline- and soma-expressed genes in the euchromatin of Drosophila melanogaster. Because only insertions that occur in the germline are transmitted to the next generation, we predicted a higher density of TEs around germline-expressed genes than soma-expressed genes. We show that the rate of TE insertions is greater near germline- than soma-expressed genes. However, this effect is partly offset by stronger selection for genome compactness (against excess noncoding DNA) on germline-expressed genes. We also demonstrate that the local genome organization in clusters of coexpressed genes plays a fundamental role in the genomic distribution of TEs. Our analysis shows that-in addition to recombination rate-the distribution of TEs is shaped by the interaction of gene expression and genome organization. The important role of selection for compactness sheds a new light on the role of TEs in genome evolution. Instead of making genomes grow passively, TEs are controlled by the forces shaping genome compactness, most likely linked to the efficiency of gene expression or its complexity and possibly their interaction with mechanisms of TE silencing.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: Analysis of the first reported complete genome sequence of Bifidobacterium longum NCC2705, an actinobacterium colonizing the gastrointestinal tract, uncovered its proteomic relatedness to Streptomyces coelicolor and Mycobacterium tuberculosis. However, a rapid scrutiny by genometric methods revealed a genome organization totally different from all so far sequenced high-GC Gram-positive chromosomes. RESULTS: Generally, the cumulative GC- and ORF orientation skew curves of prokaryotic genomes consist of two linear segments of opposite slope: the minimum and the maximum of the curves correspond to the origin and the terminus of chromosome replication, respectively. However, analyses of the B. longum NCC2705 chromosome yielded six, instead of two, linear segments, while its dnaA locus, usually associated with the origin of replication, was not located at the minimum of the curves. Furthermore, the coorientation of gene transcription with replication was very low. Comparison with closely related actinobacteria strongly suggested that the chromosome of B. longum was misassembled, and the identification of two pairs of relatively long homologous DNA sequences offers the possibility for an alternative genome assembly proposed here below. By genometric criteria, this configuration displays all of the characters common to bacteria, in particular to related high-GC Gram-positives. In addition, it is compatible with the partially sequenced genome of DJO10A B. longum strain. Recently, a corrected sequence of B. longum NCC2705, with a configuration similar to the one proposed here below, has been deposited in GenBank, confirming our predictions. CONCLUSION: Genometric analyses, in conjunction with standard bioinformatic tools and knowledge of bacterial chromosome architecture, represent fast and straightforward methods for the evaluation of chromosome assembly.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We performed whole genome sequencing in 16 unrelated patients with autosomal recessive retinitis pigmentosa (ARRP), a disease characterized by progressive retinal degeneration and caused by mutations in over 50 genes, in search of pathogenic DNA variants. Eight patients were from North America, whereas eight were Japanese, a population for which ARRP seems to have different genetic drivers. Using a specific workflow, we assessed both the coding and noncoding regions of the human genome, including the evaluation of highly polymorphic SNPs, structural and copy number variations, as well as 69 control genomes sequenced by the same procedures. We detected homozygous or compound heterozygous mutations in 7 genes associated with ARRP (USH2A, RDH12, CNGB1, EYS, PDE6B, DFNB31, and CERKL) in eight patients, three Japanese and five Americans. Fourteen of the 16 mutant alleles identified were previously unknown. Among these, there was a 2.3-kb deletion in USH2A and an inverted duplication of ∼446 kb in EYS, which would have likely escaped conventional screening techniques or exome sequencing. Moreover, in another Japanese patient, we identified a homozygous frameshift (p.L206fs), absent in more than 2,500 chromosomes from ethnically matched controls, in the ciliary gene NEK2, encoding a serine/threonine-protein kinase. Inactivation of this gene in zebrafish induced retinal photoreceptor defects that were rescued by human NEK2 mRNA. In addition to identifying a previously undescribed ARRP gene, our study highlights the importance of rare structural DNA variations in Mendelian diseases and advocates the need for screening approaches that transcend the analysis of the coding sequences of the human genome.