15 resultados para Illumina
em BORIS: Bern Open Repository and Information System - Berna - Suiça
Resumo:
With the advent of high through-put sequencing (HTS), the emerging science of metagenomics is transforming our understanding of the relationships of microbial communities with their environments. While metagenomics aims to catalogue the genes present in a sample through assessing which genes are actively expressed, metatranscriptomics can provide a mechanistic understanding of community inter-relationships. To achieve these goals, several challenges need to be addressed from sample preparation to sequence processing, statistical analysis and functional annotation. Here we use an inbred non-obese diabetic (NOD) mouse model in which germ-free animals were colonized with a defined mixture of eight commensal bacteria, to explore methods of RNA extraction and to develop a pipeline for the generation and analysis of metatranscriptomic data. Applying the Illumina HTS platform, we sequenced 12 NOD cecal samples prepared using multiple RNA-extraction protocols. The absence of a complete set of reference genomes necessitated a peptide-based search strategy. Up to 16% of sequence reads could be matched to a known bacterial gene. Phylogenetic analysis of the mapped ORFs revealed a distribution consistent with ribosomal RNA, the majority from Bacteroides or Clostridium species. To place these HTS data within a systems context, we mapped the relative abundance of corresponding Escherichia coli homologs onto metabolic and protein-protein interaction networks. These maps identified bacterial processes with components that were well-represented in the datasets. In summary this study highlights the potential of exploiting the economy of HTS platforms for metatranscriptomics.
Resumo:
The heritability of attention deficit hyperactivity disorder (ADHD) is approximately 0.8. Despite several larger scale attempts, genome-wide association studies (GWAS) have not led to the identification of significant results. We performed a GWAS based on 495 German young patients with ADHD (according to DSM-IV criteria; Human660W-Quadv1; Illumina, San Diego, CA) and on 1,300 population-based adult controls (HumanHap550v3; Illumina). Some genes neighboring the single nucleotide polymorphisms (SNPs) with the lowest P-values (best P-value: 8.38 × 10(-7)) have potential relevance for ADHD (e.g., glutamate receptor, metabotropic 5 gene, GRM5). After quality control, the 30 independent SNPs with the lowest P-values (P-values ≤ 7.57 × 10(-5) ) were chosen for confirmation. Genotyping of these SNPs in up to 320 independent German families comprising at least one child with ADHD revealed directionally consistent effect-size point estimates for 19 (10 not consistent) of the SNPs. In silico analyses of the 30 SNPs in the largest meta-analysis so far (2,064 trios, 896 cases, and 2,455 controls) revealed directionally consistent effect-size point estimates for 16 SNPs (11 not consistent). None of the combined analyses revealed a genome-wide significant result. SNPs in previously described autosomal candidate genes did not show significantly lower P-values compared to SNPs within random sets of genes of the same size. We did not find genome-wide significant results in a GWAS of German children with ADHD compared to controls. The second best SNP is located in an intron of GRM5, a gene located within a recently described region with an infrequent copy number variation in patients with ADHD.
Resumo:
Microphthalmia in sheep is an autosomal recessive inherited congenital anomaly found within the Texel breed. It is characterized by extremely small or absent eyes and affected lambs are absolutely blind. For the first time, we use a genome-wide ovine SNP array for positional cloning of a Mendelian trait in sheep. Genotyping 23 cases and 23 controls using Illumina's OvineSNP50 BeadChip allowed us to localize the causative mutation for microphthalmia to a 2.4 Mb interval on sheep chromosome 22 by association and homozygosity mapping. The PITX3 gene is located within this interval and encodes a homeodomain-containing transcription factor involved in vertebrate lens formation. An abnormal development of the lens vesicle was shown to be the primary event in ovine microphthalmia. Therefore, we considered PITX3 a positional and functional candidate gene. An ovine BAC clone was sequenced, and after full-length cDNA cloning the PITX3 gene was annotated. Here we show that the ovine microphthalmia phenotype is perfectly associated with a missense mutation (c.338G>C, p.R113P) in the evolutionary conserved homeodomain of PITX3. Selection against this candidate causative mutation can now be used to eliminate microphthalmia from Texel sheep in production systems. Furthermore, the identification of a naturally occurring PITX3 mutation offers the opportunity to use the Texel as a genetically characterized large animal model for human microphthalmia.
Resumo:
Effective population size is an important parameter for the assessment of genetic diversity within a livestock population and its development over time. If pedigree information is not available, linkage disequilibrium (LD) analysis might offer an alternative perspective for the estimation of effective population size. In this study, 128 individuals of the Swiss Eringer breed were genotyped using the Illumina BovineSNP50 beadchip. We set bin size at 50 kb for LD analysis, assuming that LD for proximal single nucleotide polymorphism (SNP)-pairs reflects distant breeding history while LD from distal SNP-pairs would reflect near history. Recombination rates varied among different regions of the genome. The use of physical distances as an approximation of genetic distances (e.g. setting 1 Mb = 0.01 Morgan) led to an upward bias in LD-based estimates of effective population size for generations beyond 50, while estimates for recent history were unaffected. Correction for restricted sample size did not substantially affect these results. LD-based actual effective population size was estimated in the range of 87-149, whereas pedigree-based effective population size resulted in 321 individuals. For conservation purposes, requiring knowledge of recent history (<50 generations), approximation assuming constant recombination rate seemed adequate.
Resumo:
Balkan endemic nephropathy (BEN) is a familial chronic tubulointerstitial disease with insidious onset and slow progression leading to terminal renal failure. The results of molecular biological investigations propose that BEN is a multifactorial disease with genetic predisposition to environmental risk agents. Exome sequencing of 22 000 genes with Illumina Nextera Exome Enrichment Kit was performed on 22 DNA samples (11 Bulgarian patients and 11 Serbian patients). Software analysis was performed via NextGene, Provean, and PolyPhen. The frequency of all annotated genetic variants with deleterious/damaging effect was compared with those of European populations. Then we focused on nonannotated variants (with no data available about them and not found in healthy Bulgarian controls). There is no statistically significant difference between annotated variants in BEN patients and European populations. From nonannotated variants with more than 40% frequency in both patients' groups, we nominated 3 genes with possible deleterious/damaging variants-CELA1, HSPG2, and KCNK5. Mutant genes (CELA1, HSPG2, and KCNK5) in BEN patients encode proteins involved in basement membrane/extracellular matrix and vascular tone, tightly connected to process of angiogenesis. We suggest that an abnormal process of angiogenesis plays a key role in the molecular pathogenesis of BEN.
Resumo:
Stylonychia lemnae is a classical model single-celled eukaryote, and a quintessential ciliate typified by dimorphic nuclei: A small, germline micronucleus and a massive, vegetative macronucleus. The genome within Stylonychia's macronucleus has a very unusual architecture, comprised variably and highly amplified "nanochromosomes," each usually encoding a single gene with a minimal amount of surrounding noncoding DNA. As only a tiny fraction of the Stylonychia genes has been sequenced, and to promote research using this organism, we sequenced its macronuclear genome. We report the analysis of the 50.2-Mb draft S. lemnae macronuclear genome assembly, containing in excess of 16,000 complete nanochromosomes, assembled as less than 20,000 contigs. We found considerable conservation of fundamental genomic properties between S. lemnae and its close relative, Oxytricha trifallax, including nanochromosomal gene synteny, alternative fragmentation, and copy number. Protein domain searches in Stylonychia revealed two new telomere-binding protein homologs and the presence of linker histones. Among the diverse histone variants of S. lemnae and O. trifallax, we found divergent, coexpressed variants corresponding to four of the five core nucleosomal proteins (H1.2, H2A.6, H2B.4, and H3.7) suggesting that these ciliates may possess specialized nucleosomes involved in genome processing during nuclear differentiation. The assembly of the S. lemnae macronuclear genome demonstrates that largely complete, well-assembled highly fragmented genomes of similar size and complexity may be produced from one library and lane of Illumina HiSeq 2000 shotgun sequencing. The provision of the S. lemnae macronuclear genome sets the stage for future detailed experimental studies of chromatin-mediated, RNA-guided developmental genome rearrangements.
Resumo:
The recent development of a goat SNP genotyping microarray enables genome-wide association studies in this important livestock species. We investigated the genetic basis of the black and brown coat colour in Valais Blacknecked and Coppernecked goats. A genome-wide association analysis using goat SNP50 BeadChip genotypes of 22 cases and 23 controls allowed us to map the locus for the brown coat colour to goat chromosome 8. The TYRP1 gene is located within the associated chromosomal region, and TYRP1 variants cause similar coat colour phenotypes in different species. We thus considered TYRP1 as a strong positional and functional candidate. We resequenced the caprine TYRP1 gene by Sanger and Illumina sequencing and identified two non-synonymous variants, p.Ile478Thr and p.Gly496Asp, that might have a functional impact on the TYRP1 protein. However, based on the obtained pedigree and genotype data, the brown coat colour in these goats is not due to a single recessive loss-of-function allele. Surprisingly, the genotype distribution and the pedigree data suggest that the (496) Asp allele might possibly act in a dominant manner. The (496) Asp allele was present in 77 of 81 investigated Coppernecked goats and did not occur in black goats. This strongly suggests heterogeneity underlying the brown coat colour in Coppernecked goats. Functional experiments or targeted matings will be required to verify the unexpected preliminary findings.
Resumo:
BACKGROUND A novel Gram-negative, non-haemolytic, non-motile, rod-shaped bacterium was discovered in the lungs of a dead parakeet (Melopsittacus undulatus) that was kept in captivity in a petshop in Basel, Switzerland. The organism is described with a chemotaxonomic profile and the nearly complete genome sequence obtained through the assembly of short sequence reads. RESULTS Genome sequence analysis and characterization of respiratory quinones, fatty acids, polar lipids, and biochemical phenotype is presented here. Comparison of gene sequences revealed that the most similar species is Pelistega europaea, with BLAST identities of only 93% to the 16S rDNA gene, 76% identity to the rpoB gene, and a similar GC content (~43%) as the organism isolated from the parakeet, DSM 24701 (40%). The closest full genome sequences are those of Bordetella spp. and Taylorella spp. High-throughput sequencing reads from the Illumina-Solexa platform were assembled with the Edena de novo assembler to form 195 contigs comprising the ~2 Mb genome. Genome annotation with RAST, construction of phylogenetic trees with the 16S rDNA (rrs) gene sequence and the rpoB gene, and phylogenetic placement using other highly conserved marker genes with ML Tree all suggest that the bacterial species belongs to the Alcaligenaceae family. Analysis of samples from cages with healthy parakeets suggested that the newly discovered bacterial species is not widespread in parakeet living quarters. CONCLUSIONS Classification of this organism in the current taxonomy system requires the formation of a new genus and species. We designate the new genus Basilea and the new species psittacipulmonis. The type strain of Basilea psittacipulmonis is DSM 24701 (= CIP 110308 T, 16S rDNA gene sequence Genbank accession number JX412111 and GI 406042063).
Resumo:
BACKGROUND A cost-effective strategy to increase the density of available markers within a population is to sequence a small proportion of the population and impute whole-genome sequence data for the remaining population. Increased densities of typed markers are advantageous for genome-wide association studies (GWAS) and genomic predictions. METHODS We obtained genotypes for 54 602 SNPs (single nucleotide polymorphisms) in 1077 Franches-Montagnes (FM) horses and Illumina paired-end whole-genome sequencing data for 30 FM horses and 14 Warmblood horses. After variant calling, the sequence-derived SNP genotypes (~13 million SNPs) were used for genotype imputation with the software programs Beagle, Impute2 and FImpute. RESULTS The mean imputation accuracy of FM horses using Impute2 was 92.0%. Imputation accuracy using Beagle and FImpute was 74.3% and 77.2%, respectively. In addition, for Impute2 we determined the imputation accuracy of all individual horses in the validation population, which ranged from 85.7% to 99.8%. The subsequent inclusion of Warmblood sequence data further increased the correlation between true and imputed genotypes for most horses, especially for horses with a high level of admixture. The final imputation accuracy of the horses ranged from 91.2% to 99.5%. CONCLUSIONS Using Impute2, the imputation accuracy was higher than 91% for all horses in the validation population, which indicates that direct imputation of 50k SNP-chip data to sequence level genotypes is feasible in the FM population. The individual imputation accuracy depended mainly on the applied software and the level of admixture.
Resumo:
Leopard Complex spotting occurs in several breeds of horses and is caused by an incompletely dominant allele (LP). Homozygosity for LP is also associated with congenital stationary night blindness (CSNB) in Appaloosa horses. Previously, LP was mapped to a 6 cm region on ECA1 containing the candidate gene TRPM1 (Transient Receptor Potential Cation Channel, Subfamily M, Member 1) and decreased expression of this gene, measured by qRT-PCR, was identified as the likely cause of both spotting and ocular phenotypes. This study describes investigations for a mutation causing or associated with the Leopard Complex and CSNB phenotype in horses. Re-sequencing of the gene and associated splice sites within the 105 624 bp genomic region of TRPM1 led to the discovery of 18 SNPs. Most of the SNPs did not have a predictive value for the presence of LP. However, one SNP (ECA1:108,249,293 C>T) found within intron 11 had a strong (P < 0.0005), but not complete, association with LP and CSNB and thus is a good marker but unlikely to be causative. To further localize the association, 70 SNPs spanning over two Mb including the TRPM1 gene were genotyped in 192 horses from three different breeds segregating for LP. A single 173 kb haplotype associated with LP and CSNB (ECA1: 108,197,355- 108,370,150) was identified. Illumina sequencing of 300 kb surrounding this haplotype revealed 57 SNP variants. Based on their localization within expressed sequences or regions of high sequence conservation across mammals, six of these SNPs were considered to be the most likely candidate mutations. While the precise function of TRPM1 remains to be elucidated, this work solidifies its functional role in both pigmentation and night vision. Further, this work has identified several potential regulatory elements of the TRPM1 gene that should be investigated further in this and other species.
Resumo:
A novel canine muscular dystrophy in Landseer dogs was observed. We had access to five affected dogs from two litters. The clinical signs started at a few weeks of age and the severe progressive muscle weakness led to euthanasia between 5 and 15 months of age. The pedigrees of the affected dogs suggested a monogenic autosomal recessive inheritance of the trait. Linkage and homozygosity mapping indicated two potential genome segments for the causative variant on chromosomes 10 and 31 harboring a total of 4.8 Mb of DNA or 0.2% of the canine genome. Using the illumina sequencing technology we obtained a whole genome sequence from one affected Landseer. Variants were called with respect to the dog reference genome and compared to the genetic variants of 170 control dogs from other breeds. The affected Landseer dog was homozygous for a single private non-synonymous variant in the critical intervals, a nonsense variant in the COL6A1 gene (Chr31:39,303,964G>T; COL6A1:c.289G>T; p.E97*). Genotypes at this variant showed perfect concordance with the muscular dystrophy phenotype in all five cases and more than one thousand control dogs. Variants in the human COL6A1 gene cause Bethlem myopathy or Ullrich congenital muscular dystrophy. We therefore conclude that the identified canine COL6A1 variant is most likely causative for the observed muscular dystrophy in Landseer dogs. Based on the nature of the genetic variant in Landseer dogs and their severe clinical phenotype these dogs represent a model for human Ullrich congenital muscular dystrophy.
Resumo:
The identification of quantitative trait loci (QTL) such as height and their underlying causative variants is still challenging and often requires large sample sizes. In humans hundreds of loci with small effects control the heritable portion of height variability. In domestic animals, typically only a few loci with comparatively large effects explain a major fraction of the heritability. We investigated height at withers in Shetland ponies and mapped a QTL to ECA 6 by genome-wide association (GWAS) using a small cohort of only 48 animals and the Illumina equine SNP70 BeadChip. Fine-mapping revealed a shared haplotype block of 793 kb in small Shetland ponies. The HMGA2 gene, known to be associated with height in horses and many other species, was located in the associated haplotype. After closing a gap in the equine reference genome we identified a non-synonymous variant in the first exon of HMGA2 in small Shetland ponies. The variant was predicted to affect the functionally important first AT-hook DNA binding domain of the HMGA2 protein (c.83G>A; p.G28E). We assessed the functional impact and found impaired DNA binding of a peptide with the mutant sequence in an electrophoretic mobility shift assay. This suggests that the HMGA2 variant also affects DNA binding in vivo and thus leads to reduced growth and a smaller stature in Shetland ponies. The identified HMGA2 variant also segregates in several other pony breeds but was not found in regular-sized horse breeds. We therefore conclude that we identified a quantitative trait nucleotide for height in horses.
Resumo:
Whole exome sequencing (WES) is increasingly used in research and diagnostics. WES users expect coverage of the entire coding region of known genes as well as sufficient read depth for the covered regions. It is, however, unknown which recent WES platform is most suitable to meet these expectations. We present insights into the performance of the most recent standard exome enrichment platforms from Agilent, NimbleGen and Illumina applied to six different DNA samples by two sequencing vendors per platform. Our results suggest that both Agilent and NimbleGen overall perform better than Illumina and that the high enrichment performance of Agilent is stable among samples and between vendors, whereas NimbleGen is only able to achieve vendor- and sample-specific best exome coverage. Moreover, the recent Agilent platform overall captures more coding exons with sufficient read depth than NimbleGen and Illumina. Due to considerable gaps in effective exome coverage, however, the three platforms cannot capture all known coding exons alone or in combination, requiring improvement. Our data emphasize the importance of evaluation of updated platform versions and suggest that enrichment-free whole genome sequencing can overcome the limitations of WES in sufficiently covering coding exons, especially GC-rich regions, and in characterizing structural variants.
Resumo:
A complete reference genome of the Apis mellifera Filamentous virus (AmFV) was determined using Illumina Hiseq sequencing. The AmFV genome is a double stranded DNA molecule of approximately 498,500 nucleotides with a GC content of 50.8%. It encompasses 247 non-overlapping open reading frames (ORFs), equally distributed on both strands, which cover 65% of the genome. While most of the ORFs lacked threshold sequence alignments to reference protein databases, twenty-eight were found to display significant homologies with proteins present in other large double stranded DNA viruses. Remarkably, 13 ORFs had strong similarity with typical baculovirus domains such as PIFs (per os infectivity factor genes: pif-1, pif-2, pif-3 and p74) and BRO (Baculovirus Repeated Open Reading Frame). The putative AmFV DNA polymerase is of type B, but is only distantly related to those of the baculoviruses. The ORFs encoding proteins involved in nucleotide metabolism had the highest percent identity to viral proteins in GenBank. Other notable features include the presence of several collagen-like, chitin-binding, kinesin and pacifastin domains. Due to the large size of the AmFV genome and the inconsistent affiliation with other large double stranded DNA virus families infecting invertebrates, AmFV may belong to a new virus family.
Resumo:
Deep polar ice cores provide atmospheric records of nitrous oxide (N₂O) and other trace gases reflecting climate history along with a parallel archive of microbial cells transported with mineral dust, marine and volcanic aerosols from around the globe. Our interdisciplinary study of 32 samples from different depths of the recently drilled NEEM Greenland ice core addressed the question whether the identified microorganisms were capable of post-depositional biological production of N₂O in situ. We used high-resolution geochemical and microbiological approaches to examine the N₂O concentrations, the quantitative distributions of dust, Ca⁺², NH₄⁺ and NO₃⁻ ¡ons related to N cycle pathways, the microbial abundance and diversity at specific NEEM core depths from 1758 m to 1867.8 m. Results showed varying concentrations of N₂O (220 –271.5 ppb). Microbial abundance fluctuated between 3.3 x 10⁴ and 3.3 x 10⁶ cells mL⁻¹ in direct correlation with dust and Ca²⁺ concentrations with higher cell numbers deposited during colder periods. The average values of NH₄⁺ and NO₃⁻ indicated that substrates were available for the microorganisms capable of utilizing them. PCR amplification of selected functional genes involved in bacterial and archaeal nitrification and denitrification was not successful. Sanger and Illumina MiSeq sequence analyses of SSU rRNA genes showed variable representation of Alpha-, Beta- and Gammaproteobacteria, Firmicutes, Actinobacteria, chloroplasts and fungi. The metabolic potential of the dominant genera of Proteobacteria and Firmicutes as possible N₂O producers suggested that denitrification activity may have led to in-situ production and accumulation of N₂O.