10 resultados para PCR. Sequencing

em Duke University


Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: The rate of emergence of human pathogens is steadily increasing; most of these novel agents originate in wildlife. Bats, remarkably, are the natural reservoirs of many of the most pathogenic viruses in humans. There are two bat genome projects currently underway, a circumstance that promises to speed the discovery host factors important in the coevolution of bats with their viruses. These genomes, however, are not yet assembled and one of them will provide only low coverage, making the inference of most genes of immunological interest error-prone. Many more wildlife genome projects are underway and intend to provide only shallow coverage. RESULTS: We have developed a statistical method for the assembly of gene families from partial genomes. The method takes full advantage of the quality scores generated by base-calling software, incorporating them into a complete probabilistic error model, to overcome the limitation inherent in the inference of gene family members from partial sequence information. We validated the method by inferring the human IFNA genes from the genome trace archives, and used it to infer 61 type-I interferon genes, and single type-II interferon genes in the bats Pteropus vampyrus and Myotis lucifugus. We confirmed our inferences by direct cloning and sequencing of IFNA, IFNB, IFND, and IFNK in P. vampyrus, and by demonstrating transcription of some of the inferred genes by known interferon-inducing stimuli. CONCLUSION: The statistical trace assembler described here provides a reliable method for extracting information from the many available and forthcoming partial or shallow genome sequencing projects, thereby facilitating the study of a wider variety of organisms with ecological and biomedical significance to humans than would otherwise be possible.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: There is considerable interest in the development of methods to efficiently identify all coding variants present in large sample sets of humans. There are three approaches possible: whole-genome sequencing, whole-exome sequencing using exon capture methods, and RNA-Seq. While whole-genome sequencing is the most complete, it remains sufficiently expensive that cost effective alternatives are important. RESULTS: Here we provide a systematic exploration of how well RNA-Seq can identify human coding variants by comparing variants identified through high coverage whole-genome sequencing to those identified by high coverage RNA-Seq in the same individual. This comparison allowed us to directly evaluate the sensitivity and specificity of RNA-Seq in identifying coding variants, and to evaluate how key parameters such as the degree of coverage and the expression levels of genes interact to influence performance. We find that although only 40% of exonic variants identified by whole genome sequencing were captured using RNA-Seq; this number rose to 81% when concentrating on genes known to be well-expressed in the source tissue. We also find that a high false positive rate can be problematic when working with RNA-Seq data, especially at higher levels of coverage. CONCLUSIONS: We conclude that as long as a tissue relevant to the trait under study is available and suitable quality control screens are implemented, RNA-Seq is a fast and inexpensive alternative approach for finding coding variants in genes with sufficiently high expression levels.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We used ultra-deep sequencing to obtain tens of thousands of HIV-1 sequences from regions targeted by CD8+ T lymphocytes from longitudinal samples from three acutely infected subjects, and modeled viral evolution during the critical first weeks of infection. Previous studies suggested that a single virus established productive infection, but these conclusions were tempered because of limited sampling; now, we have greatly increased our confidence in this observation through modeling the observed earliest sample diversity based on vastly more extensive sampling. Conventional sequencing of HIV-1 from acute/early infection has shown different patterns of escape at different epitopes; we investigated the earliest escapes in exquisite detail. Over 3-6 weeks, ultradeep sequencing revealed that the virus explored an extraordinary array of potential escape routes in the process of evading the earliest CD8 T-lymphocyte responses--using 454 sequencing, we identified over 50 variant forms of each targeted epitope during early immune escape, while only 2-7 variants were detected in the same samples via conventional sequencing. In contrast to the diversity seen within epitopes, non-epitope regions, including the Envelope V3 region, which was sequenced as a control in each subject, displayed very low levels of variation. In early infection, in the regions sequenced, the consensus forms did not have a fitness advantage large enough to trigger reversion to consensus amino acids in the absence of immune pressure. In one subject, a genetic bottleneck was observed, with extensive diversity at the second time point narrowing to two dominant escape forms by the third time point, all within two months of infection. Traces of immune escape were observed in the earliest samples, suggesting that immune pressure is present and effective earlier than previously reported; quantifying the loss rate of the founder virus suggests a direct role for CD8 T-lymphocyte responses in viral containment after peak viremia. Dramatic shifts in the frequencies of epitope variants during the first weeks of infection revealed a complex interplay between viral fitness and immune escape.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A precise molecular identification of transmitted hepatitis C virus (HCV) genomes could illuminate key aspects of transmission biology, immunopathogenesis and natural history. We used single genome sequencing of 2,922 half or quarter genomes from plasma viral RNA to identify transmitted/founder (T/F) viruses in 17 subjects with acute community-acquired HCV infection. Sequences from 13 of 17 acute subjects, but none of 14 chronic controls, exhibited one or more discrete low diversity viral lineages. Sequences within each lineage generally revealed a star-like phylogeny of mutations that coalesced to unambiguous T/F viral genomes. Numbers of transmitted viruses leading to productive clinical infection were estimated to range from 1 to 37 or more (median = 4). Four acutely infected subjects showed a distinctly different pattern of virus diversity that deviated from a star-like phylogeny. In these cases, empirical analysis and mathematical modeling suggested high multiplicity virus transmission from individuals who themselves were acutely infected or had experienced a virus population bottleneck due to antiviral drug therapy. These results provide new quantitative and qualitative insights into HCV transmission, revealing for the first time virus-host interactions that successful vaccines or treatment interventions will need to overcome. Our findings further suggest a novel experimental strategy for identifying full-length T/F genomes for proteome-wide analyses of HCV biology and adaptation to antiviral drug or immune pressures.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Single-molecule sequencing instruments can generate multikilobase sequences with the potential to greatly improve genome and transcriptome assembly. However, the error rates of single-molecule reads are high, which has limited their use thus far to resequencing bacteria. To address this limitation, we introduce a correction algorithm and assembly strategy that uses short, high-fidelity sequences to correct the error in single-molecule sequences. We demonstrate the utility of this approach on reads generated by a PacBio RS instrument from phage, prokaryotic and eukaryotic whole genomes, including the previously unsequenced genome of the parrot Melopsittacus undulatus, as well as for RNA-Seq reads of the corn (Zea mays) transcriptome. Our long-read correction achieves >99.9% base-call accuracy, leading to substantially better assemblies than current sequencing strategies: in the best example, the median contig size was quintupled relative to high-coverage, second-generation assemblies. Greater gains are predicted if read lengths continue to increase, including the prospect of single-contig bacterial chromosome assembly.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The International Crocodilian Genomes Working Group (ICGWG) will sequence and assemble the American alligator (Alligator mississippiensis), saltwater crocodile (Crocodylus porosus) and Indian gharial (Gavialis gangeticus) genomes. The status of these projects and our planned analyses are described.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mitochondria are responsible for producing the vast majority of cellular ATP, and are therefore critical to organismal health [1]. They contain thir own genomes (mtDNA) which encode 13 proteins that are all subunits of the mitochondrial respiratory chain (MRC) and are essential for oxidative phosphorylation [2]. mtDNA is present in multiple copies per cell, usually between 103 and 104 , though this number is reduced during certain developmental stages [3, 4]. The health of the mitochondrial genome is also important to the health of the organism, as mutations in mtDNA lead to human diseases that collectively affect approximately 1 in 4000 people [5, 6]. mtDNA is more susceptible than nuclear DNA (nucDNA) to damage by many environmental pollutants, for reasons including the absence of Nucleotide Excision Repair (NER) in the mitochondria [7]. NER is a highly functionally conserved DNA repair pathway that removes bulky, helix distorting lesions such as those caused by ultraviolet C (UVC) radiation and also many environmental toxicants, including benzo[a]pyrene (BaP) [8]. While these lesions cannot be repaired, they are slowly removed through a process that involves mitochondrial dynamics and autophagy [9, 10]. However, when present during development in C. elegans, this damage reduces mtDNA copy number and ATP levels [11]. We hypothesize that this damage, when present during development, will result in mitochondrial dysfunction and increase the potential for adverse outcomes later in life.

To test this hypothesis, 1st larval stage (L1) C. elegans are exposed to 3 doses of 7.5J/m2 ultraviolet C radiation 24 hours apart, leading to the accumulation of mtDNA damage [9, 11]. After exposure, many mitochondrial endpoints are assessed at multiple time points later in life. mtDNA and nucDNA damage levels and genome copy numbers are measured via QPCR and real-time PCR , respectively, every 2 day for 10 days. Steady state ATP levels are measured via luciferase expressing reporter strains and traditional ATP extraction methods. Oxygen consumption is measured using a Seahorse XFe24 extra cellular flux analyzer. Gene expression changes are measured via real time PCR and targeted metabolomics via LC-MS are used to investigate changes in organic acid, amino acid and acyl-carnitine levels. Lastly, nematode developmental delay is assessed as growth, and measured via imaging and COPAS biosort.

I have found that despite being removed, UVC induced mtDNA damage during development leads to persistent deficits in energy production later in life. mtDNA copy number is permanently reduced, as are ATP levels, though oxygen consumption is increased, indicating inefficient or uncoupled respiration. Metabolomic data and mutant sensitivity indicate a role for NADPH and oxidative stress in these results, and exposed nematodes are more sensitive to the mitochondrial poison rotenone later in life. These results fit with the developmental origin of health and disease hypothesis, and show the potential for environmental exposures to have lasting effects on mitochondrial function.

Lastly, we are currently working to investigate the potential for irreparable mtDNA lesions to drive mutagenesis in mtDNA. Mutations in mtDNA lead to a wide range of diseases, yet we currently do not understand the environmental component of what causes them. In vitro evidence suggests that UVC induced thymine dimers can be mutagenic [12]. We are using duplex sequencing of C. elegans mtDNA to determine mutation rates in nematodes exposed to our serial UVC protocol. Furthermore, by including mutant strains deficient in mitochondrial fission and mitophagy, we hope to determine if deficiencies in these processes will further increase mtDNA mutation rates, as they are implicated in human diseases.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: Parrots belong to a group of behaviorally advanced vertebrates and have an advanced ability of vocal learning relative to other vocal-learning birds. They can imitate human speech, synchronize their body movements to a rhythmic beat, and understand complex concepts of referential meaning to sounds. However, little is known about the genetics of these traits. Elucidating the genetic bases would require whole genome sequencing and a robust assembly of a parrot genome. FINDINGS: We present a genomic resource for the budgerigar, an Australian Parakeet (Melopsittacus undulatus) -- the most widely studied parrot species in neuroscience and behavior. We present genomic sequence data that includes over 300× raw read coverage from multiple sequencing technologies and chromosome optical maps from a single male animal. The reads and optical maps were used to create three hybrid assemblies representing some of the largest genomic scaffolds to date for a bird; two of which were annotated based on similarities to reference sets of non-redundant human, zebra finch and chicken proteins, and budgerigar transcriptome sequence assemblies. The sequence reads for this project were in part generated and used for both the Assemblathon 2 competition and the first de novo assembly of a giga-scale vertebrate genome utilizing PacBio single-molecule sequencing. CONCLUSIONS: Across several quality metrics, these budgerigar assemblies are comparable to or better than the chicken and zebra finch genome assemblies built from traditional Sanger sequencing reads, and are sufficient to analyze regions that are difficult to sequence and assemble, including those not yet assembled in prior bird genomes, and promoter regions of genes differentially regulated in vocal learning brain regions. This work provides valuable data and material for genome technology development and for investigating the genomics of complex behavioral traits.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Limited data are available regarding the molecular epidemiology of Mycobacterium tuberculosis (Mtb) strains circulating in Guatemala. Beijing-lineage Mtb strains have gained prevalence worldwide and are associated with increased virulence and drug resistance, but there have been only a few cases reported in Central America. Here we report the first whole genome sequencing of Central American Beijing-lineage strains of Mtb. We find that multiple Beijing-lineage strains, derived from independent founding events, are currently circulating in Guatemala, but overall still represent a relatively small proportion of disease burden. Finally, we identify a specific Beijing-lineage outbreak centered on a poor neighborhood in Guatemala City.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The purpose of this research was to use next generation sequencing to identify mutations in patients with primary immunodeficiency diseases whose pathogenic gene mutations had not been identified. Remarkably, four unrelated patients were found by next generation sequencing to have the same heterozygous mutation in an essential donor splice site of PIK3R1 (NM_181523.2:c.1425 + 1G > A) found in three prior reports. All four had the Hyper IgM syndrome, lymphadenopathy and short stature, and one also had SHORT syndrome. They were investigated with in vitro immune studies, RT-PCR, and immunoblotting studies of the mutation's effect on mTOR pathway signaling. All patients had very low percentages of memory B cells and class-switched memory B cells and reduced numbers of naïve CD4+ and CD8+ T cells. RT-PCR confirmed the presence of both an abnormal 273 base-pair (bp) size and a normal 399 bp size band in the patient and only the normal band was present in the parents. Following anti-CD40 stimulation, patient's EBV-B cells displayed higher levels of S6 phosphorylation (mTOR complex 1 dependent event), Akt phosphorylation at serine 473 (mTOR complex 2 dependent event), and Akt phosphorylation at threonine 308 (PI3K/PDK1 dependent event) than controls, suggesting elevated mTOR signaling downstream of CD40. These observations suggest that amino acids 435-474 in PIK3R1 are important for its stability and also its ability to restrain PI3K activity. Deletion of Exon 11 leads to constitutive activation of PI3K signaling. This is the first report of this mutation and immunologic abnormalities in SHORT syndrome.