19 resultados para SEQUENCING REVEALS

em Duke University


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Lymphomas comprise a diverse group of malignancies derived from immune cells. High throughput sequencing has recently emerged as a powerful and versatile method for analysis of the cancer genome and transcriptome. As these data continue to emerge, the crucial work lies in sorting through the wealth of information to hone in on the critical aspects that will give us a better understanding of biology and new insight for how to treat disease. Finding the important signals within these large data sets is one of the major challenges of next generation sequencing.

In this dissertation, I have developed several complementary strategies to describe the genetic underpinnings of lymphomas. I begin with developing a better method for RNA sequencing that enables strand-specific total RNA sequencing and alternative splicing profiling in the same analysis. I then combine this RNA sequencing technique with whole exome sequencing to better understand the global landscape of aberrations in these diseases. Finally, I use traditional cell and molecular biology techniques to define the consequences of major genetic alterations in lymphoma.

Through this analysis, I find recurrent silencing mutations in the G alpha binding protein GNA13 and associated focal adhesion proteins. I aim to describe how loss-of-function mutations in GNA13 can be oncogenic in the context of germinal center B cell biology. Using in vitro techniques including liquid chromatography-mass spectrometry and knockdown and overexpression of genes in B cell lymphoma cell lines, I determine protein binding partners and downstream effectors of GNA13. I also develop a transgenic mouse model to study the role of GNA13 in the germinal center in vivo to determine effects of GNA13 deletion on germinal center structure and cell migration.

Thus, I have developed complementary approaches that span the spectrum from discovery to context-dependent gene models that afford a better understanding of the biological function of aberrant events and ultimately result in a better understanding of disease.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: The rate of emergence of human pathogens is steadily increasing; most of these novel agents originate in wildlife. Bats, remarkably, are the natural reservoirs of many of the most pathogenic viruses in humans. There are two bat genome projects currently underway, a circumstance that promises to speed the discovery host factors important in the coevolution of bats with their viruses. These genomes, however, are not yet assembled and one of them will provide only low coverage, making the inference of most genes of immunological interest error-prone. Many more wildlife genome projects are underway and intend to provide only shallow coverage. RESULTS: We have developed a statistical method for the assembly of gene families from partial genomes. The method takes full advantage of the quality scores generated by base-calling software, incorporating them into a complete probabilistic error model, to overcome the limitation inherent in the inference of gene family members from partial sequence information. We validated the method by inferring the human IFNA genes from the genome trace archives, and used it to infer 61 type-I interferon genes, and single type-II interferon genes in the bats Pteropus vampyrus and Myotis lucifugus. We confirmed our inferences by direct cloning and sequencing of IFNA, IFNB, IFND, and IFNK in P. vampyrus, and by demonstrating transcription of some of the inferred genes by known interferon-inducing stimuli. CONCLUSION: The statistical trace assembler described here provides a reliable method for extracting information from the many available and forthcoming partial or shallow genome sequencing projects, thereby facilitating the study of a wider variety of organisms with ecological and biomedical significance to humans than would otherwise be possible.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: There is considerable interest in the development of methods to efficiently identify all coding variants present in large sample sets of humans. There are three approaches possible: whole-genome sequencing, whole-exome sequencing using exon capture methods, and RNA-Seq. While whole-genome sequencing is the most complete, it remains sufficiently expensive that cost effective alternatives are important. RESULTS: Here we provide a systematic exploration of how well RNA-Seq can identify human coding variants by comparing variants identified through high coverage whole-genome sequencing to those identified by high coverage RNA-Seq in the same individual. This comparison allowed us to directly evaluate the sensitivity and specificity of RNA-Seq in identifying coding variants, and to evaluate how key parameters such as the degree of coverage and the expression levels of genes interact to influence performance. We find that although only 40% of exonic variants identified by whole genome sequencing were captured using RNA-Seq; this number rose to 81% when concentrating on genes known to be well-expressed in the source tissue. We also find that a high false positive rate can be problematic when working with RNA-Seq data, especially at higher levels of coverage. CONCLUSIONS: We conclude that as long as a tissue relevant to the trait under study is available and suitable quality control screens are implemented, RNA-Seq is a fast and inexpensive alternative approach for finding coding variants in genes with sufficiently high expression levels.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: The superior colliculus (SC) has been shown to play a crucial role in the initiation and coordination of eye- and head-movements. The knowledge about the function of this structure is mainly based on single-unit recordings in animals with relatively few neuroimaging studies investigating eye-movement related brain activity in humans. METHODOLOGY/PRINCIPAL FINDINGS: The present study employed high-field (7 Tesla) functional magnetic resonance imaging (fMRI) to investigate SC responses during endogenously cued saccades in humans. In response to centrally presented instructional cues, subjects either performed saccades away from (centrifugal) or towards (centripetal) the center of straight gaze or maintained fixation at the center position. Compared to central fixation, the execution of saccades elicited hemodynamic activity within a network of cortical and subcortical areas that included the SC, lateral geniculate nucleus (LGN), occipital cortex, striatum, and the pulvinar. CONCLUSIONS/SIGNIFICANCE: Activity in the SC was enhanced contralateral to the direction of the saccade (i.e., greater activity in the right as compared to left SC during leftward saccades and vice versa) during both centrifugal and centripetal saccades, thereby demonstrating that the contralateral predominance for saccade execution that has been shown to exist in animals is also present in the human SC. In addition, centrifugal saccades elicited greater activity in the SC than did centripetal saccades, while also being accompanied by an enhanced deactivation within the prefrontal default-mode network. This pattern of brain activity might reflect the reduced processing effort required to move the eyes toward as compared to away from the center of straight gaze, a position that might serve as a spatial baseline in which the retinotopic and craniotopic reference frames are aligned.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We used ultra-deep sequencing to obtain tens of thousands of HIV-1 sequences from regions targeted by CD8+ T lymphocytes from longitudinal samples from three acutely infected subjects, and modeled viral evolution during the critical first weeks of infection. Previous studies suggested that a single virus established productive infection, but these conclusions were tempered because of limited sampling; now, we have greatly increased our confidence in this observation through modeling the observed earliest sample diversity based on vastly more extensive sampling. Conventional sequencing of HIV-1 from acute/early infection has shown different patterns of escape at different epitopes; we investigated the earliest escapes in exquisite detail. Over 3-6 weeks, ultradeep sequencing revealed that the virus explored an extraordinary array of potential escape routes in the process of evading the earliest CD8 T-lymphocyte responses--using 454 sequencing, we identified over 50 variant forms of each targeted epitope during early immune escape, while only 2-7 variants were detected in the same samples via conventional sequencing. In contrast to the diversity seen within epitopes, non-epitope regions, including the Envelope V3 region, which was sequenced as a control in each subject, displayed very low levels of variation. In early infection, in the regions sequenced, the consensus forms did not have a fitness advantage large enough to trigger reversion to consensus amino acids in the absence of immune pressure. In one subject, a genetic bottleneck was observed, with extensive diversity at the second time point narrowing to two dominant escape forms by the third time point, all within two months of infection. Traces of immune escape were observed in the earliest samples, suggesting that immune pressure is present and effective earlier than previously reported; quantifying the loss rate of the founder virus suggests a direct role for CD8 T-lymphocyte responses in viral containment after peak viremia. Dramatic shifts in the frequencies of epitope variants during the first weeks of infection revealed a complex interplay between viral fitness and immune escape.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: Isometric muscle contraction, where force is generated without muscle shortening, is a molecular traffic jam in which the number of actin-attached motors is maximized and all states of motor action are trapped with consequently high heterogeneity. This heterogeneity is a major limitation to deciphering myosin conformational changes in situ. METHODOLOGY: We used multivariate data analysis to group repeat segments in electron tomograms of isometrically contracting insect flight muscle, mechanically monitored, rapidly frozen, freeze substituted, and thin sectioned. Improved resolution reveals the helical arrangement of F-actin subunits in the thin filament enabling an atomic model to be built into the thin filament density independent of the myosin. Actin-myosin attachments can now be assigned as weak or strong by their motor domain orientation relative to actin. Myosin attachments were quantified everywhere along the thin filament including troponin. Strong binding myosin attachments are found on only four F-actin subunits, the "target zone", situated exactly midway between successive troponin complexes. They show an axial lever arm range of 77°/12.9 nm. The lever arm azimuthal range of strong binding attachments has a highly skewed, 127° range compared with X-ray crystallographic structures. Two types of weak actin attachments are described. One type, found exclusively in the target zone, appears to represent pre-working-stroke intermediates. The other, which contacts tropomyosin rather than actin, is positioned M-ward of the target zone, i.e. the position toward which thin filaments slide during shortening. CONCLUSION: We present a model for the weak to strong transition in the myosin ATPase cycle that incorporates azimuthal movements of the motor domain on actin. Stress/strain in the S2 domain may explain azimuthal lever arm changes in the strong binding attachments. The results support previous conclusions that the weak attachments preceding force generation are very different from strong binding attachments.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Phosphorylation of GTP-binding-regulatory (G)-protein-coupled receptors by specific G-protein-coupled receptor kinases (GRKs) is a major mechanism responsible for agonist-mediated desensitization of signal transduction processes. However, to date, studies of the specificity of these enzymes have been hampered by the difficulty of preparing the purified and reconstituted receptor preparations required as substrates. Here we describe an approach that obviates this problem by utilizing highly purified membrane preparations from Sf9 and 293 cells overexpressing G-protein-coupled receptors. We use this technique to demonstrate specificity of several GRKs with respect to both receptor substrates and the enhancing effects of G-protein beta gamma subunits on phosphorylation. Enriched membrane preparations of the beta 2- and alpha 2-C2-adrenergic receptors (ARs, where alpha 2-C2-AR refers to the AR whose gene is located on human chromosome 2) prepared by sucrose density gradient centrifugation from Sf9 or 293 cells contain the receptor at 100-300 pmol/mg of protein and serve as efficient substrates for agonist-dependent phosphorylation by beta-AR kinase 1 (GRK2), beta-AR kinase 2 (GRK3), or GRK5. Stoichiometries of agonist-mediated phosphorylation of the receptors by GRK2 (beta-AR kinase 1), in the absence and presence of G beta gamma, are 1 and 3 mol/mol, respectively. The rate of phosphorylation of the membrane receptors is 3 times faster than that of purified and reconstituted receptors. While phosphorylation of the beta 2-AR by GRK2, -3, and -5 is similar, the activity of GRK2 and -3 is enhanced by G beta gamma whereas that of GRK5 is not. In contrast, whereas GRK2 and -3 efficiently phosphorylate alpha 2-C2-AR, GRK5 is quite weak. The availability of a simple direct phosphorylation assay applicable to any cloned G-protein-coupled receptor should greatly facilitate elucidation of the mechanisms of regulation of these receptors by the expanding family of GRKs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Light-dependent deactivation of rhodopsin as well as homologous desensitization of beta-adrenergic receptors involves receptor phosphorylation that is mediated by the highly specific protein kinases rhodopsin kinase (RK) and beta-adrenergic receptor kinase (beta ARK), respectively. We report here the cloning of a complementary DNA for RK. The deduced amino acid sequence shows a high degree of homology to beta ARK. In a phylogenetic tree constructed by comparing the catalytic domains of several protein kinases, RK and beta ARK are located on a branch close to, but separate from the cyclic nucleotide-dependent protein kinase and protein kinase C subfamilies. From the common structural features we conclude that both RK and beta ARK are members of a newly delineated gene family of guanine nucleotide-binding protein (G protein)-coupled receptor kinases that may function in diverse pathways to regulate the function of such receptors.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A precise molecular identification of transmitted hepatitis C virus (HCV) genomes could illuminate key aspects of transmission biology, immunopathogenesis and natural history. We used single genome sequencing of 2,922 half or quarter genomes from plasma viral RNA to identify transmitted/founder (T/F) viruses in 17 subjects with acute community-acquired HCV infection. Sequences from 13 of 17 acute subjects, but none of 14 chronic controls, exhibited one or more discrete low diversity viral lineages. Sequences within each lineage generally revealed a star-like phylogeny of mutations that coalesced to unambiguous T/F viral genomes. Numbers of transmitted viruses leading to productive clinical infection were estimated to range from 1 to 37 or more (median = 4). Four acutely infected subjects showed a distinctly different pattern of virus diversity that deviated from a star-like phylogeny. In these cases, empirical analysis and mathematical modeling suggested high multiplicity virus transmission from individuals who themselves were acutely infected or had experienced a virus population bottleneck due to antiviral drug therapy. These results provide new quantitative and qualitative insights into HCV transmission, revealing for the first time virus-host interactions that successful vaccines or treatment interventions will need to overcome. Our findings further suggest a novel experimental strategy for identifying full-length T/F genomes for proteome-wide analyses of HCV biology and adaptation to antiviral drug or immune pressures.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Single-molecule sequencing instruments can generate multikilobase sequences with the potential to greatly improve genome and transcriptome assembly. However, the error rates of single-molecule reads are high, which has limited their use thus far to resequencing bacteria. To address this limitation, we introduce a correction algorithm and assembly strategy that uses short, high-fidelity sequences to correct the error in single-molecule sequences. We demonstrate the utility of this approach on reads generated by a PacBio RS instrument from phage, prokaryotic and eukaryotic whole genomes, including the previously unsequenced genome of the parrot Melopsittacus undulatus, as well as for RNA-Seq reads of the corn (Zea mays) transcriptome. Our long-read correction achieves >99.9% base-call accuracy, leading to substantially better assemblies than current sequencing strategies: in the best example, the median contig size was quintupled relative to high-coverage, second-generation assemblies. Greater gains are predicted if read lengths continue to increase, including the prospect of single-contig bacterial chromosome assembly.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The International Crocodilian Genomes Working Group (ICGWG) will sequence and assemble the American alligator (Alligator mississippiensis), saltwater crocodile (Crocodylus porosus) and Indian gharial (Gavialis gangeticus) genomes. The status of these projects and our planned analyses are described.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: Penguins are flightless aquatic birds widely distributed in the Southern Hemisphere. The distinctive morphological and physiological features of penguins allow them to live an aquatic life, and some of them have successfully adapted to the hostile environments in Antarctica. To study the phylogenetic and population history of penguins and the molecular basis of their adaptations to Antarctica, we sequenced the genomes of the two Antarctic dwelling penguin species, the Adélie penguin [Pygoscelis adeliae] and emperor penguin [Aptenodytes forsteri]. RESULTS: Phylogenetic dating suggests that early penguins arose ~60 million years ago, coinciding with a period of global warming. Analysis of effective population sizes reveals that the two penguin species experienced population expansions from ~1 million years ago to ~100 thousand years ago, but responded differently to the climatic cooling of the last glacial period. Comparative genomic analyses with other available avian genomes identified molecular changes in genes related to epidermal structure, phototransduction, lipid metabolism, and forelimb morphology. CONCLUSIONS: Our sequencing and initial analyses of the first two penguin genomes provide insights into the timing of penguin origin, fluctuations in effective population sizes of the two penguin species over the past 10 million years, and the potential associations between these biological patterns and global climate change. The molecular changes compared with other avian genomes reflect both shared and diverse adaptations of the two penguin species to the Antarctic environment.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Most biological reactions rely on interplay between binding and changes in both macromolecular structure and dynamics. Practical understanding of this interplay requires detection of critical intermediates and determination of their binding and conformational characteristics. However, many of these species are only transiently present and they have often been overlooked in mechanistic studies of reactions that couple binding to conformational change. We monitored the kinetics of ligand-induced conformational changes in a small protein using six different ligands. We analyzed the kinetic data to simultaneously determine both binding affinities for the conformational states and the rate constants of conformational change. The approach we used is sufficiently robust to determine the affinities of three conformational states and detect even modest differences in the protein's affinities for relatively similar ligands. Ligand binding favors higher-affinity conformational states by increasing forward conformational rate constants and/or decreasing reverse conformational rate constants. The amounts by which forward rate constants increase and reverse rate constants decrease are proportional to the ratio of affinities of the conformational states. We also show that both the affinity ratio and another parameter, which quantifies the changes in conformational rate constants upon ligand binding, are strong determinants of the mechanism (conformational selection and/or induced fit) of molecular recognition. Our results highlight the utility of analyzing the kinetics of conformational changes to determine affinities that cannot be determined from equilibrium experiments. Most importantly, they demonstrate an inextricable link between conformational dynamics and the binding affinities of conformational states.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genome-wide association studies (GWASs) have characterized 13 loci associated with melanoma, which only account for a small part of melanoma risk. To identify new genes with too small an effect to be detected individually but which collectively influence melanoma risk and/or show interactive effects, we used a two-step analysis strategy including pathway analysis of genome-wide SNP data, in a first step, and epistasis analysis within significant pathways, in a second step. Pathway analysis, using the gene-set enrichment analysis (GSEA) approach and the gene ontology (GO) database, was applied to the outcomes of MELARISK (3,976 subjects) and MDACC (2,827 subjects) GWASs. Cross-gene SNP-SNP interaction analysis within melanoma-associated GOs was performed using the INTERSNP software. Five GO categories were significantly enriched in genes associated with melanoma (false discovery rate ≤ 5% in both studies): response to light stimulus, regulation of mitotic cell cycle, induction of programmed cell death, cytokine activity and oxidative phosphorylation. Epistasis analysis, within each of the five significant GOs, showed significant evidence for interaction for one SNP pair at TERF1 and AFAP1L2 loci (pmeta-int  = 2.0 × 10(-7) , which met both the pathway and overall multiple-testing corrected thresholds that are equal to 9.8 × 10(-7) and 2.0 × 10(-7) , respectively) and suggestive evidence for another pair involving correlated SNPs at the same loci (pmeta-int  = 3.6 × 10(-6) ). This interaction has important biological relevance given the key role of TERF1 in telomere biology and the reported physical interaction between TERF1 and AFAP1L2 proteins. This finding brings a novel piece of evidence for the emerging role of telomere dysfunction into melanoma development.