966 resultados para de novo genome assembly
Resumo:
The complete nucleotide sequence of genome segment S4 of rice ragged stunt oryzavirus (RRSV, Thai-isolate) was determined. The 3823 bp sequence contains two large open reading frames (ORFs). ORF1, spanning nucleotides 12 to 3776, is capable of encoding a protein of M(r) 141,380 (P4a). The P4a amino acid sequence predicted from the nucleotide sequence contains sequence motifs conserved in RNA-dependent RNA polymerases (RDRPs). When compared for evolutionary relationships with RDRPs of other reoviruses using the amino acid sequences around the conserved GDD motif, P4a was shown to be more related to Nilaparvata lugens reovirus and reovirus serotype 3 than to rice dwarf phytoreovirus, bovine rotavirus or bluetongue virus. The ORF2, spanning nucleotides 491 to 1468, is out of frame with ORF1 and is capable of encoding a protein of 36, 920 (P4b). Coupled in vitro transcription-translation from cloned ORF2 in wheat germ extract confirmed the existence of ORF2 but in vivo production and possible function of P4b is yet to be determined.
Resumo:
The nucleotide sequences of genome segments S7 and S10 of a Thai-isolate of rice ragged stunt virus (RRSV) were determined. The 1938 bp S7 sequence contains a single large open reading frame (ORF) spanning nucleotides 20 to 1 843 that is predicted to encode a protein of M(r) 68 025. The 1 162 bp S10 sequence has a major ORF spanning nucleotides 142 to 1 032 that is predicted to encode a protein of M(r) 32364. This S10 ORF is preceded by a small ORF (nt 20-55) which is probably a minicistron. Coupled in vitro transcription-translation from the two major ORFs gave protein products of the expected sizes. However, no protein was visualised from S10 when the small ORF sequence was included. Proteins were expressed in Escherichia coli from the full length ORF of S7 (P7) and from a segment of the S10 ORF (P10) fused to the ORF of glutathione S-transferase (GST). Neither fusion protein was recognised by polyclonal antibodies raised against RRSV particles. Furthermore, polyclonal antibodies raised against GST-P7 fusion protein did not recognise any virion structural polypeptides. These data strongly suggest that the proteins P7 and P10 do not form part of RRSV particle. This is further supported by observed sequence homology (though very weak) of predicted.
Resumo:
The nucleotide sequence of DNA complementary to rice ragged stunt oryzavirus (RRSV) genome segment 8 (S8) of an isolate from Thailand was determined. RRSV S8 is 1 914 bp in size and contains a single large open reading frame (ORF) spanning nucleotides 23 to 1 810 which is capable of encoding a protein of M(r) 67 348. The N-terminal amino acid sequence of a ~43K virion polypeptide matched to that inferred for an internal region of the S8 coding sequence. These data suggest that the 43K protein is encoded by S8 and is derived by a proteolytic cleavage. Predicted polypeptide sizes from this possible cleavage of S8 protein are 26K and 42K. Polyclonal antibodies raised against a maltose binding protein (MBP)-S8 fusion polypeptide (expressed in Escherichia coli) recognised four RRSV particle associated polypeptides of M(r) 67K, 46K, 43K and 26K and all except the 26K polypeptide were also highly immunoreactive to polyclonal antibodies raised against purified RRSV particles. Cleavage of the MBP-S8 fusion polypeptide with protease Factor X produced the expected 40K MBP and two polypeptides of apparent M(r) 46K and 26K. Antibodies to purified RRSV particles reacted strongly with the intact fusion protein and the 46K cleavage product but weakly to the 26K product. Furthermore, in vitro transcription and translation of the S8 coding region revealed a post-translational self cleavage of the 67K polypeptide to 46K and 26K products. These data indicate that S8 encodes a structural polypeptide, the majority of which is auto- catalytically cleaved to 26K and 46K proteins. The data also suggest that the 26K protein is the self cleaving protease and that the 46K product is further processed or undergoes stable conformational changes to a ~43K major capsid protein.
Resumo:
The complete nucleotide sequence of the genome segment 5 (S5) of a Thai isolate of rice ragged stunt virus (RRSV) was determined. The 2682 nucleotide sequence contains a single long open reading frame capable of encoding a polypeptide with a molecular mass of ~91 kDa. Polypeptides encoded by various truncated cDNAs of S5 were expressed using the pGEX fusion protein vector and the highest level of fusion protein was obtained from a construct encoding a hydrophilic region of S5 protein. Antibodies raised against this fusion protein recognized a minor polypeptide, with a molecular mass of ~ 91 kDa, that was present in purified preparations of RRSV particles, infected insect vectors and infected rice plants. This indicates that RRSV S5 encodes a minor structural protein. Comparing the RRSV S5 sequence with sequences of other reo-viruses did not reveal any significant sequence similarities.
Resumo:
The genomic sequence of an Australian isolate of carrot mottle umbravirus (CMoV-A) was determined from cDNA generated from dsRNA. This provides the first data on the genome organization and phylogeny of an umbravirus. The 4201-nucleotide genome contains four major open reading frames (ORFs). Analysis suggests that ORF2 encodes an RNA-dependent RNA polymerase, that ORF4 encodes a movement protein, and that the virus has no coat protein gene. The functions of ORFs 1 and 3 remain unknown. ORF2 is probably translated following ribosomal frameshifting. ORFs 3 and 4 are probably translated from a subgenomic mRNA. Sequence comparisons showed CMoV-A to be closely related to pea enation mosaic RNA2 NA2), but also to have affinities with the Bromoviridae. These findings shed light on the relationships between the luteoviruses, PEMV, and the umbraviruses and on the relationships between the carmo-like viruses and the Bromoviridae.
Resumo:
Complementary DNAs covering the entire RNA genome of soybean dwarf luteovirus (SDV) were cloned and sequenced. Computer analysis of the 5861 nucleotide sequence revealed five major open reading frames (ORFs) possessing conservation of sequence and organisation with known luteovirus sequences. Comparative analyses of the genome structure show that SDV shares sequence homology and features of gene organisation with barley yellow dwarf virus (PAV isolate) in the 5' half of the genome, yet is more closely related to potato leafroll virus in its 3' coding regions. In addition, SDV differs from other known luteoviruses in possessing an exceptionally long 3' terminal sequence with no apparent coding capacity. We conclude from these data that the SDV genome represents a third variant genome type in the luteovirus group.
Resumo:
Recent studies suggest that genetic and environmental factors do not account for all the schizophrenia risk and epigenetics also plays a role in disease susceptibility. DNA methylation is a heritable epigenetic modification that can regulate gene expression. Genome-Wide DNA methylation analysis was performed on post-mortem human brain tissue from 24 patients with schizophrenia and 24 unaffected controls. DNA methylation was assessed at over 485 000 CpG sites using the Illumina Infinium Human Methylation450 Bead Chip. After adjusting for age and post-mortem interval (PMI), 4 641 probes corresponding to 2 929 unique genes were found to be differentially methylated. Of those genes, 1 291 were located in a CpG island and 817 were in a promoter region. These include NOS1, AKT1, DTNBP1, DNMT1, PPP3CC and SOX10 which have previously been associated with schizophrenia. More than 100 of these genes overlap with a previous DNA methylation study of peripheral blood from schizophrenia patients in which 27 000 CpG sites were analysed. Unsupervised clustering analysis of the top 3 000 most variable probes revealed two distinct groups with significantly more people with schizophrenia in cluster one compared to controls (p = 1.74x10-4). The first cluster was composed of 88% of patients with schizophrenia and only 12% controls while the second cluster was composed of 27% of patients with schizophrenia and 73% controls. These results strongly suggest that differential DNA methylation is important in schizophrenia etiology and add support for the use of DNA methylation profiles as a future prognostic indicator of schizophrenia.
Resumo:
Genomic instability underlies the transformation of host cells toward malignancy, promotes development of invasion and metastasis and shapes the response of established cancer to treatment. In this review, we discuss recent advances in our understanding of genomic stability in squamous cell carcinoma of the head and neck (HNSCC), with an emphasis on DNA repair pathways. HNSCC is characterized by distinct profiles in genome stability between similarly staged cancers that are reflected in risk, treatment response and outcomes. Defective DNA repair generates chromosomal derangement that can cause subsequent alterations in gene expression, and is a hallmark of progression toward carcinoma. Variable functionality of an increasing spectrum of repair gene polymorphisms is associated with increased cancer risk, while aetiological factors such as human papillomavirus, tobacco and alcohol induce significantly different behaviour in induced malignancy, underpinned by differences in genomic stability. Targeted inhibition of signalling receptors has proven to be a clinically-validated therapy, and protein expression of other DNA repair and signalling molecules associated with cancer behaviour could potentially provide a more refined clinical model for prognosis and treatment prediction. Development and expansion of current genomic stability models is furthering our understanding of HNSCC pathophysiology and uncovering new, promising treatment strategies. © 2013 Glenn Jenkins et al.
Resumo:
To characterize aphid mitochondrial genome (mitogenome) features, we sequenced the complete mitogenome of the Russian wheat aphid, Diuraphis noxia. The 15,784-bp mitogenome with a high A + T content (84.76%) and strong C skew (− 0.26) was arranged in the same gene order as that of the ancestral insect. Unlike typical insect mitogenomes, D. noxia possessed a large tandem repeat region (644 bp) located between trnE and trnF. Sequencing partial mitogenome of the cotton aphid (Aphis gossypii) further confirmed the presence of the large repeat region in aphids, but with different repeat length and copy number. Another motif (58 bp) tandemly repeated 2.3 times in the control region of D. noxia. All repeat units in D. noxia could be folded into stem-loop secondary structures, which could further promote an increase in copy numbers. Characterization of the D. noxia mitogenome revealed distinct mitogenome architectures, thus advancing our understanding of insect mitogenomic diversities and evolution.
Resumo:
Sorghum is a food and feed cereal crop adapted to heat and drought and a staple for 500 million of the world’s poorest people. Its small diploid genome and phenotypic diversity make it an ideal C4 grass model as a complement to C3 rice. Here we present high coverage (16–45 × ) resequenced genomes of 44 sorghum lines representing the primary gene pool and spanning dimensions of geographic origin, end-use and taxonomic group. We also report the first resequenced genome of S. propinquum, identifying 8 M high-quality SNPs, 1.9 M indels and specific gene loss and gain events in S. bicolor. We observe strong racial structure and a complex domestication history involving at least two distinct domestication events. These assembled genomes enable the leveraging of existing cereal functional genomics data against the novel diversity available in sorghum, providing an unmatched resource for the genetic improvement of sorghum and other grass species.
Resumo:
Bats account for one-fifth of mammalian species, are the only mammals with powered flight, and are among the few animals that echolocate. The insect-eating Brandt’s bat (Myotis brandtii) is the longest-lived bat species known to date (lifespan exceeds 40 years) and, at 4–8 g adult body weight, is the most extreme mammal with regard to disparity between body mass and longevity. Here we report sequencing and analysis of the Brandt’s bat genome and transcriptome, which suggest adaptations consistent with echolocation and hibernation, as well as altered metabolism, reproduction and visual function. Unique sequence changes in growth hormone and insulin-like growth factor 1 receptors are also observed. The data suggest that an altered growth hormone/insulin-like growth factor 1 axis, which may be common to other long-lived bat species, together with adaptations such as hibernation and low reproductive rate, contribute to the exceptional lifespan of the Brandt’s bat.
Resumo:
This study investigated the population genetics, demographic history and pathway of invasion of the Russian wheat aphid (RWA) from its native range in Central Asia, the Middle East and Europe to South Africa and the Americas. We screened microsatellite markers, mitochondrial DNA and endosymbiont genes in 504 RWA clones from nineteen populations worldwide. Following pathway analyses of microsatellite and endosymbiont data, we postulate that Turkey and Syria were the most likely sources of invasion to Kenya and South Africa, respectively. Furthermore, we found that one clone transferred between South Africa and the Americas was most likely responsible for the New World invasion. Finally, endosymbiont DNA was found to be a high resolution population genetic marker, extremely useful for studies of invasion over a relatively short evolutionary history time frame. This study has provided valuable insights into the factors that may have facilitated the recent global invasion by this damaging pest.
Resumo:
The expression of transgenes in plant genomes can be inhibited by either transcriptional gene silencing or posttranscriptional gene silencing (PTGS). Overexpression of the chalcone synthase-A (CHS-A) transgene triggers PTGS of CHS-A and thus results in loss of flower pigmentation in petunia. We previously demonstrated that epigenetic inactivation of CHS-A transgene transcription leads to a reversion of the PTGS phenotype. Although neomycin phosphotransferase II (nptII), a marker gene co-introduced into the genome with the CHS-A transgene, is not normally silenced in petunia, even when CHS-A is silenced, here we found that nptII was silenced in a petunia line in which CHS-A PTGS was induced, but not in the revertant plants that had no PTGS of CHS-A. Transcriptional activity, accumulation of short interfering RNAs, and restoration of mRNA level after infection with viruses that had suppressor proteins of gene silencing indicated that the mechanism for nptII silencing was posttranscriptional. Read-through transcripts of the CHS-A gene toward the nptII gene were detected. Deep-sequencing analysis revealed a striking difference between the predominant size class of small RNAs produced from the read-through transcripts (22 nt) and that from the CHS-A RNAs (21 nt). These results implicate the involvement of read-through transcription and distinct phases of RNA degradation in the coincident PTGS of linked transgenes and provide new insights into the destabilization of transgene expression.
Resumo:
As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of 'Golden Delicious', SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple.