116 resultados para Transcriptomes
Resumo:
Background: Tef (Eragrostis tef), an indigenous cereal critical to food security in the Horn of Africa, is rich in minerals and protein, resistant to many biotic and abiotic stresses and safe for diabetics as well as sufferers of immune reactions to wheat gluten. We present the genome of tef, the first species in the grass subfamily Chloridoideae and the first allotetraploid assembled de novo. We sequenced the tef genome for marker-assisted breeding, to shed light on the molecular mechanisms conferring tef's desirable nutritional and agronomic properties, and to make its genome publicly available as a community resource. Results: The draft genome contains 672 Mbp representing 87% of the genome size estimated from flow cytometry. We also sequenced two transcriptomes, one from a normalized RNA library and another from unnormalized RNASeq data. The normalized RNA library revealed around 38000 transcripts that were then annotated by the SwissProt group. The CoGe comparative genomics platform was used to compare the tef genome to other genomes, notably sorghum. Scaffolds comprising approximately half of the genome size were ordered by syntenic alignment to sorghum producing tef pseudo-chromosomes, which were sorted into A and B genomes as well as compared to the genetic map of tef. The draft genome was used to identify novel SSR markers, investigate target genes for abiotic stress resistance studies, and understand the evolution of the prolamin family of proteins that are responsible for the immune response to gluten. Conclusions: It is highly plausible that breeding targets previously identified in other cereal crops will also be valuable breeding targets in tef. The draft genome and transcriptome will be of great use for identifying these targets for genetic improvement of this orphan crop that is vital for feeding 50 million people in the Horn of Africa.
Resumo:
Cichlid fishes are famous for large, diverse and replicated adaptive radiations in the Great Lakes of East Africa. To understand the molecular mechanisms underlying cichlid phenotypic diversity, we sequenced the genomes and transcriptomes of five lineages of African cichlids: the Nile tilapia (Oreochromis niloticus), an ancestral lineage with low diversity; and four members of the East African lineage: Neolamprologus brichardi/pulcher (older radiation, Lake Tanganyika), Metriaclima zebra (recent radiation, Lake Malawi), Pundamilia nyererei (very recent radiation, Lake Victoria), and Astatotilapia burtoni (riverine species around Lake Tanganyika). We found an excess of gene duplications in the East African lineage compared to tilapia and other teleosts, an abundance of non-coding element divergence, accelerated coding sequence evolution, expression divergence associated with transposable element insertions, and regulation by novel microRNAs. In addition, we analysed sequence data from sixty individuals representing six closely related species from Lake Victoria, and show genome-wide diversifying selection on coding and regulatory variants, some of which were recruited from ancient polymorphisms. We conclude that a number of molecular mechanisms shaped East African cichlid genomes, and that amassing of standing variation during periods of relaxed purifying selection may have been important in facilitating subsequent evolutionary diversification.
Resumo:
Around 14 distinct virus species-complexes have been detected in honeybees, each with one or more strains or sub-species. Here we present the initial characterization of an entirely new virus species-complex discovered in honeybee (Apis mellifera L.) and varroa mite (Varroa destructor) samples from Europe and the USA. The virus has a naturally poly-adenylated RNA genome of about 6500 nucleotides with a genome organization and sequence similar to the Tymoviridae (Tymovirales; Tymoviridae), a predominantly plant-infecting virus family. Literature and laboratory analyses indicated that the virus had not previously been described. The virus is very common in French apiaries, mirroring the results from an extensive Belgian survey, but could not be detected in equally-extensive Swedish and Norwegian bee disease surveys. The virus appears to be closely linked to varroa, with the highest prevalence found in varroa samples and a clear seasonal distribution peaking in autumn, coinciding with the natural varroa population development. Sub-genomic RNA analyses show that bees are definite hosts, while varroa is a possible host and likely vector. The tentative name of Bee Macula-like virus (BeeMLV) is therefore proposed. A second, distantly related Tymoviridae-like virus was also discovered in varroa transcriptomes, tentatively named Varroa Tymo-like virus (VTLV).
Resumo:
Root-knot nematodes (RKNs) induce giant cells (GCs) from root vascular cells inside the galls. Accompanying molecular changes as a function of infection time and across different species, and their functional impact, are still poorly understood. Thus, the transcriptomes of tomato galls and laser capture microdissected (LCM) GCs over the course of parasitism were compared with those of Arabidopsis, and functional analysis of a repressed gene was performed. Microarray hybridization with RNA from galls and LCM GCs, infection-reproduction tests and quantitative reverse transcription-polymerase chain reaction (qRT-PCR) transcriptional profiles in susceptible and resistant (Mi-1) lines were performed in tomato. Tomato GC-induced genes include some possibly contributing to the epigenetic control of GC identity. GC-repressed genes are conserved between tomato and Arabidopsis, notably those involved in lignin deposition. However, genes related to the regulation of gene expression diverge, suggesting that diverse transcriptional regulators mediate common responses leading to GC formation in different plant species. TPX1, a cell wall peroxidase specifically involved in lignification, was strongly repressed in GCs/galls, but induced in a nearly isogenic Mi-1 resistant line on nematode infection. TPX1 overexpression in susceptible plants hindered nematode reproduction and GC expansion. Time-course and cross-species comparisons of gall and GC transcriptomes provide novel insights pointing to the relevance of gene repression during RKN establishment.
Resumo:
The European chestnut (Castanea sativa Mill.) is a multipurpose species that has been widely cultivated around the Mediterranean basin since ancient times. New varieties were brought to the Iberian Peninsula during the Roman Empire, which coexist since then with native populations that survived the last glaciation. The relevance of chestnut cultivation has being steadily growing since the Middle Ages, until the rural decline of the past century put a stop to this trend. Forest fires and diseases were also major factors. Chestnut cultivation is gaining momentum again due to its economic (wood, fruits) and ecologic relevance, and represents currently an important asset in many rural areas of Europe. In this Thesis we apply different molecular tools to help improve current management strategies. For this study we have chosen El Bierzo (Castile and Leon, NW Spain), which has a centenary tradition of chestnut cultivation and management, and also presents several unique features from a genetic perspective (next paragraph). Moreover, its nuts are widely appreciated in Spain and abroad for their organoleptic properties. We have focused our experimental work on two major problems faced by breeders and the industry: the lack of a fine-grained genetic characterization and the need for new strategies to control blight disease. To characterize with sufficient detail the genetic diversity and structure of El Bierzo orchards, we analyzed DNA from 169 trees grafted for nut production covering the entire region. We also analyzed 62 nuts from all traditional varieties. El Bierzo constitutes an outstanding scenario to study chestnut genetics and the influence of human management because: (i) it is located at one extreme of the distribution area; (ii) it is a major glacial refuge for the native species; (iii) it has a long tradition of human management (since Roman times, at least); and (iv) its geographical setting ensures an unusual degree of genetic isolation. Thirteen microsatellite markers provided enough informativeness and discrimination power to genotype at the individual level. Together with an unexpected level of genetic variability, we found evidence of genetic structure, with three major gene pools giving rise to the current population. High levels of genetic differentiation between groups supported this organization. Interestingly, genetic structure does not match with spatial boundaries, suggesting that the exchange of material and cultivation practices have strongly influenced natural gene flow. The microsatellite markers selected for this study were also used to classify a set of 62 samples belonging to all traditional varieties. We identified several cases of synonymies and homonymies, evidencing the need to substitute traditional classification systems with new tools for genetic profiling. Management and conservation strategies should also benefit from these tools. The avenue of high-throughput sequencing technologies, combined with the development of bioinformatics tools, have paved the way to study transcriptomes without the need for a reference genome. We took advantage of RNA sequencing and de novo assembly tools to determine the transcriptional landscape of chestnut in response to blight disease. In addition, we have selected a set of candidate genes with high potential for developing resistant varieties via genetic engineering. Our results evidenced a deep transcriptional reprogramming upon fungal infection. The plant hormones ET and JA appear to orchestrate the defensive response. Interestingly, our results also suggest a role for auxins in modulating such response. Many transcription factors were identified in this work that interact with promoters of genes involved in disease resistance. Among these genes, we have conducted a functional characterization of a two major thaumatin-like proteins (TLP) that belongs to the PR5 family. Two genes encoding chestnut cotyledon TLPs have been previously characterized, termed CsTL1 and CsTL2. We substantiate here their protective role against blight disease for the first time, including in silico, in vitro and in vivo evidence. The synergy between TLPs and other antifungal proteins, particularly endo-p-1,3-glucanases, bolsters their interest for future control strategies based on biotechnological approaches.
Resumo:
Gibberellins (GAs) are plant hormones that affect plant growth and regulate gene expression differentially across tissues. To study the molecular mechanisms underlying GA signaling in Arabidopsis thaliana, we focused on a GDSL lipase gene (LIP1) induced by GA and repressed by DELLA proteins. LIP1 contains an L1 box promoter sequence, conserved in the promoters of epidermis-specific genes, that is bound by ATML1, an HD-ZIP transcription factor required for epidermis specification. In this study, we demonstrate that LIP1 is specifically expressed in the epidermis and that its L1 box sequence mediates GA-induced transcription. We show that this sequence is overrepresented in the upstream regulatory regions of GA-induced and DELLA-repressed transcriptomes and that blocking GA signaling in the epidermis represses the expression of L1 box–containing genes and negatively affects seed germination. We show that DELLA proteins interact directly with ATML1 and its paralogue PDF2 and that silencing of both HD-ZIP transcription factors inhibits epidermal gene expression and delays germination. Our results indicate that, upon seed imbibition, increased GA levels reduce DELLA protein abundance and release ATML1/PDF2 to activate L1 box gene expression, thus enhancing germination potential.
Resumo:
Aldosterone and vasopressin are responsible for the final adjustment of sodium and water reabsorption in the kidney. In principal cells of the kidney cortical collecting duct (CCD), the integral response to aldosterone and the long-term functional effects of vasopressin depend on transcription. In this study, we analyzed the transcriptome of a highly differentiated mouse clonal CCD principal cell line (mpkCCDcl4) and the changes in the transcriptome induced by aldosterone and vasopressin. Serial analysis of gene expression (SAGE) was performed on untreated cells and on cells treated with either aldosterone or vasopressin for 4 h. The transcriptomes in these three experimental conditions were determined by sequencing 169,721 transcript tags from the corresponding SAGE libraries. Limiting the analysis to tags that occurred twice or more in the data set, 14,654 different transcripts were identified, 3,642 of which do not match known mouse sequences. Statistical comparison (at P < 0.05 level) of the three SAGE libraries revealed 34 AITs (aldosterone-induced transcripts), 29 ARTs (aldosterone-repressed transcripts), 48 VITs (vasopressin-induced transcripts) and 11 VRTs (vasopressin-repressed transcripts). A selection of the differentially-expressed, hormone-specific transcripts (5 VITs, 2 AITs and 1 ART) has been validated in the mpkCCDcl4 cell line either by Northern blot hybridization or reverse transcription–PCR. The hepatocyte nuclear transcription factor HNF-3-α (VIT39), the receptor activity modifying protein RAMP3 (VIT48), and the glucocorticoid-induced leucine zipper protein (GILZ) (AIT28) are candidate proteins playing a role in physiological responses of this cell line to vasopressin and aldosterone.
Resumo:
We used 2D protein gel electrophoresis and DNA microarray technologies to systematically analyze genes under glucose repression in Bacillus subtilis. In particular, we focused on genes expressed after the shift from glycolytic to gluconeogenic at the middle logarithmic phase of growth in a nutrient sporulation medium, which remained repressed by the addition of glucose. We also examined whether or not glucose repression of these genes was mediated by CcpA, the catabolite control protein of this bacterium. The wild-type and ccpA1 cells were grown with and without glucose, and their proteomes and transcriptomes were compared. 2D gel electrophoresis allowed us to identify 11 proteins, the synthesis of which was under glucose repression. Of these proteins, the synthesis of four (IolA, I, S and PckA) was under CcpA-independent control. Microarray analysis enabled us to detect 66 glucose-repressive genes, 22 of which (glmS, acoA, C, yisS, speD, gapB, pckA, yvdR, yxeF, iolA, B, C, D, E, F, G, H, I, J, R, S and yxbF ) were at least partially under CcpA-independent control. Furthermore, we found that CcpA and IolR, a repressor of the iol divergon, were involved in the glucose repression of the synthesis of inositol dehydrogenase encoded by iolG included in the above list. The CcpA-independent glucose repression of the iol genes appeared to be explained by inducer exclusion.
Resumo:
Background: Chitosan oligosaccharide (COS), a deacetylated derivative of chitin, is an abundant, and renewable natural polymer. COS has higher antimicrobial properties than chitosan and is presumed to act by disrupting/permeabilizing the cell membranes of bacteria, yeast and fungi. COS is relatively non-toxic to mammals. By identifying the molecular and genetic targets of COS, we hope to gain a better understanding of the antifungal mode of action of COS. Results: Three different chemogenomic fitness assays, haploinsufficiency (HIP), homozygous deletion (HOP), and multicopy suppression (MSP) profiling were combined with a transcriptomic analysis to gain insight in to the mode of action and mechanisms of resistance to chitosan oligosaccharides. The fitness assays identified 39 yeast deletion strains sensitive to COS and 21 suppressors of COS sensitivity. The genes identified are involved in processes such as RNA biology (transcription, translation and regulatory mechanisms), membrane functions (e.g. signalling, transport and targeting), membrane structural components, cell division, and proteasome processes. The transcriptomes of control wild type and 5 suppressor strains overexpressing ARL1, BCK2, ERG24, MSG5, or RBA50, were analyzed in the presence and absence of COS. Some of the up-regulated transcripts in the suppressor overexpressing strains exposed to COS included genes involved in transcription, cell cycle, stress response and the Ras signal transduction pathway. Down-regulated transcripts included those encoding protein folding components and respiratory chain proteins. The COS-induced transcriptional response is distinct from previously described environmental stress responses (i.e. thermal, salt, osmotic and oxidative stress) and pre-treatment with these well characterized environmental stressors provided little or any resistance to COS. Conclusions: Overexpression of the ARL1 gene, a member of the Ras superfamily that regulates membrane trafficking, provides protection against COS-induced cell membrane permeability and damage. We found that the ARL1 COS-resistant over-expression strain was as sensitive to Amphotericin B, Fluconazole and Terbinafine as the wild type cells and that when COS and Fluconazole are used in combination they act in a synergistic fashion. The gene targets of COS identified in this study indicate that COS’s mechanism of action is different from other commonly studied fungicides that target membranes, suggesting that COS may be an effective fungicide for drug-resistant fungal pathogens.
Resumo:
With the sequencing and annotation of genomes and transcriptomes of several eukaryotes, the importance of noncoding RNA (ncRNA)-RNA molecules that are not translated to protein products-has become more evident. A subclass of ncRNA transcripts are encoded by highly regulated, multi-exon, transcriptional units, are processed like typical protein-coding mRNAs and are increasingly implicated in regulation of many cellular functions in eukaryotes. This study describes the identification of candidate functional ncRNAs from among the RIKEN mouse full-length cDNA collection, which contains 60,770 sequences, by using a systematic computational filtering approach. We initially searched for previously reported ncRNAs and found nine murine ncRNAs and homologs of several previously described nonmouse ncRNAs. Through our computational approach to filter artifact-free clones that lack protein coding potential, we extracted 4280 transcripts as the largest-candidate set. Many clones in the set had EST hits, potential CpG islands surrounding the transcription start sites, and homologies with the human genome. This implies that many candidates are indeed transcribed in a regulated manner. Our results demonstrate that ncRNAs are a major functional subclass of processed transcripts in mammals.
Resumo:
Despite the identification of SRY as the testis-determining gene in mammals, the genetic interactions controlling the earliest steps of male sex determination remain poorly understood. In particular, the molecular lesions underlying a high proportion of human XY gonadal dysgenesis, XX maleness and XX true hermaphroditism remain undiscovered. A number of screens have identified candidate genes whose expression is modulated during testis or ovary differentiation in mice, but these screens have used whole gonads, consisting of multiple cell types, or stages of gonadal development well beyond the time of sex determination. We describe here a novel reporter mouse line that expresses enhanced green fluorescent protein under the control of an Sf1 promoter fragment, marking Sertoli and granulosa cell precursors during the critical period of sex determination. These cells were purified from gonads of male and female transgenic embryos at 10.5 dpc (shortly after Sry transcription is activated) and 11.5 dpc (when Sox9 transcription begins), and their transcriptomes analysed using Affymetrix genome arrays. We identified 266 genes, including Dhh, Fgf9 and Ptgds, that were upregulated and 50 genes that were downregulated in 11.5 dpc male somatic gonad cells only, and 242 genes, including Fst, that were upregulated in 11.5 dpc female somatic gonad cells only. The majority of these genes are novel genes that lack identifiable homology, and several human orthologues were found to map to chromosomal loci implicated in disorders of sexual development. These genes represent an important resource with which to piece together the earliest steps of sex determination and gonad development, and provide new candidates for mutation searching in human sexual dysgenesis syndromes.
Resumo:
Using the two largest collections of Mus musculus and Homo sapiens transcription start sites ( TSSs) determined based on CAGE tags, ditags, full- length cDNAs, and other transcript data, we describe the compositional landscape surrounding TSSs with the aim of gaining better insight into the properties of mammalian promoters. We classified TSSs into four types based on compositional properties of regions immediately surrounding them. These properties highlighted distinctive features in the extended core promoters that helped us delineate boundaries of the transcription initiation domain space for both species. The TSS types were analyzed for associations with initiating dinucleotides, CpG islands, TATA boxes, and an extensive collection of statistically significant cis- elements in mouse and human. We found that different TSS types show preferences for different sets of initiating dinucleotides and ciselements. Through Gene Ontology and eVOC categories and tissue expression libraries we linked TSS characteristics to expression. Moreover, we show a link of TSS characteristics to very specific genomic organization in an example of immune- response- related genes ( GO: 0006955). Our results shed light on the global properties of the two transcriptomes not revealed before and therefore provide the framework for better understanding of the transcriptional mechanisms in the two species, as well as a framework for development of new and more efficient promoter- and gene- finding tools.
Resumo:
T he international FANTOM consortium aims to produce a comprehensive picture of the mammalian transcriptome, based upon an extensive cDNA collection and functional annotation of full-length enriched cDNAs. The previous dataset, FANTOM(2), comprised 60,770 full- length enriched cDNAs. Functional annotation revealed that this cDNA dataset contained only about half of the estimated number of mouse protein- coding genes, indicating that a number of cDNAs still remained to be collected and identified. To pursue the complete gene catalog that covers all predicted mouse genes, cloning and sequencing of full- length enriched cDNAs has been continued since FANTOM2. In FANTOM3, 42,031 newly isolated cDNAs were subjected to functional annotation, and the annotation of 4,347 FANTOM2 cDNAs was updated. To accomplish accurate functional annotation, we improved our automated annotation pipeline by introducing new coding sequence prediction programs and developed a Web- based annotation interface for simplifying the annotation procedures to reduce manual annotation errors. Automated coding sequence and function prediction was followed with manual curation and review by expert curators. A total of 102,801 full- length enriched mouse cDNAs were annotated. Out of 102,801 transcripts, 56,722 were functionally annotated as protein coding ( including partial or truncated transcripts), providing to our knowledge the greatest current coverage of the mouse proteome by full- length cDNAs. The total number of distinct non- protein- coding transcripts increased to 34,030. The FANTOM3 annotation system, consisting of automated computational prediction, manual curation, and. nal expert curation, facilitated the comprehensive characterization of the mouse transcriptome, and could be applied to the transcriptomes of other species.
Resumo:
Of the ~1.7 million SINE elements in the human genome, only a tiny number are estimated to be active in transcription by RNA polymerase (Pol) III. Tracing the individual loci from which SINE transcripts originate is complicated by their highly repetitive nature. By exploiting RNA-Seq datasets and unique SINE DNA sequences, we devised a bioinformatic pipeline allowing us to identify Pol III-dependent transcripts of individual SINE elements. When applied to ENCODE transcriptomes of seven human cell lines, this search strategy identified ~1300 Alu loci and ~1100 MIR loci corresponding to detectable transcripts, with ~120 and ~60 respectively Alu and MIR loci expressed in at least three cell lines. In vitro transcription of selected SINEs did not reflect their in vivo expression properties, and required the native 5’-flanking region in addition to internal promoter. We also identified a cluster of expressed AluYa5-derived transcription units, juxtaposed to snaR genes on chromosome 19, formed by a promoter-containing left monomer fused to an Alu-unrelated downstream moiety. Autonomous Pol III transcription was also revealed for SINEs nested within Pol II-transcribed genes raising the possibility of an underlying mechanism for Pol II gene regulation by SINE transcriptional units. Moreover the application of our bioinformatic pipeline to both RNA-seq data of cells subjected to an in vitro pro-oncogenic stimulus and of in vivo matched tumor and non-tumor samples allowed us to detect increased Alu RNA expression as well as the source loci of such deregulation. The ability to investigate SINE transcriptomes at single-locus resolution will facilitate both the identification of novel biologically relevant SINE RNAs and the assessment of SINE expression alteration under pathological conditions.
Resumo:
Olfactory sensory neurons (OSNs), which detect a myriad of odorants, are known to express one allele of one olfactory receptor (OR) gene (Olfr) from the largest gene family in the mammalian genome. The OSNs expressing the same OR project their axons to the main olfactory bulb where they converge to form glomeruli. This “One neuron-one receptor rule” makes the olfactory epithelium (OE), which consists of a vast number of OSNs expressing unique ORs, one of the most heterogeneous cell populations. However, the mechanism of how the single OR allele is chosen remains unclear along with the question of whether one OSN only expresses a single OR gene, a hypothesis that has not been rigorously verified while we performed the experiments. Moreover, failure of axonal targeting to single glomerulus was observed in MeCP2 deficient OSNs where delayed development was proposed as an explanation for the phenotype. How Mecp2 mutation caused this aberrant targeting is not entirely understood.
In this dissertation, we explored the transcriptomes of single and mature OSNs by single-cell RNA-Seq to reveal their heterogeneity and further studied the OR gene expression from these isolated OSNs. The singularity of sequenced OSNs was ensured by the observation of monoallelic expression of X-linked genes from the hybrid samples from crosses between mice of different strains where strain-specific polymorphisms could be used to track the allelic origins of SNP-containing reads. The clustering of expression profiles from triplicates that originated from the same cell assured that the transcriptomic identities of OSNs were maintained through the experimental process. The average gene expression profiles of sequenced OSNs correlated well to the conventional transcriptome data of FACS-sorted Omp-positive cells, and the top-ranked expression of OR was conceded in the single-OSN transcriptomes. While exploring cellular diversity, in addition to OR genes, we revealed nearly 200 differentially expressed genes among the sequenced OSNs in this study. Among the 36 sequenced OSNs, eight cells (22.2%) showed multiple OR gene expression and the presences of additional ORs were not restricted to the neighbor loci that shared the transcriptional effect of the primary OR expression, suggesting that the “One neuron-one receptor rule” might not be strictly true at the transcription level. All of the inferable ORs, including additional co-expressed ORs, were shown to be monoallelic. Our sequencing of 21 Mecp2308 mutant OSNs, of which 62% expressed more than one OR genes, and the expression levels of the additional ORs were significantly higher than those in the wild-type, suggested that MeCP2 plays a role in the regulation of singular OR gene expression. Dual label in situ hybridization along with the sequence data revealed that dorsal and ventral ORs were co-expressed in the same Mecp2 mutant OSN, further implying that MeCP2 might be involved in regulation of OR territories in the OE. Our results suggested a new role of MeCP2 in OR gene choice and ratified that this multiple-OR expression caused by Mecp2 mutation did not accompany delayed OSN development that has been observed in the previous studies on the Mecp2 mutants.