993 resultados para GENOME CHARACTERIZATION
Resumo:
Rapid evolution and high intrahost sequence diversity are hallmarks of human and simian immunodeficiency virus (HIV/SIV) infection. Minor viral variants have important implications for drug resistance, receptor tropism, and immune evasion. Here, we used ultradeep pyrosequencing to sequence complete HIV/SIV genomes, detecting variants present at a frequency as low as 1%. This approach provides a more complete characterization of the viral population than is possible with conventional methods, revealing low-level drug resistance and detecting previously hidden changes in the viral population. While this work applies pyrosequencing to immunodeficiency viruses, this approach could be applied to virtually any viral pathogen.
Resumo:
The xeroderma pigmentosum complementation group B (XPB) protein is involved in both DNA repair and transcription in human cells. It is a component of the transcription factor IIH (TFIIH) and is responsible for DNA helicase activity during nucleotide (nt) excision repair (NER). Its high evolutionary conservation has allowed identification of homologous proteins in different organisms, including plants. In contrast to other organisms, Arabidopsis thaliana harbors a duplication of the XPB orthologue (AtXPB1 and AtXPB2), and the proteins encoded by the duplicated genes are very similar (95% amino acid identity). Complementation assays in yeast rad25 mutant strains suggest the involvement of AtXPB2 in DNA repair, as already shown for AtXPB1, indicating that these proteins may be functionally redundant in the removal of DNA lesions in A. thaliana. Although both genes are expressed in a constitutive manner during the plant life cycle, Northern blot analyses suggest that light modulates the expression level of both XPB copies, and transcript levels increase during early stages of development. Considering the high similarity between AtXPB1 and AtXPB2 and that both of predicted proteins may act in DNA repair, it is possible that this duplication may confer more flexibility and resistance to DNA damaging agents in thale cress. (C) 2004 Elsevier B.V. All rights reserved.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The objective of this work is to characterize the genome of the chromosome 1 of A.thaliana, a small flowering plants used as a model organism in studies of biology and genetics, on the basis of a recent mathematical model of the genetic code. I analyze and compare different portions of the genome: genes, exons, coding sequences (CDS), introns, long introns, intergenes, untranslated regions (UTR) and regulatory sequences. In order to accomplish the task, I transformed nucleotide sequences into binary sequences based on the definition of the three different dichotomic classes. The descriptive analysis of binary strings indicate the presence of regularities in each portion of the genome considered. In particular, there are remarkable differences between coding sequences (CDS and exons) and non-coding sequences, suggesting that the frame is important only for coding sequences and that dichotomic classes can be useful to recognize them. Then, I assessed the existence of short-range dependence between binary sequences computed on the basis of the different dichotomic classes. I used three different measures of dependence: the well-known chi-squared test and two indices derived from the concept of entropy i.e. Mutual Information (MI) and Sρ, a normalized version of the “Bhattacharya Hellinger Matusita distance”. The results show that there is a significant short-range dependence structure only for the coding sequences whose existence is a clue of an underlying error detection and correction mechanism. No doubt, further studies are needed in order to assess how the information carried by dichotomic classes could discriminate between coding and noncoding sequence and, therefore, contribute to unveil the role of the mathematical structure in error detection and correction mechanisms. Still, I have shown the potential of the approach presented for understanding the management of genetic information.
Resumo:
Around 14 distinct virus species-complexes have been detected in honeybees, each with one or more strains or sub-species. Here we present the initial characterization of an entirely new virus species-complex discovered in honeybee (Apis mellifera L.) and varroa mite (Varroa destructor) samples from Europe and the USA. The virus has a naturally poly-adenylated RNA genome of about 6500 nucleotides with a genome organization and sequence similar to the Tymoviridae (Tymovirales; Tymoviridae), a predominantly plant-infecting virus family. Literature and laboratory analyses indicated that the virus had not previously been described. The virus is very common in French apiaries, mirroring the results from an extensive Belgian survey, but could not be detected in equally-extensive Swedish and Norwegian bee disease surveys. The virus appears to be closely linked to varroa, with the highest prevalence found in varroa samples and a clear seasonal distribution peaking in autumn, coinciding with the natural varroa population development. Sub-genomic RNA analyses show that bees are definite hosts, while varroa is a possible host and likely vector. The tentative name of Bee Macula-like virus (BeeMLV) is therefore proposed. A second, distantly related Tymoviridae-like virus was also discovered in varroa transcriptomes, tentatively named Varroa Tymo-like virus (VTLV).
Resumo:
The xeroderma pigmentosum complementation group B (XPB) protein is involved in both DNA repair and transcription in human cells. It is a component of the transcription factor IIH (TFIIH) and is responsible for DNA helicase activity during nucleotide (nt) excision repair (NER). Its high evolutionary conservation has allowed identification of homologous proteins in different organisms, including plants. In contrast to other organisms, Arabidopsis thaliana harbors a duplication of the XPB orthologue (AtXPB1 and AtXPB2), and the proteins encoded by the duplicated genes are very similar (95% amino acid identity). Complementation assays in yeast rad25 mutant strains suggest the involvement of AtXPB2 in DNA repair, as already shown for AtXPB1, indicating that these proteins may be functionally redundant in the removal of DNA lesions in A. thaliana. Although both genes are expressed in a constitutive manner during the plant life cycle, Northern blot analyses suggest that light modulates the expression level of both XPB copies, and transcript levels increase during early stages of development. Considering the high similarity between AtXPB1 and AtXPB2 and that both of predicted proteins may act in DNA repair, it is possible that this duplication may confer more flexibility and resistance to DNA damaging agents in thale cress. (C) 2004 Elsevier B.V. All rights reserved.
Resumo:
Pseudocercospora griseola (Sacc.) Crous &. Braun is a widespread fungal phytopathogen that is responsible for angular leaf spot in the common bean (Phaseolus vulgaris L.). A number of fungal phytopathogens have been shown to harbour mycoviruses, and this possibility was investigated in populations of Pseudocercospora griseola. The total nucleic acid extracts of 61 fungal isolates were subjected to agarose gel electrophoresis. Small fragments (800-4800 bp) could be identified in 42 of the samples. The presence of dsRNA in isolate Ig838 was confirmed by treatment of total nucleic acid with DNase, RNase A, and nuclease S I. Transmission electron microscopy revealed the presence of viral-like particles 40 nm in diameter in the mycelia of 2 fungal isolates, namely 29-3 and Ig838. The transmission of dsRNA by means of conidia was 100% for isolate 29-3, but there was loss of 1-6 fragments of dsRNA in monosporic colonies of isolate Ig848. Cycloheximide treatment failed to inhibit the mycovirus in isolate 29-3, but proved efficient in the elimination of the 2.2, 2.0, 1.8, 1.2 and 1.0 kb fragments in 2 colonies of isolate Ig848. The occurrence of a mycovirus in Pseudocercospora griseola was demonstrated for the first time in the present study.
Resumo:
Les facteurs de transcription sont des protéines spécialisées qui jouent un rôle important dans différents processus biologiques tel que la différenciation, le cycle cellulaire et la tumorigenèse. Ils régulent la transcription des gènes en se fixant sur des séquences d’ADN spécifiques (éléments cis-régulateurs). L’identification de ces éléments est une étape cruciale dans la compréhension des réseaux de régulation des gènes. Avec l’avènement des technologies de séquençage à haut débit, l’identification de tout les éléments fonctionnels dans les génomes, incluant gènes et éléments cis-régulateurs a connu une avancée considérable. Alors qu’on est arrivé à estimer le nombre de gènes chez différentes espèces, l’information sur les éléments qui contrôlent et orchestrent la régulation de ces gènes est encore mal définie. Grace aux techniques de ChIP-chip et de ChIP-séquençage il est possible d’identifier toutes les régions du génome qui sont liées par un facteur de transcription d’intérêt. Plusieurs approches computationnelles ont été développées pour prédire les sites fixés par les facteurs de transcription. Ces approches sont classées en deux catégories principales: les algorithmes énumératifs et probabilistes. Toutefois, plusieurs études ont montré que ces approches génèrent des taux élevés de faux négatifs et de faux positifs ce qui rend difficile l’interprétation des résultats et par conséquent leur validation expérimentale. Dans cette thèse, nous avons ciblé deux objectifs. Le premier objectif a été de développer une nouvelle approche pour la découverte des sites de fixation des facteurs de transcription à l’ADN (SAMD-ChIP) adaptée aux données de ChIP-chip et de ChIP-séquençage. Notre approche implémente un algorithme hybride qui combine les deux stratégies énumérative et probabiliste, afin d’exploiter les performances de chacune d’entre elles. Notre approche a montré ses performances, comparée aux outils de découvertes de motifs existants sur des jeux de données simulées et des jeux de données de ChIP-chip et de ChIP-séquençage. SAMD-ChIP présente aussi l’avantage d’exploiter les propriétés de distributions des sites liés par les facteurs de transcription autour du centre des régions liées afin de limiter la prédiction aux motifs qui sont enrichis dans une fenêtre de longueur fixe autour du centre de ces régions. Les facteurs de transcription agissent rarement seuls. Ils forment souvent des complexes pour interagir avec l’ADN pour réguler leurs gènes cibles. Ces interactions impliquent des facteurs de transcription dont les sites de fixation à l’ADN sont localisés proches les uns des autres ou bien médier par des boucles de chromatine. Notre deuxième objectif a été d’exploiter la proximité spatiale des sites liés par les facteurs de transcription dans les régions de ChIP-chip et de ChIP-séquençage pour développer une approche pour la prédiction des motifs composites (motifs composés par deux sites et séparés par un espacement de taille fixe). Nous avons testé ce module pour prédire la co-localisation entre les deux demi-sites ERE qui forment le site ERE, lié par le récepteur des œstrogènes ERα. Ce module a été incorporé à notre outil de découverte de motifs SAMD-ChIP.
Resumo:
In this work, we used sugarcane as a model due to its importance for sugar and ethanol production. Unlike the current plant models, sugarcane presents a complex genetics and an enormous allelic variation. Here, we report the analysis of SAGE libraries produced using the shoot apical meristem from contrasted genotypes by flowering induction (non-flowering vs. early-flowering varieties) grown under São Paulo state conditions. The expression pattern was analyzed using samples from São Paulo (SP) and Rio Grande do Norte (RN) states. These results showed that cDNAs identified by SAGE libraries had differential expression only in São Paulo state samples. Furthermore, the cDNA identified CYP (Citocrome P450) was chosen for in silico and genome characterization because it was found in SAGE libraries and subtractive libraries from samples from RN. Phylogenetic trees showed the relationship for these sequences. Furthermore, the qRT-PCR for CYP showed a potential role as flowering indutor for RN samples considering different isophorms. Considering the results present here, it can be consider that CYP gene may be used as molecular marker
Resumo:
In this work, we used sugarcane as a model due to its importance for sugar and ethanol production. Unlike the current plant models, sugarcane presents a complex genetics and an enormous allelic variation. Here, we report the analysis of SAGE libraries produced using the shoot apical meristem from contrasted genotypes by flowering induction (non-flowering vs. early-flowering varieties) grown under São Paulo state conditions. The expression pattern was analyzed using samples from São Paulo (SP) and Rio Grande do Norte (RN) states. These results showed that cDNAs identified by SAGE libraries had differential expression only in São Paulo state samples. Furthermore, the cDNA identified CYP (Citocrome P450) was chosen for in silico and genome characterization because it was found in SAGE libraries and subtractive libraries from samples from RN. Phylogenetic trees showed the relationship for these sequences. Furthermore, the qRT-PCR for CYP showed a potential role as flowering indutor for RN samples considering different isophorms. Considering the results present here, it can be consider that CYP gene may be used as molecular marker
Resumo:
A rapid and simple technique for the purification of Toxoplasma gondii tachyzoites was developed. Highly purified parasites were obtained from the peritoneal exudates of infected mice by means of two consecutive discontinous sucrose gradients run at low speed (10,000xg, 30 min). Parasites obtained by this method conserved its biological activity. Hybridizations tudies with DNA from healthy mice and from purified tachyzoites preparations demonstrated that Toxoplasma gondii tachyzoites DNA could be obtained with better than 90 per cents purity. Preliminary studies with DNA endonucleases showed the presence in the tachyzoites genome of highly repetitives sequences.
Resumo:
To develop a comprehensive overview of copy number aberrations (CNAs) in stage-II/III colorectal cancer (CRC), we characterized 302 tumors from the PETACC-3 clinical trial. Microsatellite-stable (MSS) samples (n = 269) had 66 minimal common CNA regions, with frequent gains on 20 q (72.5%), 7 (41.8%), 8 q (33.1%) and 13 q (51.0%) and losses on 18 (58.6%), 4 q (26%) and 21 q (21.6%). MSS tumors have significantly more CNAs than microsatellite-instable (MSI) tumors: within the MSI tumors a novel deletion of the tumor suppressor WWOX at 16 q23.1 was identified (p<0.01). Focal aberrations identified by the GISTIC method confirmed amplifications of oncogenes including EGFR, ERBB2, CCND1, MET, and MYC, and deletions of tumor suppressors including TP53, APC, and SMAD4, and gene expression was highly concordant with copy number aberration for these genes. Novel amplicons included putative oncogenes such as WNK1 and HNF4A, which also showed high concordance between copy number and expression. Survival analysis associated a specific patient segment featured by chromosome 20 q gains to an improved overall survival, which might be due to higher expression of genes such as EEF1B2 and PTK6. The CNA clustering also grouped tumors characterized by a poor prognosis BRAF-mutant-like signature derived from mRNA data from this cohort. We further revealed non-random correlation between CNAs among unlinked loci, including positive correlation between 20 q gain and 8 q gain, and 20 q gain and chromosome 18 loss, consistent with co-selection of these CNAs. These results reinforce the non-random nature of somatic CNAs in stage-II/III CRC and highlight loci and genes that may play an important role in driving the development and outcome of this disease.
Resumo:
In mammals, post-testicular sperm maturation taking place in the epididymis is required for the spermatozoa to acquire the abilities required to fertilize the egg in vivo. The epididymal epithelial cells secrete proteins and other small molecules into the lumen, where they interact with the spermatozoa and enable necessary maturational changes. In this study different in silico, in vitro and in vivo approaches were utilized in order to find novel genes responsible for the function of the epididymis and post-testicular sperm maturation in the mouse. Available online genomic databases were analyzed to identify genes potentially expressed in the epididymis, gene expression profiling was performed by studying their expression in different mouse tissues, and significance of certain genes to fertility was assessed by generating genetically modified mouse models. A recently discovered Pate (prostate and testis expression) gene family was found to be predominantly expressed in the epididymis. It represents one of the largest known gene families expressed in the epididymis, and the members code for proteins potentially involved in defense against microorganisms. Through genetically modified mouse models CRISP4 (cysteine-rich secretory protein 4) was identified to regulate sperm acrosome reaction, and BMYC to inhibit the expression of the Myc proto-oncogene in the developing testis. A mouse line expressing iCre recombinase specifically in the epididymis was also generated. This model can be used to generate conditional, epididymis-specific knock-out models, and will be a valuable tool in fertility studies.