923 resultados para Genomic organization
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Trypanosoma cruzi, the agent of Chagas disease, is a complex of genetically diverse isolates highly phylogenetically related to T. cruzi-like species, Trypanosoma cruzi marinkellei and Trypanosoma dionisii, all sharing morphology of blood and culture forms and development within cells. However, they differ in hosts, vectors and pathogenicity: T. cruzi is a human pathogen infective to virtually all mammals whilst the other two species are non-pathogenic and bat restricted. Previous studies suggest that variations in expression levels and genetic diversity of cruzipain, the major isoform of cathepsin L-like (CATL) enzymes of T. cruzi, correlate with levels of cellular invasion, differentiation, virulence and pathogenicity of distinct strains. In this study, we compared 80 sequences of genes encoding cruzipain from 25 T. cruzi isolates representative of all discrete typing units (DTUs TcI-TcVI) and the new genotype Tcbat and 10 sequences of homologous genes from other species. The catalytic domain repertoires diverged according to DTUs and trypanosome species. Relatively homogeneous sequences are found within and among isolates of the same DTU except TcV and TcVI, which displayed sequences unique or identical to those of TcII and TcIII, supporting their origin from the hybridization between these two DTUs. In network genealogies, sequences from T. cruzi clustered tightly together and closer to T. c. marinkellei than to T. dionisii and largely differed from homologues of T. rangeli and T. b. brucei. Here, analysis of isolates representative of the overall biological and genetic diversity of T. cruzi and closest T. cruzi-like species evidenced DTU- and species-specific polymorphisms corroborating phylogenetic relationships inferred with other genes. Comparison of both phylogenetically close and distant trypanosomes is valuable to understand host-parasite interactions, virulence and pathogenicity. Our findings corroborate cruzipain as valuable target for drugs, vaccine, diagnostic and genotyping approaches.
Resumo:
Tangier disease is characterized by low serum high density lipoproteins and a biochemical defect in the cellular efflux of lipids to high density lipoproteins. ABC1, a member of the ATP-binding cassette family, recently has been identified as the defective gene in Tangier disease. We report here the organization of the human ABC1 gene and the identification of a mutation in the ABC1 gene from the original Tangier disease kindred. The organization of the human ABC1 gene is similar to that of the mouse ABC1 gene and other related ABC genes. The ABC1 gene contains 49 exons that range in size from 33 to 249 bp and is over 70 kb in length. Sequence analysis of the ABC1 gene revealed that the proband for Tangier disease was homozygous for a deletion of nucleotides 3283 and 3284 (TC) in exon 22. The deletion results in a frameshift mutation and a premature stop codon starting at nucleotide 3375. The product is predicted to encode a nonfunctional protein of 1,084 aa, which is approximately half the size of the full-length ABC1 protein. The loss of a Mnl1 restriction site, which results from the deletion, was used to establish the genotype of the rest of the kindred. In summary, we report on the genomic organization of the human ABC1 gene and identify a frameshift mutation in the ABC1 gene of the index case of Tangier disease. These results will be useful in the future characterization of the structure and function of the ABC1 gene and the analysis of additional ABC1 mutations in patients with Tangier disease.
Resumo:
The structures of the genes encoding the α1 and β1 subunits of murine soluble guanylyl cyclase (sGC) were determined. Full-length cDNAs isolated from mouse lungs encoding the α1 (2.5 kb) and β1 (3.3 kb) subunits are presented in this report. The α1 sGC gene is approximately 26.4 kb and contains nine exons, whereas the β1 sGC gene spans 22 kb and consists of 14 exons. The positions of exon/intron boundaries and the sizes of introns for both genes are described. Comparison of mouse genomic organization with the Human Genome Database predicted the exon/intron boundaries of the human genes and revealed that human and mouse α1 and β1 sGC genes have similar structures. Both mouse genes are localized on the third chromosome, band 3E3-F1, and are separated by a fragment that is 2% of the chromosomal length. The 5′ untranscribed regions of α1 and β1 subunit genes were subcloned into luciferase reporter constructs, and the functional analysis of promoter activity was performed in murine neuroblastoma N1E-115 cells. Our results indicate that the 5′ untranscribed regions for both genes possess independent promoter activities and, together with the data on chromosomal localization, suggest independent regulation of both genes.
Resumo:
Microsatellites, tandem arrays of short (2-5 bp) nucleotide motifs, are present in high numbers in most eukaryotic genomes. We have characterized the physical distribution of microsatellites on chromosomes of sugar beet (Beta vulgaris L.). Each microsatellite sequence shows a characteristic genomic distribution and motif-dependent dispersion, with site-specific amplification on one to seven pairs of centromeres or intercalary chromosomal regions and weaker, dispersed hybridization along chromosomes. Exclusion of some microsatellites from 18S-5.8S-25S rRNA gene sites, centromeres, and intercalary sites was observed. In-gel and in situ hybridization patterns are correlated, with highly repeated restriction fragments indicating major centromeric sites of microsatellite arrays. The results have implications for genome evolution and the suitability of particular microsatellite markers for genetic mapping and genome analysis.
Resumo:
In humans, a polymorphic gene encodes the drug-metabolizing enzyme NATI (arylamine N-acetyltransferase Type 1), which is widely expressed throughout the body. While the protein-coding region of NATI is contained within a single exon, examination of the human EST (expressed sequence tag) database at the NCBI revealed the presence of nine separate exons, eight of which were located in the 5'non-coding region of NATI. Differential splicing produced at least eight unique mRNA isoforms that could be grouped according to the location of the first exon, which suggested that NATI expression occurs from three alternative promoters. Using RT (reverse transcriptase)-PCR, we identified one major transcript in various epithelial cells derived from different tissues. In contrast, multiple transcripts were observed in blood-derived cell lines (CEM, THP-1 and Jurkat), with a novel variant, not identified in the EST database, found in CEM cells only. The major splice variant increased gene expression 9-11-fold in a luciferase reporter assay, while the other isoforrns were similar or slightly greater than the control. We examined the upstream region of the most active splice variant in a promoter-reporter assay, and isolated a 257 bp sequence that produced maximal promoter activity. This sequence lacked a TATA box, but contained a consensus Sp1 site and a CAAT box, as well as several other putative transcription-factor-binding sites. Cell-specific expression of the different NATI transcripts may contribute to the variation in NATI activity in vivo.
Resumo:
Sulfate plays an essential role in human growth and development. Here, we characterized the functional properties of the human Na+-sulfate cotransporter (hNaS2), determined its tissue distribution, and identified its gene (SLC13A4) structure. Expression of hNaS2 protein in Xenopus oocytes led to a Na+-dependent transport of sulfate that was inhibited by thiosulfate, phosphate, molybdate. selenate and tungstate, but not by oxalate, citrate, succinate, phenol red or DIDS. Transport kinetics of hNaS2 determined a K, for sulfate of 0.38 mM, suggestive of a high affinity sulfate transporter. Na+ kinetics determined a Hill coefficient of 1.6 +/- 0.6, suggesting a Na: SO42- stoichiometry of 2:1. hNaS2 mRNA was highly expressed in placenta and testis, with intermediate levels in brain and lower levels found in the heart, thymus, and liver. The SLC13A4 gene contains 16 exons, spanning over 47 kb in length. Its 5'-flanking region contains CAAT- and GC-box motifs, and a number of putative transcription factor binding sites, including GATA-1, AP-1, and AP-2 consensus sequences. This is the first study to characterize hNaS2 transport kinetics, define its tissue distribution, and resolve its gene (SLC13A4) structure and 5' flanking region. (C) 2004 Elsevier Inc. All rights reserved.
Dinoflagellate Genomic Organization and Phylogenetic Marker Discovery Utilizing Deep Sequencing Data
Resumo:
Dinoflagellates possess large genomes in which most genes are present in many copies. This has made studies of their genomic organization and phylogenetics challenging. Recent advances in sequencing technology have made deep sequencing of dinoflagellate transcriptomes feasible. This dissertation investigates the genomic organization of dinoflagellates to better understand the challenges of assembling dinoflagellate transcriptomic and genomic data from short read sequencing methods, and develops new techniques that utilize deep sequencing data to identify orthologous genes across a diverse set of taxa. To better understand the genomic organization of dinoflagellates, a genomic cosmid clone of the tandemly repeated gene Alchohol Dehydrogenase (AHD) was sequenced and analyzed. The organization of this clone was found to be counter to prevailing hypotheses of genomic organization in dinoflagellates. Further, a new non-canonical splicing motif was described that could greatly improve the automated modeling and annotation of genomic data. A custom phylogenetic marker discovery pipeline, incorporating methods that leverage the statistical power of large data sets was written. A case study on Stramenopiles was undertaken to test the utility in resolving relationships between known groups as well as the phylogenetic affinity of seven unknown taxa. The pipeline generated a set of 373 genes useful as phylogenetic markers that successfully resolved relationships among the major groups of Stramenopiles, and placed all unknown taxa on the tree with strong bootstrap support. This pipeline was then used to discover 668 genes useful as phylogenetic markers in dinoflagellates. Phylogenetic analysis of 58 dinoflagellates, using this set of markers, produced a phylogeny with good support of all branches. The Suessiales were found to be sister to the Peridinales. The Prorocentrales formed a monophyletic group with the Dinophysiales that was sister to the Gonyaulacales. The Gymnodinales was found to be paraphyletic, forming three monophyletic groups. While this pipeline was used to find phylogenetic markers, it will likely also be useful for finding orthologs of interest for other purposes, for the discovery of horizontally transferred genes, and for the separation of sequences in metagenomic data sets.
Resumo:
A DNA sequence, TPE1, representing the internal domain of a Ty1-copia retroelement, was isolated from genomic DNA of Pinus elliottii Engelm. var. elliottii (slash pine). Genomic Southern analysis showed that this sequence, carrying partial reverse transcriptase and integrase gene sequences, is highly amplified within the genome of slash pine and part of a dispersed element >4.8 kbp. Fluorescent in situ hybridization to metaphase chromosomes shows that the element is relatively uniformly dispersed over all 12 chromosome pairs and is highly abundant in the genome. It is largely excluded from centromeric regions and intercalary chromosomal sites representing the 18S-5.8S-25S rRNA genes. Southern hybridization with specific DNA probes for the reverse transcriptase gene shows that TPE1 represents a large subgroup of heterogeneous Ty1-copia retrotransposons in Pinus species. Because no TPE1 transcription could be detected, it is most likely an inactive element--at least in needle tissue. Further evidence for inactivity was found in recombinant reverse transcriptase and integrase sequences. The distribution of TPE1 within different gymnosperms that contain Ty1-copia group retrotransposons, as shown by a PCR assay, was investigated by Southern hybridization. The TPE1 family is highly amplified and conserved in all Pinus species analyzed, showing a similar genomic organization in the three- and five-needle pine species investigated. It is also present in spruce, bald cypress (swamp cypress), and in gingko but in fewer copies and a different genomic organization.
Resumo:
MicroRNAs (miRNA) are recognized posttranscriptional gene repressors involved in the control of almost every biological process. Allelic variants in these regions may be an important source of phenotypic diversity and contribute to disease susceptibility. We analyzed the genomic organization of 325 human miRNAs (release 7.1, miRBase) to construct a panel of 768 single-nucleotide polymorphisms (SNPs) covering approximately 1 Mb of genomic DNA, including 131 isolated miRNAs (40%) and 194 miRNAs arranged in 48 miRNA clusters, as well as their 5-kb flanking regions. Of these miRNAs, 37% were inside known protein-coding genes, which were significantly associated with biological functions regarding neurological, psychological or nutritional disorders. SNP coverage analysis revealed a lower SNP density in miRNAs compared with the average of the genome, with only 24 SNPs located in the 325 miRNAs studied. Further genotyping of 340 unrelated Spanish individuals showed that more than half of the SNPs in miRNAs were either rare or monomorphic, in agreement with the reported selective constraint on human miRNAs. A comparison of the minor allele frequencies between Spanish and HapMap population samples confirmed the applicability of this SNP panel to the study of complex disorders among the Spanish population, and revealed two miRNA regions, hsa-mir-26a-2 in the CTDSP2 gene and hsa-mir-128-1 in the R3HDM1 gene, showing geographical allelic frequency variation among the four HapMap populations, probably because of differences in natural selection. The designed miRNA SNP panel could help to identify still hidden links between miRNAs and human disease.
Resumo:
Animal olfactory systems have a critical role for the survival and reproduction of individuals. In insects, the odorant-binding proteins (OBPs) are encoded by a moderately sized gene family, and mediate the first steps of the olfactory processing. Most OBPs are organized in clusters of a few paralogs, which are conserved over time. Currently, the biological mechanism explaining the close physical proximity among OBPs is not yet established. Here, we conducted a comprehensive study aiming to gain insights into the mechanisms underlying the OBP genomic organization. We found that the OBP clusters are embedded within large conserved arrangements. These organizations also include other non-OBP genes, which often encode proteins integral to plasma membrane. Moreover, the conservation degree of such large clusters is related to the following: 1) the promoter architecture of the confined genes, 2) a characteristic transcriptional environment, and 3) the chromatin conformation of the chromosomal region. Our results suggest that chromatin domains may restrict the location of OBP genes to regions having the appropriate transcriptional environment, leading to the OBP cluster structure. However, the appropriate transcriptional environment for OBP and the other neighbor genes is not dominated by reduced levels of expression noise. Indeed, the stochastic fluctuations in the OBP transcript abundance may have a critical role in the combinatorial nature of the olfactory coding process.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)