998 resultados para Human genomics
Resumo:
Background. The anaerobic spirochaete Brachyspira pilosicoli causes enteric disease in avian, porcine and human hosts, amongst others. To date, the only available genome sequence of B. pilosicoli is that of strain 95/1000, a porcine isolate. In the first intra-species genome comparison within the Brachyspira genus, we report the whole genome sequence of B. pilosicoli B2904, an avian isolate, the incomplete genome sequence of B. pilosicoli WesB, a human isolate, and the comparisons with B. pilosicoli 95/1000. We also draw on incomplete genome sequences from three other Brachyspira species. Finally we report the first application of the high-throughput Biolog phenotype screening tool on the B. pilosicoli strains for detailed comparisons between genotype and phenotype. Results. Feature and sequence genome comparisons revealed a high degree of similarity between the three B. pilosicoli strains, although the genomes of B2904 and WesB were larger than that of 95/1000 (~2,765, 2.890 and 2.596 Mb, respectively). Genome rearrangements were observed which correlated largely with the positions of mobile genetic elements. Through comparison of the B2904 and WesB genomes with the 95/1000 genome, features that we propose are non-essential due to their absence from 95/1000 include a peptidase, glycine reductase complex components and transposases. Novel bacteriophages were detected in the newly-sequenced genomes, which appeared to have involvement in intra- and inter-species horizontal gene transfer. Phenotypic differences predicted from genome analysis, such as the lack of genes for glucuronate catabolism in 95/1000, were confirmed by phenotyping. Conclusions. The availability of multiple B. pilosicoli genome sequences has allowed us to demonstrate the substantial genomic variation that exists between these strains, and provides an insight into genetic events that are shaping the species. In addition, phenotype screening allowed determination of how genotypic differences translated to phenotype. Further application of such comparisons will improve understanding of the metabolic capabilities of Brachyspira species.
Resumo:
It has been postulated that noncoding RNAs (ncRNAs) are involved in the posttranscriptional control of gene expression, and may have contributed to the emergence of the complex attributes observed in mammalians. We show here that the complement of ncRNAs expressed from intronic regions of the human and mouse genomes comprises at least 78,147 and 39,660 transcriptional units, respectively. To identify conserved intronic sequences expressed in both humans and mice, we used custom-designed human cDNA microarrays to separately interrogate RNA from mouse and human liver, kidney, and prostate tissues. An overlapping tissue expression signature was detected for both species, comprising 198 transcripts; among these, 22 RNAs map to intronic regions with evidence of evolutionary conservation in humans and mice. Transcription of selected human-mouse intronic ncRNAs was confirmed using strand-specific RT-PCR. Altogether, these results support an evolutionarily conserved role of intronic ncRNAs in human and mouse, which are likely to be involved in the fine tuning of gene expression regulation in different mammalian tissues. (C) 2008 Elsevier Inc. All rights reserved.
Resumo:
Chicken is one of the most important sources of animal protein for human consumption, and breeding programmes have been responsible for constant improvements in production efficiency and product quality. Furthermore, chicken has largely contributed to fundamental discoveries in biology for the last 100 years. In this article we review recent developments in poultry genomics and their contribution to adding functional information to the already existing structural genomics, including the availability of the complete genome sequence, a comprehensive collection of mRNA sequences ( ESTs), microarray platforms, and their use to complement QTL mapping strategies in the identification of genes that underlie complex traits. Efforts of the Brazilian Poultry Genomics Programme in this area resulted in generation of a resource population, which was used for identification of Quantitative Trait Loci ( QTL) regions, generation of ESTs and candidate gene studies that contributed to furthering our understanding of the complex biological processes involved in growth and muscular development in chicken.
Resumo:
Background: The genome-wide identification of both morbid genes, i.e., those genes whose mutations cause hereditary human diseases, and druggable genes, i.e., genes coding for proteins whose modulation by small molecules elicits phenotypic effects, requires experimental approaches that are time-consuming and laborious. Thus, a computational approach which could accurately predict such genes on a genome-wide scale would be invaluable for accelerating the pace of discovery of causal relationships between genes and diseases as well as the determination of druggability of gene products.Results: In this paper we propose a machine learning-based computational approach to predict morbid and druggable genes on a genome-wide scale. For this purpose, we constructed a decision tree-based meta-classifier and trained it on datasets containing, for each morbid and druggable gene, network topological features, tissue expression profile and subcellular localization data as learning attributes. This meta-classifier correctly recovered 65% of known morbid genes with a precision of 66% and correctly recovered 78% of known druggable genes with a precision of 75%. It was than used to assign morbidity and druggability scores to genes not known to be morbid and druggable and we showed a good match between these scores and literature data. Finally, we generated decision trees by training the J48 algorithm on the morbidity and druggability datasets to discover cellular rules for morbidity and druggability and, among the rules, we found that the number of regulating transcription factors and plasma membrane localization are the most important factors to morbidity and druggability, respectively.Conclusions: We were able to demonstrate that network topological features along with tissue expression profile and subcellular localization can reliably predict human morbid and druggable genes on a genome-wide scale. Moreover, by constructing decision trees based on these data, we could discover cellular rules governing morbidity and druggability.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The correct identification of all human genes, and their derived transcripts, has not yet been achieved, and it remains one of the major aims of the worldwide genomics community. Computational programs suggest the existence of 30,000 to 40,000 human genes. However, definitive gene identification can only be achieved by experimental approaches. We used two distinct methodologies, one based on the alignment of mouse orthologous sequences to the human genome, and another based on the construction of a high-quality human testis cDNA library, in an attempt to identify new human transcripts within the human genome sequence. We generated 47 complete human transcript sequences, comprising 27 unannotated and 20 annotated sequences. Eight of these transcripts are variants of previously known genes. These transcripts were characterized according to size, number of exons, and chromosomal localization, and a search for protein domains was undertaken based on their putative open reading frames. In silico expression analysis suggests that some of these transcripts are expressed at low levels and in a restricted set of tissues.
Resumo:
Breast cancer is the most common type of cancer among women worldwide. Research using breast cancer cell lines derived from primary tumors may provide valuable additional knowledge regarding this type of cancer. Therefore, the aim of this study was to investigate the phenotypic profiles of MACL-1 and MGSO-3, the only Brazilian breast cancer cell lines available for comparative studies. We evaluated the presence of hormone receptors, proliferation, differentiation and stem cell markers, using immunohistochemical staining of the primary tumor, cultured cells and xenografts implanted in immunodeficient mice. We also investigated the ability of the cell lines to form colonies and copy number alterations by array comparative genomic hybridization. Histopathological analysis showed that the invasive primary tumor from which the MACL-1 cell line was derived, was a luminal A subtype carcinoma, while the ductal carcinoma in situ (DCIS) that gave rise to the MGSO-3 cell line was a HER2 subtype tumor, both showing different proliferation levels. The cell lines and the tumor xenografts in mice preserved their high proliferative potential, but did not maintain the expression of the other markers assessed. This shift in expression may be due to the selection of an 'establishment' phenotype in vitro. Whole-genome DNA evaluation showed a large amount of copy number alterations (CNAs) in the two cell lines. These findings render MACL-1 and MGSO-3 the first characterized Brazilian breast cancer cell lines to be potentially used for comparative research. © 2013 Spandidos Publications Ltd. All rights reserved.
Resumo:
The publication of the human genome sequence in 2001 was a major step forward in knowledge necessary to understand the variations between individuals. For farmed species, genomic sequence information will facilitate the selection of animals optimised to live, and be productive, in particular environments. The availability of cattle genome sequence has allowed the breeding industry to take the first steps towards predicting phenotypes from genotypes by estimating a genomic breeding value (gEBV) for bulls using genome-wide DNA markers. The sequencing of the buffalo genome and creation of a panel of DNA markers has created the opportunity to apply molecular selection approaches for this species.The genomes of several buffalo of different breeds were sequenced and aligned with the bovine genome, which facilitated the identification of millions of sequence variants in the buffalo genomes. Based on frequencies of variants within and among buffalo breeds, and their distribution across the genome compared with the bovine genome, 90,000 putative single nucleotide polymorphisms (SNP) were selected to create an Axiom (R) Buffalo Genotyping Array 90K. This SNP Chip was tested in buffalo populations from Italy and Brazil and found to have at least 75% high quality and polymorphic markers in these populations. The 90K SNP chip was then used to investigate the structure of buffalo populations, and to localise the variations having a major effect on milk production.
Resumo:
Brucella species are Gram-negative bacteria that infect mammals. Recently, two unusual strains (Brucella inopinata BO1(T) and B. inopinata-like BO2) have been isolated from human patients, and their similarity to some atypical brucellae isolated from Australian native rodent species was noted. Here we present a phylogenomic analysis of the draft genome sequences of BO1(T) and BO2 and of the Australian rodent strains 83-13 and NF2653 that shows that they form two groups well separated from the other sequenced Brucella spp. Several important differences were noted. Both BO1(T) and BO2 did not agglutinate significantly when live or inactivated cells were exposed to monospecific A and Mantisera against O-side chain sugars composed of N-formyl-perosamine. While BO1(T) maintained the genes required to synthesize a typical Brucella O-antigen, BO2 lacked many of these genes but still produced a smooth LPS (lipopolysaccharide). Most missing genes were found in the wbk region involved in O-antigen synthesis in classic smooth Brucella spp. In their place, BO2 carries four genes that other bacteria use for making a rhamnose-based O-antigen. Electrophoretic, immunoblot, and chemical analyses showed that BO2 carries an antigenically different O-antigen made of repeating hexose-rich oligosaccharide units that made the LPS water-soluble, which contrasts with the homopolymeric O-antigen of other smooth brucellae that have a phenol-soluble LPS. The results demonstrate the existence of a group of early-diverging brucellae with traits that depart significantly from those of the Brucella species described thus far. IMPORTANCE This report examines differences between genomes from four new Brucella strains and those from the classic Brucella spp. Our results show that the four new strains are outliers with respect to the previously known Brucella strains and yet are part of the genus, forming two new clades. The analysis revealed important information about the evolution and survival mechanisms of Brucella species, helping reshape our knowledge of this important zoonotic pathogen. One discovery of special importance is that one of the strains, BO2, produces an O-antigen distinct from any that has been seen in any other Brucella isolates to date.
Resumo:
Background: Proteinaceous toxins are observed across all levels of inter-organismal and intra-genomic conflicts. These include recently discovered prokaryotic polymorphic toxin systems implicated in intra-specific conflicts. They are characterized by a remarkable diversity of C-terminal toxin domains generated by recombination with standalone toxin-coding cassettes. Prior analysis revealed a striking diversity of nuclease and deaminase domains among the toxin modules. We systematically investigated polymorphic toxin systems using comparative genomics, sequence and structure analysis. Results: Polymorphic toxin systems are distributed across all major bacterial lineages and are delivered by at least eight distinct secretory systems. In addition to type-II, these include type-V, VI, VII (ESX), and the poorly characterized "Photorhabdus virulence cassettes (PVC)", PrsW-dependent and MuF phage-capsid-like systems. We present evidence that trafficking of these toxins is often accompanied by autoproteolytic processing catalyzed by HINT, ZU5, PrsW, caspase-like, papain-like, and a novel metallopeptidase associated with the PVC system. We identified over 150 distinct toxin domains in these systems. These span an extraordinary catalytic spectrum to include 23 distinct clades of peptidases, numerous previously unrecognized versions of nucleases and deaminases, ADP-ribosyltransferases, ADP ribosyl cyclases, RelA/SpoT-like nucleotidyltransferases, glycosyltranferases and other enzymes predicted to modify lipids and carbohydrates, and a pore-forming toxin domain. Several of these toxin domains are shared with host-directed effectors of pathogenic bacteria. Over 90 families of immunity proteins might neutralize anywhere between a single to at least 27 distinct types of toxin domains. In some organisms multiple tandem immunity genes or immunity protein domains are organized into polyimmunity loci or polyimmunity proteins. Gene-neighborhood-analysis of polymorphic toxin systems predicts the presence of novel trafficking-related components, and also the organizational logic that allows toxin diversification through recombination. Domain architecture and protein-length analysis revealed that these toxins might be deployed as secreted factors, through directed injection, or via inter-cellular contact facilitated by filamentous structures formed by RHS/YD, filamentous hemagglutinin and other repeats. Phyletic pattern and life-style analysis indicate that polymorphic toxins and polyimmunity loci participate in cooperative behavior and facultative 'cheating' in several ecosystems such as the human oral cavity and soil. Multiple domains from these systems have also been repeatedly transferred to eukaryotes and their viruses, such as the nucleo-cytoplasmic large DNA viruses. Conclusions: Along with a comprehensive inventory of toxins and immunity proteins, we present several testable predictions regarding active sites and catalytic mechanisms of toxins, their processing and trafficking and their role in intra-specific and inter-specific interactions between bacteria. These systems provide insights regarding the emergence of key systems at different points in eukaryotic evolution, such as ADP ribosylation, interaction of myosin VI with cargo proteins, mediation of apoptosis, hyphal heteroincompatibility, hedgehog signaling, arthropod toxins, cell-cell interaction molecules like teneurins and different signaling messengers.
Resumo:
Abstract Background The thymus is a central lymphoid organ, in which bone marrow-derived T cell precursors undergo a complex process of maturation. Developing thymocytes interact with thymic microenvironment in a defined spatial order. A component of thymic microenvironment, the thymic epithelial cells, is crucial for the maturation of T-lymphocytes through cell-cell contact, cell matrix interactions and secretory of cytokines/chemokines. There is evidence that extracellular matrix molecules play a fundamental role in guiding differentiating thymocytes in both cortical and medullary regions of the thymic lobules. The interaction between the integrin α5β1 (CD49e/CD29; VLA-5) and fibronectin is relevant for thymocyte adhesion and migration within the thymic tissue. Our previous results have shown that adhesion of thymocytes to cultured TEC line is enhanced in the presence of fibronectin, and can be blocked with anti-VLA-5 antibody. Results Herein, we studied the role of CD49e expressed by the human thymic epithelium. For this purpose we knocked down the CD49e by means of RNA interference. This procedure resulted in the modulation of more than 100 genes, some of them coding for other proteins also involved in adhesion of thymocytes; others related to signaling pathways triggered after integrin activation, or even involved in the control of F-actin stress fiber formation. Functionally, we demonstrated that disruption of VLA-5 in human TEC by CD49e-siRNA-induced gene knockdown decreased the ability of TEC to promote thymocyte adhesion. Such a decrease comprised all CD4/CD8-defined thymocyte subsets. Conclusion Conceptually, our findings unravel the complexity of gene regulation, as regards key genes involved in the heterocellular cell adhesion between developing thymocytes and the major component of the thymic microenvironment, an interaction that is a mandatory event for proper intrathymic T cell differentiation.
Resumo:
Abstract Background One of the least common types of alternative splicing is the complete retention of an intron in a mature transcript. Intron retention (IR) is believed to be the result of intron, rather than exon, definition associated with failure of the recognition of weak splice sites flanking short introns. Although studies on individual retained introns have been published, few systematic surveys of large amounts of data have been conducted on the mechanisms that lead to IR. Results TTo understand how sequence features are associated with or control IR, and to produce a generalized model that could reveal previously unknown signals that regulate this type of alternative splicing, we partitioned intron retention events observed in human cDNAs into two groups based on the relative abundance of both isoforms and compared relevant features. We found that a higher frequency of IR in human is associated with individual introns that have weaker splice sites, genes with shorter intron lengths, higher expression levels and lower density of both a set of exon splicing silencers (ESSs) and the intronic splicing enhancer GGG. Both groups of retained introns presented events conserved in mouse, in which the retained introns were also short and presented weaker splice sites. Conclusion Although our results confirmed that weaker splice sites are associated with IR, they showed that this feature alone cannot explain a non-negligible fraction of events. Our analysis suggests that cis-regulatory elements are likely to play a crucial role in regulating IR and also reveals previously unknown features that seem to influence its occurrence. These results highlight the importance of considering the interplay among these features in the regulation of the relative frequency of IR.
Resumo:
Abstract Background Recent medical and biological technology advances have stimulated the development of new testing systems that have been providing huge, varied amounts of molecular and clinical data. Growing data volumes pose significant challenges for information processing systems in research centers. Additionally, the routines of genomics laboratory are typically characterized by high parallelism in testing and constant procedure changes. Results This paper describes a formal approach to address this challenge through the implementation of a genetic testing management system applied to human genome laboratory. We introduced the Human Genome Research Center Information System (CEGH) in Brazil, a system that is able to support constant changes in human genome testing and can provide patients updated results based on the most recent and validated genetic knowledge. Our approach uses a common repository for process planning to ensure reusability, specification, instantiation, monitoring, and execution of processes, which are defined using a relational database and rigorous control flow specifications based on process algebra (ACP). The main difference between our approach and related works is that we were able to join two important aspects: 1) process scalability achieved through relational database implementation, and 2) correctness of processes using process algebra. Furthermore, the software allows end users to define genetic testing without requiring any knowledge about business process notation or process algebra. Conclusions This paper presents the CEGH information system that is a Laboratory Information Management System (LIMS) based on a formal framework to support genetic testing management for Mendelian disorder studies. We have proved the feasibility and showed usability benefits of a rigorous approach that is able to specify, validate, and perform genetic testing using easy end user interfaces.
Resumo:
CYP3A verstoffwechselt mehr als 50% aller gegenwärtig in der Therapie eingesetzten Wirkstoffe, die häufig an klinisch relevanten Arzneimitttel-Wechselwirkungen beteiligt sind. Das Verständnis über die Bedeutung und die Regulation von einzelnen CYP3A Genen in der Pharmakologie und Physiologie ist unvollständig. Wir untersuchten die Evolution des CYP3 Genlokus über einen Zeitraum von 450 Millionen Jahre mittels genomischer Sequenzen von 16 Tierarten. Neue CYP3 Unterfamilien (CYP3B, C und D) entstanden über eine beschleunigte Evolution aus CYP3A Vorstufen von Clupeocephala Spezies. Ausgeprägte funktionelle Unterschiede traten zwischen CYP3A in Säugern und Clupeocephala CYP3 auf. Alle amnioten CYP3A Gene entwickelten sich aus zwei CYP3A Urgenen. Aufgrund der Entstehung von Säugern mit Plazenta ging eines von ihnen verloren während das andere eine neue genomische Umgebung infolge einer Translokation erlangte. In Primaten unterzog sich CYP3A mit mehreren Genduplikationen, Deletionen, Pseudogenisierung und Genkonversionen einer raschen evolutionären Veränderung. Die Entwicklung von CYP3A in Schmalnasenaffen (Alte Welt Affen, große Menschenaffen und Menschen) unterschieden sich wesentlich von Neue Welt Primaten (z.B. gewöhnlichen Krallenaffen) und Feuchtnasenaffen (z.B. Galago). Stellvertretend für die CYP3A Protein-codierende Sequenz entdeckten wir zwei frühe Episoden von besonders starker positiver Selektion: (1) auf CYP3A7 in der frühen hominoiden Evolution, welche im fetalen Zeitraum von einer Einschränkung der hepatischen Expression begleitet war, und (2) auf humanes CYP3A4 im Anschluss an die Teilung der Abstammungslinie in Schimpansen und Mensch. In Übereinstimmung mit diesen Befunden beeinflussen drei von vier positiv ausgewählten Aminosäuren, die in früheren biochemischen CYP3A Studien untersucht wurden, die Aktivität und Regioselektivität. Es ist somit naheliegend, dass CYP3A7 und CYP3A4 katalytische Funktionen erworben haben können, die besonders wichtig waren für die Evolution von Hominoiden und Menschen. Die Charakterisierung von CYP3A Promotoren in Primaten zeigte eine Anreicherung von ER6 Elementen in CYP3A Promotoren von Primaten und einen Trend in Richtung Erhöhung der ER6 Enstehung entlang den Abstammungslinien, die zu humanen und Schimpansen CYP3A4 führten. Die steigende Anzahl an ER6 Elementen kann durch die ausgeprägte CYP3A4 Induzierbarkeit und Expressionsvariabilität im Menschen verursacht sein.
Resumo:
The human cytochrome P450 3A4 (CYP3A4), the predominant but variably expressed cytochrome P450 in adult liver and small intestine is involved in the metabolism of over 50% of currently used drugs. Its paralog CYP3A5 plays a crucial role in the disposition of several drugs with low therapeutic index, including tacrolimus. Limited information is available for the CYP3A5 transcriptional regulation and its induction by xenobiotics remains controversial. In the first part of this study, we analysed the CYP3A5 transcriptional regulation and its induction by xenobiotics in vivo using transgenic mice. To this end, two transgenic strains were established by pronuclear injection of a plasmid, expressing firefly luciferase driven by a 6.2 kb of the human CYP3A5 promoter. A detailed analysis of both strains shows a tissue distribution largely reflecting that of CYP3A5 transcripts in humans. Thus, the highest luciferase activity was detected in the small intestine, followed by oesophagus, testis, lung, adrenal gland, ovary, prostate and kidney. However, no activity was observed in the liver. CYP3A5-luc transgenic mice were similarly induced in both sexes with either PCN or TCPOBOP in small intestine in a dose-dependent manner. Thus, the 6.2 kb upstream promoter of CYP3A5 mediates the broad tissue activity in transgenic mice. CYP3A5 promoter is inducible in the small intestine in vivo, which may contribute to the variable expression of CYP3A in this organ. rnThe hepato-intestinal level of the detoxifying oxidases CYP3A4 and CYP3A5 is adjusted to the xenobiotic exposure mainly via the xenosensor and transcriptional factor PXR. CYP3A5 is additionally expressed in several other organs lacking PXR, including kidney. In the second part of this study, we investigated the mechanism of the differential expression of CYP3A5 and CYP3A4 and its evolutionary origin using renal and intestinal cells, and comparative genomics. For this examination, we established a two-cell line models reflecting the expression relationships of CYP3A4 and CYP3A5 in the kidney and small intestine in vivo. Our data demonstrate that the CYP3A5 expression in renal cells was enabled by the loss of a suppressing Yin Yang 1 (YY1)-binding site from the CYP3A5 promoter. This allowed for a renal CYP3A5 expression in a PXR-independent manner. The YY1 element is retained in the CYP3A4 gene, leading to its suppression, perhaps via interference with the NF1 activity in renal cells. In intestinal cells, the inhibition of CYP3A4 expression by YY1 is abrogated by a combined activating effect of PXR and NF1 acting on their respective response elements located adjacent to the YY1-binding site on CYP3A4 proximal promoter. CYP3A4 expression is further facilitated by a point mutation attenuating the suppressing effect of YY1 binding site. The differential expression of CYP3A4 and CYP3A5 in these organs results from the loss of the YY1 binding element from the CYP3A5 promoter, acting in concert with the differential organ expression of PXR, and with the higher accumulation of PXR response elements in CYP3A4. rn