950 resultados para Human Genes


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The origin of new genes through gene duplication is fundamental to the evolution of lineage- or species-specific phenotypic traits. In this report, we estimate the number of functional retrogenes on the lineage leading to humans generated by the high rate of retroposition (retroduplication) in primates. Extensive comparative sequencing and expression studies coupled with evolutionary analyses and simulations suggest that a significant proportion of recent retrocopies represent bona fide human genes. We estimate that at least one new retrogene per million years emerged on the human lineage during the past approximately 63 million years of primate evolution. Detailed analysis of a subset of the data shows that the majority of retrogenes are specifically expressed in testis, whereas their parental genes show broad expression patterns. Consistently, most retrogenes evolved functional roles in spermatogenesis. Proteins encoded by X chromosome-derived retrogenes were strongly preserved by purifying selection following the duplication event, supporting the view that they may act as functional autosomal substitutes during X-inactivation of late spermatogenesis genes. Also, some retrogenes acquired a new or more adapted function driven by positive selection. We conclude that retroduplication significantly contributed to the formation of recent human genes and that most new retrogenes were progressively recruited during primate evolution by natural and/or sexual selection to enhance male germline function.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The recent availability of the chicken genome sequence poses the question of whether there are human protein-coding genes conserved in chicken that are currently not included in the human gene catalog. Here, we show, using comparative gene finding followed by experimental verification of exon pairs by RT–PCR, that the addition to the multi-exonic subset of this catalog could be as little as 0.2%, suggesting that we may be closing in on the human gene set. Our protocol, however, has two shortcomings: (i) the bioinformatic screening of the predicted genes, applied to filter out false positives, cannot handle intronless genes; and (ii) the experimental verification could fail to identify expression at a specific developmental time. This highlights the importance of developing methods that could provide a reliable estimate of the number of these two types of genes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Humans differ substantially with respect to susceptibility to human immunodeficiency virus type 1 (HIV-1). We evaluated variants of nine host genes participating in the viral life cycle for their role in modulating HIV-1 infection. Alleles were assessed ex vivo for their impact on viral replication in purified CD4 T cells from healthy blood donors (n = 128). Thereafter, candidate alleles were assessed in vivo in a cohort of HIV-1-infected individuals (n = 851) not receiving potent antiretroviral therapy. As a benchmark test, we tested 12 previously reported host genetic variants influencing HIV-1 infection as well as single nucleotide polymorphisms in the nine candidate genes. This led to the proposition of three alleles of PML, TSG101, and PPIA as potentially associated with differences in progression of HIV-1 disease. In a model considering the combined effects of new and previously reported gene variants, we estimated that their effect might be responsible for lengthening or shortening by up to 2.8 years the period from 500 CD4 T cells/mul to <200 CD4 T cells/mul.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The recent availability of the chicken genome sequence poses the question of whether there are human protein-coding genes conserved in chicken that are currently not included in the human gene catalog. Here, we show, using comparative gene finding followed by experimental verification of exon pairs by RT-PCR, that the addition to the multi-exonic subset of this catalog could be as little as 0.2%, suggesting that we may be closing in on the human gene set. Our protocol, however, has two shortcomings: (i) the bioinformatic screening of the predicted genes, applied to filter out false positives, cannot handle intronless genes; and (ii) the experimental verification could fail to identify expression at a specific developmental time. This highlights the importance of developing methods that could provide a reliable estimate of the number of these two types of genes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: In order to provide a cost-effective tool to analyse pharmacogenetic markers in malaria treatment, DNA microarray technology was compared with sequencing of polymerase chain reaction (PCR) fragments to detect single nucleotide polymorphisms (SNPs) in a larger number of samples. Methods: The microarray was developed to affordably generate SNP data of genes encoding the human cytochrome P450 enzyme family (CYP) and N-acetyltransferase-2 (NAT2) involved in antimalarial drug metabolisms and with known polymorphisms, i.e. CYP2A6, CYP2B6, CYP2C8, CYP2C9, CYP2C19, CYP2D6, CYP3A4, CYP3A5, and NAT2. Results: For some SNPs, i.e. CYP2A6*2, CYP2B6*5, CYP2C8*3, CYP2C9*3/*5, CYP2C19*3, CYP2D6*4 and NAT2*6/*7/*14, agreement between both techniques ranged from substantial to almost perfect (kappa index between 0.61 and 1.00), whilst for other SNPs a large variability from slight to substantial agreement (kappa index between 0.39 and 1.00) was found, e. g. CYP2D6*17 (2850C>T), CYP3A4*1B and CYP3A5*3. Conclusion: The major limit of the microarray technology for this purpose was lack of robustness and with a large number of missing data or with incorrect specificity.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The genome-wide identification of both morbid genes, i.e., those genes whose mutations cause hereditary human diseases, and druggable genes, i.e., genes coding for proteins whose modulation by small molecules elicits phenotypic effects, requires experimental approaches that are time-consuming and laborious. Thus, a computational approach which could accurately predict such genes on a genome-wide scale would be invaluable for accelerating the pace of discovery of causal relationships between genes and diseases as well as the determination of druggability of gene products.Results: In this paper we propose a machine learning-based computational approach to predict morbid and druggable genes on a genome-wide scale. For this purpose, we constructed a decision tree-based meta-classifier and trained it on datasets containing, for each morbid and druggable gene, network topological features, tissue expression profile and subcellular localization data as learning attributes. This meta-classifier correctly recovered 65% of known morbid genes with a precision of 66% and correctly recovered 78% of known druggable genes with a precision of 75%. It was than used to assign morbidity and druggability scores to genes not known to be morbid and druggable and we showed a good match between these scores and literature data. Finally, we generated decision trees by training the J48 algorithm on the morbidity and druggability datasets to discover cellular rules for morbidity and druggability and, among the rules, we found that the number of regulating transcription factors and plasma membrane localization are the most important factors to morbidity and druggability, respectively.Conclusions: We were able to demonstrate that network topological features along with tissue expression profile and subcellular localization can reliably predict human morbid and druggable genes on a genome-wide scale. Moreover, by constructing decision trees based on these data, we could discover cellular rules governing morbidity and druggability.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

From the late 1980s, the automation of sequencing techniques and the computer spread gave rise to a flourishing number of new molecular structures and sequences and to proliferation of new databases in which to store them. Here are presented three computational approaches able to analyse the massive amount of publicly avalilable data in order to answer to important biological questions. The first strategy studies the incorrect assignment of the first AUG codon in a messenger RNA (mRNA), due to the incomplete determination of its 5' end sequence. An extension of the mRNA 5' coding region was identified in 477 in human loci, out of all human known mRNAs analysed, using an automated expressed sequence tag (EST)-based approach. Proof-of-concept confirmation was obtained by in vitro cloning and sequencing for GNB2L1, QARS and TDP2 and the consequences for the functional studies are discussed. The second approach analyses the codon bias, the phenomenon in which distinct synonymous codons are used with different frequencies, and, following integration with a gene expression profile, estimates the total number of codons present across all the expressed mRNAs (named here "codonome value") in a given biological condition. Systematic analyses across different pathological and normal human tissues and multiple species shows a surprisingly tight correlation between the codon bias and the codonome bias. The third approach is useful to studies the expression of human autism spectrum disorder (ASD) implicated genes. ASD implicated genes sharing microRNA response elements (MREs) for the same microRNA are co-expressed in brain samples from healthy and ASD affected individuals. The different expression of a recently identified long non coding RNA which have four MREs for the same microRNA could disrupt the equilibrium in this network, but further analyses and experiments are needed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Somatic-cell hybrids have been shown to maintain the correct epigenetic chromatin states to study developmental globin gene expression as well as gene expression on the active and inactive X chromosomes. This suggests the potential use of somatic-cell hybrids containing either a maternal or a paternal human chromosome as a model system to study known imprinted genes and to identify as-yet-unknown imprinted genes. Testing gene expression by using reverse transcription followed by PCR, we show that functional imprints are maintained at four previously characterized 15q11–q13 loci in hybrids containing a single human chromosome 15 and at two chromosome 11p15 loci in hybrids containing a single chromosome 11. In contrast, three γ-aminobutyric acid type A receptor subunit genes in 15q12–q13 are nonimprinted. Furthermore, we have found that differential DNA methylation imprints at the SNRPN promoter and at a CpG island in 11p15 are also maintained in somatic-cell hybrids. Somatic-cell hybrids therefore are a valid and powerful system for studying known imprinted genes as well as for rapidly identifying new imprinted genes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The sequencing of the human genome has led to the identification of many genes whose functions remain to be determined. Because of conservation of genetic function, microbial systems have often been used for identification and characterization of human genes. We have investigated the use of the Escherichia coli SOS induction assay as a screen for yeast and human genes that might play a role in DNA metabolism and/or in genome stability. The SOS system has previously been used to analyze bacterial and viral genes that directly modify DNA. An initial screen of meiotically expressed yeast genes revealed several genes associated with chromosome metabolism (e.g., RAD51 and HHT1 as well as others). The SOS induction assay was then extended to the isolation of human genes. Several known human genes involved in DNA metabolism, such as the Ku70 end-binding protein and DNA ligase IV, were identified, as well as a large number of previously unknown genes. Thus, the SOS assay can be used to identify and characterize human genes, many of which may participate in chromosome metabolism.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Of the rules used by the splicing machinery to precisely determine intron–exon boundaries only a fraction is known. Recent evidence suggests that specific short sequences within exons help in defining these boundaries. Such sequences are known as exonic splicing enhancers (ESE). A possible bioinformatical approach to studying ESE sequences is to compare genes that harbor introns with genes that do not. For this purpose two non-redundant samples of 719 intron-containing and 63 intron-lacking human genes were created. We performed a statistical analysis on these datasets of intron-containing and intron-lacking human coding sequences and found a statistically significant difference (P = 0.01) between these samples in terms of 5–6mer oligonucleotide distributions. The difference is not created by a few strong signals present in the majority of exons, but rather by the accumulation of multiple weak signals through small variations in codon frequencies, codon biases and context-dependent codon biases between the samples. A list of putative novel human splicing regulation sequences has been elucidated by our analysis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The base composition pattern (BCP) in the putative promoter region (PPRs) up to 5 Kb lengths of 682 human genes on Chromosome 22 (Chr22) was examined. Two-dimensional (2D) and three-dimensional (3D) functions were designed to delineate the DNA base composition, with four major patterns identified. It is found that 17.6% genes include TATA box, 28.0% GC box, 18.9% CAAT box and 38.4% CpG islands, and approximately 10% genes have one of four putative initiator (Inr) motifs. The occurrence of the promoter elements is tightly associated with the base composition features in the promoter regions, and the associations of the base composition features with occurrence of the promoter elements in the promoter regions mediate tissue-wide expression of the genes in human. The occurrence of two or more promoter elements in the promoter regions is required for the medium- and wide-range expression profiles of the human genes on Chr22. Thus, the reported data shed light on the characteristics of the PPRs of the human genes on Chr22, which may improve our understanding of regulatory roles of the PPRs with occurrence of the promoter elements in gene expression.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: One of the main goals of cancer genetics is to identify the causative elements at the molecular level leading to cancer.Results: We have conducted an analysis of a set of genes known to be involved in cancer in order to unveil their unique features that can assist towards the identification of new candidate cancer genes. Conclusion: We have detected key patterns in this group of genes in terms of the molecular function or the biological process in which they are involved as well as sequence properties. Based on these features we have developed an accurate Bayesian classification model with which human genes have been scored for their likelihood of involvement in cancer.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Different signatures of natural selection persist over varying time scales in our genome, revealing possible episodes of adaptative evolution during human history. Here, we identify genes showing signatures of ancestral positive selection in the human lineage and investigate whether some of those genes have been evolving adaptatively in extant human populations. Specifically, we compared more than 11,000 human genes with their orthologs inchimpanzee, mouse, rat and dog and applied a branch-site likelihood method to test for positive selection on the human lineage. Among the significant cases, a robust set of 11 genes were then further explored for signatures of recent positive selection using SNP data. We genotyped 223 SNPs in 39 worldwide populations from the HGDP Diversity panel and supplemented this information with available genotypes for up to 4,814 SNPs distributed along 2 Mb centered on each gene. After exploring the allele frequency spectrum, population differentiation and the maintainance of long unbroken haplotypes, we found signals of recent adaptative phenomena in only one of the 11 candidate gene regions. However, the signal ofrecent selection in this region may come from a different, neighbouring gene (CD5) ratherthan from the candidate gene itself (VPS37C). For this set of positively-selected genes in thehuman lineage, we find no indication that these genes maintained their rapid evolutionarypace among human populations. Based on these data, it therefore appears that adaptation forhuman-specific and for population-specific traits may have involved different genes.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Abstract Background RNAs transcribed from intronic regions of genes are involved in a number of processes related to post-transcriptional control of gene expression. However, the complement of human genes in which introns are transcribed, and the number of intronic transcriptional units and their tissue expression patterns are not known. Results A survey of mRNA and EST public databases revealed more than 55,000 totally intronic noncoding (TIN) RNAs transcribed from the introns of 74% of all unique RefSeq genes. Guided by this information, we designed an oligoarray platform containing sense and antisense probes for each of 7,135 randomly selected TIN transcripts plus the corresponding protein-coding genes. We identified exonic and intronic tissue-specific expression signatures for human liver, prostate and kidney. The most highly expressed antisense TIN RNAs were transcribed from introns of protein-coding genes significantly enriched (p = 0.002 to 0.022) in the 'Regulation of transcription' Gene Ontology category. RNA polymerase II inhibition resulted in increased expression of a fraction of intronic RNAs in cell cultures, suggesting that other RNA polymerases may be involved in their biosynthesis. Members of a subset of intronic and protein-coding signatures transcribed from the same genomic loci have correlated expression patterns, suggesting that intronic RNAs regulate the abundance or the pattern of exon usage in protein-coding messages. Conclusion We have identified diverse intronic RNA expression patterns, pointing to distinct regulatory roles. This gene-oriented approach, using a combined intron-exon oligoarray, should permit further comparative analysis of intronic transcription under various physiological and pathological conditions, thus advancing current knowledge about the biological functions of these noncoding RNAs.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Human x rodent somatic cell hybrids have played an important role in human genetics research. They have been especially useful for assigning genes to chromosomes and isolating DNA markers from specific regions of the human genome.^ By employing a combination of somatic cell genetic, recombinant DNA, and cytogenetic techniques, human DNA excision repair gene ERCC4 was mapped regionally to human 16p13.13-13.2, even though the gene has not been cloned. Human x Chinese hamster ovary (CHO) cell hybrids selected for human ERCC4 activity and containing 16p13.1-p13.3 as the only human genetic material were identified. These hybrids were used to order DNA markers located in 16p13.1-p13.3. New DNA markers physically close to ERCC4 were isolated from such hybrids. Using amplified human DNA from the hybrids as probe in fluorescent in situ hybridization, the short arm breakpoint in the chromosome 16 inversion associated with acute myelomonocytic leukemia (AMML) was found to be physically close to the ERCC4 gene. The physical mapping and eventually, the cloning of the ERCC4 gene, will benefit the understanding of the DNA repair system and the study of other important biomedical problems such as tumorigenesis.^ To facilitate the cloning of ERCC4 gene and, in general, the cloning of genes from any defined regions of the human genome, a method was developed for the direct isolation of human transcribed genes ffom somatic cell hybrids. cDNA was prepared from human x rodent hybrid by using consensus 5$\sp\prime$ splice site sequences as primers. These primers were designed to select immature, unspliced messenger RNA (still retaining species specific repeat sequences) as templates. Screening of a derived cDNA library for human repeat sequences resulted in the isolation of human clones at the anticipated frequency with characteristics expected of exons of transcribed human genes. The usefulness of the splice site specific primers was analyzed and the cDNA synthesis conditions with these primers were optimized. The procedure was shown to be sensitive enough to clone weakly expressed genes. Studying the expression of the represented genes with the isolated clones was shown to be feasible. Such regional specific human gene fragments will be very valuable for many human genetic studies such as the search of inherited disease genes and the construction of a cDNA map of the human genome. ^