978 resultados para License agreements
Differences in the evolutionary history of disease genes affected by dominant or recessive mutations
Resumo:
Background: Global analyses of human disease genes by computational methods have yielded important advances in the understanding of human diseases. Generally these studies have treated the group of disease genes uniformly, thus ignoring the type of disease-causing mutations (dominant or recessive). In this report we present a comprehensive study of the evolutionary history of autosomal disease genes separated by mode of inheritance.Results: We examine differences in protein and coding sequence conservation between dominant and recessive human disease genes. Our analysis shows that disease genes affected by dominant mutations are more conserved than those affected by recessive mutations. This could be a consequence of the fact that recessive mutations remain hidden from selection while heterozygous. Furthermore, we employ functional annotation analysis and investigations into disease severity to support this hypothesis. Conclusion: This study elucidates important differences between dominantly- and recessively-acting disease genes in terms of protein and DNA sequence conservation, paralogy and essentiality. We propose that the division of disease genes by mode of inheritance will enhance both understanding of the disease process and prediction of candidate disease genes in the future.
Resumo:
Amino acid tandem repeats, also called homopolymeric tracts, are extremely abundant in eukaryotic proteins. To gain insight into the genome-wide evolution of these regions in mammals, we analyzed the repeat content in a large data set of rat-mouse-human orthologs. Our results show that human proteins contain more amino acid repeats than rodent proteins and that trinucleotide repeats are also more abundant in human coding sequences. Using the human species as an outgroup, we were able to address differences in repeat loss and repeat gain in the rat and mouse lineages. In this data set, mouse proteins contain substantially more repeats than rat proteins, which can be at least partly attributed to a higher repeat loss in the rat lineage. The data are consistent with a role for trinucleotide slippage in the generation of novel amino acid repeats. We confirm the previously observed functional bias of proteins with repeats, with overrepresentation of transcription factors and DNA-binding proteins. We show that genes encoding amino acid repeats tend to have an unusually high GC content, and that differences in coding GC content among orthologs are directly related to the presence/absence of repeats. We propose that the different GC content isochore structure in rodents and humans may result in an increased amino acid repeat prevalence in the human lineage.
Resumo:
Drug-resistance and therapy failure due to drug-drug interactions are the main challenges in current treatment against Human Immunodeficiency Virus (HIV) infection. As such, there is a continuous need for the development of new and more potent anti-HIV drugs. Here we established a high-throughput screen based on the highly permissive TZM-bl cell line to identify novel HIV inhibitors. The assay allows discriminating compounds acting on early and/or late steps of the HIV replication cycle. The platform was used to screen a unique library of secondary metabolites derived from myxobacteria. Several hits with good anti-HIV profiles were identified. Five of the initial hits were tested for their antiviral potency. Four myxobacterial compounds, sulfangolid C, soraphen F, epothilon D and spirangien B, showed EC50 values in the nM range with SI > 15. Interestingly, we found a high amount of overlapping hits compared with a previous screen for Hepatitis C Virus (HCV) using the same library. The unique structures and mode-of-actions of these natural compounds make myxobacteria an attractive source of chemicals for the development of broad-spectrum antivirals. Further biological and structural studies of our initial hits might help recognize smaller drug-like derivatives that in turn could be synthesized and further optimized.
Resumo:
Cells are subjected to dramatic changes of gene expression upon environmental changes. Stresscauses a general down-regulation of gene expression together with the induction of a set of stress-responsivegenes. The p38-related stress-activated protein kinase Hog1 is an important regulator of transcription uponosmostress in yeast. Genome-wide localization studies of RNA polymerase II (RNA Pol II) and Hog1 showed that stress induced major changes in RNA Pol II localization, with a shift toward stress-responsive genes relative to housekeeping genes. RNA Pol II relocalization required Hog1, which was also localized to stress-responsive loci. In addition to RNA Pol II-bound genes, Hog1 also localized to RNA polymerase III-bound genes, pointing to a wider role for Hog1 in transcriptional control than initially expected. Interestingly, an increasing association of Hog1 with stressresponsive genes was strongly correlated with chromatin remodeling and increased gene expression. Remarkably, MNase-Seq analysis showed that although chromatin structure was not significantly altered at a genome-wide level in response to stress, there was pronounced chromatin remodeling for those genes that displayed Hog1 association. Hog1 serves to bypass the general down-regulation of gene expression that occurs in response to osmostress, and does so both by targeting RNA Pol II machinery and by inducing chromatin remodeling at stressresponsive loci.
Resumo:
Low-complexity regions (LCRs) in proteins are tracts that are highly enriched in one or a few aminoacids. Given their high abundance, and their capacity to expand in relatively short periods of time through replication slippage, they can greatly contribute to increase protein sequence space and generate novel protein functions. However, little is known about the global impact of LCRs on protein evolution. We have traced back the evolutionary history of 2,802 LCRs from a large set of homologous protein families from H.sapiens, M.musculus, G.gallus, D.rerio and C.intestinalis. Transcriptional factors and other regulatory functions are overrepresented in proteins containing LCRs. We have found that the gain of novel LCRs is frequently associated with repeat expansion whereas the loss of LCRs is more often due to accumulation of amino acid substitutions as opposed to deletions. This dichotomy results in net protein sequence gain over time. We have detected a significant increase in the rate of accumulation of novel LCRs in the ancestral Amniota and mammalian branches, and a reduction in the chicken branch. Alanine and/or glycine-rich LCRs are overrepresented in recently emerged LCR sets from all branches, suggesting that their expansion is better tolerated than for other LCR types. LCRs enriched in positively charged amino acids show the contrary pattern, indicating an important effect of purifying selection in their maintenance. We have performed the first large-scale study on the evolutionary dynamics of LCRs in protein families. The study has shown that the composition of an LCR is an important determinant of its evolutionary pattern.
Resumo:
The repair process of damaged tissue involves the coordinated activities of several cell types in response to local and systemic signals. Following acute tissue injury, infiltrating inflammatory cells and resident stem cells orchestrate their activities to restore tissue homeostasis. However, during chronic tissue damage, such as in muscular dystrophies, the inflammatory-cell infiltration and fibroblast activation persists, while the reparative capacity of stem cells (satellite cells) is attenuated. Abnormal dystrophic muscle repair and its end stage, fibrosis, represent the final common pathway of virtually all chronic neurodegenerative muscular diseases. As our understanding of the pathogenesis of muscle fibrosis has progressed, it has become evident that the muscle provides a useful model for the regulation of tissue repair by the local microenvironment, showing interplay among muscle-specific stem cells, inflammatory cells, fibroblasts and extracellular matrix components of the mammalian wound-healing response. This article reviews the emerging findings of the mechanisms that underlie normal versus aberrant muscle-tissue repair.
Resumo:
Drug safety issues pose serious health threats to the population and constitute a major cause of mortality worldwide. Due to the prominent implications to both public health and the pharmaceutical industry, it is of great importance to unravel the molecular mechanisms by which an adverse drug reaction can be potentially elicited. These mechanisms can be investigated by placing the pharmaco-epidemiologically detected adverse drug reaction in an information-rich context and by exploiting all currently available biomedical knowledge to substantiate it. We present a computational framework for the biological annotation of potential adverse drug reactions. First, the proposed framework investigates previous evidences on the drug-event association in the context of biomedical literature (signal filtering). Then, it seeks to provide a biological explanation (signal substantiation) by exploring mechanistic connections that might explain why a drug produces a specific adverse reaction. The mechanistic connections include the activity of the drug, related compounds and drug metabolites on protein targets, the association of protein targets to clinical events, and the annotation of proteins (both protein targets and proteins associated with clinical events) to biological pathways. Hence, the workflows for signal filtering and substantiation integrate modules for literature and database mining, in silico drug-target profiling, and analyses based on gene-disease networks and biological pathways. Application examples of these workflows carried out on selected cases of drug safety signals are discussed. The methodology and workflows presented offer a novel approach to explore the molecular mechanisms underlying adverse drug reactions
Resumo:
AbstractBACKGROUND: Scientists have been trying to understand the molecular mechanisms of diseases to design preventive and therapeutic strategies for a long time. For some diseases, it has become evident that it is not enough to obtain a catalogue of the disease-related genes but to uncover how disruptions of molecular networks in the cell give rise to disease phenotypes. Moreover, with the unprecedented wealth of information available, even obtaining such catalogue is extremely difficult.PRINCIPAL FINDINGS: We developed a comprehensive gene-disease association database by integrating associations from several sources that cover different biomedical aspects of diseases. In particular, we focus on the current knowledge of human genetic diseases including mendelian, complex and environmental diseases. To assess the concept of modularity of human diseases, we performed a systematic study of the emergent properties of human gene-disease networks by means of network topology and functional annotation analysis. The results indicate a highly shared genetic origin of human diseases and show that for most diseases, including mendelian, complex and environmental diseases, functional modules exist. Moreover, a core set of biological pathways is found to be associated with most human diseases. We obtained similar results when studying clusters of diseases, suggesting that related diseases might arise due to dysfunction of common biological processes in the cell.CONCLUSIONS: For the first time, we include mendelian, complex and environmental diseases in an integrated gene-disease association database and show that the concept of modularity applies for all of them. We furthermore provide a functional analysis of disease-related modules providing important new biological insights, which might not be discovered when considering each of the gene-disease association repositories independently. Hence, we present a suitable framework for the study of how genetic and environmental factors, such as drugs, contribute to diseases.AVAILABILITY: The gene-disease networks used in this study and part of the analysis are available at http://ibi.imim.es/DisGeNET/DisGeNETweb.html#Download
Resumo:
Structural variation has played an important role in the evolutionary restructuring of human and great ape genomes. Recent analyses have suggested that the genomes of chimpanzee and human have been particularly enriched for this form of genetic variation. Here, we set out to assess the extent of structural variation in the gorilla lineage by generating 10-fold genomic sequence coverage from a western lowland gorilla and integrating these data into a physical and cytogenetic framework of structural variation. We discovered and validated over 7665 structural changes within the gorilla lineage, including sequence resolution of inversions, deletions, duplications, and mobile element insertions. A comparison with human and other ape genomes shows that the gorilla genome has been subjected to the highest rate of segmental duplication. We show that both the gorilla and chimpanzee genomes have experienced independent yet convergent patterns of structural mutation that have not occurred in humans, including the formation of subtelomeric heterochromatic caps, the hyperexpansion of segmental duplications, and bursts of retroviral integrations. Our analysis suggests that the chimpanzee and gorilla genomes are structurally more derived than either orangutan or human genomes.
Resumo:
The gibbon genome exhibits extensive karyotypic diversity with an increased rate of chromosomal rearrangements during evolution. In an effort to understand the mechanistic origin and implications of these rearrangement events, we sequenced 24 synteny breakpoint regions in the white-cheeked gibbon (Nomascus leucogenys, NLE) in the form of high-quality BAC insert sequences (4.2 Mbp). While there is a significant deficit of breakpoints in genes, we identified seven human gene structures involved in signaling pathways (DEPDC4, GNG10), phospholipid metabolism (ENPP5, PLSCR2), beta-oxidation (ECH1), cellular structure and transport (HEATR4), and transcription (ZNF461), that have been disrupted in the NLE gibbon lineage. Notably, only three of these genes show the expected evolutionary signatures of pseudogenization. Sequence analysis of the breakpoints suggested both nonclassical nonhomologous end-joining (NHEJ) and replication-based mechanisms of rearrangement. A substantial number (11/24) of human-NLE gibbon breakpoints showed new insertions of gibbon-specific repeats and mosaic structures formed from disparate sequences including segmental duplications, LINE, SINE, and LTR elements. Analysis of these sites provides a model for a replication-dependent repair mechanism for double-strand breaks (DSBs) at rearrangement sites and insights into the structure and formation of primate segmental duplications at sites of genomic rearrangements during evolution.
Resumo:
A raga is a collective melodic expression consisting of motifs. A raga can be identified using motifs which areunique to it. Motifs can be thought of as signature prosodic phrases. Different ragas may be composed of the same setof notes, or even phrases, but the prosody may be completely different. In this paper, an attempt is made to determinethe characteristic motifs that enable identification of a raga and distinguish between them. To determine this, motifs are first manually marked for a set of five popular raga by a professional musician. The motifs are then normalisedwith respect to the tonic. HMMs are trained for each motif using 80% of the data and about 20% are used for testing. The results do indicate that about 80% of the motifs are identified as belonging to a specific raga accurately.
Resumo:
In this paper, we describe several techniques for detecting tonic pitch value in Indian classical music. In Indian music, the raga is the basic melodic framework and it is built on the tonic. Tonic detection is therefore fundamental for any melodic analysis in Indian classical music. This workexplores detection of tonic by processing the pitch histograms of Indian classic music. Processing of pitch histograms using group delay functions and its ability to amplify certain traits of Indian music in the pitch histogram, is discussed. Three different strategies to detect tonic, namely, the concert method, the template matching and segmented histogram method are proposed. The concert method exploits the fact that the tonic is constant over a piece/concert.templatematchingmethod and segmented histogrammethodsuse the properties: (i) the tonic is always present in the background, (ii) some notes are less inflected and dominant, to detect the tonic of individual pieces. All the three methods yield good results for Carnatic music (90−100% accuracy), while for Hindustanimusic, the templatemethod works best, provided the v¯adi samv¯adi notes for a given piece are known (85%).
Resumo:
In this paper we investigate how note onsets in Turkish Makam music compositions are distributed, and in how far this distribution supports or contradicts the metrical structure of the pieces, the usul. We use MIDI data to derive the distributions in the form of onset histograms, and comparethem with metrical weights that are applied to describe the usul in theory. We compute correlation and syncopation values to estimate the degrees of support and contradiction, respectively. While the concept of syncopation is rarelymentioned in the context of this music, we can gain interesting insight into the structure of a piece using such a measure.We show that metrical contradiction is systematically applied in some metrical structures. We will compare thedifferences between Western music and Turkish Makam music regarding metrical support and contradiction. Such a study can help avoiding pitfalls in later attempts to perform audio processing tasks such as beat tracking or rhythmic similarity measurements.
Resumo:
We introduce a width parameter that bounds the complexity of classical planning problems and domains, along with a simple but effective blind-search procedure that runs in time that is exponential in the problem width. We show that many benchmark domains have a bounded and small width provided thatgoals are restricted to single atoms, and hence that such problems are provably solvable in low polynomial time. We then focus on the practical value of these ideas over the existing benchmarks which feature conjunctive goals. We show that the blind-search procedure can be used for both serializing the goal into subgoals and for solving the resulting problems, resulting in a ‘blind’ planner that competes well with a best-first search planner guided by state-of-the-art heuristics. In addition, ideas like helpful actions and landmarks can be integrated as well, producing a planner with state-of-the-art performance.
Resumo:
The rules and regulations for operating a motored vehicles in Iowa.