986 resultados para SEQUENCE TYPES
Resumo:
The recent availability of the chicken genome sequence poses the question of whether there are human protein-coding genes conserved in chicken that are currently not included in the human gene catalog. Here, we show, using comparative gene finding followed by experimental verification of exon pairs by RT–PCR, that the addition to the multi-exonic subset of this catalog could be as little as 0.2%, suggesting that we may be closing in on the human gene set. Our protocol, however, has two shortcomings: (i) the bioinformatic screening of the predicted genes, applied to filter out false positives, cannot handle intronless genes; and (ii) the experimental verification could fail to identify expression at a specific developmental time. This highlights the importance of developing methods that could provide a reliable estimate of the number of these two types of genes.
Resumo:
Background: Single nucleotide polymorphisms (SNPs) are the most frequent type of sequence variation between individuals, and represent a promising tool for finding genetic determinants of complex diseases and understanding the differences in drug response. In this regard, it is of particular interest to study the effect of non-synonymous SNPs in the context of biological networks such as cell signalling pathways. UniProt provides curated information about the functional and phenotypic effects of sequence variation, including SNPs, as well as on mutations of protein sequences. However, no strategy has been developed to integrate this information with biological networks, with the ultimate goal of studying the impact of the functional effect of SNPs in the structure and dynamics of biological networks. Results: First, we identified the different challenges posed by the integration of the phenotypic effect of sequence variants and mutations with biological networks. Second, we developed a strategy for the combination of data extracted from public resources, such as UniProt, NCBI dbSNP, Reactome and BioModels. We generated attribute files containing phenotypic and genotypic annotations to the nodes of biological networks, which can be imported into network visualization tools such as Cytoscape. These resources allow the mapping and visualization of mutations and natural variations of human proteins and their phenotypic effect on biological networks (e.g. signalling pathways, protein-protein interaction networks, dynamic models). Finally, an example on the use of the sequence variation data in the dynamics of a network model is presented. Conclusion: In this paper we present a general strategy for the integration of pathway and sequence variation data for visualization, analysis and modelling purposes, including the study of the functional impact of protein sequence variations on the dynamics of signalling pathways. This is of particular interest when the SNP or mutation is known to be associated to disease. We expect that this approach will help in the study of the functional impact of disease-associated SNPs on the behaviour of cell signalling pathways, which ultimately will lead to a better understanding of the mechanisms underlying complex diseases.
Resumo:
Background: A number of studies have used protein interaction data alone for protein function prediction. Here, we introduce a computational approach for annotation of enzymes, based on the observation that similar protein sequences are more likely to perform the same function if they share similar interacting partners. Results: The method has been tested against the PSI-BLAST program using a set of 3,890 protein sequences from which interaction data was available. For protein sequences that align with at least 40% sequence identity to a known enzyme, the specificity of our method in predicting the first three EC digits increased from 80% to 90% at 80% coverage when compared to PSI-BLAST. Conclusion: Our method can also be used in proteins for which homologous sequences with known interacting partners can be detected. Thus, our method could increase 10% the specificity of genome-wide enzyme predictions based on sequence matching by PSI-BLAST alone.
Resumo:
Background: Single Nucleotide Polymorphisms, among other type of sequence variants, constitute key elements in genetic epidemiology and pharmacogenomics. While sequence data about genetic variation is found at databases such as dbSNP, clues about the functional and phenotypic consequences of the variations are generally found in biomedical literature. The identification of the relevant documents and the extraction of the information from them are hampered by the large size of literature databases and the lack of widely accepted standard notation for biomedical entities. Thus, automatic systems for the identification of citations of allelic variants of genes in biomedical texts are required. Results: Our group has previously reported the development of OSIRIS, a system aimed at the retrieval of literature about allelic variants of genes http://ibi.imim.es/osirisform.html. Here we describe the development of a new version of OSIRIS (OSIRISv1.2, http://ibi.imim.es/OSIRISv1.2.html webcite) which incorporates a new entity recognition module and is built on top of a local mirror of the MEDLINE collection and HgenetInfoDB: a database that collects data on human gene sequence variations. The new entity recognition module is based on a pattern-based search algorithm for the identification of variation terms in the texts and their mapping to dbSNP identifiers. The performance of OSIRISv1.2 was evaluated on a manually annotated corpus, resulting in 99% precision, 82% recall, and an F-score of 0.89. As an example, the application of the system for collecting literature citations for the allelic variants of genes related to the diseases intracranial aneurysm and breast cancer is presented. Conclusion: OSIRISv1.2 can be used to link literature references to dbSNP database entries with high accuracy, and therefore is suitable for collecting current knowledge on gene sequence variations and supporting the functional annotation of variation databases. The application of OSIRISv1.2 in combination with controlled vocabularies like MeSH provides a way to identify associations of biomedical interest, such as those that relate SNPs with diseases.
Resumo:
High Resolution Magic Angle Spinning (HR-MAS) NMR allows metabolic characterization of biopsies. HR-MAS spectra from tissues of most organs show strong lipid contributions that are overlapping metabolite regions, which hamper metabolite estimation. Metabolite quantification and analysis would benefit from a separation of lipids and small metabolites. Generally, a relaxation filter is used to reduce lipid contributions. However, the strong relaxation filter required to eliminate most of the lipids also reduces the signals for small metabolites. The aim of our study was therefore to investigate different diffusion editing techniques in order to employ diffusion differences for separating lipid and small metabolite contributions in the spectra from different organs for unbiased metabonomic analysis. Thus, 1D and 2D diffusion measurements were performed, and pure lipid spectra that were obtained at strong diffusion weighting (DW) were subtracted from those obtained at low DW, which include both small metabolites and lipids. This subtraction yielded almost lipid free small metabolite spectra from muscle tissue. Further improved separation was obtained by combining a 1D diffusion sequence with a T2-filter, with the subtraction method eliminating residual lipids from the spectra. Similar results obtained for biopsies of different organs suggest that this method is applicable in various tissue types. The elimination of lipids from HR-MAS spectra and the resulting less biased assessment of small metabolites have potential to remove ambiguities in the interpretation of metabonomic results. This is demonstrated in a reproducibility study on biopsies from human muscle.
Resumo:
Shrews of the genus Sorex are characterized by a Holarctic distribution, and relationships among extant taxa have never been fully resolved. Phylogenies have been proposed based on morphological, karyological, and biochemical comparisons, but these analyses often produced controversial and contradictory results. Phylogenetic analyses of partial mitochondrial cytochrome b gene sequences (1011 bp) were used to examine the relationships among 27 Sorex species. The molecular data suggest that Sorex comprises two major monophyletic lineages, one restricted mostly to the New World and one with a primarily Palearctic distribution. Furthermore, several sister-species relationships are revealed by the analysis. Based on the split between the Soricinae and Crocidurinae subfamilies, we used a 95% confidence interval for both the calibration of a molecular clock and the subsequent calculation of major diversification events within the genus Sorex. Our analysis does not support an unambiguous acceleration of the molecular clock in shrews, the estimated rate being similar to other estimates of mammalian mitochondrial clocks. In addition, the data presented here indicate that estimates from the fossil record greatly underestimate divergence dates among Sorex taxa.
Resumo:
Donateur : Yakchitch, Vladimir (18..-19..? ; Statisticien)
Resumo:
The intensity of parental investments in child care time is expected to vary across families with different norms and time-constraints. Additionally, it should also differ across countries, since the abilities of parents to harmonize family and work vary by national context. In our opinion, however, this question remains inconclusive for two main reasons: 1) only some countries have been studied from a comparative approach; 2) previous studies have not paid enough attention to the analysis of how the conditional effects of education and employment affect parental investments.In this paper we used nationally representative time-use data from Denmark, Flanders, Spain and the United Kingdom (N=4,031) to explore how employment and education predict variations in child care time. IN Britain and Spain employment has a strong negative effect on fathers’ child care, but a weaker one in Flanders and particularly in Denmark. In contrast, maternal employment has a strong negative impact in all four countries. Education increases child care time significantly only among Spanish mothers and fathers, as well as British mothers. Nonetheless, we find that college-educated mothers under similar time-constraints increase substantially their expected child care time in Britain, Flanders and Spain; for fathers we find a more mixed picture. Routine child care activities are more sensitive to both maternal and paternal employment than interactive child care activities. Finally, we observe that working a public sector job generally increases a total time allocated to parental care, controlling for several demographic and socioeconomic variables.
Resumo:
The bacterial insertion sequence IS21 shares with many insertion sequences a two-step, reactive junction transposition pathway, for which a model is presented in this review: a reactive junction with abutted inverted repeats is first formed and subsequently integrated into the target DNA. The reactive junction occurs in IS21-IS21 tandems and IS21 minicircles. In addition, IS21 shows a unique specialization of transposition functions. By alternative translation initiation, the transposase gene codes for two products: the transposase, capable of promoting both steps of the reactive junction pathway, and the cointegrase, which only promotes the integration of reactive junctions but with higher efficiency. This review also includes a survey of the IS21 family and speculates on the possibility that other members present a similar transpositional specialization.
Resumo:
Recent population genetic studies suggest that staphylococcal cassette chromosome mec (SCCmec) was acquired much more frequently than previously thought. In the present study, we aimed to investigate the diversity of SCCmec elements in a local methicillin-resistant Staphylococcus aureus (MRSA) population. Each MRSA isolate (one per patient) recovered in the Vaud canton of Switzerland from January 2005 to December 2008 was analyzed by the double-locus sequence typing (DLST) method and SCCmec typing. DLST analysis indicated that 1,884/2,036 isolates (92.5%) belong to four predominant clones. As expected from the local spread of a clone, most isolates within clones harbored an identical SCCmec type. However, three to seven SCCmec types have been recovered in every predominant DLST clone, suggesting that some of these elements might have been acquired locally. This pattern could also be explained by distinct importations of related isolates into the study region. The addition of a third highly variable locus to further increase the discriminatory power of typing as well as epidemiological data suggested that most ambiguous situations were explained by the second hypothesis. In conclusion, our study showed that even if the acquisition of new SCCmec elements at a local level likely occurs, it does not explain all the diversity observed in the study region.
Resumo:
Epidemiological processes leave a fingerprint in the pattern of genetic structure of virus populations. Here, we provide a new method to infer epidemiological parameters directly from viral sequence data. The method is based on phylogenetic analysis using a birth-death model (BDM) rather than the commonly used coalescent as the model for the epidemiological transmission of the pathogen. Using the BDM has the advantage that transmission and death rates are estimated independently and therefore enables for the first time the estimation of the basic reproductive number of the pathogen using only sequence data, without further assumptions like the average duration of infection. We apply the method to genetic data of the HIV-1 epidemic in Switzerland.
Resumo:
In contrast with mammals and birds, most poikilothermic vertebrates feature structurally undifferentiated sex chromosomes, which may result either from frequent turnovers, or from occasional events of XY recombination. The latter mechanism was recently suggested to be responsible for sex-chromosome homomorphy in European tree frogs (Hyla arborea). However, no single case of male recombination has been identified in large-scale laboratory crosses, and populations from NW Europe consistently display sex-specific allelic frequencies with male-diagnostic alleles, suggesting the absence of recombination in their recent history. To address this apparent paradox, we extended the phylogeographic scope of investigations, by analyzing the sequences of three sex-linked markers throughout the whole species distribution. Refugial populations (southern Balkans and Adriatic coast) show a mix of X and Y alleles in haplotypic networks, and no more within-individual pairwise nucleotide differences in males than in females, testifying to recurrent XY recombination. In contrast, populations of NW Europe, which originated from a recent postglacial expansion, show a clear pattern of XY differentiation; the X and Y gametologs of the sex-linked gene Med15 present different alleles, likely fixed by drift on the front wave of expansions, and kept differentiated since. Our results support the view that sex-chromosome homomorphy in H. arborea is maintained by occasional or historical events of recombination; whether the frequency of these events indeed differs between populations remains to be clarified.