65 resultados para sequence identity
Resumo:
A number of experimental methods have been reported for estimating the number of genes in a genome, or the closely related coding density of a genome, defined as the fraction of base pairs in codons. Recently, DNA sequence data representative of the genome as a whole have become available for several organisms, making the problem of estimating coding density amenable to sequence analytic methods. Estimates of coding density for a single genome vary widely, so that methods with characterized error bounds have become increasingly desirable. We present a method to estimate the protein coding density in a corpus of DNA sequence data, in which a ‘coding statistic’ is calculated for a large number of windows of the sequence under study, and the distribution of the statistic is decomposed into two normal distributions, assumed to be the distributions of the coding statistic in the coding and noncoding fractions of the sequence windows. The accuracy of the method is evaluated using known data and application is made to the yeast chromosome III sequence and to C.elegans cosmid sequences. It can also be applied to fragmentary data, for example a collection of short sequences determined in the course of STS mapping.
Resumo:
Background: Single nucleotide polymorphisms (SNPs) are the most frequent type of sequence variation between individuals, and represent a promising tool for finding genetic determinants of complex diseases and understanding the differences in drug response. In this regard, it is of particular interest to study the effect of non-synonymous SNPs in the context of biological networks such as cell signalling pathways. UniProt provides curated information about the functional and phenotypic effects of sequence variation, including SNPs, as well as on mutations of protein sequences. However, no strategy has been developed to integrate this information with biological networks, with the ultimate goal of studying the impact of the functional effect of SNPs in the structure and dynamics of biological networks. Results: First, we identified the different challenges posed by the integration of the phenotypic effect of sequence variants and mutations with biological networks. Second, we developed a strategy for the combination of data extracted from public resources, such as UniProt, NCBI dbSNP, Reactome and BioModels. We generated attribute files containing phenotypic and genotypic annotations to the nodes of biological networks, which can be imported into network visualization tools such as Cytoscape. These resources allow the mapping and visualization of mutations and natural variations of human proteins and their phenotypic effect on biological networks (e.g. signalling pathways, protein-protein interaction networks, dynamic models). Finally, an example on the use of the sequence variation data in the dynamics of a network model is presented. Conclusion: In this paper we present a general strategy for the integration of pathway and sequence variation data for visualization, analysis and modelling purposes, including the study of the functional impact of protein sequence variations on the dynamics of signalling pathways. This is of particular interest when the SNP or mutation is known to be associated to disease. We expect that this approach will help in the study of the functional impact of disease-associated SNPs on the behaviour of cell signalling pathways, which ultimately will lead to a better understanding of the mechanisms underlying complex diseases.
Resumo:
Background: Single Nucleotide Polymorphisms, among other type of sequence variants, constitute key elements in genetic epidemiology and pharmacogenomics. While sequence data about genetic variation is found at databases such as dbSNP, clues about the functional and phenotypic consequences of the variations are generally found in biomedical literature. The identification of the relevant documents and the extraction of the information from them are hampered by the large size of literature databases and the lack of widely accepted standard notation for biomedical entities. Thus, automatic systems for the identification of citations of allelic variants of genes in biomedical texts are required. Results: Our group has previously reported the development of OSIRIS, a system aimed at the retrieval of literature about allelic variants of genes http://ibi.imim.es/osirisform.html. Here we describe the development of a new version of OSIRIS (OSIRISv1.2, http://ibi.imim.es/OSIRISv1.2.html webcite) which incorporates a new entity recognition module and is built on top of a local mirror of the MEDLINE collection and HgenetInfoDB: a database that collects data on human gene sequence variations. The new entity recognition module is based on a pattern-based search algorithm for the identification of variation terms in the texts and their mapping to dbSNP identifiers. The performance of OSIRISv1.2 was evaluated on a manually annotated corpus, resulting in 99% precision, 82% recall, and an F-score of 0.89. As an example, the application of the system for collecting literature citations for the allelic variants of genes related to the diseases intracranial aneurysm and breast cancer is presented. Conclusion: OSIRISv1.2 can be used to link literature references to dbSNP database entries with high accuracy, and therefore is suitable for collecting current knowledge on gene sequence variations and supporting the functional annotation of variation databases. The application of OSIRISv1.2 in combination with controlled vocabularies like MeSH provides a way to identify associations of biomedical interest, such as those that relate SNPs with diseases.
Resumo:
The core objective of this research process was to design an operational tool for place brand analysis. By modelling the emotional significance and the deeper-lying symbols associated to a specific place identity I expected to create a semiotic tool that could be applied, mutatis mutandis, on other similar place brands. As a field case study to develop the instrument, my choice of the research arena was Barcelona city, the capital of Catalonia Autonomous Community, Spain. Barcelona brand identity was approached in the line of the Chicago Urban Anthropology School. The research methods were designed according to the prescriptions of the urban anthropology, namely qualitative methods: in-depth interviews and discourse analysis. The final research outcome was a model summarizing a range of specific emotional values that support a place brand to position in the collective mindset and to assume a positively valued status and identity in the world order.
Resumo:
This paper lays down some theoretical framework for further research to be made on the subject of how identity of young Slovenian and Catalan users is forming within the social networking website Facebook. The author pursues his interest based on observation of how communicationand thus interaction between users is changing and how this is reflected in everyday practices. In so doing he tries to identify the connections between the individual, society and technology, asthese are more and more interwoven, and we cannot think one without the other in thecontemporary globalised world.
Resumo:
In todays competitive markets, the importance of goodscheduling strategies in manufacturing companies lead to theneed of developing efficient methods to solve complexscheduling problems.In this paper, we studied two production scheduling problemswith sequence-dependent setups times. The setup times areone of the most common complications in scheduling problems,and are usually associated with cleaning operations andchanging tools and shapes in machines.The first problem considered is a single-machine schedulingwith release dates, sequence-dependent setup times anddelivery times. The performance measure is the maximumlateness.The second problem is a job-shop scheduling problem withsequence-dependent setup times where the objective is tominimize the makespan.We present several priority dispatching rules for bothproblems, followed by a study of their performance. Finally,conclusions and directions of future research are presented.
Resumo:
In 1952 F. Riesz and Sz.Nágy published an example of a monotonic continuous function whose derivative is zero almost everywhere, that is to say, a singular function. Besides, the function was strictly increasing. Their example was built as the limit of a sequence of deformations of the identity function. As an easy consequence of the definition, the derivative, when it existed and was finite, was found to be zero. In this paper we revisit the Riesz-N´agy family of functions and we relate it to a system for real numberrepresentation which we call (t, t-1) expansions. With the help of these real number expansions we generalize the family. The singularity of the functions is proved through some metrical properties of the expansions used in their definition which also allows us to give a more precise way of determining when the derivative is 0 or infinity.
Resumo:
Descripció de la seqüència estratigràfica i dels registres paleoambientals dels sediments holocens de Sant Julià de Boada
Resumo:
La globalización cuestiona la existencia de una relación mimética entre ciudadanía y Estado-nación. Las identidades homogéneas, sustentadas ideológicamente en nociones como «lengua nacional», plantean problemas en sociedades en las que ha crecido espectacularmente la diversidad lingüística e identitaria. Cataluña es un territorio en el que una parte de la población afirma una identidad catalana distinta a la española y viceversa. Además, se ha teorizado que la identidad catalana y la lengua catalana coexisten mutuamente. Por eso, se suceden voces que defienden la presencia del catalán en la educación escolar como fuente de la identidad nacional catalana, mientras que otras voces defienden su presencia simplemente como una buena manera de aprender el catalán cuando no se puede aprender en el medio social y familiar. En los últimos años, Cataluña ha recibido casi un millón de personas extranjeras que han modificado notablemente su situación sociolingüística. Las últimas encuestas manifiestan que un 6,3% de la población utiliza habitualmente una lengua distinta del catalán y del castellano. En este marco, mostramos las construcciones identitarias de un grupo de adolescentes de origen extranjero que están en el segundo ciclo de la ESO. Los datos fueron recogidos mediante dos grupos de discusión de seis-siete estudiantes de distinto origen, lengua propia y tiempo de residencia en Cataluña. Los resultados muestran la importancia del lugar de origen en la construcción de la identidad. Además, los participantes que afirman sentimientos catalanes o españoles no los relacionan con la lengua sino con los intercambios sociales que han establecido con sus iguales de origen naciona. Las intervenciones muestran también las dificultades para promover identidades múltiples desde el contexto escolar que eviten actitudes racistas y xenófobas y sirvan para promover proyectos colectivos de futuro en los que se pueda vivir desde una cierta diferencia
Resumo:
We present Stroemgren uvby and Hbeta_ photometry for a set of 575 northern main sequence A type stars, most of them belonging to the Hipparcos Input Catalogue, with V from 5mag to 10mag and with known radial velocities. These observations enlarge the catalogue we began to compile some years ago to more than 1500 stars. Our catalogue includes kinematic and astrophysical data for each star. Our future goal is to perform an accurate analysis of the kinematical behaviour of these stars in the solar neighbourhood.
Resumo:
Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera1 and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium2, and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness.
Resumo:
We argue that preferences for secession are the expression of a common unobserved mechanisms determining national identity. This paper examines the hypothesis of independence of both preferences for secession (independent Euskadi) and Basque national identity in the light of Akerloff and Kranton (2000). We deal with psychological determinants of individuals' national identity formation as well as those that influence the propensity of individuals to support the secession of their perceived ¿imagined community¿ or nation.. We undertake econometric survey analysis for the Basque Country using a bivariate probit model and publicly available data from the Spanish Centre for Sociological Research. Our results provide robust evidence of a common determination of national identity and political preferences for the secession of the Basque Country consistently with Akerloff and Kranton model.
Resumo:
Aphids are important agricultural pests and also biological models for studies of insect-plant interactions, symbiosis, virus vectoring, and the developmental causes of extreme phenotypic plasticity. Here we present the 464 Mb draft genome assembly of the pea aphid Acyrthosiphon pisum. This first published whole genome sequence of a basal hemimetabolous insect provides an outgroup to the multiple published genomes of holometabolous insects. Pea aphids are host-plant specialists, they can reproduce both sexually and asexually, and they have coevolved with an obligate bacterial symbiont. Here we highlight findings from whole genome analysis that may be related to these unusual biological features. These findings include discovery of extensive gene duplication in more than 2000 gene families as well as loss of evolutionarily conserved genes. Gene family expansions relative to other published genomes include genes involved in chromatin modification, miRNA synthesis, and sugar transport. Gene losses include genes central to the IMD immune pathway, selenoprotein utilization, purine salvage, and the entire urea cycle. The pea aphid genome reveals that only a limited number of genes have been acquired from bacteria; thus the reduced gene count of Buchnera does not reflect gene transfer to the host genome. The inventory of metabolic genes in the pea aphid genome suggests that there is extensive metabolite exchange between the aphid and Buchnera, including sharing of amino acid biosynthesis between the aphid and Buchnera. The pea aphid genome provides a foundation for post-genomic studies of fundamental biological questions and applied agricultural problems.
Resumo:
Adenoviruses of primates include human (HAdV) and simian (SAdV) isolates classified into 8 species (Human Adenovirus A to G, and Simian Adenovirus A). In this study, a novel adenovirus was isolated from a colony of cynomolgus macaques (Macaca fascicularis) and subcultured in VERO cells. Its complete genome was purified and a region encompassing the hexon gene, the protease gene, the DNA binding protein (DBP) and the 100 kDa protein was amplified by PCR and sequenced by primer walking. Sequence analysis of these four genes showed that the new isolate had 80% identity to other primate adenoviruses and lacked recombination events. The study of the evolutionary relationships of this new monkey AdV based on the combined sequences of the four genes supported a close relationship to SAdV-3 and SAdV-6, lineages isolated from Rhesus monkeys. The clade formed by these three types is separated from the remaining clades and establishes a novel branch that is related to species HAdV-A, F and G. However, the genetic distance corresponding to the newly isolated monkey AdV considerably differs from these as to belong to a new, not yet established species. Results presented here widen our knowledge on SAdV and represents an important contribution to the understanding of the evolutionary history of primate adenoviruses.
Resumo:
Despite data favouring a role of dietary fat in colonic carcinogenesis, no study has focused on tissue n3 and n6 fatty acid (FA) status in human colon adenoma-carcinoma sequence. Thus, FA profile was measured in plasma phospholipids of patients with colorectal cancer (n = 22), sporadic adenoma (n = 27), and normal colon (n = 12) (control group). Additionally, mucosal FAs were assessed in both diseased and normal mucosa of cancer (n = 15) and adenoma (n = 21) patients, and from normal mucosa of controls (n = 8). There were no differences in FA profile of both plasma phospholipids and normal mucosa, between adenoma and control patients. There were considerable differences, however, in FAs between diseased and paired normal mucosa of adenoma patients, with increases of linoleic (p = 0.02), dihomogammalinolenic (p = 0.014), and eicosapentaenoic (p = 0.012) acids, and decreases of alpha linolenic (p = 0.001) and arachidonic (p = 0.02) acids in diseased mucosa. A stepwise reduction of eicosapentaenoic acid concentrations in diseased mucosa from benign adenoma to the most advanced colon cancer was seen (p = 0.009). Cancer patients showed lower alpha linolenate (p = 0.002) and higher dihomogammalinolenate (p = 0.003) in diseased than in paired normal mucosa. In conclusion changes in tissue n3 and n6 FA status might participate in the early phases of the human colorectal carcinogenesis.