921 resultados para Processing wikipedia data
Resumo:
RESUMEN En los últimos años, debido al incremento en la demanda por parte de las empresas de tecnologías que posibiliten la monitorización y el análisis de un gran volumen de datos en tiempo real, la tecnología CEP (Complex Event Processing) ha surgido como una potencia en alza y su uso se ha incrementado notablemente en ciertos sectores como, por ejemplo, la gestión y automatización de procesos de negocios, finanzas, monitorización de redes y aplicaciones, así como redes de sensores inteligentes como el caso de estudio en el que nos centraremos. CEP se basa en un lenguaje de procesamiento de eventos (Event Processing Language,EPL) cuya utilización puede resultar bastante compleja para usuarios inexpertos. Esta complejidad supone un hándicap y, por lo tanto, un problema a la hora de que su uso se extienda. Este Proyecto Fin de Grado (PFG) pretende dar una solución a este problema, acercando al usuario la tecnología CEP mediante técnicas de abstracción y modelado. Para ello, este PFG ha definido un lenguaje de modelado específico dominio, sencillo e intuitivo para el usuario inexperto, al que se ha dado soporte mediante el desarrollo de una herramienta de modelado gráfico (CEP Modeler) en la que se pueden modelar consultas CEP de forma gráfica, sencilla y de manera más accesible para el usuario. ABSTRACT Over recent years, more and more companies demand technology for monitoring and analyzing a vast volume of data in real time. In this regard, the CEP technology (Complex Event Processing) has emerged as a novel approach to that end, and its use has increased dramatically in certain domains, such as, management and automation of business processes, finance, monitoring of networks and applications, as well as smart sensor networks as the case study in which we will focus. CEP is based on in the Event Processing Language (EPL). This language can be rather difficult to use for new users. This complexity can be a handicap, and therefore, a problem at the time of extending its use. This project aims to provide a solution to this problem, trying to approach the CEP technology to users through abstraction and modelling techniques. To that end, this project has defined an intuitive and simple domain-specific modelling language for new users through a web tool (CEP Modeler) for graphically modeling CEP queries, in an easier and more accessible way.
Resumo:
La gran cantidad de datos que se registran diariamente en los sistemas de base de datos de las organizaciones ha generado la necesidad de analizarla. Sin embargo, se enfrentan a la complejidad de procesar enormes volúmenes de datos a través de métodos tradicionales de análisis. Además, dentro de un contexto globalizado y competitivo las organizaciones se mantienen en la búsqueda constante de mejorar sus procesos, para lo cual requieren herramientas que les permitan tomar mejores decisiones. Esto implica estar mejor informado y conocer su historia digital para describir sus procesos y poder anticipar (predecir) eventos no previstos. Estos nuevos requerimientos de análisis de datos ha motivado el desarrollo creciente de proyectos de minería de datos. El proceso de minería de datos busca obtener desde un conjunto masivo de datos, modelos que permitan describir los datos o predecir nuevas instancias en el conjunto. Implica etapas de: preparación de los datos, procesamiento parcial o totalmente automatizado para identificar modelos en los datos, para luego obtener como salida patrones, relaciones o reglas. Esta salida debe significar un nuevo conocimiento para la organización, útil y comprensible para los usuarios finales, y que pueda ser integrado a los procesos para apoyar la toma de decisiones. Sin embargo, la mayor dificultad es justamente lograr que el analista de datos, que interviene en todo este proceso, pueda identificar modelos lo cual es una tarea compleja y muchas veces requiere de la experiencia, no sólo del analista de datos, sino que también del experto en el dominio del problema. Una forma de apoyar el análisis de datos, modelos y patrones es a través de su representación visual, utilizando las capacidades de percepción visual del ser humano, la cual puede detectar patrones con mayor facilidad. Bajo este enfoque, la visualización ha sido utilizada en minería datos, mayormente en el análisis descriptivo de los datos (entrada) y en la presentación de los patrones (salida), dejando limitado este paradigma para el análisis de modelos. El presente documento describe el desarrollo de la Tesis Doctoral denominada “Nuevos Esquemas de Visualizaciones para Mejorar la Comprensibilidad de Modelos de Data Mining”. Esta investigación busca aportar con un enfoque de visualización para apoyar la comprensión de modelos minería de datos, para esto propone la metáfora de modelos visualmente aumentados. ABSTRACT The large amount of data to be recorded daily in the systems database of organizations has generated the need to analyze it. However, faced with the complexity of processing huge volumes of data over traditional methods of analysis. Moreover, in a globalized and competitive environment organizations are kept constantly looking to improve their processes, which require tools that allow them to make better decisions. This involves being bettered informed and knows your digital story to describe its processes and to anticipate (predict) unanticipated events. These new requirements of data analysis, has led to the increasing development of data-mining projects. The data-mining process seeks to obtain from a massive data set, models to describe the data or predict new instances in the set. It involves steps of data preparation, partially or fully automated processing to identify patterns in the data, and then get output patterns, relationships or rules. This output must mean new knowledge for the organization, useful and understandable for end users, and can be integrated into the process to support decision-making. However, the biggest challenge is just getting the data analyst involved in this process, which can identify models is complex and often requires experience not only of the data analyst, but also the expert in the problem domain. One way to support the analysis of the data, models and patterns, is through its visual representation, i.e., using the capabilities of human visual perception, which can detect patterns easily in any context. Under this approach, the visualization has been used in data mining, mostly in exploratory data analysis (input) and the presentation of the patterns (output), leaving limited this paradigm for analyzing models. This document describes the development of the doctoral thesis entitled "New Visualizations Schemes to Improve Understandability of Data-Mining Models". This research aims to provide a visualization approach to support understanding of data mining models for this proposed metaphor visually enhanced models.
Resumo:
El proyecto nace de un proyecto anterior donde se construyó un modelo para representar la información de los estudios superiores mediante una red de ontologías, proporcionando una definición común de conceptos importantes. Este proyecto consiste en desarrollar una herramienta capaz de generar datos educativos, a partir de la red de ontologías mencionadas anteriormente, siguiendo el paradigma de Linked Data [1]. La herramienta deberá extraer datos de diferentes fuentes educativas y transformará dichos datos educativos a datos enlazados (Linked Data). Para llevar a cabo esta labor se ha utilizado GATE Developer [2], es un entorno de desarrollo que proporciona un completo conjunto de herramientas gráficas interactivas para la creación, medición y mantenimiento de componentes de software para el procesamiento del lenguaje humano.---ABSTRACT---The project arises from a previous project in which a model was constructed to represent information of higher education through a network of ontologies, providing a common definition of important concepts. This project is to develop a tool capable of generating educational data from the ontology network mentioned above, following the paradigm of Linked Data [1]. The tool will extract data from different educational sources and transform said data to linked data (linked data). To carry out this work has been used GATE Developer [2]. It is a development environment that provides a comprehensive set of interactive graphical tools for creating, measuring and maintenance of software components for human language processing.
Resumo:
A challenge for subunit vaccines whose goal is to elicit CD8+ cytotoxic T lymphocytes (CTLs) is to deliver the antigen to the cytosol of the living cell, where it can be processed for presentation by major histocompatibility complex (MHC) class I molecules. Several bacterial toxins have evolved to efficiently deliver catalytic protein moieties to the cytosol of eukaryotic cells. Anthrax lethal toxin consists of two distinct proteins that combine to form the active toxin. Protective antigen (PA) binds to cells and is instrumental in delivering lethal factor (LF) to the cell cytosol. To test whether the lethal factor protein could be exploited for delivery of exogenous proteins to the MHC class I processing pathway, we constructed a genetic fusion between the amino-terminal 254 aa of LF and the gp120 portion of the HIV-1 envelope protein. Cells treated with this fusion protein (LF254-gp120) in the presence of PA effectively processed gp120 and presented an epitope recognized by HIV-1 gp120 V3-specific CTL. In contrast, when cells were treated with the LF254-gp120 fusion protein and a mutant PA protein defective for translocation, the cells were not able to present the epitope and were not lysed by the specific CTL. The entry into the cytosol and dependence on the classical cytosolic MHC class I pathway were confirmed by showing that antigen presentation by PA + LF254-gp120 was blocked by the proteasome inhibitor lactacystin. These data demonstrate the ability of the LF amino-terminal fragment to deliver antigens to the MHC class I pathway and provide the basis for the development of novel T cell vaccines.
Resumo:
Mouse CD1(mCD1) molecules have been reported to present two types of antigens: peptides or proteins and the glycolipid α-galactosylceramide. Here, we demonstrate that a protein antigen, chicken ovalbumin (Ova), must be processed to generate peptides presented by mCD1 to CD8+ T cells. The processing and mCD1-mediated presentation of chicken Ova depend on endosomal localization because inhibitors of endosomal acidification and endosomal recycling pathways block T cell reactivity. Furthermore, a cytoplasmic tail mutant of mCD1, which disrupts endosomal localization, has a greatly reduced capacity to present Ova to mCD1 restricted cells. Newly synthesized mCD1 molecules, however, are not required for Ova presentation, suggesting that molecules recycling from the cell surface are needed. Because of these data showing that mCD1 trafficks to endosomes, where it can bind peptides derived from exogenous proteins, we conclude that peptide antigen presentation by mCD1 is likely to be a naturally occurring phenomenon. In competition assays, α-galactosylceramide did not inhibit Ova presentation, and presentation of the glycolipid was not inhibited by excess Ova or the peptide epitope derived from it. This suggests that, although both lipid and peptide presentation may occur naturally, mCD1 may interact differently with these two types of antigens.
Resumo:
Three small nucleolar RNAs (snoRNAs), E1, E2 and E3, have been described that have unique sequences and interact directly with unique segments of pre-rRNA in vivo. In this report, injection of antisense oligodeoxynucleotides into Xenopus laevis oocytes was used to target the specific degradation of these snoRNAs. Specific disruptions of pre-rRNA processing were then observed, which were reversed by injection of the corresponding in vitro-synthesized snoRNA. Degradation of each of these three snoRNAs produced a unique rRNA maturation phenotype. E1 RNA depletion shut down 18 rRNA formation, without overaccumulation of 20S pre-rRNA. After E2 RNA degradation, production of 18S rRNA and 36S pre-rRNA stopped, and 38S pre-rRNA accumulated, without overaccumulation of 20S pre-rRNA. E3 RNA depletion induced the accumulation of 36S pre-rRNA. This suggests that each of these snoRNAs plays a different role in pre-rRNA processing and indicates that E1 and E2 RNAs are essential for 18S rRNA formation. The available data support the proposal that these snoRNAs are at least involved in pre-rRNA processing at the following pre-rRNA cleavage sites: E1 at the 5′ end and E2 at the 3′ end of 18S rRNA, and E3 at or near the 5′ end of 5.8S rRNA.
Resumo:
Hereditary hemochromatosis (HH) is the most common autosomal recessive disorder known in humans. A candidate gene for HH called HFE has recently been cloned that encodes a novel member of the major histocompatibility complex class I family. Most HH patients are homozygous for a Cys-282→Tyr (C282Y) mutation in HFE gene, which has been shown to disrupt interaction with β2-microglobulin; a second mutation, His-63→Asp (H63D), is enriched in HH patients who are heterozygous for C282Y mutation. The aims of this study were to determine the effects of the C282Y and H63D mutations on the cellular trafficking and degradation of the HFE protein in transfected COS-7 cells. The results indicate that, while the wild-type and H63D HFE proteins associate with β2-microglobulin and are expressed on the cell surface of COS-7 cells, these capabilities are lost by the C282Y HFE protein. We present biochemical and immunofluorescence data that indicate that the C282Y mutant protein: (i) is retained in the endoplasmic reticulum and middle Golgi compartment, (ii) fails to undergo late Golgi processing, and (iii) is subject to accelerated degradation. The block in intracellular transport, accelerated turnover, and failure of the C282Y protein to be presented normally on the cell surface provide a possible basis for impaired function of this mutant protein in HH.
Resumo:
We have examined the distribution of RNA transcription and processing factors in the amphibian oocyte nucleus or germinal vesicle. RNA polymerase I (pol I), pol II, and pol III occur in the Cajal bodies (coiled bodies) along with various components required for transcription and processing of the three classes of nuclear transcripts: mRNA, rRNA, and pol III transcripts. Among these components are transcription factor IIF (TFIIF), TFIIS, splicing factors, the U7 small nuclear ribonucleoprotein particle, the stem–loop binding protein, SR proteins, cleavage and polyadenylation factors, small nucleolar RNAs, nucleolar proteins that are probably involved in pre-rRNA processing, and TFIIIA. Earlier studies and data presented here show that several of these components are first targeted to Cajal bodies when injected into the oocyte and only subsequently appear in the chromosomes or nucleoli, where transcription itself occurs. We suggest that pol I, pol II, and pol III transcription and processing components are preassembled in Cajal bodies before transport to the chromosomes and nucleoli. Most components of the pol II transcription and processing pathway that occur in Cajal bodies are also found in the many hundreds of B-snurposomes in the germinal vesicle. Electron microscopic images show that B-snurposomes consist primarily, if not exclusively, of 20- to 30-nm particles, which closely resemble the interchromatin granules described from sections of somatic nuclei. We suggest the name pol II transcriptosome for these particles to emphasize their content of factors involved in synthesis and processing of mRNA transcripts. We present a model in which pol I, pol II, and pol III transcriptosomes are assembled in the Cajal bodies before export to the nucleolus (pol I), to the B-snurposomes and eventually to the chromosomes (pol II), and directly to the chromosomes (pol III). The key feature of this model is the preassembly of the transcription and processing machinery into unitary particles. An analogy can be made between ribosomes and transcriptosomes, ribosomes being unitary particles involved in translation and transcriptosomes being unitary particles for transcription and processing of RNA.
Resumo:
The patterns of cortico-cortical and cortico-thalamic connections of auditory cortical areas in the rhesus monkey have led to the hypothesis that acoustic information is processed in series and in parallel in the primate auditory cortex. Recent physiological experiments in the behaving monkey indicate that the response properties of neurons in different cortical areas are both functionally distinct from each other, which is indicative of parallel processing, and functionally similar to each other, which is indicative of serial processing. Thus, auditory cortical processing may be similar to the serial and parallel “what” and “where” processing by the primate visual cortex. If “where” information is serially processed in the primate auditory cortex, neurons in cortical areas along this pathway should have progressively better spatial tuning properties. This prediction is supported by recent experiments that have shown that neurons in the caudomedial field have better spatial tuning properties than neurons in the primary auditory cortex. Neurons in the caudomedial field are also better than primary auditory cortex neurons at predicting the sound localization ability across different stimulus frequencies and bandwidths in both azimuth and elevation. These data support the hypothesis that the primate auditory cortex processes acoustic information in a serial and parallel manner and suggest that this may be a general cortical mechanism for sensory perception.
Self-organized phase transitions in neural networks as a neural mechanism of information processing.
Resumo:
Transitions between dynamically stable activity patterns imposed on an associative neural network are shown to be induced by self-organized infinitesimal changes in synaptic connection strength and to be a kind of phase transition. A key event for the neural process of information processing in a population coding scheme is transition between the activity patterns encoding usual entities. We propose that the infinitesimal and short-term synaptic changes based on the Hebbian learning rule are the driving force for the transition. The phase transition between the following two dynamical stable states is studied in detail, the state where the firing pattern is changed temporally so as to itinerate among several patterns and the state where the firing pattern is fixed to one of several patterns. The phase transition from the pattern itinerant state to a pattern fixed state may be induced by the Hebbian learning process under a weak input relevant to the fixed pattern. The reverse transition may be induced by the Hebbian unlearning process without input. The former transition is considered as recognition of the input stimulus, while the latter is considered as clearing of the used input data to get ready for new input. To ensure that information processing based on the phase transition can be made by the infinitesimal and short-term synaptic changes, it is absolutely necessary that the network always stays near the critical state corresponding to the phase transition point.
Resumo:
We previously characterized a methionine aminopeptidase (EC 3.4.11.18; Met-AP1; also called peptidase M) in Saccharomyces cerevisiae, which differs from its prokaryotic homologues in that it (i) contains an N-terminal zinc-finger domain and (ii) does not produce lethality when disrupted, although it does slow growth dramatically; it is encoded by a gene called MAP1. Here we describe a second methionine aminopeptidase (Met-AP2) in S. cerevisiae, encoded by MAP2, which was cloned as a suppressor of the slow-growth phenotype of the map1 null strain. The DNA sequence of MAP2 encodes a protein of 421 amino acids that shows 22% identity with the sequence of yeast Met-AP1. Surprisingly, comparison with sequences in the GenBank data base showed that the product of MAP2 has even greater homology (55% identity) with rat p67, which was characterized as an initiation factor 2-associated protein but not yet shown to have Met-AP activity. Transformants of map1 null cells expressing MAP2 in a high-copy-number plasmid contained 3- to 12-fold increases in Met-AP activity on different peptide substrates. The epitope-tagged suppressor gene product was purified by immunoaffinity chromatography and shown to contain Met-AP activity. To evaluate the physiological significance of Met-AP2, the MAP2 gene was deleted from wild-type and map1 null yeast strains. The map2 null strain, like the map1 null strain, is viable but with a slower growth rate. The map1, map2 double-null strains are nonviable. Thus, removal of N-terminal methionine is an essential function in yeast, as in prokaryotes, but yeast require two methionine aminopeptidases to provide the essential function which can only be partially provided by Met-AP1 or Met-AP2 alone.
Resumo:
Stimulation of muscarinic m1 or m3 receptors can, by generating diacylglycerol and activating protein kinase C, accelerate the breakdown of the amyloid precursor protein (APP) to form soluble, nonamyloidogenic derivatives (APPs), as previously shown. This relationship has been demonstrated in human glioma and neuroblastoma cells, as well as in transfected human embryonic kidney 293 cells and PC-12 cells. We now provide evidence that stimulation of metabotropic glutamate receptors (mGluRs), which also are coupled to phosphatidylinositol 4,5-bisphosphate hydrolysis, similarly accelerates processing of APP into nonamyloidogenic APPs. This process is demonstrated both in hippocampal neurons derived from fetal rats and in human embryonic kidney 293 cells transfected with cDNA expression constructs encoding the mGluR 1 alpha subtype. In hippocampal neurons, both an mGluR antagonist, L-(+)-2-amino-3-phosphonopropionic acid, and an inhibitor of protein kinase C, GF 109203X, blocked the APPs release evoked by glutamate receptor stimulation. Ionotropic glutamate agonists, N-methyl-D-aspartate or S(-)-5-fluorowillardiine, failed to affect APPs release. These data show that selective mGluR agonists that initiate signal-transduction events can regulate APP processing in bona fide primary neurons and transfected cells. As glutamatergic neurons in the cortex and hippocampus are damaged in Alzheimer disease, amyloid production in these regions may be enhanced by deficits in glutamatergic neurotransmission.
Resumo:
The maturation of 5S RNA in Escherichia coli is poorly understood. Although it is known that large precursors of 5S RNA accumulate in mutant cells lacking the endoribonuclease-RNase E, almost nothing is known about how the mature 5' and 3' termini of these molecules are generated. We have examined 5S RNA maturation in wild-type and single- or multiple-exoribonuclease-deficient cells by Northern blot and primer-extension analysis. Our results indicate that no mature 5S RNA is made in RNase T-deficient strains. Rather, 5S RNA precursors containing predominantly 2 extra nucleotides at the 3' end accumulate. Apparently, these 5S RNAs are functional inasmuch as mutant cells are viable, growing only slightly slower than wild type. Purified RNase T can remove the extra 3' residues, showing that it is directly involved in the trimming reaction. In contrast, mutations affecting other 3' exoribonucleases have no effect on 5S RNA maturation. Approximately 90% of the 5S RNAs in both wild-type and RNase T- cells contain mature 5' termini, indicating that 5' processing is independent of RNase T action. These data identify the enzyme responsible for generating the mature 3' terminus of 5S RNA molecules and also demonstrate that a completely processed 5S RNA molecule is not essential for cell survival.
Resumo:
A Internet das Coisas é um novo paradigma de comunicação que estende o mundo virtual (Internet) para o mundo real com a interface e interação entre objetos. Ela possuirá um grande número de dispositivos heteregôneos interconectados, que deverá gerar um grande volume de dados. Um dos importantes desafios para seu desenvolvimento é se guardar e processar esse grande volume de dados em aceitáveis intervalos de tempo. Esta pesquisa endereça esse desafio, com a introdução de serviços de análise e reconhecimento de padrões nas camadas inferiores do modelo de para Internet das Coisas, que procura reduzir o processamento nas camadas superiores. Na pesquisa foram analisados os modelos de referência para Internet das Coisas e plataformas para desenvolvimento de aplicações nesse contexto. A nova arquitetura de implementada estende o LinkSmart Middeware pela introdução de um módulo para reconhecimento de padrões, implementa algoritmos para estimação de valores, detecção de outliers e descoberta de grupos nos dados brutos, oriundos de origens de dados. O novo módulo foi integrado à plataforma para Big Data Hadoop e usa as implementações algorítmicas do framework Mahout. Este trabalho destaca a importância da comunicação cross layer integrada à essa nova arquitetura. Nos experimentos desenvolvidos na pesquisa foram utilizadas bases de dados reais, provenientes do projeto Smart Santander, de modo a validar da nova arquitetura de IoT integrada aos serviços de análise e reconhecimento de padrões e a comunicação cross-layer.
Resumo:
Subsidence is a natural hazard that affects wide areas in the world causing important economic costs annually. This phenomenon has occurred in the metropolitan area of Murcia City (SE Spain) as a result of groundwater overexploitation. In this work aquifer system subsidence is investigated using an advanced differential SAR interferometry remote sensing technique (A-DInSAR) called Stable Point Network (SPN). The SPN derived displacement results, mainly the velocity displacement maps and the time series of the displacement, reveal that in the period 2004–2008 the rate of subsidence in Murcia metropolitan area doubled with respect to the previous period from 1995 to 2005. The acceleration of the deformation phenomenon is explained by the drought period started in 2006. The comparison of the temporal evolution of the displacements measured with the extensometers and the SPN technique shows an average absolute error of 3.9±3.8 mm. Finally, results from a finite element model developed to simulate the recorded time history subsidence from known water table height changes compares well with the SPN displacement time series estimations. This result demonstrates the potential of A-DInSAR techniques to validate subsidence prediction models as an alternative to using instrumental ground based techniques for validation.