962 resultados para multiple data sources


Relevância:

90.00% 90.00%

Publicador:

Resumo:

We present a nonparametric Bayesian method for disease subtype discovery in multi-dimensional cancer data. Our method can simultaneously analyse a wide range of data types, allowing for both agreement and disagreement between their underlying clustering structure. It includes feature selection and infers the most likely number of disease subtypes, given the data. We apply the method to 277 glioblastoma samples from The Cancer Genome Atlas, for which there are gene expression, copy number variation, methylation and microRNA data. We identify 8 distinct consensus subtypes and study their prognostic value for death, new tumour events, progression and recurrence. The consensus subtypes are prognostic of tumour recurrence (log-rank p-value of $3.6 \times 10^{-4}$ after correction for multiple hypothesis tests). This is driven principally by the methylation data (log-rank p-value of $2.0 \times 10^{-3}$) but the effect is strengthened by the other 3 data types, demonstrating the value of integrating multiple data types. Of particular note is a subtype of 47 patients characterised by very low levels of methylation. This subtype has very low rates of tumour recurrence and no new events in 10 years of follow up. We also identify a small gene expression subtype of 6 patients that shows particularly poor survival outcomes. Additionally, we note a consensus subtype that showly a highly distinctive data signature and suggest that it is therefore a biologically distinct subtype of glioblastoma. The code is available from https://sites.google.com/site/multipledatafusion/

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Usually, firms that produce innovative global products are discussed within the context of developed countries. New ventures in developing countries are typically viewed as low-cost product providers that generate technologically similar products to those produced by developed economies. However, this paper argues that some Chinese university spin-outs (USOs), although rare, have adopted a novel 'catch-up' strategy to build global products on the basis of indigenous platform technologies. This paper attempts to develop a conceptual framework to address the question: how do these specific Chinese USOs develop their innovation capabilities to build global products? In order to explore the idiosyncrasies of the specific USOs, this paper uses the multiple case studies method. The primary data sources are accessed through semi-structured interviews. In addition, archival data and other materials are used as secondary sources. The study analyses the configuration of capabilities that are needed for idiosyncratic growth, and maps them to the globalisation processes. This paper provides a strategic 'roadmap' as an explanatory guide to entrepreneurs, policy makers and investors to better understand the phenomena. © 2014 Inderscience Enterprises Ltd.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We investigate adaptive buffer management techniques for approximate evaluation of sliding window joins over multiple data streams. In many applications, data stream processing systems have limited memory or have to deal with very high speed data streams. In both cases, computing the exact results of joins between these streams may not be feasible, mainly because the buffers used to compute the joins contain much smaller number of tuples than the tuples contained in the sliding windows. Therefore, a stream buffer management policy is needed in that case. We show that the buffer replacement policy is an important determinant of the quality of the produced results. To that end, we propose GreedyDual-Join (GDJ) an adaptive and locality-aware buffering technique for managing these buffers. GDJ exploits the temporal correlations (at both long and short time scales), which we found to be prevalent in many real data streams. We note that our algorithm is readily applicable to multiple data streams and multiple joins and requires almost no additional system resources. We report results of an experimental study using both synthetic and real-world data sets. Our results demonstrate the superiority and flexibility of our approach when contrasted to other recently proposed techniques.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Multiple sound sources often contain harmonics that overlap and may be degraded by environmental noise. The auditory system is capable of teasing apart these sources into distinct mental objects, or streams. Such an "auditory scene analysis" enables the brain to solve the cocktail party problem. A neural network model of auditory scene analysis, called the AIRSTREAM model, is presented to propose how the brain accomplishes this feat. The model clarifies how the frequency components that correspond to a give acoustic source may be coherently grouped together into distinct streams based on pitch and spatial cues. The model also clarifies how multiple streams may be distinguishes and seperated by the brain. Streams are formed as spectral-pitch resonances that emerge through feedback interactions between frequency-specific spectral representaion of a sound source and its pitch. First, the model transforms a sound into a spatial pattern of frequency-specific activation across a spectral stream layer. The sound has multiple parallel representations at this layer. A sound's spectral representation activates a bottom-up filter that is sensitive to harmonics of the sound's pitch. The filter activates a pitch category which, in turn, activate a top-down expectation that allows one voice or instrument to be tracked through a noisy multiple source environment. Spectral components are suppressed if they do not match harmonics of the top-down expectation that is read-out by the selected pitch, thereby allowing another stream to capture these components, as in the "old-plus-new-heuristic" of Bregman. Multiple simultaneously occuring spectral-pitch resonances can hereby emerge. These resonance and matching mechanisms are specialized versions of Adaptive Resonance Theory, or ART, which clarifies how pitch representations can self-organize durin learning of harmonic bottom-up filters and top-down expectations. The model also clarifies how spatial location cues can help to disambiguate two sources with similar spectral cures. Data are simulated from psychophysical grouping experiments, such as how a tone sweeping upwards in frequency creates a bounce percept by grouping with a downward sweeping tone due to proximity in frequency, even if noise replaces the tones at their interection point. Illusory auditory percepts are also simulated, such as the auditory continuity illusion of a tone continuing through a noise burst even if the tone is not present during the noise, and the scale illusion of Deutsch whereby downward and upward scales presented alternately to the two ears are regrouped based on frequency proximity, leading to a bounce percept. Since related sorts of resonances have been used to quantitatively simulate psychophysical data about speech perception, the model strengthens the hypothesis the ART-like mechanisms are used at multiple levels of the auditory system. Proposals for developing the model to explain more complex streaming data are also provided.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This article describes a neural network model capable of generating a spatial representation of the pitch of an acoustic source. Pitch is one of several auditory percepts used by humans to separate multiple sound sources in the environment from each other. The model provides a neural instantiation of a type of "harmonic sieve". It is capable of quantitatively simulating a large body of psychoacoustical data, including new data on octave shift perception.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

A model of pitch perception, called the Spatial Pitch Network or SPINET model, is developed and analyzed. The model neurally instantiates ideas front the spectral pitch modeling literature and joins them to basic neural network signal processing designs to simulate a broader range of perceptual pitch data than previous spectral models. The components of the model arc interpreted as peripheral mechanical and neural processing stages, which arc capable of being incorporated into a larger network architecture for separating multiple sound sources in the environment. The core of the new model transforms a spectral representation of an acoustic source into a spatial distribution of pitch strengths. The SPINET model uses a weighted "harmonic sieve" whereby the strength of activation of a given pitch depends upon a weighted sum of narrow regions around the harmonics of the nominal pitch value, and higher harmonics contribute less to a pitch than lower ones. Suitably chosen harmonic weighting functions enable computer simulations of pitch perception data involving mistuned components, shifted harmonics, and various types of continuous spectra including rippled noise. It is shown how the weighting functions produce the dominance region, how they lead to octave shifts of pitch in response to ambiguous stimuli, and how they lead to a pitch region in response to the octave-spaced Shepard tone complexes and Deutsch tritones without the use of attentional mechanisms to limit pitch choices. An on-center off-surround network in the model helps to produce noise suppression, partial masking and edge pitch. Finally, it is shown how peripheral filtering and short term energy measurements produce a model pitch estimate that is sensitive to certain component phase relationships.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Transcriptional regulation has been studied intensively in recent decades. One important aspect of this regulation is the interaction between regulatory proteins, such as transcription factors (TF) and nucleosomes, and the genome. Different high-throughput techniques have been invented to map these interactions genome-wide, including ChIP-based methods (ChIP-chip, ChIP-seq, etc.), nuclease digestion methods (DNase-seq, MNase-seq, etc.), and others. However, a single experimental technique often only provides partial and noisy information about the whole picture of protein-DNA interactions. Therefore, the overarching goal of this dissertation is to provide computational developments for jointly modeling different experimental datasets to achieve a holistic inference on the protein-DNA interaction landscape.

We first present a computational framework that can incorporate the protein binding information in MNase-seq data into a thermodynamic model of protein-DNA interaction. We use a correlation-based objective function to model the MNase-seq data and a Markov chain Monte Carlo method to maximize the function. Our results show that the inferred protein-DNA interaction landscape is concordant with the MNase-seq data and provides a mechanistic explanation for the experimentally collected MNase-seq fragments. Our framework is flexible and can easily incorporate other data sources. To demonstrate this flexibility, we use prior distributions to integrate experimentally measured protein concentrations.

We also study the ability of DNase-seq data to position nucleosomes. Traditionally, DNase-seq has only been widely used to identify DNase hypersensitive sites, which tend to be open chromatin regulatory regions devoid of nucleosomes. We reveal for the first time that DNase-seq datasets also contain substantial information about nucleosome translational positioning, and that existing DNase-seq data can be used to infer nucleosome positions with high accuracy. We develop a Bayes-factor-based nucleosome scoring method to position nucleosomes using DNase-seq data. Our approach utilizes several effective strategies to extract nucleosome positioning signals from the noisy DNase-seq data, including jointly modeling data points across the nucleosome body and explicitly modeling the quadratic and oscillatory DNase I digestion pattern on nucleosomes. We show that our DNase-seq-based nucleosome map is highly consistent with previous high-resolution maps. We also show that the oscillatory DNase I digestion pattern is useful in revealing the nucleosome rotational context around TF binding sites.

Finally, we present a state-space model (SSM) for jointly modeling different kinds of genomic data to provide an accurate view of the protein-DNA interaction landscape. We also provide an efficient expectation-maximization algorithm to learn model parameters from data. We first show in simulation studies that the SSM can effectively recover underlying true protein binding configurations. We then apply the SSM to model real genomic data (both DNase-seq and MNase-seq data). Through incrementally increasing the types of genomic data in the SSM, we show that different data types can contribute complementary information for the inference of protein binding landscape and that the most accurate inference comes from modeling all available datasets.

This dissertation provides a foundation for future research by taking a step toward the genome-wide inference of protein-DNA interaction landscape through data integration.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

BACKGROUND: The National Comprehensive Cancer Network and the American Society of Clinical Oncology have established guidelines for the treatment and surveillance of colorectal cancer (CRC), respectively. Considering these guidelines, an accurate and efficient method is needed to measure receipt of care. METHODS: The accuracy and completeness of Veterans Health Administration (VA) administrative data were assessed by comparing them with data manually abstracted during the Colorectal Cancer Care Collaborative (C4) quality improvement initiative for 618 patients with stage I-III CRC. RESULTS: The VA administrative data contained gender, marital, and birth information for all patients but race information was missing for 62.1% of patients. The percent agreement for demographic variables ranged from 98.1-100%. The kappa statistic for receipt of treatments ranged from 0.21 to 0.60 and there was a 96.9% agreement for the date of surgical resection. The percentage of post-diagnosis surveillance events in C4 also in VA administrative data were 76.0% for colonoscopy, 84.6% for physician visit, and 26.3% for carcinoembryonic antigen (CEA) test. CONCLUSIONS: VA administrative data are accurate and complete for non-race demographic variables, receipt of CRC treatment, colonoscopy, and physician visits; but alternative data sources may be necessary to capture patient race and receipt of CEA tests.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Query processing over the Internet involving autonomous data sources is a major task in data integration. It requires the estimated costs of possible queries in order to select the best one that has the minimum cost. In this context, the cost of a query is affected by three factors: network congestion, server contention state, and complexity of the query. In this paper, we study the effects of both the network congestion and server contention state on the cost of a query. We refer to these two factors together as system contention states. We present a new approach to determining the system contention states by clustering the costs of a sample query. For each system contention state, we construct two cost formulas for unary and join queries respectively using the multiple regression process. When a new query is submitted, its system contention state is estimated first using either the time slides method or the statistical method. The cost of the query is then calculated using the corresponding cost formulas. The estimated cost of the query is further adjusted to improve its accuracy. Our experiments show that our methods can produce quite accurate cost estimates of the submitted queries to remote data sources over the Internet.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Objective: Several surveillance definitions of influenza-like illness (ILI) have been proposed, based on the presence of symptoms. Symptom data can be obtained from patients, medical records, or both. Past research has found that agreements between health record data and self-report are variable depending on the specific symptom. Therefore, we aimed to explore the implications of using data on influenza symptoms extracted from medical records, similar data collected prospectively from outpatients, and the combined data from both sources as predictors of laboratory-confirmed influenza. Methods: Using data from the Hutterite Influenza Prevention Study, we calculated: 1) the sensitivity, specificity and predictive values of individual symptoms within surveillance definitions; 2) how frequently surveillance definitions correlated to laboratory-confirmed influenza; and 3) the predictive value of surveillance definitions. Results: Of the 176 participants with reports from participants and medical records, 142 (81%) were tested for influenza and 37 (26%) were PCR positive for influenza. Fever (alone) and fever combined with cough and/or sore throat were highly correlated with being PCR positive for influenza for all data sources. ILI surveillance definitions, based on symptom data from medical records only or from both medical records and self-report, were better predictors of laboratory-confirmed influenza with higher odds ratios and positive predictive values. Discussion: The choice of data source to determine ILI will depend on the patient population, outcome of interest, availability of data source, and use for clinical decision making, research, or surveillance. © Canadian Public Health Association, 2012.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This paper presents the background rationale and key findings for a model-based study of supercritical waste heat recovery organic Rankine cycles. The paper’s objective is to cover the necessary groundwork to facilitate the future operation of a thermodynamic organic Rankine cycle model under realistic thermodynamic boundary conditions for performance optimisation of organic Rankine cycles. This involves determining the type of power cycle for organic Rankine cycles, the circuit configuration and suitable boundary conditions. The study focuses on multiple heat sources from vehicles but the findings are generally applicable, with careful consideration, to any waste heat recovery system. This paper introduces waste heat recovery and discusses the general merits of organic fluids versus water and supercritical operation versus subcritical operation from a theoretical perspective and, where possible, from a practical perspective. The benefits of regeneration are investigated from an efficiency perspective for selected subcritical and supercritical conditions. A simulation model is described with an introduction to some general Rankine cycle boundary conditions. The paper describes the analysis of real hybrid vehicle data from several driving cycles and its manipulation to represent the thermal inertia for model heat input boundary conditions. Basic theory suggests that selecting the operating pressures and temperatures to maximise the Rankine cycle performance is relatively straightforward. However, it was found that this may not be the case for an organic Rankine cycle operating in a vehicle. When operating in a driving cycle, the available heat and its quality can vary with the power output and between heat sources. For example, the available coolant heat does not vary much with the load, whereas the quantity and quality of the exhaust heat varies considerably. The key objective for operation in the vehicle is optimum utilisation of the available heat by delivering the maximum work out. The fluid selection process and the presentation and analysis of the final results of the simulation work on organic Rankine cycles are the subjects of two future publications.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

To provide in-time reactions to a large volume of surveil- lance data, uncertainty-enabled event reasoning frameworks for CCTV and sensor based intelligent surveillance system have been integrated to model and infer events of interest. However, most of the existing works do not consider decision making under uncertainty which is important for surveillance operators. In this paper, we extend an event reasoning framework for decision support, which enables our framework to predict, rank and alarm threats from multiple heterogeneous sources.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Pain management for older adults in long-term care (LTC) has been recognized as a problem internationally. The purpose of this study was to explore the role of a clinical nurse specialist (CNS) and nurse practitioner (NP) as change champions during the implementation of an evidence-based pain protocol in LTC. In this exploratory, multiple-case design study, we collected data from two LTC homes in Ontario, Canada. Three data sources were used: participant observation of an NP and a CNS for 18 hours each over a 3-week period; CNS and NP diaries recording strategies, barriers, and facilitators to the implementation process; and interviews with members of the interdisciplinary team to explore perceptions about the NP and CNS role in implementing the pain protocol. Data were analyzed using thematic content analysis. The NP and CNS used a variety of effective strategies to promote pain management changes in practice including educational outreach with team members, reminders to nursing staff to highlight the pain protocol and educate about practice changes, chart audits and feedback to the nursing staff, interdisciplinary working group meetings, ad hoc meetings with nursing staff, and resident assessment using advanced skills. The CNS and NP are ideal champions to implement pain management protocols and likely other quality improvement initiatives.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

OBJECTIVE/BACKGROUND: Many associations between abdominal aortic aneurysm (AAA) and genetic polymorphisms have been reported. It is unclear which are genuine and which may be caused by type 1 errors, biases, and flexible study design. The objectives of the study were to identify associations supported by current evidence and to investigate the effect of study design on reporting associations.

METHODS: Data sources were MEDLINE, Embase, and Web of Science. Reports were dual-reviewed for relevance and inclusion against predefined criteria (studies of genetic polymorphisms and AAA risk). Study characteristics and data were extracted using an agreed tool and reports assessed for quality. Heterogeneity was assessed using I(2) and fixed- and random-effects meta-analyses were conducted for variants that were reported at least twice, if any had reported an association. Strength of evidence was assessed using a standard guideline.

RESULTS: Searches identified 467 unique articles, of which 97 were included. Of 97 studies, 63 reported at least one association. Of 92 studies that conducted multiple tests, only 27% corrected their analyses. In total, 263 genes were investigated, and associations were reported in polymorphisms in 87 genes. Associations in CDKN2BAS, SORT1, LRP1, IL6R, MMP3, AGTR1, ACE, and APOA1 were supported by meta-analyses.

CONCLUSION: Uncorrected multiple testing and flexible study design (particularly testing many inheritance models and subgroups, and failure to check for Hardy-Weinberg equilibrium) contributed to apparently false associations being reported. Heterogeneity, possibly due to the case mix, geographical, temporal, and environmental variation between different studies, was evident. Polymorphisms in nine genes had strong or moderate support on the basis of the literature at this time. Suggestions are made for improving AAA genetics study design and conduct.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Este trabalho põe em evidência o valor formativo da prática profissional supervisionada e da escrita reflexiva com feedback co-construtivo sobre a praxis enquanto eixos estruturantes da construção de competências profissionais na formação inicial de professores do 1.º ciclo do ensino básico. Integrada no paradigma da complexidade e assumindo, do ponto de vista teórico-epistemológico, o diálogo entre o paradigma construtivista da complexidade (Lerbet 1986, 2004; Morin, 1994, s.d.; Le Moigne, 2002, 2003a), o paradigma da complexificação e da epistemologia da escuta/controvérsia (Correia, 2001), o experiencialismo crítico (Alarcão, 2001b) e o construtivismo/socioconstrutivismo (Piaget, 1975; Perret-Clermont, 1978; Morgado, 1988), a investigação que desenvolvemos teve como objectivo central saber como e em que condições, num contexto de formação reflexiva (Schön, 1983, 1992; Zeichner, 1993; Alarcão, 1996b, 2001c; Marcelo, 1999; Sá-Chaves, 2002; Alarcão e Tavares, 2003; Perrenoud, 2004) e, simultaneamente, de investigação (Moreira e Alarcão, 1997; Elliot, 1997; Alarcão, 2001a, 2001b; Moreira, 2001; Esteves, 2002; Estrela, 2003), se opera a construção da profissionalidade e da identidade social docente (Perrenoud, 1995, 2001b; Le Boterf, 1999; DeSeCo, 2002), ou seja, compreender a forma como se estabelecem e evoluem as dimensões que caracterizam o conhecimento profissional e os factores (activadores e inibidores) de desenvolvimento nele envolvidos. Para atingir este objectivo, propusemo-nos desenhar e realizar uma investigação centrada numa metodologia de formação – investigação-acção (Bataille, 1981; Pourtois, 1981; Morin, 1985; Moreira, 2001), de orientação reflexiva, focada no desenvolvimento pessoal e profissional dos alunos do 4.º ano do curso de formação de professores do 1.º CEB, da Escola Superior de Educação de Coimbra. Adoptou-se, por isso, uma dupla modelização para a investigação: o estudo de caso (para a investigação) e a investigação-acção (para a formação). O pólo técnico da investigação (Bruyne, Herman e Schoutheete, 1991) configurou, assim, como modo de investigação, o estudo de caso (multicaso) e as entrevistas, os interrogatórios clínicos (realizados no âmbito da pós-observação da componente de formação Estágio), a escrita regular de narrativas autobiográficas centrada nas trajectórias de formação (de processo e de síntese), a observação de aulas, entre outros, como instrumentos de recolha de dados que pareceram adequados à metodologia essencialmente qualitativa que elegemos. Os mesmos instrumentos, articulada e conjuntamente com outros adoptados no âmbito do desenvolvimento da unidade curricular Observação e Intervenção Educativa IV - Seminário de Análise e Reflexão Práticas, assumiram também funções formativas, ou seja, constituíram, pela análise dos dados que possibilitaram, ferramentas importantes de auto, hetero e co-formação e, concomitantemente, de investigação. A triangulação dos dados provenientes destas múltiplas fontes de informação assegurou o contraditório na gestão dos dados garantindo, deste modo, a validade das conclusões da investigação. Os dados da observação/supervisão das práticas pedagógicas e respectiva análise permitiu-nos: 1) identificar o estabelecimento e a evolução de configurações de relação entre a aprendizagem de competências básicas para o desempenho docente no 1.º CEB e certos aspectos explícitos do contexto de formação inicial tais como a iniciação à prática profissional supervisionada e a escrita reflexiva com feedback co-construtivo; 2) perspectivar, no contexto da iniciação à prática profissional supervisionada, a existência de um espaço de intervenção comum co-concebido, co-planificado, co-desenvolvido e coavaliado pelas instituições formadora e cooperantes em torno de um projecto de formação onde o diálogo prática-teoria-prática emerge como central na construção da complexa rede de competências profissionais que hoje se reclamam na formação inicial de professores; 3) conceber o professor como um profissional crítico-reflexivo e a reflexão e a investigação partilhada como dispositivos centrais de auto-avaliação e auto-regulação do desempenho profissional e do desenvolvimento ao longo da vida; 4) percepcionar a formação inicial do professor de 1.º CEB como o início do processo de vinculação/socialização à profissão. Estas conclusões podem, a nosso ver, contribuir, no quadro de uma colaboração interinstitucional co-formadora, que do ponto de vista das políticas de formação de professores se impõe redefinir, para o reconhecimento e valorização da importância dos contextos de prática supervisionada e da escrita autobiográfica com feedback co-construtivo no desenvolvimento profissional e identitário na formação inicial de educadores/professores e, simultaneamente, sustentar, ancorada numa nova ética de investigação, a teoria da formação na perspectiva da epistemologia do sujeito aprendente.