981 resultados para sequential data


Relevância:

40.00% 40.00%

Publicador:

Resumo:

When estimating the effect of treatment on HIV using data from observational studies, standard methods may produce biased estimates due to the presence of time-dependent confounders. Such confounding can be present when a covariate, affected by past exposure, is both a predictor of the future exposure and the outcome. One example is the CD4 cell count, being a marker for disease progression for HIV patients, but also a marker for treatment initiation and influenced by treatment. Fitting a marginal structural model (MSM) using inverse probability weights is one way to give appropriate adjustment for this type of confounding. In this paper we study a simple and intuitive approach to estimate similar treatment effects, using observational data to mimic several randomized controlled trials. Each 'trial' is constructed based on individuals starting treatment in a certain time interval. An overall effect estimate for all such trials is found using composite likelihood inference. The method offers an alternative to the use of inverse probability of treatment weights, which is unstable in certain situations. The estimated parameter is not identical to the one of an MSM, it is conditioned on covariate values at the start of each mimicked trial. This allows the study of questions that are not that easily addressed fitting an MSM. The analysis can be performed as a stratified weighted Cox analysis on the joint data set of all the constructed trials, where each trial is one stratum. The model is applied to data from the Swiss HIV cohort study.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Most statistical analysis, theory and practice, is concerned with static models; models with a proposed set of parameters whose values are fixed across observational units. Static models implicitly assume that the quantified relationships remain the same across the design space of the data. While this is reasonable under many circumstances this can be a dangerous assumption when dealing with sequentially ordered data. The mere passage of time always brings fresh considerations and the interrelationships among parameters, or subsets of parameters, may need to be continually revised. ^ When data are gathered sequentially dynamic interim monitoring may be useful as new subject-specific parameters are introduced with each new observational unit. Sequential imputation via dynamic hierarchical models is an efficient strategy for handling missing data and analyzing longitudinal studies. Dynamic conditional independence models offers a flexible framework that exploits the Bayesian updating scheme for capturing the evolution of both the population and individual effects over time. While static models often describe aggregate information well they often do not reflect conflicts in the information at the individual level. Dynamic models prove advantageous over static models in capturing both individual and aggregate trends. Computations for such models can be carried out via the Gibbs sampler. An application using a small sample repeated measures normally distributed growth curve data is presented. ^

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The principled statistical application of Gaussian random field models used in geostatistics has historically been limited to data sets of a small size. This limitation is imposed by the requirement to store and invert the covariance matrix of all the samples to obtain a predictive distribution at unsampled locations, or to use likelihood-based covariance estimation. Various ad hoc approaches to solve this problem have been adopted, such as selecting a neighborhood region and/or a small number of observations to use in the kriging process, but these have no sound theoretical basis and it is unclear what information is being lost. In this article, we present a Bayesian method for estimating the posterior mean and covariance structures of a Gaussian random field using a sequential estimation algorithm. By imposing sparsity in a well-defined framework, the algorithm retains a subset of “basis vectors” that best represent the “true” posterior Gaussian random field model in the relative entropy sense. This allows a principled treatment of Gaussian random field models on very large data sets. The method is particularly appropriate when the Gaussian random field model is regarded as a latent variable model, which may be nonlinearly related to the observations. We show the application of the sequential, sparse Bayesian estimation in Gaussian random field models and discuss its merits and drawbacks.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Recently within the machine learning and spatial statistics communities many papers have explored the potential of reduced rank representations of the covariance matrix, often referred to as projected or fixed rank approaches. In such methods the covariance function of the posterior process is represented by a reduced rank approximation which is chosen such that there is minimal information loss. In this paper a sequential framework for inference in such projected processes is presented, where the observations are considered one at a time. We introduce a C++ library for carrying out such projected, sequential estimation which adds several novel features. In particular we have incorporated the ability to use a generic observation operator, or sensor model, to permit data fusion. We can also cope with a range of observation error characteristics, including non-Gaussian observation errors. Inference for the variogram parameters is based on maximum likelihood estimation. We illustrate the projected sequential method in application to synthetic and real data sets. We discuss the software implementation and suggest possible future extensions.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We examined the sequence, order or steps of hygienic behavior (HB) from pin-killed pupae until the removal of them by the bees. We conducted our study with four colonies of Apis mellifera carnica in Germany and made four repetitions. The pin-killing method was used for evaluation of the HB of bees. The data were collected every 2 h after perforation, totaling 13 observations. Additionally, for one hygienic colony and another non-hygienic colony, individual analyses of each dead pupa were made at every observation, including all details, steps or sequences of HB. The bees recognize the cells containing dead pupae within 2 h after perforation, initially making a hole in the capping, which is the beginning of HB. Uncapping of the dead brood cell reached maximum values from 4 to 6 h after perforation; after 24 h, practically all cells were already uncapped. Another variable, called brood partially removed, was analyzed 4 h after perforation, after the cells had been perforated, which involved uncapping, followed by partial or total removal of the brood. Maximum values of brood partially removed were found 10 h after perforation, though such cells could be found up to 48 h after perforation. The most frequent sequence of events in both colonies was: capped cell -> punctured cell. brood partially removed -> empty cell. A new model of three pairs of recessive genes (uncapping u1, u2 and remover r) was proposed in order to explain the genetic control of the HB in Apis mellifera. We recommend evaluating HB 24 h after perforation and using a correction factor to compensate for control removal levels. We found a series of details of HB, which allow a study of how various factors may affect the sequence of the activities involved in HB and investigation of the genetics that controls this process.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Without intensive selection, the majority of bovine oocytes submitted to in vitro embryo production (IVP) fail to develop to the blastocyst stage. This is attributed partly to their maturation status and competences. Using the Affymetrix GeneChip Bovine Genome Array, global mRNA expression analysis of immature (GV) and in vitro matured (IVM) bovine oocytes was carried out to characterize the transcriptome of bovine oocytes and then use a variety of approaches to determine whether the observed transcriptional changes during IVM was real or an artifact of the techniques used during analysis. Results: 8489 transcripts were detected across the two oocyte groups, of which similar to 25.0% (2117 transcripts) were differentially expressed (p < 0.001); corresponding to 589 over-expressed and 1528 under-expressed transcripts in the IVM oocytes compared to their immature counterparts. Over expression of transcripts by IVM oocytes is particularly interesting, therefore, a variety of approaches were employed to determine whether the observed transcriptional changes during IVM were real or an artifact of the techniques used during analysis, including the analysis of transcript abundance in oocytes in vitro matured in the presence of a-amanitin. Subsets of the differentially expressed genes were also validated by quantitative real-time PCR (qPCR) and the gene expression data was classified according to gene ontology and pathway enrichment. Numerous cell cycle linked (CDC2, CDK5, CDK8, HSPA2, MAPK14, TXNL4B), molecular transport (STX5, STX17, SEC22A, SEC22B), and differentiation (NACA) related genes were found to be among the several over-expressed transcripts in GV oocytes compared to the matured counterparts, while ANXA1, PLAU, STC1and LUM were among the over-expressed genes after oocyte maturation. Conclusion: Using sequential experiments, we have shown and confirmed transcriptional changes during oocyte maturation. This dataset provides a unique reference resource for studies concerned with the molecular mechanisms controlling oocyte meiotic maturation in cattle, addresses the existing conflicting issue of transcription during meiotic maturation and contributes to the global goal of improving assisted reproductive technology.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A novel flow-based strategy for implementing simultaneous determinations of different chemical species reacting with the same reagent(s) at different rates is proposed and applied to the spectrophotometric catalytic determination of iron and vanadium in Fe-V alloys. The method relies on the influence of Fe(II) and V(IV) on the rate of the iodide oxidation by Cr(VI) under acidic conditions, the Jones reducing agent is then needed Three different plugs of the sample are sequentially inserted into an acidic KI reagent carrier stream, and a confluent Cr(VI) solution is added downstream Overlap between the inserted plugs leads to a complex sample zone with several regions of maximal and minimal absorbance values. Measurements performed on these regions reveal the different degrees of reaction development and tend to be more precise Data are treated by multivariate calibration involving the PLS algorithm The proposed system is very simple and rugged Two latent variables carried out ca 95% of the analytical information and the results are in agreement with ICP-OES. (C) 2010 Elsevier B V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Artificial neural networks have been used to analyze a number of engineering problems, including settlement caused by different tunneling methods in various types of ground mass. This paper focuses on settlement over shotcrete- supported tunnels on Sao Paulo subway line 2 (West Extension) that were excavated in Tertiary sediments using the sequential excavation method. The adjusted network is a good tool for predicting settlement above new tunnels to be excavated in similar conditions. The influence of network training parameters on the quality of results is also discussed. (C) 2007 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Data mining is the process to identify valid, implicit, previously unknown, potentially useful and understandable information from large databases. It is an important step in the process of knowledge discovery in databases, (Olaru & Wehenkel, 1999). In a data mining process, input data can be structured, seme-structured, or unstructured. Data can be in text, categorical or numerical values. One of the important characteristics of data mining is its ability to deal data with large volume, distributed, time variant, noisy, and high dimensionality. A large number of data mining algorithms have been developed for different applications. For example, association rules mining can be useful for market basket problems, clustering algorithms can be used to discover trends in unsupervised learning problems, classification algorithms can be applied in decision-making problems, and sequential and time series mining algorithms can be used in predicting events, fault detection, and other supervised learning problems (Vapnik, 1999). Classification is among the most important tasks in the data mining, particularly for data mining applications into engineering fields. Together with regression, classification is mainly for predictive modelling. So far, there have been a number of classification algorithms in practice. According to (Sebastiani, 2002), the main classification algorithms can be categorized as: decision tree and rule based approach such as C4.5 (Quinlan, 1996); probability methods such as Bayesian classifier (Lewis, 1998); on-line methods such as Winnow (Littlestone, 1988) and CVFDT (Hulten 2001), neural networks methods (Rumelhart, Hinton & Wiliams, 1986); example-based methods such as k-nearest neighbors (Duda & Hart, 1973), and SVM (Cortes & Vapnik, 1995). Other important techniques for classification tasks include Associative Classification (Liu et al, 1998) and Ensemble Classification (Tumer, 1996).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Establishment of long-term potentiation (LTP) at perforant path synapses is highly correlated with increased expression of Egr and AP-1 transcription factors in rat dentate gyrus granule cells. We have investigated whether increased transcription factor levels are reflected in increased transcription factor activity by assessing Egr and AP-I DNA binding activity using gel shift assays. LTP produced an increase in binding to the Egr element, which was NMDA receptor-dependent and correlated closely with our previously reported increase in Egr-1 (zif/268) protein levels. Supershift analysis confirmed involvement of Egr-1, but not Egr-2 in the DNA binding activity. AP-1 DNA binding was also rapidly elevated in parallel with protein levels, however, the peak increase in activity was delayed until 4 h, a time point when we have previously shown that only jun-D protein was elevated. These data indicate that binding of Egr-1 and AP-1 to their response elements is increased in two phases. This may result in activation of distinct banks of target genes which contribute to the establishment of persistent LTP. (C) 2000 Elsevier Science B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Significant hemodynamic changes, including preload and afterload modifications, occur during the transition from the fetal to the neonatal environment. The ductus arteriosus closes, pulmonary vascular resistance decreases, and pulmonary blood flow increases. Strain rate (SR) and strain (e) have been proposed as ultrasound indices for quantifying regional wall deformation. This study was designed to determine if these indices can detect variations in regional deformation between early and late neonatal periods. Methods: Data were obtained from 30 healthy neonates (15 male). The initial study was performed at a mean age of 20.1614 hours (exam 1) and the second at 31.962.9 days (exam 2). Apical and parasternal views were used to quantify regional left ventricular (LV) and right ventricular (RV) longitudinal and radial SR and e, and systolic, early, and late diastolic values were calculated from these curves. A paired-samples t test was performed comparing the two groups. Results: Compared with exam 1, LV radial deformation showed significant reductions in peak systolic e in the basal and mid segments (51615% vs 4669%, P < .01). LV longitudinal deformation behaved similarly, showing significant peak systolic e reductions in all measured segments. Systolic SR showed reductions only in the basal and apical segments of the lateral wall and in the mid portion of the inferior wall (-1.9 +/- 0.5 vs -1.7 +/- 0.3 s(-1) and -1.9 +/- 0.4 vs -1.7 +/- 0.2 s(-1), respectively, P = .03). RV longitudinal free and inferior wall systolic SR and e values were significantly higher in exam 2. Conclusions: LV peak systolic e decreases in exam 2 were possibly due to afterload increase and preload decrease. The lower RV initial deformation indices could be attributed to increased afterload caused by physiologic pulmonary hypertension or immature RV contractile properties. SR seemed to be a more robust index than e and less influenced by preload and afterload hemodynamic alteration. (J Am Soc Echocardiogr 2010;23:294-300.)

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The effect of number of samples and selection of data for analysis on the calculation of surface motor unit potential (SMUP) size in the statistical method of motor unit number estimates (MUNE) was determined in 10 normal subjects and 10 with amyotrophic lateral sclerosis (ALS). We recorded 500 sequential compound muscle action potentials (CMAPs) at three different stable stimulus intensities (10–50% of maximal CMAP). Estimated mean SMUP sizes were calculated using Poisson statistical assumptions from the variance of 500 sequential CMAP obtained at each stimulus intensity. The results with the 500 data points were compared with smaller subsets from the same data set. The results using a range of 50–80% of the 500 data points were compared with the full 500. The effect of restricting analysis to data between 5–20% of the CMAP and to standard deviation limits was also assessed. No differences in mean SMUP size were found with stimulus intensity or use of different ranges of data. Consistency was improved with a greater sample number. Data within 5% of CMAP size gave both increased consistency and reduced mean SMUP size in many subjects, but excluded valid responses present at that stimulus intensity. These changes were more prominent in ALS patients in whom the presence of isolated SMUP responses was a striking difference from normal subjects. Noise, spurious data, and large SMUP limited the Poisson assumptions. When these factors are considered, consistent statistical MUNE can be calculated from a continuous sequence of data points. A 2 to 2.5 SD or 10% window are reasonable methods of limiting data for analysis. Muscle Nerve 27: 320–331, 2003

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Desilication and a combination of alkaline followed by acid treatment were applied to MCM-22 zeolite using two different base concentrations. The samples were characterised by powder X-ray diffraction, Al-27 and Si-29 MAS-NMR spectroscopy, SEM, TEM and low temperature N-2 adsorption. The acidity of the samples was study through pyridine adsorption followed by FTIR spectroscopy and by the analyses of the hydroxyl region. The catalytic behaviour, anticipated by the effect of post-synthesis treatments on the acidity and space available inside the two internal pore systems was evaluated by using the model reaction of m-xylene transformation. The generation of mesoporosity was achieved upon alkaline treatment with 0.05 M NaOH solution and practically no additional gain was obtained when the more concentrate solution, 0.1 M, was used. Instead, Al extraction takes place along with Si, as shown by Si-29 and Al-27 MAS-NMR data, followed by Al deposition as extraframework species. Samples submitted to alkaline plus acid treatments present distinct behaviour. When the lowest NaOH solution was used no relevant effect was observed on the textural characteristics. Additionally, when the acid treatment was performed on an already fragilized MCM-22 structure, due to previous desilication with 0.1 M NaOH solution, the extraction of Al from both internal pore systems promotes their interconnection, evolving from a 2-D to a 3-D porous structure. This transformation has a marked effect in the catalytic behaviour, allowing an increase of m-xylene conversion as a consequence of an easier and faster molecular traffic in the 3-D structure. On the other hand, the continuous deposition of extraframework Al species inside the pores leads to a shape selective effect that privileges the formation of the more valuable isomer p-xylene.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Dissertação para obtenção do Grau de Mestre em Engenharia Informática

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The goal of this thesis is the study of a tool that can help analysts in finding sequential patterns. This tool will have a focus on financial markets. A study will be made on how new and relevant knowledge can be mined from real life information, potentially giving investors, market analysts, and economists new basis to make informed decisions. The Ramex Forum algorithm will be used as a basis for the tool, due to its ability to find sequential patterns in financial data. So that it further adapts to the needs of the thesis, a study of relevant improvements to the algorithm will be made. Another important aspect of this algorithm is the way that it displays the patterns found, even with good results it is difficult to find relevant patterns among all the studied samples without a proper result visualization component. As such, different combinations of parameterizations and ways to visualize data will be evaluated and their influence in the analysis of those patterns will be discussed. In order to properly evaluate the utility of this tool, case studies will be performed as a final test. Real information will be used to produce results and those will be evaluated in regards to their accuracy, interest, and relevance.