968 resultados para text vector space model
Resumo:
In the context of expensive numerical experiments, a promising solution for alleviating the computational costs consists of using partially converged simulations instead of exact solutions. The gain in computational time is at the price of precision in the response. This work addresses the issue of fitting a Gaussian process model to partially converged simulation data for further use in prediction. The main challenge consists of the adequate approximation of the error due to partial convergence, which is correlated in both design variables and time directions. Here, we propose fitting a Gaussian process in the joint space of design parameters and computational time. The model is constructed by building a nonstationary covariance kernel that reflects accurately the actual structure of the error. Practical solutions are proposed for solving parameter estimation issues associated with the proposed model. The method is applied to a computational fluid dynamics test case and shows significant improvement in prediction compared to a classical kriging model.
Resumo:
It is system dynamics that determines the function of cells, tissues and organisms. To develop mathematical models and estimate their parameters are an essential issue for studying dynamic behaviors of biological systems which include metabolic networks, genetic regulatory networks and signal transduction pathways, under perturbation of external stimuli. In general, biological dynamic systems are partially observed. Therefore, a natural way to model dynamic biological systems is to employ nonlinear state-space equations. Although statistical methods for parameter estimation of linear models in biological dynamic systems have been developed intensively in the recent years, the estimation of both states and parameters of nonlinear dynamic systems remains a challenging task. In this report, we apply extended Kalman Filter (EKF) to the estimation of both states and parameters of nonlinear state-space models. To evaluate the performance of the EKF for parameter estimation, we apply the EKF to a simulation dataset and two real datasets: JAK-STAT signal transduction pathway and Ras/Raf/MEK/ERK signaling transduction pathways datasets. The preliminary results show that EKF can accurately estimate the parameters and predict states in nonlinear state-space equations for modeling dynamic biochemical networks.
Resumo:
Computer vision-based food recognition could be used to estimate a meal's carbohydrate content for diabetic patients. This study proposes a methodology for automatic food recognition, based on the Bag of Features (BoF) model. An extensive technical investigation was conducted for the identification and optimization of the best performing components involved in the BoF architecture, as well as the estimation of the corresponding parameters. For the design and evaluation of the prototype system, a visual dataset with nearly 5,000 food images was created and organized into 11 classes. The optimized system computes dense local features, using the scale-invariant feature transform on the HSV color space, builds a visual dictionary of 10,000 visual words by using the hierarchical k-means clustering and finally classifies the food images with a linear support vector machine classifier. The system achieved classification accuracy of the order of 78%, thus proving the feasibility of the proposed approach in a very challenging image dataset.
Resumo:
Surgical robots have been proposed ex vivo to drill precise holes in the temporal bone for minimally invasive cochlear implantation. The main risk of the procedure is damage of the facial nerve due to mechanical interaction or due to temperature elevation during the drilling process. To evaluate the thermal risk of the drilling process, a simplified model is proposed which aims to enable an assessment of risk posed to the facial nerve for a given set of constant process parameters for different mastoid bone densities. The model uses the bone density distribution along the drilling trajectory in the mastoid bone to calculate a time dependent heat production function at the tip of the drill bit. Using a time dependent moving point source Green's function, the heat equation can be solved at a certain point in space so that the resulting temperatures can be calculated over time. The model was calibrated and initially verified with in vivo temperature data. The data was collected in minimally invasive robotic drilling of 12 holes in four different sheep. The sheep were anesthetized and the temperature elevations were measured with a thermocouple which was inserted in a previously drilled hole next to the planned drilling trajectory. Bone density distributions were extracted from pre-operative CT data by averaging Hounsfield values over the drill bit diameter. Post-operative [Formula: see text]CT data was used to verify the drilling accuracy of the trajectories. The comparison of measured and calculated temperatures shows a very good match for both heating and cooling phases. The average prediction error of the maximum temperature was less than 0.7 °C and the average root mean square error was approximately 0.5 °C. To analyze potential thermal damage, the model was used to calculate temperature profiles and cumulative equivalent minutes at 43 °C at a minimal distance to the facial nerve. For the selected drilling parameters, temperature elevation profiles and cumulative equivalent minutes suggest that thermal elevation of this minimally invasive cochlear implantation surgery may pose a risk to the facial nerve, especially in sclerotic or high density mastoid bones. Optimized drilling parameters need to be evaluated and the model could be used for future risk evaluation.
Resumo:
We analyzed observations of interstellar neutral helium (ISN He) obtained from the Interstellar Boundary Explorer (IBEX) satellite during its first six years of operation. We used a refined version of the ISN He simulation model, presented in the companion paper by Sokol et al. (2015b), along with a sophisticated data correlation and uncertainty system and parameter fitting method, described in the companion paper by Swaczyna et al. We analyzed the entire data set together and the yearly subsets, and found the temperature and velocity vector of ISN He in front of the heliosphere. As seen in the previous studies, the allowable parameters are highly correlated and form a four-dimensional tube in the parameter space. The inflow longitudes obtained from the yearly data subsets show a spread of similar to 6 degrees, with the other parameters varying accordingly along the parameter tube, and the minimum chi(2) value is larger than expected. We found, however, that the Mach number of the ISN He flow shows very little scatter and is thus very tightly constrained. It is in excellent agreement with the original analysis of ISN He observations from IBEX and recent reanalyses of observations from Ulysses. We identify a possible inaccuracy in the Warm Breeze parameters as the likely cause of the scatter in the ISN He parameters obtained from the yearly subsets, and we suppose that another component may exist in the signal or a process that is not accounted for in the current physical model of ISN He in front of the heliosphere. From our analysis, the inflow velocity vector, temperature, and Mach number of the flow are equal to lambda(ISNHe) = 255 degrees.8 +/- 0 degrees.5, beta(ISNHe) = 5 degrees.16 +/- 0 degrees.10, T-ISNHe = 7440 +/- 260 K, nu(SNHe) = 25.8 +/- 0.4 km s(-1), and M-ISNHe = 5.079 +/- 0.028, with uncertainties strongly correlated along the parameter tube.
Resumo:
Ocean biogeochemical and ecosystem processes are linked by net primary production (NPP) in the ocean's surface layer, where inorganic carbon is fixed by photosynthetic processes. Determinations of NPP are necessarily a function of phytoplankton biomass and its physiological status, but the estimation of these two terms from space has remained an elusive target. Here we present new satellite ocean color observations of phytoplankton carbon (C) and chlorophyll (Chl) biomass and show that derived Chl:C ratios closely follow anticipated physiological dependencies on light, nutrients, and temperature. With this new information, global estimates of phytoplankton growth rates (mu) and carbon-based NPP are made for the first time. Compared to an earlier chlorophyll-based approach, our carbon-based values are considerably higher in tropical oceans, show greater seasonality at middle and high latitudes, and illustrate important differences in the formation and demise of regional algal blooms. This fusion of emerging concepts from the phycological and remote sensing disciplines has the potential to fundamentally change how we model and observe carbon cycling in the global oceans.
Resumo:
Background. The purpose of this study was to describe the risk factors and demographics of persons with salmonellosis and shigellosis and to investigate both seasonal and spatial variations in the occurrence of these infections in Texas from 2000 to 2004, utilizing time series analyses and the geographic information system digital mapping methods. ^ Methods. Spatial Analysis: MapInfo software was used to map the distribution of age-adjusted rates of reported shigellosis and salmonellosis in Texas from 2000–2004 by zip codes. Census data on above or below poverty level, household income, highest level of educational attainment, race, ethnicity, and urban/rural community status was obtained from the 2000 Decennial Census for each zip code. The zip codes with the upper 10% and lower 10% were compared using t-tests and logistic regression to determine whether there were any potential risk factors. ^ Temporal analysis. Seasonal patterns in the prevalence of infections in Texas from 2000 to 2003 were determined by performing time-series analysis on the numbers of cases of salmonellosis and shigellosis. A linear regression was also performed to assess for trends in the incidence of each disease, along with auto-correlation and multi-component cosinor analysis. ^ Results. Spatial analysis: Analysis by general linear model showed a significant association between infection rates and age, with young children aged less than 5 and those aged 5–9 years having increased risk of infection for both disease conditions. The data demonstrated that those populations with high percentages of people who attained a higher than high school education were less likely to be represented in zip codes with high rates of shigellosis. However, for salmonellosis, logistic regression models indicated that when compared to populations with high percentages of non-high school graduates, having a high school diploma or equivalent increased the odds of having a high rate of infection. ^ Temporal analysis. For shigellosis, multi-component cosinor analyses were used to determine the approximated cosine curve which represented a statistically significant representation of the time series data for all age groups by sex. The shigellosis results show 2 peaks, with a major peak occurring in June and a secondary peak appearing around October. Salmonellosis results showed a single peak and trough in all age groups with the peak occurring in August and the trough occurring in February. ^ Conclusion. The results from this study can be used by public health agencies to determine the timing of public health awareness programs and interventions in order to prevent salmonellosis and shigellosis from occurring. Because young children depend on adults for their meals, it is important to increase the awareness of day-care workers and new parents about modes of transmission and hygienic methods of food preparation and storage. ^
Resumo:
A Bayesian approach to estimation of the regression coefficients of a multinominal logit model with ordinal scale response categories is presented. A Monte Carlo method is used to construct the posterior distribution of the link function. The link function is treated as an arbitrary scalar function. Then the Gauss-Markov theorem is used to determine a function of the link which produces a random vector of coefficients. The posterior distribution of the random vector of coefficients is used to estimate the regression coefficients. The method described is referred to as a Bayesian generalized least square (BGLS) analysis. Two cases involving multinominal logit models are described. Case I involves a cumulative logit model and Case II involves a proportional-odds model. All inferences about the coefficients for both cases are described in terms of the posterior distribution of the regression coefficients. The results from the BGLS method are compared to maximum likelihood estimates of the regression coefficients. The BGLS method avoids the nonlinear problems encountered when estimating the regression coefficients of a generalized linear model. The method is not complex or computationally intensive. The BGLS method offers several advantages over Bayesian approaches. ^
Resumo:
A discussion of nonlinear dynamics, demonstrated by the familiar automobile, is followed by the development of a systematic method of analysis of a possibly nonlinear time series using difference equations in the general state-space format. This format allows recursive state-dependent parameter estimation after each observation thereby revealing the dynamics inherent in the system in combination with random external perturbations.^ The one-step ahead prediction errors at each time period, transformed to have constant variance, and the estimated parametric sequences provide the information to (1) formally test whether time series observations y(,t) are some linear function of random errors (ELEM)(,s), for some t and s, or whether the series would more appropriately be described by a nonlinear model such as bilinear, exponential, threshold, etc., (2) formally test whether a statistically significant change has occurred in structure/level either historically or as it occurs, (3) forecast nonlinear system with a new and innovative (but very old numerical) technique utilizing rational functions to extrapolate individual parameters as smooth functions of time which are then combined to obtain the forecast of y and (4) suggest a measure of resilience, i.e. how much perturbation a structure/level can tolerate, whether internal or external to the system, and remain statistically unchanged. Although similar to one-step control, this provides a less rigid way to think about changes affecting social systems.^ Applications consisting of the analysis of some familiar and some simulated series demonstrate the procedure. Empirical results suggest that this state-space or modified augmented Kalman filter may provide interesting ways to identify particular kinds of nonlinearities as they occur in structural change via the state trajectory.^ A computational flow-chart detailing computations and software input and output is provided in the body of the text. IBM Advanced BASIC program listings to accomplish most of the analysis are provided in the appendix. ^
Resumo:
Inflammatory breast cancer (IBC) is a rare but very aggressive form of locally advanced breast cancer (1-6% of total breast cancer patients in United States), with a 5-year overall survival rate of only 40.5%, compared with 85% of the non-IBC patients. So far, a unique molecular signature for IBC able to explain the dramatic differences in the tumor biology between IBC and non-IBC has not been identified. As immune cells in the tumor microenvironment plays an important role in regulating tumor progression, we hypothesized that tumor-associated dendritic cells (TADC) may be responsible for regulating the development of the aggressive characteristics of IBC. MiRNAs can be released into the extracellular space and mediate the intercellular communication by regulating target gene expression beyond their cells of origin. We hypothesized that miRNAs released by IBC cells can induce an increased activation status, secretion of pro-inflammatory cytokines and migration ability of TADC. In an in vitro model of IBC tumor microenvironment, we found that the co-cultured of the IBC cell line SUM-149 with immature dendritic cells (iDCSUM-149) induced a higher degree of activation and maturation of iDCSUM-149 upon stimulation with lipopolysaccharide (LPS) compared with iDCs co-cultured with the non-IBC cell line SUM-159 (iDCSUM-159), resulting in: increased expression of the costimulatory and activation markers; higher production of pro-inflammatory cytokines (TNF-a, IL-6); and 3) higher migratory ability. These differences were due to the exosome-mediated transfer of miR-19a and miR-146a from SUM-149 and SUM-159, respectively, to iDCs, causing the downregulation of the miR-19a target genes PTEN, SOCS-1 and the miR-146a target genes IRAK1, TRAF6. PTEN, SOCS-1 and IRAK1, TRAF6 are important negative and positive regulator of cytokine- and TLR-mediated activation/maturation signaling pathway in DCs. Increased levels of IL-6 induced the upregulation of miR-19a synthesis in SUM-149 cells that was associated with the induction of CD44+CD24-ALDH1+ cancer stem cells (CSCs) with epithelial-to-mesenchymal transition (EMT) characteristics. In conclusion, in IBC tumor microenvironment IL-6/miR-19a axis can represent a self-sustaining loop able to maintain a pro-inflammatory status of DCs, leading to the development of tumor cells with high metastatic potential (EMT CSCs) responsible of the poor prognosis in IBC patients.
Resumo:
This data set provides a high-resolution digital elevation model (DEM) of a thermokarst depression (~7 km²) on ice-complex deposits in the Arctic Lena Delta, Siberia. The DEM based on a geodetic field survey and was used for quantitative land surface analyses and detailed description of the thermokarst depression morphology. Detailed morphometrical analyses, volume calculations, and solar radiation modeling were performed and statistically analyzed by Ulrich et al. (2010) to investigate the asymmetrical thermokarst depression development and directed lake migration previously proposed by Morgenstern et al. (2008). Furthermore, the high-resolution DEM in combination with satellite data allowed detailed analyses of spatial and temporal landscape changes due to thermokarst development (Günther, 2009).