914 resultados para Data selection


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Due to the health impacts caused by exposures to air pollutants in urban areas, monitoring and forecasting of air quality parameters have become popular as an important topic in atmospheric and environmental research today. The knowledge on the dynamics and complexity of air pollutants behavior has made artificial intelligence models as a useful tool for a more accurate pollutant concentration prediction. This paper focuses on an innovative method of daily air pollution prediction using combination of Support Vector Machine (SVM) as predictor and Partial Least Square (PLS) as a data selection tool based on the measured values of CO concentrations. The CO concentrations of Rey monitoring station in the south of Tehran, from Jan. 2007 to Feb. 2011, have been used to test the effectiveness of this method. The hourly CO concentrations have been predicted using the SVM and the hybrid PLS–SVM models. Similarly, daily CO concentrations have been predicted based on the aforementioned four years measured data. Results demonstrated that both models have good prediction ability; however the hybrid PLS–SVM has better accuracy. In the analysis presented in this paper, statistic estimators including relative mean errors, root mean squared errors and the mean absolute relative error have been employed to compare performances of the models. It has been concluded that the errors decrease after size reduction and coefficients of determination increase from 56 to 81% for SVM model to 65–85% for hybrid PLS–SVM model respectively. Also it was found that the hybrid PLS–SVM model required lower computational time than SVM model as expected, hence supporting the more accurate and faster prediction ability of hybrid PLS–SVM model.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The Load/Unload Response Ratio (LURR) method is proposed for short-to-intermediate-term earthquake prediction [Yin, X.C., Chen, X.Z., Song, Z.P., Yin, C., 1995. A New Approach to Earthquake Prediction — The Load/Unload Response Ratio (LURR) Theory, Pure Appl. Geophys., 145, 701–715]. This method is based on measuring the ratio between Benioff strains released during the time periods of loading and unloading, corresponding to the Coulomb Failure Stress change induced by Earth tides on optimally oriented faults. According to the method, the LURR time series usually climb to an anomalously high peak prior to occurrence of a large earthquake. Previous studies have indicated that the size of critical seismogenic region selected for LURR measurements has great influence on the evaluation of LURR. In this study, we replace the circular region usually adopted in LURR practice with an area within which the tectonic stress change would mostly affect the Coulomb stress on a potential seismogenic fault of a future event. The Coulomb stress change before a hypothetical earthquake is calculated based on a simple back-slip dislocation model of the event. This new algorithm, by combining the LURR method with our choice of identified area with increased Coulomb stress, is devised to improve the sensitivity of LURR to measure criticality of stress accumulation before a large earthquake. Retrospective tests of this algorithm on four large earthquakes occurred in California over the last two decades show remarkable enhancement of the LURR precursory anomalies. For some strong events of lesser magnitudes occurred in the same neighborhoods and during the same time periods, significant anomalies are found if circular areas are used, and are not found if increased Coulomb stress areas are used for LURR data selection. The unique feature of this algorithm may provide stronger constraints on forecasts of the size and location of future large events.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A significant cost in obtaining acoustic training data is the generation of accurate transcriptions. For some sources close-caption data is available. This allows the use of lightly-supervised training techniques. However, for some sources and languages close-caption is not available. In these cases unsupervised training techniques must be used. This paper examines the use of unsupervised techniques for discriminative training. In unsupervised training automatic transcriptions from a recognition system are used for training. As these transcriptions may be errorful data selection may be useful. Two forms of selection are described, one to remove non-target language shows, the other to remove segments with low confidence. Experiments were carried out on a Mandarin transcriptions task. Two types of test data were considered, Broadcast News (BN) and Broadcast Conversations (BC). Results show that the gains from unsupervised discriminative training are highly dependent on the accuracy of the automatic transcriptions. © 2007 IEEE.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This thesis addresses the fine structure, both radial and lateral, of compressional wave velocity and attenuation of the Earth's core and the lowermost mantle using waveforms, differential travel times and amplitudes of PKP waves, which penetrate the Earth's core.

The structure near the inner core boundary (ICB) is studied by analyzing waveforms of a regional sample. The waveform modeling approach is demonstrated to be an effective tool for constrainning the ICB structure. The best model features a sharp velocity jump of 0.78km/s at the ICB and a low velocity gradient at the lowermost outer core (indicating possible inhomogeneity) and high attenuation at the top of the inner core.

A spherically symmetric P-wave model of the core, is proposed from PKP differential times, waveforms and amplitudes. The ICB remains sharp with a velocity jump of 0. 78km/ s. A very low velocity gradient at the base of the fluid core is demonstrated to be a robust feature, indicating inhomogeneity is practically inevitable. The model also indicates that the attenuation in the inner core decreases with depth. The velocity at D" is smaller than PREM.

The inner core is confirmed to be very anisotropic, possessing a cylindrical symmetry around the Earth spin axis with the N-S direction 3% faster than the E-W direction. All of the N-S rays through the inner core were found to be faster than the E-W rays by 1.5 to 3.5s. Exhaustive data selection and efforts in insolating contributions from the region above ensure that this is an inner core feature.

The anisotropy at the very top of the inner core is found to be distinctly different from the deeper part. The top 60km of the inner core is not anisotropic. From 60km to 150km, there appears to be a transition from isotropy to anisotropy.

PKP differential travel times are used to study the P velocity structure in D". Systematic regional variations of up to 2s in AB-DF times were observed, attributed primarily to heterogeneities in the lower 500km of the mantle. However, direct comparisons with tomographic models are not successful.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Esta tese de doutorado tem como foco as limitações que se colocam aos gestores do nível local para produzir serviços de qualidade em sistemas de saúde. A definição tradicional do conceito de gestão de recursos humanos foi ampliada por abarcar questões relacionadas à configuração federalista do país, considerando diversidades estruturais e culturais. Utilizou como fonte primária, dados selecionados a partir de entrevistas com gestores de RH com reconhecida experiência na área. Como fonte secundária, utilizou-se as informações pré-selecionadas de pesquisas disponíveis sobre o campo de gestão do trabalho da saúde. A análise do autor confronta as características estruturais do federalismo brasileiro e os desafios decorrentes da base da concepção do sistema nacional de saúde. Os resultados deste estudo iluminam possíveis caminhos alternativos para superar as limitações presentes na gestão de recursos humanos no nível local.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Este estudo abordou o contexto de uma engrenagem macropolítica, na qual o poder dominante (representado pela indústria farmacêutica) se articula à indústria do conhecimento, condicionando construção e divulgação de conhecimento biomédico à mesma lógica da produção e distribuição capitalista de mercadorias. A indústria farmacêutica tem investido em projetos de pesquisas clínicas, com o objetivo de legitimar, cientificamente, seus produtos. A investigação deste papel financiador de ensaios clínicos baseou-se nos depoimentos de quatro médicos, professores de Medicina, coparticipantes de projetos da indústria farmacêutica. Os resultados evidenciaram que os protocolos das pesquisas são elaborados pela própria indústria, sem participação do médico colaborador; o acesso à íntegra dos dados coletados é de exclusividade dos coordenadores centrais da pesquisa; e, os resultados dos ensaios, apresentados através de resumos, são previamente submetidos a critérios de seleção de dados. Com o objetivo de articular saber médico a expectativas de mercado, o poder dominante se exerce através de técnicas de poder (estratégias de marketing), atribuindo aos médicos o papel de técnicos de poder, a serviço de seus interesses. Essa relativização de valores clama por uma crítica revisão ética.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Three experiments investigated the effect of rarity on people's selection and interpretation of data in a variant of the pseudodiagnosticity task. For familiar (Experiment 1) but not for arbitrary (Experiment 3) materials, participants were more likely to select evidence so as to complete a likelihood ratio when the initial evidence they received was a single likelihood concerning a rare feature. This rarity effect with familiar materials was replicated in Experiment 2 where it was shown that participants were relatively insensitive to explicit manipulations of the likely diagnosticity of rare evidence. In contrast to the effects for data selection, there was an effect of rarity on confidence ratings after receipt of a single likelihood for arbitrary but not for familiar materials. It is suggested that selecting diagnostic evidence necessitates explicit consideration of the alternative hypothesis and that consideration of the possible consequences of the evidence for the alternative weakens the rarity effect in confidence ratings. Paradoxically, although rarity effects in evidence selection and confidence ratings are in the spirit of Bayesian reasoning, the effect on confidence ratings appears to rely on participants thinking less about the alternative hypothesis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Objective To investigate the effects of weaning protocols on the total duration of mechanical ventilation, mortality, adverse events, quality of life, weaning duration, and length of stay in the intensive care unit and hospital.

Design Systematic review.

Data sources Cochrane Central Register of Controlled Trials, Medline, Embase, CINAHL, LILACS, ISI Web of Science, ISI Conference Proceedings, Cambridge Scientific Abstracts, and reference lists of articles. We did not apply language restrictions.

Review methods We included randomised and quasi-randomised controlled trials of weaning from mechanical ventilation with and without protocols in critically ill adults.

Data selection Three authors independently assessed trial quality and extracted data. A priori subgroup and sensitivity analyses were performed. We contacted study authors for additional information.

Results Eleven trials that included 1971 patients met the inclusion criteria. Compared with usual care, the geometric mean duration of mechanical ventilation in the weaning protocol group was reduced by 25% (95% confidence interval 9% to 39%, P=0.006; 10 trials); the duration of weaning was reduced by 78% (31% to 93%, P=0.009; six trials); and stay in the intensive care unit length by 10% (2% to 19%, P=0.02; eight trials). There was significant heterogeneity among studies for total duration of mechanical ventilation (I(2)=76%, P

Conclusion There is evidence of a reduction in the duration of mechanical ventilation, weaning, and stay in the intensive care unit when standardised weaning protocols are used, but there is significant heterogeneity among studies and an insufficient number of studies to investigate the source of this heterogeneity. Some studies suggest that organisational context could influence outcomes, but this could not be evaluated as it was outside the scope of this review.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Introduction: HIV testing is a cornerstone of efforts to combat the HIV epidemic, and testing conducted as part of surveillance provides invaluable data on the spread of infection and the effectiveness of campaigns to reduce the transmission of HIV. However, participation in HIV testing can be low, and if respondents systematically select not to be tested because they know or suspect they are HIV positive (and fear disclosure), standard approaches to deal with missing data will fail to remove selection bias. We implemented Heckman-type selection models, which can be used to adjust for missing data that are not missing at random, and established the extent of selection bias in a population-based HIV survey in an HIV hyperendemic community in rural South Africa.

Methods: We used data from a population-based HIV survey carried out in 2009 in rural KwaZulu-Natal, South Africa. In this survey, 5565 women (35%) and 2567 men (27%) provided blood for an HIV test. We accounted for missing data using interviewer identity as a selection variable which predicted consent to HIV testing but was unlikely to be independently associated with HIV status. Our approach involved using this selection variable to examine the HIV status of residents who would ordinarily refuse to test, except that they were allocated a persuasive interviewer. Our copula model allows for flexibility when modelling the dependence structure between HIV survey participation and HIV status.

Results: For women, our selection model generated an HIV prevalence estimate of 33% (95% CI 27–40) for all people eligible to consent to HIV testing in the survey. This estimate is higher than the estimate of 24% generated when only information from respondents who participated in testing is used in the analysis, and the estimate of 27% when imputation analysis is used to predict missing data on HIV status. For men, we found an HIV prevalence of 25% (95% CI 15–35) using the selection model, compared to 16% among those who participated in testing, and 18% estimated with imputation. We provide new confidence intervals that correct for the fact that the relationship between testing and HIV status is unknown and requires estimation.

Conclusions: We confirm the feasibility and value of adopting selection models to account for missing data in population-based HIV surveys and surveillance systems. Elements of survey design, such as interviewer identity, present the opportunity to adopt this approach in routine applications. Where non-participation is high, true confidence intervals are much wider than those generated by standard approaches to dealing with missing data suggest.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We present an improved, biologically inspired and multiscale keypoint operator. Models of single- and double-stopped hypercomplex cells in area V1 of the mammalian visual cortex are used to detect stable points of high complexity at multiple scales. Keypoints represent line and edge crossings, junctions and terminations at fine scales, and blobs at coarse scales. They are detected by applying first and second derivatives to responses of complex cells in combination with two inhibition schemes to suppress responses along lines and edges. A number of optimisations make our new algorithm much faster than previous biologically inspired models, achieving real-time performance on modern GPUs and competitive speeds on CPUs. In this paper we show that the keypoints exhibit state-of-the-art repeatability in standardised benchmarks, often yielding best-in-class performance. This makes them interesting both in biological models and as a useful detector in practice. We also show that keypoints can be used as a data selection step, significantly reducing the complexity in state-of-the-art object categorisation. (C) 2014 Elsevier B.V. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Ever since Adam Smith, economists have argued that share contracts do not provide proper incentives. This paper uses tenancy data from India to assess the existence of missing incentives in this classical example of moral hazard. Sharecroppers are found to be less productive than owners, but as productive as fixed-rent tenants. Also, the productivity gap between owners and both types of tenants is driven by sample-selection issues. An endogenous selection rule matches tenancy contracts with less-skilled farmers and lower-quality lands. Due to complementarity, such a matching affects tenants’ input choices. Controlling for that, the contract form has no effect on the expected output. Next, I explicitly model farmer’s optimal decisions to test the existence of non-contractible inputs being misused. No evidence of missing incentives is found.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The objective of this study was to estimate (co)variance components using random regression on B-spline functions to weight records obtained from birth to adulthood. A total of 82 064 weight records of 8145 females obtained from the data bank of the Nellore Breeding Program (PMGRN/Nellore Brazil) which started in 1987, were used. The models included direct additive and maternal genetic effects and animal and maternal permanent environmental effects as random. Contemporary group and dam age at calving (linear and quadratic effect) were included as fixed effects, and orthogonal Legendre polynomials of age (cubic regression) were considered as random covariate. The random effects were modeled using B-spline functions considering linear, quadratic and cubic polynomials for each individual segment. Residual variances were grouped in five age classes. Direct additive genetic and animal permanent environmental effects were modeled using up to seven knots (six segments). A single segment with two knots at the end points of the curve was used for the estimation of maternal genetic and maternal permanent environmental effects. A total of 15 models were studied, with the number of parameters ranging from 17 to 81. The models that used B-splines were compared with multi-trait analyses with nine weight traits and to a random regression model that used orthogonal Legendre polynomials. A model fitting quadratic B-splines, with four knots or three segments for direct additive genetic effect and animal permanent environmental effect and two knots for maternal additive genetic effect and maternal permanent environmental effect, was the most appropriate and parsimonious model to describe the covariance structure of the data. Selection for higher weight, such as at young ages, should be performed taking into account an increase in mature cow weight. Particularly, this is important in most of Nellore beef cattle production systems, where the cow herd is maintained on range conditions. There is limited modification of the growth curve of Nellore cattle with respect to the aim of selecting them for rapid growth at young ages while maintaining constant adult weight.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This study investigated, by descriptive documental research, the intervention models in Sexual Education in the school with the participation of the family and use of new technologies. The methodological stages were: collects in bases of data, selection and reading of the studies and elaboration of thematic groupings, by the content analysis. Four thematic group were elaborated: 1) intervention models with professionals; 2) intervention models with adolescents; 3) intervention models with the direct participation of the family and 4) Appreciation of the family’s participation in sexual education. The analised articles prioritize the adolescent population, when descrive the intervention. Few studies consider the family as important to sexual education and the techniques of the computer science are not very explored as pedagogic resources.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The objective of this study was to estimate (co)variance components using random regression on B-spline functions to weight records obtained from birth to adulthood. A total of 82 064 weight records of 8145 females obtained from the data bank of the Nellore Breeding Program (PMGRN/Nellore Brazil) which started in 1987, were used. The models included direct additive and maternal genetic effects and animal and maternal permanent environmental effects as random. Contemporary group and dam age at calving (linear and quadratic effect) were included as fixed effects, and orthogonal Legendre polynomials of age (cubic regression) were considered as random covariate. The random effects were modeled using B-spline functions considering linear, quadratic and cubic polynomials for each individual segment. Residual variances were grouped in five age classes. Direct additive genetic and animal permanent environmental effects were modeled using up to seven knots (six segments). A single segment with two knots at the end points of the curve was used for the estimation of maternal genetic and maternal permanent environmental effects. A total of 15 models were studied, with the number of parameters ranging from 17 to 81. The models that used B-splines were compared with multi-trait analyses with nine weight traits and to a random regression model that used orthogonal Legendre polynomials. A model fitting quadratic B-splines, with four knots or three segments for direct additive genetic effect and animal permanent environmental effect and two knots for maternal additive genetic effect and maternal permanent environmental effect, was the most appropriate and parsimonious model to describe the covariance structure of the data. Selection for higher weight, such as at young ages, should be performed taking into account an increase in mature cow weight. Particularly, this is important in most of Nellore beef cattle production systems, where the cow herd is maintained on range conditions. There is limited modification of the growth curve of Nellore cattle with respect to the aim of selecting them for rapid growth at young ages while maintaining constant adult weight.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This thesis describes the developments of new models and toolkits for the orbit determination codes to support and improve the precise radio tracking experiments of the Cassini-Huygens mission, an interplanetary mission to study the Saturn system. The core of the orbit determination process is the comparison between observed observables and computed observables. Disturbances in either the observed or computed observables degrades the orbit determination process. Chapter 2 describes a detailed study of the numerical errors in the Doppler observables computed by NASA's ODP and MONTE, and ESA's AMFIN. A mathematical model of the numerical noise was developed and successfully validated analyzing against the Doppler observables computed by the ODP and MONTE, with typical relative errors smaller than 10%. The numerical noise proved to be, in general, an important source of noise in the orbit determination process and, in some conditions, it may becomes the dominant noise source. Three different approaches to reduce the numerical noise were proposed. Chapter 3 describes the development of the multiarc library, which allows to perform a multi-arc orbit determination with MONTE. The library was developed during the analysis of the Cassini radio science gravity experiments of the Saturn's satellite Rhea. Chapter 4 presents the estimation of the Rhea's gravity field obtained from a joint multi-arc analysis of Cassini R1 and R4 fly-bys, describing in details the spacecraft dynamical model used, the data selection and calibration procedure, and the analysis method followed. In particular, the approach of estimating the full unconstrained quadrupole gravity field was followed, obtaining a solution statistically not compatible with the condition of hydrostatic equilibrium. The solution proved to be stable and reliable. The normalized moment of inertia is in the range 0.37-0.4 indicating that Rhea's may be almost homogeneous, or at least characterized by a small degree of differentiation.