218 resultados para stratified random sampling
Resumo:
Log-linear and maximum-margin models are two commonly-used methods in supervised machine learning, and are frequently used in structured prediction problems. Efficient learning of parameters in these models is therefore an important problem, and becomes a key factor when learning from very large data sets. This paper describes exponentiated gradient (EG) algorithms for training such models, where EG updates are applied to the convex dual of either the log-linear or max-margin objective function; the dual in both the log-linear and max-margin cases corresponds to minimizing a convex function with simplex constraints. We study both batch and online variants of the algorithm, and provide rates of convergence for both cases. In the max-margin case, O(1/ε) EG updates are required to reach a given accuracy ε in the dual; in contrast, for log-linear models only O(log(1/ε)) updates are required. For both the max-margin and log-linear cases, our bounds suggest that the online EG algorithm requires a factor of n less computation to reach a desired accuracy than the batch EG algorithm, where n is the number of training examples. Our experiments confirm that the online algorithms are much faster than the batch algorithms in practice. We describe how the EG updates factor in a convenient way for structured prediction problems, allowing the algorithms to be efficiently applied to problems such as sequence learning or natural language parsing. We perform extensive evaluation of the algorithms, comparing them to L-BFGS and stochastic gradient descent for log-linear models, and to SVM-Struct for max-margin models. The algorithms are applied to a multi-class problem as well as to a more complex large-scale parsing task. In all these settings, the EG algorithms presented here outperform the other methods.
Resumo:
Ocean processes are dynamic, complex, and occur on multiple spatial and temporal scales. To obtain a synoptic view of such processes, ocean scientists collect data over long time periods. Historically, measurements were continually provided by fixed sensors, e.g., moorings, or gathered from ships. Recently, an increase in the utilization of autonomous underwater vehicles has enabled a more dynamic data acquisition approach. However, we still do not utilize the full capabilities of these vehicles. Here we present algorithms that produce persistent monitoring missions for underwater vehicles by balancing path following accuracy and sampling resolution for a given region of interest, which addresses a pressing need among ocean scientists to efficiently and effectively collect high-value data. More specifically, this paper proposes a path planning algorithm and a speed control algorithm for underwater gliders, which together give informative trajectories for the glider to persistently monitor a patch of ocean. We optimize a cost function that blends two competing factors: maximize the information value along the path, while minimizing deviation from the planned path due to ocean currents. Speed is controlled along the planned path by adjusting the pitch angle of the underwater glider, so that higher resolution samples are collected in areas of higher information value. The resulting paths are closed circuits that can be repeatedly traversed to collect long-term ocean data in dynamic environments. The algorithms were tested during sea trials on an underwater glider operating off the coast of southern California, as well as in Monterey Bay, California. The experimental results show significant improvements in data resolution and path reliability compared to previously executed sampling paths used in the respective regions.
Resumo:
In the present study we investigate the effect of viscous dissipation on natural convection from a vertical plate placed in a thermally stratified environment. The reduced equations are integrated by employing the implicit finite difference scheme of Keller box method and obtained the effect of heat due to viscous dissipation on the local skin friction and local Nusselt number at various stratification levels, for fluids having Prandtl numbers of 10, 50, and 100. Solutions are also obtained using the perturbation technique for small values of viscous dissipation parameters $\xi$ and compared to the finite difference solutions for 0 · $\xi$ · 1. Effect of viscous dissipation and temperature stratification are also shown on the velocity and temperature distributions in the boundary layer region.
Resumo:
We present here a numerical study of laminar doubly diffusive free convection flows adjacent to a vertical surface in a stable thermally stratified medium. The governing equations of mass, momentum, energy and species are non-dimensionalized. These equations have been solved by using an implicit finite difference method and local non-similarity method. The results show many interesting aspects of complex interaction of the two buoyant mechanisms that have been shown in both the tabular as well as graphical form.
Resumo:
Background Anemia due to iron deficiency is recognized as one of the major nutritional deficiencies in women and children in developing countries. Daily iron supplementation for pregnant women is recommended in many countries although there are few reports of these programs working efficiently or effectively. Weekly iron-folic acid supplementation (WIFS) and regular deworming treatment is recommended for non-pregnant women living in areas with high rates of anemia. Following a baseline survey to assess the prevalence of anemia, iron deficiency and soil transmitted helminth infections, we implemented a program to make WIFS and regular deworming treatment freely and universally available for all women of reproductive age in two districts of a province in northern Vietnam over a 12 month period. The impact of the program at the population level was assessed in terms of: i) change in mean hemoglobin and iron status indicators, and ii) change in the prevalence of anemia, iron deficiency and hookworm infections. Method Distribution of WIFS and deworming were integrated with routine health services and made available to 52,000 women. Demographic data and blood and stool samples were collected in baseline, and three and 12-month post-implementation surveys using a population-based, stratified multi-stage cluster sampling design. Results The mean Hb increased by 9.6 g/L (95% CI, 5.7, 13.5, p < 0.001) during the study period. Anemia (Hb<120 g/L) was present in 131/349 (37.5%, 95% CI 31.3, 44.8) subjects at baseline, and in 70/363 (19.3%, 95% CI 14.0, 24.6) after twelve months. Iron deficiency reduced from 75/329 (22.8%, 95% CI 16.9, 28.6) to 33/353 (9.3%, 95% CI 5.7, 13.0) by the 12-mnth survey, and hookworm infection from 279/366 (76.2%,, 95% CI 68.6, 83.8) to 66/287 (23.0%, 95% CI 17.5, 28.5) over the same period. Conclusion A free, universal WIFS program with regular deworming was associated with reduced prevalence and severity of anemia, iron deficiency and ho
Resumo:
Analytical expressions are derived for the mean and variance, of estimates of the bispectrum of a real-time series assuming a cosinusoidal model. The effects of spectral leakage, inherent in discrete Fourier transform operation when the modes present in the signal have a nonintegral number of wavelengths in the record, are included in the analysis. A single phase-coupled triad of modes can cause the bispectrum to have a nonzero mean value over the entire region of computation owing to leakage. The variance of bispectral estimates in the presence of leakage has contributions from individual modes and from triads of phase-coupled modes. Time-domain windowing reduces the leakage. The theoretical expressions for the mean and variance of bispectral estimates are derived in terms of a function dependent on an arbitrary symmetric time-domain window applied to the record. the number of data, and the statistics of the phase coupling among triads of modes. The theoretical results are verified by numerical simulations for simple test cases and applied to laboratory data to examine phase coupling in a hypothesis testing framework
Resumo:
The CDKN2 gene, encoding the cyclin-dependent kinase inhibitor p16, is a tumour suppressor gene that maps to chromosome band 9p21-p22. The most common mechanism of inactivation of this gene in human cancers is through homozygous deletion; however, in a smaller proportion of tumours and tumour cell lines intragenic mutations occur. In this study we have compiled a database of over 120 published point mutations in the CDKN2 gene from a wide variety of tumour types. A further 50 deletions, insertions, and splice mutations in CDKN2 have also been compiled. Furthermore, we have standardised the numbering of all mutations according to the full-length 156 amino acid form of p16. From this study we are able to define several hot spots, some of which occur at conserved residues within the ankyrin domains of p16. While many of the hotspots are shared by a number of cancers, the relative importance of each position varies, possibly reflecting the role of different carcinogens in the development of certain tumours. As reported previously, the mutational spectrum of CDKN2 in melanomas differs from that of internal malignancies and supports the involvement of UV in melanoma tumorigenesis. Notably, 52% of all substitutions in melanoma-derived samples occurred at just six nucleotide positions. Nonsense mutations comprise a comparatively high proportion of mutations present in the CDKN2 gene, and possible explanations for this are discussed.
Resumo:
This paper reports the feasibility and methodological considerations of using the Short Message System Experience Sampling (SMS-ES) Method, which is an experience sampling research method developed to assist researchers to collect repeat measures of consumers’ affective experiences. The method combines SMS with web-based technology in a simple yet effective way. It is described using a practical implementation study that collected consumers’ emotions in response to using mobile phones in everyday situations. The method is further evaluated in terms of the quality of data collected in the study, as well as against the methodological considerations for experience sampling studies. These two evaluations suggest that the SMS-ES Method is both a valid and reliable approach for collecting consumers’ affective experiences. Moreover, the method can be applied across a range of for-profit and not-for-profit contexts where researchers want to capture repeated measures of consumers’ affective experiences occurring over a period of time. The benefits of the method are discussed to assist researchers who wish to apply the SMS-ES Method in their own research designs.
Resumo:
Natural convection flow from an isothermal vertical plate with uniform heat source embedded in a stratified medium has been discussed in this paper. The resulting momentum and energy equations of boundary layer approximation are made non-similar by introducing the usual non-similarity transformations. Numerical solutions of these equations are obtained by an implicit finite difference method for a wide range of the stratification parameter, X. The solutions are also obtained for different values of pertinent parameters, namely, the Prandtl number, Pr and the heat generation or absorption parameter, λ and are expressed in terms of the local skin-friction and local heat transfer, which are shown in the graphical form. Effect of heat generation or absorption on the streamlines and isotherms are also shown graphically for different values of λ.