953 resultados para Estimated data sequences


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Knowledge discovery in databases is the non-trivial process of identifying valid, novel potentially useful and ultimately understandable patterns from data. The term Data mining refers to the process which does the exploratory analysis on the data and builds some model on the data. To infer patterns from data, data mining involves different approaches like association rule mining, classification techniques or clustering techniques. Among the many data mining techniques, clustering plays a major role, since it helps to group the related data for assessing properties and drawing conclusions. Most of the clustering algorithms act on a dataset with uniform format, since the similarity or dissimilarity between the data points is a significant factor in finding out the clusters. If a dataset consists of mixed attributes, i.e. a combination of numerical and categorical variables, a preferred approach is to convert different formats into a uniform format. The research study explores the various techniques to convert the mixed data sets to a numerical equivalent, so as to make it equipped for applying the statistical and similar algorithms. The results of clustering mixed category data after conversion to numeric data type have been demonstrated using a crime data set. The thesis also proposes an extension to the well known algorithm for handling mixed data types, to deal with data sets having only categorical data. The proposed conversion has been validated on a data set corresponding to breast cancer. Moreover, another issue with the clustering process is the visualization of output. Different geometric techniques like scatter plot, or projection plots are available, but none of the techniques display the result projecting the whole database but rather demonstrate attribute-pair wise analysis

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper, two probabilistic adaptive algorithmsfor jointly detecting active users in a DS-CDMA system arereported. The first one, which is based on the theory of hiddenMarkov models (HMM’s) and the Baum–Wech (BW) algorithm,is proposed within the CDMA scenario and compared withthe second one, which is a previously developed Viterbi-basedalgorithm. Both techniques are completely blind in the sense thatno knowledge of the signatures, channel state information, ortraining sequences is required for any user. Once convergencehas been achieved, an estimate of the signature of each userconvolved with its physical channel response (CR) and estimateddata sequences are provided. This CR estimate can be used toswitch to any decision-directed (DD) adaptation scheme. Performanceof the algorithms is verified via simulations as well as onexperimental data obtained in an underwater acoustics (UWA)environment. In both cases, performance is found to be highlysatisfactory, showing the near–far resistance of the analyzed algorithms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

More than 70 years ago it was recognised that ionospheric F2-layer critical frequencies [foF2] had a strong relationship to sunspot number. Using historic datasets from the Slough and Washington ionosondes, we evaluate the best statistical fits of foF2 to sunspot numbers (at each Universal Time [UT] separately) in order to search for drifts and abrupt changes in the fit residuals over Solar Cycles 17-21. This test is carried out for the original composite of the Wolf/Zürich/International sunspot number [R], the new “backbone” group sunspot number [RBB] and the proposed “corrected sunspot number” [RC]. Polynomial fits are made both with and without allowance for the white-light facular area, which has been reported as being associated with cycle-to-cycle changes in the sunspot number - foF2 relationship. Over the interval studied here, R, RBB, and RC largely differ in their allowance for the “Waldmeier discontinuity” around 1945 (the correction factor for which for R, RBB and RC is, respectively, zero, effectively over 20 %, and explicitly 11.6 %). It is shown that for Solar Cycles 18-21, all three sunspot data sequences perform well, but that the fit residuals are lowest and most uniform for RBB. We here use foF2 for those UTs for which R, RBB, and RC all give correlations exceeding 0.99 for intervals both before and after the Waldmeier discontinuity. The error introduced by the Waldmeier discontinuity causes R to underestimate the fitted values based on the foF2 data for 1932-1945 but RBB overestimates them by almost the same factor, implying that the correction for the Waldmeier discontinuity inherent in RBB is too large by a factor of two. Fit residuals are smallest and most uniform for RC and the ionospheric data support the optimum discontinuity multiplicative correction factor derived from the independent Royal Greenwich Observatory (RGO) sunspot group data for the same interval.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In this paper, we develop a data-driven methodology to characterize the likelihood of orographic precipitation enhancement using sequences of weather radar images and a digital elevation model (DEM). Geographical locations with topographic characteristics favorable to enforce repeatable and persistent orographic precipitation such as stationary cells, upslope rainfall enhancement, and repeated convective initiation are detected by analyzing the spatial distribution of a set of precipitation cells extracted from radar imagery. Topographic features such as terrain convexity and gradients computed from the DEM at multiple spatial scales as well as velocity fields estimated from sequences of weather radar images are used as explanatory factors to describe the occurrence of localized precipitation enhancement. The latter is represented as a binary process by defining a threshold on the number of cell occurrences at particular locations. Both two-class and one-class support vector machine classifiers are tested to separate the presumed orographic cells from the nonorographic ones in the space of contributing topographic and flow features. Site-based validation is carried out to estimate realistic generalization skills of the obtained spatial prediction models. Due to the high class separability, the decision function of the classifiers can be interpreted as a likelihood or susceptibility of orographic precipitation enhancement. The developed approach can serve as a basis for refining radar-based quantitative precipitation estimates and short-term forecasts or for generating stochastic precipitation ensembles conditioned on the local topography.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This correspondence proposes a new algorithm for the OFDM joint data detection and phase noise (PHN) cancellation for constant modulus modulations. We highlight that it is important to address the overfitting problem since this is a major detrimental factor impairing the joint detection process. In order to attack the overfitting problem we propose an iterative approach based on minimum mean square prediction error (MMSPE) subject to the constraint that the estimated data symbols have constant power. The proposed constrained MMSPE algorithm (C-MMSPE) significantly improves the performance of existing approaches with little extra complexity being imposed. Simulation results are also given to verify the proposed algorithm.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The phylogeny of the Australian legume genus Daviesia was estimated using sequences of the internal transcribed spacers of nuclear ribosomal DNA. Partial congruence was found with previous analyses using morphology, including strong support for monophyly of the genus and for a sister group relationship between the clade D. pachyloma and the rest of the genus. A previously unplaced bird-pollinated species, anceps + D. D. epiphyllum, was well supported as sister to the only other bird-pollinated species in the genus, D. speciosa, indicating a single origin of bird pollination in their common ancestor. Other morphological groups within Daviesia were not supported and require reassessment. A strong and previously unreported sister clade of Daviesia consists of the two monotypic genera Erichsenia and Viminaria. These share phyllode-like leaves and indehiscent fruits. The evolutionary history of cord roots, which have anomalous secondary thickening, was explored using parsimony. Cord roots are limited to three separate clades but have a complex history involving a small number of gains (most likely 0-3) and losses (0-5). The anomalous structure of cord roots ( adventitious vascular strands embedded in a parenchymatous matrix) may facilitate nutrient storage, and the roots may be contractile. Both functions may be related to a postfire resprouting adaptation. Alternatively, cord roots may be an adaptation to the low-nutrient lateritic soils of Western Australia. However, tests for association between root type, soil type, and growth habit were equivocal, depending on whether the variables were treated as phylogenetically dependent (insignificant) or independent ( significant).

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this study, the concentration probability distributions of 82 pharmaceutical compounds detected in the effluents of 179 European wastewater treatment plants were computed and inserted into a multimedia fate model. The comparative ecotoxicological impact of the direct emission of these compounds from wastewater treatment plants on freshwater ecosystems, based on a potentially affected fraction (PAF) of species approach, was assessed to rank compounds based on priority. As many pharmaceuticals are acids or bases, the multimedia fate model accounts for regressions to estimate pH-dependent fate parameters. An uncertainty analysis was performed by means of Monte Carlo analysis, which included the uncertainty of fate and ecotoxicity model input variables, as well as the spatial variability of landscape characteristics on the European continental scale. Several pharmaceutical compounds were identified as being of greatest concern, including 7 analgesics/anti-inflammatories, 3 β-blockers, 3 psychiatric drugs, and 1 each of 6 other therapeutic classes. The fate and impact modelling relied extensively on estimated data, given that most of these compounds have little or no experimental fate or ecotoxicity data available, as well as a limited reported occurrence in effluents. The contribution of estimated model input variables to the variance of freshwater ecotoxicity impact, as well as the lack of experimental abiotic degradation data for most compounds, helped in establishing priorities for further testing. Generally, the effluent concentration and the ecotoxicity effect factor were the model input variables with the most significant effect on the uncertainty of output results.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The genus Prunus L. is large and economically important. However, phylogenetic relationships within Prunus at low taxonomic level, particularly in the subgenus Amygdalus L. s.l., remain poorly investigated. This paper attempts to document the evolutionary history of Amygdalus s.l. and establishes a temporal framework, by assembling molecular data from conservative and variable molecular markers. The nuclear s6pdh gene in combination with the plastid trnSG spacer are analyzed with bayesian and maximum likelihood methods. Since previous phylogenetic analysis with these markers lacked resolution, we additionally analyzed 13 nuclear SSR loci with the δµ2 distance, followed by an unweighted pair group method using arithmetic averages algorithm. Our phylogenetic analysis with both sequence and SSR loci confirms the split between sections Amygdalus and Persica, comprising almonds and peaches, respectively. This result is in agreement with biogeographic data showing that each of the two sections is naturally distributed on each side of the Central Asian Massif chain. Using coalescent based estimations, divergence times between the two sections strongly varied when considering sequence data only or combined with SSR. The sequence-only based estimate (5 million years ago) was congruent with the Central Asian Massif orogeny and subsequent climate change. Given the low level of differentiation within the two sections using both marker types, the utility of combining microsatellites and data sequences to address phylogenetic relationships at low taxonomic level within Amygdalus is discussed. The recent evolutionary histories of almond and peach are discussed in view of the domestication processes that arose in these two phenotypically-diverging gene pools: almonds and peaches were domesticated from the Amygdalus s.s. and Persica sections, respectively. Such economically important crops may serve as good model to study divergent domestication process in close genetic pool.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Tämä opinnäytetyö on tehty yhteistyössä Lappeenrannan teknillisen yliopiston ja Enviroc Oy:n kanssa. Työn tarkoituksena on ollut kehittää suomalaisiin olosuhteisiin soveltuva laskentamalli purku- ja korjausrakentamiskohteiden hiilijalanjäljelle. Kehitettyä mallia voi käyttää yrityksessä lainsäädännön vaatimusten noudattamisen todentamiseen sekä purku- ja korjausrakentamiskohteiden toimintatapojen vertailuun. Työssä käsitellään purku- ja korjausrakentamiskohteiden hiilijalanjälkeen vaikuttavia asioita, joita ovat työmaan energiankulutus sekä syntyvien jätteiden lajittelu, kuljetukset, käsittely ja hyödyntäminen tai loppusijoitus. Laskentamalli on kehitetty laskemalla esimerkkikohteille hiilijalanjäljet elinkaarimallintamisen avulla. Työssä on tarkasteltu myös vaihtoehtoisia jäteskenaarioita sekä laskentamallin luotettavuutta. Työn lopputuloksena on saatu kolmen eri kokoluokan esimerkkikohteen hiilijalanjäljet ja laskentamallin periaatekaaviot. Jätteiden toimituspisteiden ja jätejakeiden kulkureittien vaihtelevuuden sekä eri kohteista muodostuvien erityyppisten jätejakeiden johdosta yhden kokonaisvaltaisen laskentamallin kehittäminen on haasteellista. Myös tietojen hankinta kohteista ja jatkokäsittelyistä ja etenkin primääritietojen saaminen on ongelmallista. Tämänhetkinen laskentamalli perustuu enimmäkseen sekundääritietoihin ja arvioihin, joten mallin luotettavuuden lisäämiseksi olisi panostettava primääritiedon määrän lisäämiseen. Laskennan perusteella jäteskenaariovaihtoehdoista lajitteleva toimintamalli osoittautui hiilijalanjäljen kannalta suotuisimmaksi pienemmissä kohteissa ja käsittelylaitospainotteinen malli suuressa kohteessa. Merkittäviä tekijöitä kohteiden hiilijalanjälkien muodostumiselle olivat metallien käsittely, jätteiden poltto sekä neitsytraaka-aineista valmistetun teräksen ja fossiilisten polttoaineiden vältetyt päästöt. Merkittävimmiksi kasvihuonekaasuiksi purku- ja korjausrakentamiskohteiden laskennassa osoittautuivat hiilidioksidin lisäksi halogenoidut hiilivedyt ja metaani.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A detailed analysis is presented of solar UV spectral irradiance for the period between May 2003 and August 2005, when data are available from both the Solar Ultraviolet pectral Irradiance Monitor (SUSIM) instrument (on board the pper Atmosphere Research Satellite (UARS) spacecraft) and the Solar Stellar Irradiance Comparison Experiment (SOLSTICE) instrument (on board the Solar Radiation and Climate Experiment (SORCE) satellite). The ultimate aim is to develop a data composite that can be used to accurately determine any differences between the “exceptional” solar minimum at the end of solar cycle 23 and the previous minimum at the end of solar cycle 22 without having to rely on proxy data to set the long‐term change. SUSIM data are studied because they are the only data available in the “SOLSTICE gap” between the end of available UARS SOLSTICE data and the start of the SORCE data. At any one wavelength the two data sets are considered too dissimilar to be combined into a meaningful composite if any one of three correlations does not exceed a threshold of 0.8. This criterion removes all wavelengths except those in a small range between 156 nm and 208 nm, the longer wavelengths of which influence ozone production and heating in the lower stratosphere. Eight different methods are employed to intercalibrate the two data sequences. All methods give smaller changes between the minima than are seen when the data are not adjusted; however, correcting the SUSIM data to allow for an exponentially decaying offset drift gives a composite that is largely consistent with the unadjusted data from the SOLSTICE instruments on both UARS and SORCE and in which the recent minimum is consistently lower in the wave band studied.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Svalgaard (2014) has recently pointed out that the calibration of the Helsinki magnetic observatory’s H component variometer was probably in error in published data for the years 1866–1874.5 and that this makes the interdiurnal variation index based on daily means, IDV(1d), (Lockwood et al., 2013a), and the interplanetary magnetic field strength derived from it (Lockwood et al., 2013b), too low around the peak of solar cycle 11. We use data from the modern Nurmijarvi station, relatively close to the site of the original Helsinki Observatory, to confirm a 30% underestimation in this interval and hence our results are fully consistent with the correction derived by Svalgaard. We show that the best method for recalibration uses the Helsinki Ak(H) and aa indices and is accurate to ±10 %. This makes it preferable to recalibration using either the sunspot number or the diurnal range of geomagnetic activity which we find to be accurate to ±20 %. In the case of Helsinki data during cycle 11, the two recalibration methods produce very similar corrections which are here confirmed using newly digitised data from the nearby St Petersburg observatory and also using declination data from Helsinki. However, we show that the IDV index is, compared to later years, too similar to sunspot number before 1872, revealing independence of the two data series has been lost; either because the geomagnetic data used to compile IDV has been corrected using sunspot numbers, or vice versa, or both. We present corrected data sequences for both the IDV(1d) index and the reconstructed IMF (interplanetary magnetic field).We also analyse the relationship between the derived near-Earth IMF and the sunspot number and point out the relevance of the prior history of solar activity, in addition to the contemporaneous value, to estimating any “floor” value of the near-Earth interplanetary field.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

O presente trabalho teve por objetivo a apresentação de um modelo iterativo para utilização em microcomputadores capaz de estimar valores de evapotranspiracão de referencia em diversos níveis de probabilidade, a partir de uma série de dados. O procedimento matemático envolvido na técnica iterativa empregada relaciona-se à utilização das funções de densidade gama incompleta e beta. Para tanto, foram utilizados dados quinzenais de evapotranspiracão do mês de março, a partir de uma série de 30 anos para a região de Piracicaba-SP. Através do teste de Kolmogorov-Smirnov, verificou-se que dados estimados através do modelo apresentaram alto grau de ajuste com dados relatados em literatura, justificando assim sua utilização.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

With the objective of identifying predictors of fertility in bulls, the correlation coefficients of the midpiece length of bovine spermatozoa with semen traits and fertility were estimated. Data were obtained from semen samples of 50 crossbred bulls submitted to a progeny test. The midpiece length was determined in 40 midpieces of each bull through a image analysis system (Videoplan). Besides the physical traits before and after semen freezing, the oxygen consumption rate and the cytochemical activity index were observed. The correlation coefficients were low and non-significant (P> 0.05), indicating that the midpiece length is not a good predictor of fertility.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Genetic correlations of selection indices and the traits considered in these indices with mature weight (MW) of Nelore females and correlated responses were estimated to determine whether current selection practices will result in an undesired correlated response in MW. Genetic trends for weaning and yearling indices and MW were also estimated. Data from 612,244 Nelore animals born between 1984 and 2010, belonging to different beef cattle evaluation programs from Brazil and Paraguay, were used. The following traits were studied: weaning conformation (WC), weaning precocity (WP), weaning muscling (WM), yearling conformation (YC), yearling precocity (YP), yearling muscling (YM), weaning and yearling indices, BW gain from birth to weaning (BWG), postweaning BW gain (PWG), scrotal circumference (SC), and MW. The variance and covariance components were estimated by Bayesian inference in a multitrait analysis, including all traits in the same analysis, using a nonlinear (threshold) animal model for visual scores and a linear animal model for the other traits. The mean direct heritabilities were 0.21 ± 0.007 (WC), 0.22 ± 0.007 (WP), 0.20 ± 0.007 (WM), 0.43 ± 0.005 (YC), 0.40 ± 0.005 (YP), 0.40 ± 0.005 (YM), 0.17 ± 0.003 (BWG), 0.21 ± 0.004 (PWG), 0.32 ± 0.001 (SC), and 0.44 ± 0.018 (MW). The genetic correlations between MW and weaning and yearling indices were positive and of medium magnitude (0.30 ± 0.01 and 0.31 ± 0.01, respectively). The genetic changes in weaning index, yearling index, and MW, expressed as units of genetic SD per year, were 0.26, 0.27, and 0.01, respectively. The genetic trend for MW was nonsignificant, suggesting no negative correlated response. The selection practice based on the use of sires with high final index giving preference for those better ranked for yearling precocity and muscling than for conformation generates only a minimal correlated response in MW. © 2013 American Society of Animal Science. All rights reserved.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper presents a new method to estimate hole diameters and surface roughness in precision drilling processes, using coupons taken from a sandwich plate composed of a titanium alloy plate (Ti6Al4V) glued onto an aluminum alloy plate (AA 2024T3). The proposed method uses signals acquired during the cutting process by a multisensor system installed on the machine tool. These signals are mathematically treated and then used as input for an artificial neural network. After training, the neural network system is qualified to estimate the surface roughness and hole diameter based on the signals and cutting process parameters. To evaluate the system, the estimated data were compared with experimental measurements and the errors were calculated. The results proved the efficiency of the proposed method, which yielded very low or even negligible errors of the tolerances used in most industrial drilling processes. This pioneering method opens up a new field of research, showing a promising potential for development and application as an alternative monitoring method for drilling processes. © 2012 Springer-Verlag London Limited.