842 resultados para data movement problem
Resumo:
Knowledge of the local and migratory movements of humpback whales (Megaptera novaeangliae) from New Caledonia is very limited. To investigate this topic, we attached satellite-monitored tags to 12 whales off southern New Caledonia. Tag longevity ranged from 1 to 52 days (X = 22.5 days). Tagged whales generally moved to the south or southeast, with several spending time in a previously unknown seamount habitat named Antigonia before resuming movement, generally toward Norfolk Island or New Zealand. However, 1 female with a calf traveled the entire length of the western coast of New Caledonia (~450 km) and then west in the direction of the Chesterfield Reefs, a 19th century American (''Yankee'') whaling ground. None of the New Caledonia whales traveled to or toward eastern Australia, which is broadly consistent with the low rate of interchange observed from photo-identification comparisons between these 2 areas. The connections between New Caledonia and New Zealand, together with the relatively low numbers of whales seen in these places generally, support the idea that whales from these 2 areas constitute a single population that remains small and unrecovered.
Resumo:
The Southern Westerly Winds (SWW) exert a crucial influence over the world ocean and climate. Nevertheless, a comprehensive understanding of the Holocene temporal and spatial evolution of the SWW remains a significant challenge due to the sparsity of high-resolution marine archives and appropriate SWW proxies. Here, we present a north-south transect of high-resolution planktonic foraminiferal oxygen isotope records from the western South Atlantic. Our proxy records reveal Holocene migrations of the Brazil- Malvinas Confluence (BMC), a highly sensitive feature for changes in the position and strength of the northern portion of the SWW. Through the tight coupling of the BMC position to the large-scale wind field, the records allow a quantitative reconstruction of Holocene latitudinal displacements of the SWW across the South Atlantic. Our data reveal a gradual poleward movement of the SWW by about 1-1.5° from the early to the mid-Holocene. Afterwards variability in the SWW is dominated by millennial-scale displacements in the order of 1° in latitude with no recognizable longer-term trend. These findings are confronted with results from a state-of-the-art transient Holocene climate simulation using a comprehensive coupled atmosphere-ocean general circulation model. Proxy-inferred and modeled SWW shifts compare qualitatively, but the model underestimates both orbitally forced multi-millennial and internal millennial SWW variability by almost an order of magnitude. The underestimated natural variability implies a substantial uncertainty in model projections of future SWW shifts.
Dimensions and determinants of upward mobility : a study based on longitudinal data from Delhi slums
Resumo:
This study based on two primary surveys of the same households in two different years (2007/08 and 2012) assesses the extent of inter-temporal change in income of the individual workers and makes an attempt to identify the factors which explain upward mobility in alternate econometric framework, envisaging endogeneity problem. It also encompasses a host of indicators of wellbeing and constructs the transition matrix to capture the extent of change over time at the household level. The findings are indicative of a rise in the income of workers across a sizeable percentage of households though many of them remained below the poverty line notwithstanding this increase. In fact, there is a wide spread deterioration in the wellbeing index constructed at the household level. Among several determinants of income rise two important policy prescriptions can be elicited. Inadequate education reduces the probability of upward mobility while education above a threshold level raises it. Savings are crucial for upward mobility impinging on the importance of asset creation. Views that entail neighbourhood spill-over effects also received validation. Besides, investment in housing and basic amenities turns out to be crucial for improvement in wellbeing levels.
Resumo:
Accurate control over the spent nuclear fuel content is essential for its safe and optimized transportation, storage and management. Consequently, the reactivity of spent fuel and its isotopic content must be accurately determined. Nowadays, to predict isotopic evolution throughout irradiation and decay periods is not a problem thanks to the development of powerful codes and methodologies. In order to have a realistic confidence level in the prediction of spent fuel isotopic content, it is desirable to determine how uncertainties in the basic nuclear data affect isotopic prediction calculations by quantifying their associated uncertainties
Resumo:
The fuzzy min–max neural network classifier is a supervised learning method. This classifier takes the hybrid neural networks and fuzzy systems approach. All input variables in the network are required to correspond to continuously valued variables, and this can be a significant constraint in many real-world situations where there are not only quantitative but also categorical data. The usual way of dealing with this type of variables is to replace the categorical by numerical values and treat them as if they were continuously valued. But this method, implicitly defines a possibly unsuitable metric for the categories. A number of different procedures have been proposed to tackle the problem. In this article, we present a new method. The procedure extends the fuzzy min–max neural network input to categorical variables by introducing new fuzzy sets, a new operation, and a new architecture. This provides for greater flexibility and wider application. The proposed method is then applied to missing data imputation in voting intention polls. The micro data—the set of the respondents’ individual answers to the questions—of this type of poll are especially suited for evaluating the method since they include a large number of numerical and categorical attributes.
Resumo:
In this work, we present a novel method to compensate the movement in images acquired during free breathing using first-pass gadolinium enhanced, myocardial perfusion magnetic resonance imaging (MRI). First, we use independent component analysis (ICA) to identify the optimal number of independent components (ICs) that separate the breathing motion from the intensity change induced by the contrast agent. Then, synthetic images are created by recombining the ICs, but other then in previously published work (Milles et al. 2008), we omit the component related to motion, and therefore, the resulting reference image series is free of motion. Motion compensation is then achieved by using a multi-pass non-rigid image registration scheme. We tested our method on 15 distinct image series (5 patients) consisting of 58 images each and we validated our method by comparing manually tracked intensity profiles of the myocardial sections to automatically generated ones before and after registration. The average correlation to the manually obtained curves before registration 0:89 0:11 was increased to 0:98 0:02
Resumo:
We present a methodology for reducing a straight line fitting regression problem to a Least Squares minimization one. This is accomplished through the definition of a measure on the data space that takes into account directional dependences of errors, and the use of polar descriptors for straight lines. This strategy improves the robustness by avoiding singularities and non-describable lines. The methodology is powerful enough to deal with non-normal bivariate heteroscedastic data error models, but can also supersede classical regression methods by making some particular assumptions. An implementation of the methodology for the normal bivariate case is developed and evaluated.
Resumo:
An important competence of human data analysts is to interpret and explain the meaning of the results of data analysis to end-users. However, existing automatic solutions for intelligent data analysis provide limited help to interpret and communicate information to non-expert users. In this paper we present a general approach to generating explanatory descriptions about the meaning of quantitative sensor data. We propose a type of web application: a virtual newspaper with automatically generated news stories that describe the meaning of sensor data. This solution integrates a variety of techniques from intelligent data analysis into a web-based multimedia presentation system. We validated our approach in a real world problem and demonstrate its generality using data sets from several domains. Our experience shows that this solution can facilitate the use of sensor data by general users and, therefore, can increase the utility of sensor network infrastructures.
Resumo:
There are many situations where input feature vectors are incomplete and methods to tackle the problem have been studied for a long time. A commonly used procedure is to replace each missing value with an imputation. This paper presents a method to perform categorical missing data imputation from numerical and categorical variables. The imputations are based on Simpson’s fuzzy min-max neural networks where the input variables for learning and classification are just numerical. The proposed method extends the input to categorical variables by introducing new fuzzy sets, a new operation and a new architecture. The procedure is tested and compared with others using opinion poll data.
Resumo:
Mass spectrometry (MS) data provide a promising strategy for biomarker discovery. For this purpose, the detection of relevant peakbins in MS data is currently under intense research. Data from mass spectrometry are challenging to analyze because of their high dimensionality and the generally low number of samples available. To tackle this problem, the scientific community is becoming increasingly interested in applying feature subset selection techniques based on specialized machine learning algorithms. In this paper, we present a performance comparison of some metaheuristics: best first (BF), genetic algorithm (GA), scatter search (SS) and variable neighborhood search (VNS). Up to now, all the algorithms, except for GA, have been first applied to detect relevant peakbins in MS data. All these metaheuristic searches are embedded in two different filter and wrapper schemes coupled with Naive Bayes and SVM classifiers.
Resumo:
In this article we describe a method for automatically generating text summaries of data corresponding to traces of spatial movement in geographical areas. The method can help humans to understand large data streams, such as the amounts of GPS data recorded by a variety of sensors in mobile phones, cars, etc. We describe the knowledge representations we designed for our method and the main components of our method for generating the summaries: a discourse planner, an abstraction module and a text generator. We also present evaluation results that show the ability of our method to generate certain types of geospatial and temporal descriptions.
Resumo:
Recently, the Semantic Web has experienced signi�cant advancements in standards and techniques, as well as in the amount of semantic information available online. Even so, mechanisms are still needed to automatically reconcile semantic information when it is expressed in di�erent natural languages, so that access to Web information across language barriers can be improved. That requires developing techniques for discovering and representing cross-lingual links on the Web of Data. In this paper we explore the different dimensions of such a problem and reflect on possible avenues of research on that topic.
Resumo:
Abstract Due to recent scientific and technological advances in information sys¬tems, it is now possible to perform almost every application on a mobile device. The need to make sense of such devices more intelligent opens an opportunity to design data mining algorithm that are able to autonomous execute in local devices to provide the device with knowledge. The problem behind autonomous mining deals with the proper configuration of the algorithm to produce the most appropriate results. Contextual information together with resource information of the device have a strong impact on both the feasibility of a particu¬lar execution and on the production of the proper patterns. On the other hand, performance of the algorithm expressed in terms of efficacy and efficiency highly depends on the features of the dataset to be analyzed together with values of the parameters of a particular implementation of an algorithm. However, few existing approaches deal with autonomous configuration of data mining algorithms and in any case they do not deal with contextual or resources information. Both issues are of particular significance, in particular for social net¬works application. In fact, the widespread use of social networks and consequently the amount of information shared have made the need of modeling context in social application a priority. Also the resource consumption has a crucial role in such platforms as the users are using social networks mainly on their mobile devices. This PhD thesis addresses the aforementioned open issues, focusing on i) Analyzing the behavior of algorithms, ii) mapping contextual and resources information to find the most appropriate configuration iii) applying the model for the case of a social recommender. Four main contributions are presented: - The EE-Model: is able to predict the behavior of a data mining algorithm in terms of resource consumed and accuracy of the mining model it will obtain. - The SC-Mapper: maps a situation defined by the context and resource state to a data mining configuration. - SOMAR: is a social activity (event and informal ongoings) recommender for mobile devices. - D-SOMAR: is an evolution of SOMAR which incorporates the configurator in order to provide updated recommendations. Finally, the experimental validation of the proposed contributions using synthetic and real datasets allows us to achieve the objectives and answer the research questions proposed for this dissertation.