983 resultados para Multiple datasets
Resumo:
Spatial independent component analysis (sICA) of functional magnetic resonance imaging (fMRI) time series can generate meaningful activation maps and associated descriptive signals, which are useful to evaluate datasets of the entire brain or selected portions of it. Besides computational implications, variations in the input dataset combined with the multivariate nature of ICA may lead to different spatial or temporal readouts of brain activation phenomena. By reducing and increasing a volume of interest (VOI), we applied sICA to different datasets from real activation experiments with multislice acquisition and single or multiple sensory-motor task-induced blood oxygenation level-dependent (BOLD) signal sources with different spatial and temporal structure. Using receiver operating characteristics (ROC) methodology for accuracy evaluation and multiple regression analysis as benchmark, we compared sICA decompositions of reduced and increased VOI fMRI time-series containing auditory, motor and hemifield visual activation occurring separately or simultaneously in time. Both approaches yielded valid results; however, the results of the increased VOI approach were spatially more accurate compared to the results of the decreased VOI approach. This is consistent with the capability of sICA to take advantage of extended samples of statistical observations and suggests that sICA is more powerful with extended rather than reduced VOI datasets to delineate brain activity.
Resumo:
We present a 3000-yr rainfall reconstruction from the Galápagos Islands that is based on paired biomarker records from the sediment of El Junco Lake. Located in the eastern equatorial Pacific, the climate of the Galápagos Islands is governed by movements of the Intertropical Convergence Zone (ITCZ) and the El Niño-Southern Oscillation (ENSO). We use a novel method for reconstructing past ENSO- and ITCZ-related rainfall changes through analysis of molecular and isotopic biomarker records representing several types of plants and algae that grow under differing climatic conditions. We propose that ?D values of dinosterol, a sterol produced by dinoflagellates, record changes in mean rainfall in El Junco Lake, while dD values of C34 botryococcene, a hydrocarbon unique to the green alga Botryococcus braunii, record changes in rainfall associated with moderate-to-strong El Niño events. We use these proxies to infer changes in mean rainfall and El Niño-related rainfall over the past 3000 yr. During periods in which the inferred change in El Niño-related rainfall opposed the change in mean rainfall, we infer changes in the amount of ITCZ-related rainfall. Simulations with an idealized isotope hydrology model of El Junco Lake help illustrate the interpretation of these proxy reconstructions. Opposing changes in El Niño- and ITCZ-related rainfall appear to account for several of the largest inferred hydrologic changes in El Junco Lake. We propose that these reconstructions can be used to infer changes in frequency and/or intensity of El Niño events and changes in the position of the ITCZ in the eastern equatorial Pacific over the past 3000 yr. Comparison with El Junco Lake sediment grain size records indicates general agreement of inferred rainfall changes over the late Holocene.
Resumo:
The analysis of time-dependent data is an important problem in many application domains, and interactive visualization of time-series data can help in understanding patterns in large time series data. Many effective approaches already exist for visual analysis of univariate time series supporting tasks such as assessment of data quality, detection of outliers, or identification of periodically or frequently occurring patterns. However, much fewer approaches exist which support multivariate time series. The existence of multiple values per time stamp makes the analysis task per se harder, and existing visualization techniques often do not scale well. We introduce an approach for visual analysis of large multivariate time-dependent data, based on the idea of projecting multivariate measurements to a 2D display, visualizing the time dimension by trajectories. We use visual data aggregation metaphors based on grouping of similar data elements to scale with multivariate time series. Aggregation procedures can either be based on statistical properties of the data or on data clustering routines. Appropriately defined user controls allow to navigate and explore the data and interactively steer the parameters of the data aggregation to enhance data analysis. We present an implementation of our approach and apply it on a comprehensive data set from the field of earth bservation, demonstrating the applicability and usefulness of our approach.
Resumo:
As the Antarctic Circumpolar Current crosses the South-West Indian Ocean Ridge, it creates an extensive eddy field characterised by high sea level anomaly variability. We investigated the diving behaviour of female southern elephant seals from Marion Island during their post-moult migrations in relation to this eddy field in order to determine its role in the animals' at-sea dispersal. Most seals dived within the region significantly more often than predicted by chance, and these dives were generally shallower and shorter than dives outside the eddy field. Mixed effects models estimated reductions of 44.33 ± 3.00 m (maximum depth) and 6.37 ± 0.10 min (dive duration) as a result of diving within the region, along with low between-seal variability (maximum depth: 5.5 % and dive duration: 8.4 %). U-shaped dives increased in frequency inside the eddy field, whereas W-shaped dives with multiple vertical movements decreased. Results suggest that Marion Island's adult female elephant seals' dives are characterised by lowered cost-of-transport when they encounter the eddy field during the start and end of their post-moult migrations. This might result from changes in buoyancy associated with varying body condition upon leaving and returning to the island. Our results do not suggest that the eddy field is a vital foraging ground for Marion Island's southern elephant seals. However, because seals preferentially travel through this area and likely forage opportunistically while minimising transport costs, we hypothesise that climate-mediated changes in the nature or position of this region may alter the seals' at-sea dispersal patterns.
Resumo:
The software PanGet is a special tool for the download of multiple data sets from PANGAEA. It uses the PANGAEA data set ID which is unique and part of the DOI. In a first step a list of ID's of those data sets to be downloaded must be created. There are two choices to define this individual collection of sets. Based on the ID list, the tool will download the data sets. Failed downloads are written to the file *_failed.txt. The functionality of PanGet is also part of the program Pan2Applic (choose File > Download PANGAEA datasets...) and PanTool2 (choose Basic tools > Download PANGAEA datasets...).
Resumo:
Summarizing topological relations is fundamental to many spatial applications including spatial query optimization. In this paper, we present several novel techniques to eectively construct cell density based spatial histograms for range (window) summarizations restricted to the four most important topological relations: contains, contained, overlap, and disjoint. We rst present a novel framework to construct a multiscale histogram composed of multiple Euler histograms with the guarantee of the exact summarization results for aligned windows in constant time. Then we present an approximate algorithm, with the approximate ratio 19/12, to minimize the storage spaces of such multiscale Euler histograms, although the problem is generally NP-hard. To conform to a limited storage space where only k Euler histograms are allowed, an effective algorithm is presented to construct multiscale histograms to achieve high accuracy. Finally, we present a new approximate algorithm to query an Euler histogram that cannot guarantee the exact answers; it runs in constant time. Our extensive experiments against both synthetic and real world datasets demonstrated that the approximate mul- tiscale histogram techniques may improve the accuracy of the existing techniques by several orders of magnitude while retaining the cost effciency, and the exact multiscale histogram technique requires only a storage space linearly proportional to the number of cells for the real datasets.
Resumo:
Ant Colony Optimisation algorithms mimic the way ants use pheromones for marking paths to important locations. Pheromone traces are followed and reinforced by other ants, but also evaporate over time. As a consequence, optimal paths attract more pheromone, whilst the less useful paths fade away. In the Multiple Pheromone Ant Clustering Algorithm (MPACA), ants detect features of objects represented as nodes within graph space. Each node has one or more ants assigned to each feature. Ants attempt to locate nodes with matching feature values, depositing pheromone traces on the way. This use of multiple pheromone values is a key innovation. Ants record other ant encounters, keeping a record of the features and colony membership of ants. The recorded values determine when ants should combine their features to look for conjunctions and whether they should merge into colonies. This ability to detect and deposit pheromone representative of feature combinations, and the resulting colony formation, renders the algorithm a powerful clustering tool. The MPACA operates as follows: (i) initially each node has ants assigned to each feature; (ii) ants roam the graph space searching for nodes with matching features; (iii) when departing matching nodes, ants deposit pheromones to inform other ants that the path goes to a node with the associated feature values; (iv) ant feature encounters are counted each time an ant arrives at a node; (v) if the feature encounters exceed a threshold value, feature combination occurs; (vi) a similar mechanism is used for colony merging. The model varies from traditional ACO in that: (i) a modified pheromone-driven movement mechanism is used; (ii) ants learn feature combinations and deposit multiple pheromone scents accordingly; (iii) ants merge into colonies, the basis of cluster formation. The MPACA is evaluated over synthetic and real-world datasets and its performance compares favourably with alternative approaches.
Resumo:
Popular dimension reduction and visualisation algorithms rely on the assumption that input dissimilarities are typically Euclidean, for instance Metric Multidimensional Scaling, t-distributed Stochastic Neighbour Embedding and the Gaussian Process Latent Variable Model. It is well known that this assumption does not hold for most datasets and often high-dimensional data sits upon a manifold of unknown global geometry. We present a method for improving the manifold charting process, coupled with Elastic MDS, such that we no longer assume that the manifold is Euclidean, or of any particular structure. We draw on the benefits of different dissimilarity measures allowing for the relative responsibilities, under a linear combination, to drive the visualisation process.
Resumo:
The MAREDAT atlas covers 11 types of plankton, ranging in size from bacteria to jellyfish. Together, these plankton groups determine the health and productivity of the global ocean and play a vital role in the global carbon cycle. Working within a uniform and consistent spatial and depth grid (map) of the global ocean, the researchers compiled thousands and tens of thousands of data points to identify regions of plankton abundance and scarcity as well as areas of data abundance and scarcity. At many of the grid points, the MAREDAT team accomplished the difficult conversion from abundance (numbers of organisms) to biomass (carbon mass of organisms). The MAREDAT atlas provides an unprecedented global data set for ecological and biochemical analysis and modeling as well as a clear mandate for compiling additional existing data and for focusing future data gathering efforts on key groups in key areas of the ocean. The present data set presents depth integrated values of diazotrophs abundance and biomass, computed from a collection of source data sets.
Resumo:
Biogenic reefs are important for habitat provision and coastal protection. Long-term datasets on the distribution and abundance of Sabellaria alveolata (L.) are available from Britain. The aim of this study was to combine historical records and contemporary data to (1) describe spatiotemporal variation in winter temperatures, (2) document short-term and long-term changes in the distribution and abundance of S. alveolata and discuss these changes in relation to extreme weather events and recent warming, and (3) assess the potential for artificial coastal defense structures to function as habitat for S. alveolata. A semi-quantitative abundance scale (ACFOR) was used to compare broadscale, long-term and interannual abundance of S. alveolata near its range edge in NW Britain. S. alveolata disappeared from the North Wales and Wirral coastlines where it had been abundant prior to the cold winter of 1962/1963. Population declines were also observed following the recent cold winters of 2009/2010 and 2010/2011. Extensive surveys in 2004 and 2012 revealed that S. alveolata had recolonized locations from which it had previously disappeared. Furthermore, it had increased in abundance at many locations, possibly in response to recent warming. S. alveolata was recorded on the majority of artificial coastal defense structures surveyed, suggesting that the proliferation of artificial coastal defense structures along this stretch of coastline may have enabled S. alveolata to spread across stretches of unsuitable natural habitat. Long-term and broadscale contextual monitoring is essential for monitoring responses of organisms to climate change. Historical data and gray literature can be invaluable sources of information. Our results support the theory that Lusitanian species are responding positively to climate warming but also that short-term extreme weather events can have potentially devastating widespread and lasting effects on organisms. Furthermore, the proliferation of coastal defense structures has implications for phylogeography, population genetics, and connectivity of coastal populations.
Resumo:
Biogenic reefs are important for habitat provision and coastal protection. Long-term datasets on the distribution and abundance of Sabellaria alveolata (L.) are available from Britain. The aim of this study was to combine historical records and contemporary data to (1) describe spatiotemporal variation in winter temperatures, (2) document short-term and long-term changes in the distribution and abundance of S. alveolata and discuss these changes in relation to extreme weather events and recent warming, and (3) assess the potential for artificial coastal defense structures to function as habitat for S. alveolata. A semi-quantitative abundance scale (ACFOR) was used to compare broadscale, long-term and interannual abundance of S. alveolata near its range edge in NW Britain. S. alveolata disappeared from the North Wales and Wirral coastlines where it had been abundant prior to the cold winter of 1962/1963. Population declines were also observed following the recent cold winters of 2009/2010 and 2010/2011. Extensive surveys in 2004 and 2012 revealed that S. alveolata had recolonized locations from which it had previously disappeared. Furthermore, it had increased in abundance at many locations, possibly in response to recent warming. S. alveolata was recorded on the majority of artificial coastal defense structures surveyed, suggesting that the proliferation of artificial coastal defense structures along this stretch of coastline may have enabled S. alveolata to spread across stretches of unsuitable natural habitat. Long-term and broadscale contextual monitoring is essential for monitoring responses of organisms to climate change. Historical data and gray literature can be invaluable sources of information. Our results support the theory that Lusitanian species are responding positively to climate warming but also that short-term extreme weather events can have potentially devastating widespread and lasting effects on organisms. Furthermore, the proliferation of coastal defense structures has implications for phylogeography, population genetics, and connectivity of coastal populations.
Resumo:
Rigid adherence to pre-specified thresholds and static graphical representations can lead to incorrect decisions on merging of clusters. As an alternative to existing automated or semi-automated methods, we developed a visual analytics approach for performing hierarchical clustering analysis of short time-series gene expression data. Dynamic sliders control parameters such as the similarity threshold at which clusters are merged and the level of relative intra-cluster distinctiveness, which can be used to identify "weak-edges" within clusters. An expert user can drill down to further explore the dendrogram and detect nested clusters and outliers. This is done by using the sliders and by pointing and clicking on the representation to cut the branches of the tree in multiple-heights. A prototype of this tool has been developed in collaboration with a small group of biologists for analysing their own datasets. Initial feedback on the tool has been positive.
Resumo:
ABSTRACT Researchers frequently have to analyze scales in which some participants have failed to respond to some items. In this paper we focus on the exploratory factor analysis of multidimensional scales (i.e., scales that consist of a number of subscales) where each subscale is made up of a number of Likert-type items, and the aim of the analysis is to estimate participants' scores on the corresponding latent traits. We propose a new approach to deal with missing responses in such a situation that is based on (1) multiple imputation of non-responses and (2) simultaneous rotation of the imputed datasets. We applied the approach in a real dataset where missing responses were artificially introduced following a real pattern of non-responses, and a simulation study based on artificial datasets. The results show that our approach (specifically, Hot-Deck multiple imputation followed of Consensus Promin rotation) was able to successfully compute factor score estimates even for participants that have missing data.
Resumo:
Credible spatial information characterizing the structure and site quality of forests is critical to sustainable forest management and planning, especially given the increasing demands and threats to forest products and services. Forest managers and planners are required to evaluate forest conditions over a broad range of scales, contingent on operational or reporting requirements. Traditionally, forest inventory estimates are generated via a design-based approach that involves generalizing sample plot measurements to characterize an unknown population across a larger area of interest. However, field plot measurements are costly and as a consequence spatial coverage is limited. Remote sensing technologies have shown remarkable success in augmenting limited sample plot data to generate stand- and landscape-level spatial predictions of forest inventory attributes. Further enhancement of forest inventory approaches that couple field measurements with cutting edge remotely sensed and geospatial datasets are essential to sustainable forest management. We evaluated a novel Random Forest based k Nearest Neighbors (RF-kNN) imputation approach to couple remote sensing and geospatial data with field inventory collected by different sampling methods to generate forest inventory information across large spatial extents. The forest inventory data collected by the FIA program of US Forest Service was integrated with optical remote sensing and other geospatial datasets to produce biomass distribution maps for a part of the Lake States and species-specific site index maps for the entire Lake State. Targeting small-area application of the state-of-art remote sensing, LiDAR (light detection and ranging) data was integrated with the field data collected by an inexpensive method, called variable plot sampling, in the Ford Forest of Michigan Tech to derive standing volume map in a cost-effective way. The outputs of the RF-kNN imputation were compared with independent validation datasets and extant map products based on different sampling and modeling strategies. The RF-kNN modeling approach was found to be very effective, especially for large-area estimation, and produced results statistically equivalent to the field observations or the estimates derived from secondary data sources. The models are useful to resource managers for operational and strategic purposes.