8 resultados para Multiple scale
em Digital Commons at Florida International University
Resumo:
Background: Biologists often need to assess whether unfamiliar datasets warrant the time investment required for more detailed exploration. Basing such assessments on brief descriptions provided by data publishers is unwieldy for large datasets that contain insights dependent on specific scientific questions. Alternatively, using complex software systems for a preliminary analysis may be deemed as too time consuming in itself, especially for unfamiliar data types and formats. This may lead to wasted analysis time and discarding of potentially useful data. Results: We present an exploration of design opportunities that the Google Maps interface offers to biomedical data visualization. In particular, we focus on synergies between visualization techniques and Google Maps that facilitate the development of biological visualizations which have both low-overhead and sufficient expressivity to support the exploration of data at multiple scales. The methods we explore rely on displaying pre-rendered visualizations of biological data in browsers, with sparse yet powerful interactions, by using the Google Maps API. We structure our discussion around five visualizations: a gene co-regulation visualization, a heatmap viewer, a genome browser, a protein interaction network, and a planar visualization of white matter in the brain. Feedback from collaborative work with domain experts suggests that our Google Maps visualizations offer multiple, scale-dependent perspectives and can be particularly helpful for unfamiliar datasets due to their accessibility. We also find that users, particularly those less experienced with computer use, are attracted by the familiarity of the Google Maps API. Our five implementations introduce design elements that can benefit visualization developers. Conclusions: We describe a low-overhead approach that lets biologists access readily analyzed views of unfamiliar scientific datasets. We rely on pre-computed visualizations prepared by data experts, accompanied by sparse and intuitive interactions, and distributed via the familiar Google Maps framework. Our contributions are an evaluation demonstrating the validity and opportunities of this approach, a set of design guidelines benefiting those wanting to create such visualizations, and five concrete example visualizations.
Resumo:
The primary aim of this dissertation is to develop data mining tools for knowledge discovery in biomedical data when multiple (homogeneous or heterogeneous) sources of data are available. The central hypothesis is that, when information from multiple sources of data are used appropriately and effectively, knowledge discovery can be better achieved than what is possible from only a single source. ^ Recent advances in high-throughput technology have enabled biomedical researchers to generate large volumes of diverse types of data on a genome-wide scale. These data include DNA sequences, gene expression measurements, and much more; they provide the motivation for building analysis tools to elucidate the modular organization of the cell. The challenges include efficiently and accurately extracting information from the multiple data sources; representing the information effectively, developing analytical tools, and interpreting the results in the context of the domain. ^ The first part considers the application of feature-level integration to design classifiers that discriminate between soil types. The machine learning tools, SVM and KNN, were used to successfully distinguish between several soil samples. ^ The second part considers clustering using multiple heterogeneous data sources. The resulting Multi-Source Clustering (MSC) algorithm was shown to have a better performance than clustering methods that use only a single data source or a simple feature-level integration of heterogeneous data sources. ^ The third part proposes a new approach to effectively incorporate incomplete data into clustering analysis. Adapted from K-means algorithm, the Generalized Constrained Clustering (GCC) algorithm makes use of incomplete data in the form of constraints to perform exploratory analysis. Novel approaches for extracting constraints were proposed. For sufficiently large constraint sets, the GCC algorithm outperformed the MSC algorithm. ^ The last part considers the problem of providing a theme-specific environment for mining multi-source biomedical data. The database called PlasmoTFBM, focusing on gene regulation of Plasmodium falciparum, contains diverse information and has a simple interface to allow biologists to explore the data. It provided a framework for comparing different analytical tools for predicting regulatory elements and for designing useful data mining tools. ^ The conclusion is that the experiments reported in this dissertation strongly support the central hypothesis.^
Resumo:
The Deccan Trap basalts are the remnants of a massive series of lava flows that erupted at the K/T boundary and covered 1-2 million km2 of west-central India. This eruptive event is of global interest because of its possible link to the major mass extinction event, and there is much debate about the duration of this massive volcanic event. In contrast to isotopic or paleomagnetic dating methods, I explore an alternative approach to determine the lifecycle of the magma chambers that supplied the lavas, and extend the concept to obtain a tighter constraint on Deccan’s duration. My method relies on extracting time information from elemental and isotopic diffusion across zone boundaries in individual crystals. I determined elemental and Sr-isotopic variations across abnormally large (2-5 cm) plagioclase crystals from the Thalghat and Kashele “Giant Plagioclase Basalts” from the lowermost Jawhar and Igatpuri Formations respectively in the thickest Western Ghats section near Mumbai. I also obtained bulk rock major, trace and rare earth element chemistry of each lava flow from the two formations. Thalghat flows contain only 12% zoned crystals, with 87 Sr/86Sr ratios of 0.7096 in the core and 0.7106 in the rim, separated by a sharp boundary. In contrast, all Kashele crystals have a wider range of 87Sr/86Sr values, with multiple zones. Geochemical modeling of the data suggests that the two types of crystals grew in distinct magmatic environments. Modeling intracrystalline diffusive equilibration between the core and rim of Thalghat crystals led me to obtain a crystal growth rate of 2.03x10-10 cm/s and a residence time of 780 years for the crystals in the magma chamber(s). Employing some assumptions based on field and geochronologic evidence, I extrapolated this residence time to the entire Western Ghats and obtained an estimate of 25,000–35,000 years for the duration of Western Ghats volcanism. This gave an eruptive rate of 30–40 km3/yr, which is much higher than any presently erupting volcano. This result will remain speculative until a similarly detailed analytical-modeling study is performed for the rest of the Western Ghats formations.
Resumo:
Standard economic theory suggests that capital should flow from rich countries to poor countries. However, capital has predominantly flowed to rich countries. The three essays in this dissertation attempt to explain this phenomenon. The first two essays suggest theoretical explanations for why capital has not flowed to the poor countries. The third essay empirically tests the theoretical explanations.^ The first essay examines the effects of increasing returns to scale on international lending and borrowing with moral hazard. Introducing increasing returns in a two-country general equilibrium model yields possible multiple equilibria and helps explain the possibility of capital flows from a poor to a rich country. I find that a borrowing country may need to borrow sufficient amounts internationally to reach a minimum investment threshold in order to invest domestically.^ The second essay examines how a poor country may invest in sectors with low productivity because of sovereign risk, and how collateral differences across sectors may exacerbate the problem. I model sovereign borrowing with a two-sector economy: one sector with increasing returns to scale (IRS) and one sector with diminishing returns to scale (DRS). Countries with incomes below a threshold will only invest in the DRS sector, and countries with incomes above a threshold will invest mostly in the IRS sector. The results help explain the existence of a bimodal world income distribution.^ The third essay empirically tests the explanations for why capital has not flowed from the rich to the poor countries, with a focus on institutions and initial capital. I find that institutional variables are a very important factor, but in contrast to other studies, I show that institutions do not account for the Lucas Paradox. Evidence of increasing returns still exists, even when controlling for institutions and other variables. In addition, I find that the determinants of capital flows may depend on whether a country is rich or poor.^
Resumo:
The freshwater Everglades is a complex system containing thousands of tree islands embedded within a marsh-grassland matrix. The tree island-marsh mosaic is shaped and maintained by hydrologic, edaphic and biological mechanisms that interact across multiple scales. Preserving tree islands requires a more integrated understanding of how scale-dependent phenomena interact in the larger freshwater system. The hierarchical patch dynamics paradigm provides a conceptual framework for exploring multi-scale interactions within complex systems. We used a three-tiered approach to examine the spatial variability and patterning of nutrients in relation to site parameters within and between two hydrologically defined Everglades landscapes: the freshwater Marl Prairie and the Ridge and Slough. Results were scale-dependent and complexly interrelated. Total carbon and nitrogen patterning were correlated with organic matter accumulation, driven by hydrologic conditions at the system scale. Total and bioavailable phosphorus were most strongly related to woody plant patterning within landscapes, and were found to be 3 to 11 times more concentrated in tree island soils compared to surrounding marshes. Below canopy resource islands in the slough were elongated in a downstream direction, indicating soil resource directional drift. Combined multi-scale results suggest that hydrology plays a significant role in landscape patterning and also the development and maintenance of tree islands. Once developed, tree islands appear to exert influence over the spatial distribution of nutrients, which can reciprocally affect other ecological processes.
Resumo:
Understanding habitat selection and movement remains a key question in behavioral ecology. Yet, obtaining a sufficiently high spatiotemporal resolution of the movement paths of organisms remains a major challenge, despite recent technological advances. Observing fine-scale movement and habitat choice decisions in the field can prove to be difficult and expensive, particularly in expansive habitats such as wetlands. We describe the application of passive integrated transponder (PIT) systems to field enclosures for tracking detailed fish behaviors in an experimental setting. PIT systems have been applied to habitats with clear passageways, at fixed locations or in controlled laboratory and mesocosm settings, but their use in unconfined habitats and field-based experimental setups remains limited. In an Everglades enclosure, we continuously tracked the movement and habitat use of PIT-tagged centrarchids across three habitats of varying depth and complexity using multiple flatbed antennas for 14 days. Fish used all three habitats, with marked species-specific diel movement patterns across habitats, and short-lived movements that would be likely missed by other tracking techniques. Findings suggest that the application of PIT systems to field enclosures can be an insightful approach for gaining continuous, undisturbed and detailed movement data in unconfined habitats, and for experimentally manipulating both internal and external drivers of these behaviors.
Resumo:
Simarouba glauca, a non-edible oilseed crop native to South Florida, is gaining popularity as a feedstock for the production of biodiesel. The University of Agriculture Sciences in Bangalore, India has developed a biodiesel production model based on the principles of decentralization, small scales, and multiple fuel sources. Success of such a program depends on conversion efficiencies at multiple stages. The conversion efficiency of the field-level, decentralized production model was compared with the in-laboratory conversion efficiency benchmark. The study indicated that the field-level model conversion efficiency was less than that of the lab-scale set up. The fuel qualities and characteristics of the Simarouba glauca biodiesel were tested and found to be the standards required for fuel designation. However, this research suggests that for Simarouba glauca to be widely accepted as a biodiesel feedstock further investigation is still required.
Resumo:
The Deccan Trap basalts are the remnants of a massive series of lava flows that erupted at the K/T boundary and covered 1-2 million km2 of west-central India. This eruptive event is of global interest because of its possible link to the major mass extinction event, and there is much debate about the duration of this massive volcanic event. In contrast to isotopic or paleomagnetic dating methods, I explore an alternative approach to determine the lifecycle of the magma chambers that supplied the lavas, and extend the concept to obtain a tighter constraint on Deccan’s duration. My method relies on extracting time information from elemental and isotopic diffusion across zone boundary in an individual crystal. I determined elemental and Sr-isotopic variations across abnormally large (2-5 cm) plagioclase crystals from the Thalghat and Kashele “Giant Plagioclase Basalts” from the lowermost Jawhar and Igatpuri Formations respectively in the thickest Western Ghats section near Mumbai. I also obtained bulk rock major, trace and rare earth element chemistry of each lava flow from the two formations. Thalghat flows contain only 12% zoned crystals, with 87Sr/86Sr ratios of 0.7096 in the core and 0.7106 in the rim, separated by a sharp boundary. In contrast, all Kashele crystals have a wider range of 87Sr/86Sr values, with multiple zones. Geochemical modeling of the data suggests that the two types of crystals grew in distinct magmatic environments. Modeling intracrystalline diffusive equilibration between the core and rim of Thalghat crystals led me to obtain a crystal growth rate of 2.03x10-10 cm/s and a residence time of 780 years for the crystals in the magma chamber(s). Employing some assumptions based on field and geochronologic evidence, I extrapolated this residence time to the entire Western Ghats and obtained an estimate of 25,000 – 35,000 years for the duration of Western Ghats volcanism. This gave an eruptive rate of 30 – 40 km3/yr, which is much higher than any presently erupting volcano. This result will remain speculative until a similarly detailed analytical-modeling study is performed for the rest of the Western Ghats formations.