6 resultados para data transformation
em Digital Commons at Florida International University
Resumo:
There is growing popularity in the use of composite indices and rankings for cross-organizational benchmarking. However, little attention has been paid to alternative methods and procedures for the computation of these indices and how the use of such methods may impact the resulting indices and rankings. This dissertation developed an approach for assessing composite indices and rankings based on the integration of a number of methods for aggregation, data transformation and attribute weighting involved in their computation. The integrated model developed is based on the simulation of composite indices using methods and procedures proposed in the area of multi-criteria decision making (MCDM) and knowledge discovery in databases (KDD). The approach developed in this dissertation was automated through an IT artifact that was designed, developed and evaluated based on the framework and guidelines of the design science paradigm of information systems research. This artifact dynamically generates multiple versions of indices and rankings by considering different methodological scenarios according to user specified parameters. The computerized implementation was done in Visual Basic for Excel 2007. Using different performance measures, the artifact produces a number of excel outputs for the comparison and assessment of the indices and rankings. In order to evaluate the efficacy of the artifact and its underlying approach, a full empirical analysis was conducted using the World Bank's Doing Business database for the year 2010, which includes ten sub-indices (each corresponding to different areas of the business environment and regulation) for 183 countries. The output results, which were obtained using 115 methodological scenarios for the assessment of this index and its ten sub-indices, indicated that the variability of the component indicators considered in each case influenced the sensitivity of the rankings to the methodological choices. Overall, the results of our multi-method assessment were consistent with the World Bank rankings except in cases where the indices involved cost indicators measured in per capita income which yielded more sensitive results. Low income level countries exhibited more sensitivity in their rankings and less agreement between the benchmark rankings and our multi-method based rankings than higher income country groups.
Resumo:
This dissertation develops a new mathematical approach that overcomes the effect of a data processing phenomenon known as “histogram binning” inherent to flow cytometry data. A real-time procedure is introduced to prove the effectiveness and fast implementation of such an approach on real-world data. The histogram binning effect is a dilemma posed by two seemingly antagonistic developments: (1) flow cytometry data in its histogram form is extended in its dynamic range to improve its analysis and interpretation, and (2) the inevitable dynamic range extension introduces an unwelcome side effect, the binning effect, which skews the statistics of the data, undermining as a consequence the accuracy of the analysis and the eventual interpretation of the data. ^ Researchers in the field contended with such a dilemma for many years, resorting either to hardware approaches that are rather costly with inherent calibration and noise effects; or have developed software techniques based on filtering the binning effect but without successfully preserving the statistical content of the original data. ^ The mathematical approach introduced in this dissertation is so appealing that a patent application has been filed. The contribution of this dissertation is an incremental scientific innovation based on a mathematical framework that will allow researchers in the field of flow cytometry to improve the interpretation of data knowing that its statistical meaning has been faithfully preserved for its optimized analysis. Furthermore, with the same mathematical foundation, proof of the origin of such an inherent artifact is provided. ^ These results are unique in that new mathematical derivations are established to define and solve the critical problem of the binning effect faced at the experimental assessment level, providing a data platform that preserves its statistical content. ^ In addition, a novel method for accumulating the log-transformed data was developed. This new method uses the properties of the transformation of statistical distributions to accumulate the output histogram in a non-integer and multi-channel fashion. Although the mathematics of this new mapping technique seem intricate, the concise nature of the derivations allow for an implementation procedure that lends itself to a real-time implementation using lookup tables, a task that is also introduced in this dissertation. ^
Resumo:
In communities throughout the developing world, faith-based organizations (FBOs) focus on goals such as eradicating poverty, bolstering local economies, and fostering community development, while premising their activities and interaction with local communities on theological and religious understandings. Due to their pervasive interaction with participants, the religious ideologies of these FBOs impact the religious, economic, and social realities of communities. This study investigates the relationship between the international FBO, World Vision International (WVI), and changes to religious, economic, and social ideologies and practices in Andean indigenous communities in southern Peruvian. This study aims to contribute to the greater knowledge and understanding of (1) institutionalized development strategies, (2) faith-based development, and (3) how institutionalized development interacts with processes of socio-cultural change. Based on fifteen months of field research, this study involved qualitative and quantitative methods of participant-observation, interviews, surveys, and document analysis. Data were primarily collected from households from a sample of eight communities in the Pitumarca and Combapata districts, department of Canchis, province of Cusco, Peru where two WVI Area Development Programs were operating. Research findings reveal that there is a relationship between WVI’s intervention and some changes to religious, economic, and social structure (values, ideologies, and norms) and practices, demonstrating that structure and practices change when social systems are altered by new social actors. Findings also revealed that the impacts of WVI’s intervention greatly increased over the course of several years, demonstrating that changes in structure and practice occur gradually and need a period of time to take root. Finally, results showed that the impacts of WVI’s intervention were primarily limited to those most closely involved with the organization, revealing that the ability of one social actor to incite changes in the structure and practice of another actor is associated with the intensity of the relationship between the social actors. The findings of this study should be useful in ascertaining deductions and strengthening understandings of how faith-based development organizations impact aspects of religious, economic, and social life in the areas where they work.
Resumo:
Communication has become an essential function in our civilization. With the increasing demand for communication channels, it is now necessary to find ways to optimize the use of their bandwidth. One way to achieve this is by transforming the information before it is transmitted. This transformation can be performed by several techniques. One of the newest of these techniques is the use of wavelets. Wavelet transformation refers to the act of breaking down a signal into components called details and trends by using small waveforms that have a zero average in the time domain. After this transformation the data can be compressed by discarding the details, transmitting the trends. In the receiving end, the trends are used to reconstruct the image. In this work, the wavelet used for the transformation of an image will be selected from a library of available bases. The accuracy of the reconstruction, after the details are discarded, is dependent on the wavelets chosen from the wavelet basis library. The system developed in this thesis takes a 2-D image and decomposes it using a wavelet bank. A digital signal processor is used to achieve near real-time performance in this transformation task. A contribution of this thesis project is the development of DSP-based test bed for the future development of new real-time wavelet transformation algorithms.
Resumo:
This dissertation develops a new mathematical approach that overcomes the effect of a data processing phenomenon known as "histogram binning" inherent to flow cytometry data. A real-time procedure is introduced to prove the effectiveness and fast implementation of such an approach on real-world data. The histogram binning effect is a dilemma posed by two seemingly antagonistic developments: (1) flow cytometry data in its histogram form is extended in its dynamic range to improve its analysis and interpretation, and (2) the inevitable dynamic range extension introduces an unwelcome side effect, the binning effect, which skews the statistics of the data, undermining as a consequence the accuracy of the analysis and the eventual interpretation of the data. Researchers in the field contended with such a dilemma for many years, resorting either to hardware approaches that are rather costly with inherent calibration and noise effects; or have developed software techniques based on filtering the binning effect but without successfully preserving the statistical content of the original data. The mathematical approach introduced in this dissertation is so appealing that a patent application has been filed. The contribution of this dissertation is an incremental scientific innovation based on a mathematical framework that will allow researchers in the field of flow cytometry to improve the interpretation of data knowing that its statistical meaning has been faithfully preserved for its optimized analysis. Furthermore, with the same mathematical foundation, proof of the origin of such an inherent artifact is provided. These results are unique in that new mathematical derivations are established to define and solve the critical problem of the binning effect faced at the experimental assessment level, providing a data platform that preserves its statistical content. In addition, a novel method for accumulating the log-transformed data was developed. This new method uses the properties of the transformation of statistical distributions to accumulate the output histogram in a non-integer and multi-channel fashion. Although the mathematics of this new mapping technique seem intricate, the concise nature of the derivations allow for an implementation procedure that lends itself to a real-time implementation using lookup tables, a task that is also introduced in this dissertation.
Resumo:
Dissolved organic matter (DOM) is a complex mixture of organic compounds and represents the largest reservoirs of carbon (C) on earth. Particulate organic matter (POM) is another important carbon component in C cycling and controls a variety of biogeochemical processes. Estuaries, as important interfaces between land and ocean, play important roles in retaining and transforming such organic matter (OM) and serve as both sources and sinks of DOM and POM. There is a diverse array of both autochthonous and allochthonous OM sources in wetland/estuarine ecosystems. A comprehensive study on the sources, transformation and fate of OM in such ecosystems is essential in advancing our understanding of C cycling and better constraining the global C budget. In this work, DOM characteristics were investigated in different estuaries. Dissolved organic matter source strengths and dynamics were assessed in a seagrass-dominated subtropical estuarine lagoon. DOM dynamics controlled by hydrology and seagrass primary productivity were confirmed, and the primary source of DOM was quantified using the combination of excitation emission matrix fluorescence with parallel factor analysis (EEM-PARAFAC) and stable C isotope analysis. Seagrass can contribute up to 72% of the DOM in the study area. The spatial and temporal variation of DOM dynamics was also studied in a freshwated dominated estuary fringed with extensive salt marshes. The data showed that DOM was primarily derived from freshwater marshes and controlled by hydrology while salt marsh plants play a significant role in structuring the distribution patterns of DOM quality and quantity. The OM dynamics was also investigated in a mangrove-dominate estuary and a comparative study was conducted between the DOM and POM pools. The results revealed both similarity and dissimilarity in DOM and POM composition. The dynamics of both OM pools are largely uncoupled as a result of source differences. Fringe mangrove swamps are suggested to export similar amounts of DOM and POM and should be considered as an important source in coastal C budgets. Lastly, chemical characterizations were conducted on the featured fluorescence component in OM in an attempt to better understand the composition and origins of the specific PARAFAC component. The traditionally defined ‘protein-like’ fluorescence was found to contain both proteinaceous and phenolic compounds, suggesting that the application of this parameter as a proxy for amino acid content and bioavailability may be limited.