12 resultados para L71 - Mining, Extraction, and Refining:

em Digital Commons at Florida International University


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Due to the rapid advances in computing and sensing technologies, enormous amounts of data are being generated everyday in various applications. The integration of data mining and data visualization has been widely used to analyze these massive and complex data sets to discover hidden patterns. For both data mining and visualization to be effective, it is important to include the visualization techniques in the mining process and to generate the discovered patterns for a more comprehensive visual view. In this dissertation, four related problems: dimensionality reduction for visualizing high dimensional datasets, visualization-based clustering evaluation, interactive document mining, and multiple clusterings exploration are studied to explore the integration of data mining and data visualization. In particular, we 1) propose an efficient feature selection method (reliefF + mRMR) for preprocessing high dimensional datasets; 2) present DClusterE to integrate cluster validation with user interaction and provide rich visualization tools for users to examine document clustering results from multiple perspectives; 3) design two interactive document summarization systems to involve users efforts and generate customized summaries from 2D sentence layouts; and 4) propose a new framework which organizes the different input clusterings into a hierarchical tree structure and allows for interactive exploration of multiple clustering solutions.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Peruvian economy depends for its growth on the export of natural resources and investment in the mining and hydrocarbon sectors. Peruvian governments and mining corporations have confronted anti-mining protests in different ways. While the current government has introduced policies of social inclusion to soften the negative effects of the operations of mining capital and policies of dialogue to engage social actors with the essence of governmental policies, mining companies use corporate social responsibility programs as a cover for the devastating effects of their operations on the environment and the livelihoods and habitats of the indigenous and peasant communities. Curiously, in the current context of the declining commodity prices and export volumes the Peruvian government strengthens its extractivist model of development. This article argues that whatever government that follows the rules of capital cannot but favor the corporations. It points out the main adversaries of the indigenous and peasant communities and the problems to transform the locally and/or regionally struggle into a nationwide battle for another development model.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

There is an increasing demand for DNA analysis because of the sensitivity of the method and the ability to uniquely identify and distinguish individuals with a high degree of certainty. But this demand has led to huge backlogs in evidence lockers since the current DNA extraction protocols require long processing time. The DNA analysis procedure becomes more complicated when analyzing sexual assault casework samples where the evidence contains more than one contributor. Additional processing to separate different cell types in order to simplify the final data interpretation further contributes to the existing cumbersome protocols. The goal of the present project is to develop a rapid and efficient extraction method that permits selective digestion of mixtures. ^ Selective recovery of male DNA was achieved with as little as 15 minutes lysis time upon exposure to high pressure under alkaline conditions. Pressure cycling technology (PCT) is carried out in a barocycler that has a small footprint and is semi-automated. Typically less than 10% male DNA is recovered using the standard extraction protocol for rape kits, almost seven times more male DNA was recovered from swabs using this novel method. Various parameters including instrument setting and buffer composition were optimized to achieve selective recovery of sperm DNA. Some developmental validation studies were also done to determine the efficiency of this method in processing samples exposed to various conditions that can affect the quality of the extraction and the final DNA profile. ^ Easy to use interface, minimal manual interference and the ability to achieve high yields with simple reagents in a relatively short time make this an ideal method for potential application in analyzing sexual assault samples.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

There is an increasing demand for DNA analysis because of the sensitivity of the method and the ability to uniquely identify and distinguish individuals with a high degree of certainty. But this demand has led to huge backlogs in evidence lockers since the current DNA extraction protocols require long processing time. The DNA analysis procedure becomes more complicated when analyzing sexual assault casework samples where the evidence contains more than one contributor. Additional processing to separate different cell types in order to simplify the final data interpretation further contributes to the existing cumbersome protocols. The goal of the present project is to develop a rapid and efficient extraction method that permits selective digestion of mixtures. Selective recovery of male DNA was achieved with as little as 15 minutes lysis time upon exposure to high pressure under alkaline conditions. Pressure cycling technology (PCT) is carried out in a barocycler that has a small footprint and is semi-automated. Typically less than 10% male DNA is recovered using the standard extraction protocol for rape kits, almost seven times more male DNA was recovered from swabs using this novel method. Various parameters including instrument setting and buffer composition were optimized to achieve selective recovery of sperm DNA. Some developmental validation studies were also done to determine the efficiency of this method in processing samples exposed to various conditions that can affect the quality of the extraction and the final DNA profile. Easy to use interface, minimal manual interference and the ability to achieve high yields with simple reagents in a relatively short time make this an ideal method for potential application in analyzing sexual assault samples.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objectives of this research are to analyze and develop a modified Principal Component Analysis (PCA) and to develop a two-dimensional PCA with applications in image processing. PCA is a classical multivariate technique where its mathematical treatment is purely based on the eigensystem of positive-definite symmetric matrices. Its main function is to statistically transform a set of correlated variables to a new set of uncorrelated variables over $\IR\sp{n}$ by retaining most of the variations present in the original variables.^ The variances of the Principal Components (PCs) obtained from the modified PCA form a correlation matrix of the original variables. The decomposition of this correlation matrix into a diagonal matrix produces a set of orthonormal basis that can be used to linearly transform the given PCs. It is this linear transformation that reproduces the original variables. The two-dimensional PCA can be devised as a two successive of one-dimensional PCA. It can be shown that, for an $m\times n$ matrix, the PCs obtained from the two-dimensional PCA are the singular values of that matrix.^ In this research, several applications for image analysis based on PCA are developed, i.e., edge detection, feature extraction, and multi-resolution PCA decomposition and reconstruction. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study focuses on empirical investigations and seeks implications by utilizing three different methodologies to test various aspects of trader behavior. The first methodology utilizes Prospect Theory to determine trader behavior during periods of extreme wealth contracting periods. Secondly, a threshold model to examine the sentiment variable is formulated and thirdly a study is made of the contagion effect and trader behavior. ^ The connection between consumers' sense of financial well-being or sentiment and stock market performance has been studied at length. However, without data on actual versus experimental performance, implications based on this relationship are meaningless. The empirical agenda included examining a proprietary file of daily trader activities over a five-year period. Overall, during periods of extreme wealth altering conditions, traders "satisfice" rather than choose the "best" alternative. A trader's degree of loss aversion depends on his/her prior investment performance. A model that explains the behavior of traders during periods of turmoil is developed. Prospect Theory and the data file influenced the design of the model. ^ Additional research included testing a model that permitted the data to signal the crisis through a threshold model. The third empirical study sought to investigate the existence of contagion caused by declining global wealth effects using evidence from the mining industry in Canada. Contagion, where a financial crisis begins locally and subsequently spreads elsewhere, has been studied in terms of correlations among similar regions. The results provide support for Prospect Theory in two out of the three empirical studies. ^ The dissertation emphasizes the need for specifying precise, testable models of investors' expectations by providing tools to identify paradoxical behavior patterns. True enhancements in this field must include empirical research utilizing reliable data sources to mitigate data mining problems and allow researchers to distinguish between expectations-based and risk-based explanations of behavior. Through this type of research, it may be possible to systematically exploit "irrational" market behavior. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

There is limited scientific knowledge on the composition of human odor from different biological specimens and the effect that physiological and psychological health conditions could have on them. There is currently no direct comparison of the volatile organic compounds (VOCs) emanating from different biological specimens collected from healthy individuals as well as individuals with certain diagnosed medical conditions. Therefore the question of matching VOCs present in human odor across various biological samples and across health statuses remains unanswered. The main purpose of this study was to use analytical instrumental methods to compare the VOCs from different biological specimens from the same individual and to compare the populations evaluated in this project. The goals of this study were to utilize headspace solid-phase microextraction gas chromatography mass spectrometry (HS-SPME-GC/MS) to evaluate its potential for profiling VOCs from specimens collected using standard forensic and medical methods over three different populations: healthy group with no diagnosed medical or psychological condition, one group with diagnosed type 2 diabetes, and one group with diagnosed major depressive disorder. The pre-treatment methods of collection materials developed for the study allowed for the removal of targeted VOCs from the sampling kits prior to sampling, extraction and analysis. Optimized SPME-GC/MS conditions has been demonstrated to be capable of sampling, identifying and differentiating the VOCs present in the five biological specimens collected from different subjects and yielded excellent detection limits for the VOCs from buccal swab, breath, blood, and urine with average limits of detection of 8.3 ng. Visual, Spearman rank correlation, and PCA comparisons of the most abundant and frequent VOCs from each specimen demonstrated that each specimen has characteristic VOCs that allow them to be differentiated for both healthy and diseased individuals. Preliminary comparisons of VOC profiles of healthy individuals, patients with type 2 diabetes, and patients with major depressive disorder revealed compounds that could be used as potential biomarkers to differentiate between healthy and diseased individuals. Finally, a human biological specimen compound database has been created compiling the volatile compounds present in the emanations of human hand odor, oral fluids, breath, blood, and urine.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

I conducted this study to provide insights toward deepening understanding of association between culture and writing by building, assessing, and refining a conceptual model of second language writing. To do this, I examined culture and coherence as well as the relationship between them through a mixed methods research design. Coherence has been an important and complex concept in ESL/EFL writing. I intended to study the concept of coherence in the research context of contrastive rhetoric, comparing the coherence quality in argumentative essays written by undergraduates in Mainland China and their U.S. peers. In order to analyze the complex concept of coherence, I synthesized five linguistic theories of coherence: Halliday and Hasan's cohesion theory, Carroll's theory of coherence, Enkvist's theory of coherence, Topical Structure Analysis, and Toulmin's Model. Based upon the synthesis, 16 variables were generated. Across these 16 variables, Hotelling t-test statistical analysis was conducted to predict differences in argumentative coherence between essays written by two groups of participants. In order to complement the statistical analysis, I conducted 30 interviews of the writers in the studies. Participants' responses were analyzed with open and axial coding. By analyzing the empirical data, I refined the conceptual model by adding more categories and establishing associations among them. The study found that U.S. students made use of more pronominal reference. Chinese students adopted more lexical devices of reiteration and extended paralleling progression. The interview data implied that the difference may be associated with the difference in linguistic features and rhetorical conventions in Chinese and English. As far as Toulmin's Model is concerned, Chinese students scored higher on data than their U.S. peers. According to the interview data, this may be due to the fact that Toulmin's Model, modified as three elements of arguments, have been widely and long taught in Chinese writing instruction while U.S. interview participants said that they were not taught to write essays according to Toulmin's Model. Implications were generated from the process of textual data analysis and the formulation of structural model defining coherence. These implications were aimed at informing writing instruction, assessment, peer-review, and self-revision.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

My paper discusses three different ways in which stray dogs have been intertwined with ideologies of economic and urban development in Romania. I categorize results from archival and ethnographic research under three major time periods: early socialism, late socialism, and post-socialism. During early socialism stray dogs were seen to be damaging the soviet economy by killing species that humans could also hunt, like rabbits. During late socialism, stray dogs appeared as the enemies of the communist city, and the department of urban sanitation was given orders to poison dogs with strychnine. Finally, the increasing number of stray dogs in Bucharest after the collapse of communism was seen as a direct result of former communist demolitions, and was also taken as a sign of the collapsing state. Through such examples my paper discusses how the state and particular population groups have seen dogs as parts of an unwanted and dangerous nature, rather than a species that needs to be protected. I argue that distinctions of nature and culture have served discourses of civilization and the view of Bucharest as a model socialist, and then European city. Throughout my paper I juxtapose the treatment of stray dogs with other, more “valued” urban natures like the protection of parks, the wide-spread hobby of pigeon breeding during socialist years, the most recent debate on saving the rural area of Rosia Montana from non-environmentally friendly methods of gold extraction, and the current trend of healthy eating and living.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Purpose: Metabolic syndrome (MetS) is associated with the development of cardiovascular disease (CVD) and type 2 diabetes. Decreases in circulating adiponectin and ghrelin have been associated with MetS. Our primary aim was to evaluate the relationship of MetS with adiponectin and ghrelin for Cuban Americans with and without type 2 diabetes. Methods: Cross-sectional study of 367 adults, self identified as Cuban extraction and randomly recruited from a mailing list of Broward and Miami-Dade counties. Fasted whole blood for adiponectin (ADPN) was collected using K3EDTA tubes and measured by ELISA. Ghrelin was assayed with fasted blood plasma by Enzyme Immunometric Assay. MetS and 10-year risk for coronary heart disease (CHD) were determined using the ATP III criteria. Results: Adiponectin (F=51.8, R2 =0.21 p<0.001) and ghrelin (F=12.77, R 2 =0.06, p<0.001) differed by diabetes status (ANOVA) not age and gender. In stepwise linear regression models triglyceride levels ≥ 150 mg/dL negatively corresponded (coefficient = -0.23) with ghrelin levels for persons without diabetes (F=7.45, R2 =0.053, p=0.007); abdominal obesity and fasting plasma glucose predicted high sensitivity C-reactive protein (hs-CRP) for persons with and without diabetes (F=16.3, R2 = 0.144, p <0.001). Conclusion: Low ghrelin levels were associated with MetS regardless of diabetes status. High adiponectin levels were related to a low probability for those without diabetes only. There was a positive association of hs-CRP with BMI, MetS and number of MetS components.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study focuses on empirical investigations and seeks implications by utilizing three different methodologies to test various aspects of trader behavior. The first methodology utilizes Prospect Theory to determine trader behavior during periods of extreme wealth contracting periods. Secondly, a threshold model to examine the sentiment variable is formulated and thirdly a study is made of the contagion effect and trader behavior. The connection between consumers' sense of financial well-being or sentiment and stock market performance has been studied at length. However, without data on actual versus experimental performance, implications based on this relationship are meaningless. The empirical agenda included examining a proprietary file of daily trader activities over a five-year period. Overall, during periods of extreme wealth altering conditions, traders "satisfice" rather than choose the "best" alternative. A trader's degree of loss aversion depends on his/her prior investment performance. A model that explains the behavior of traders during periods of turmoil is developed. Prospect Theory and the data file influenced the design of the model. Additional research included testing a model that permitted the data to signal the crisis through a threshold model. The third empirical study sought to investigate the existence of contagion caused by declining global wealth effects using evidence from the mining industry in Canada. Contagion, where a financial crisis begins locally and subsequently spreads elsewhere, has been studied in terms of correlations among similar regions. The results provide support for Prospect Theory in two out of the three empirical studies. The dissertation emphasizes the need for specifying precise, testable models of investors' expectations by providing tools to identify paradoxical behavior patterns. True enhancements in this field must include empirical research utilizing reliable data sources to mitigate data mining problems and allow researchers to distinguish between expectations-based and risk-based explanations of behavior. Through this type of research, it may be possible to systematically exploit "irrational" market behavior.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

As the Web evolves unexpectedly fast, information grows explosively. Useful resources become more and more difficult to find because of their dynamic and unstructured characteristics. A vertical search engine is designed and implemented towards a specific domain. Instead of processing the giant volume of miscellaneous information distributed in the Web, a vertical search engine targets at identifying relevant information in specific domains or topics and eventually provides users with up-to-date information, highly focused insights and actionable knowledge representation. As the mobile device gets more popular, the nature of the search is changing. So, acquiring information on a mobile device poses unique requirements on traditional search engines, which will potentially change every feature they used to have. To summarize, users are strongly expecting search engines that can satisfy their individual information needs, adapt their current situation, and present highly personalized search results. In my research, the next generation vertical search engine means to utilize and enrich existing domain information to close the loop of vertical search engine's system that mutually facilitate knowledge discovering, actionable information extraction, and user interests modeling and recommendation. I investigate three problems in which domain taxonomy plays an important role, including taxonomy generation using a vertical search engine, actionable information extraction based on domain taxonomy, and the use of ensemble taxonomy to catch user's interests. As the fundamental theory, ultra-metric, dendrogram, and hierarchical clustering are intensively discussed. Methods on taxonomy generation using my research on hierarchical clustering are developed. The related vertical search engine techniques are practically used in Disaster Management Domain. Especially, three disaster information management systems are developed and represented as real use cases of my research work.