18 resultados para Support Vector Machines and Naive Bayes Classifier
em Digital Commons at Florida International University
Resumo:
This research is to establish new optimization methods for pattern recognition and classification of different white blood cells in actual patient data to enhance the process of diagnosis. Beckman-Coulter Corporation supplied flow cytometry data of numerous patients that are used as training sets to exploit the different physiological characteristics of the different samples provided. The methods of Support Vector Machines (SVM) and Artificial Neural Networks (ANN) were used as promising pattern classification techniques to identify different white blood cell samples and provide information to medical doctors in the form of diagnostic references for the specific disease states, leukemia. The obtained results prove that when a neural network classifier is well configured and trained with cross-validation, it can perform better than support vector classifiers alone for this type of data. Furthermore, a new unsupervised learning algorithm---Density based Adaptive Window Clustering algorithm (DAWC) was designed to process large volumes of data for finding location of high data cluster in real-time. It reduces the computational load to ∼O(N) number of computations, and thus making the algorithm more attractive and faster than current hierarchical algorithms.
Resumo:
This dissertation established a state-of-the-art programming tool for designing and training artificial neural networks (ANNs) and showed its applicability to brain research. The developed tool, called NeuralStudio, allows users without programming skills to conduct studies based on ANNs in a powerful and very user friendly interface. A series of unique features has been implemented in NeuralStudio, such as ROC analysis, cross-validation, network averaging, topology optimization, and optimization of the activation function’s slopes. It also included a Support Vector Machines module for comparison purposes. Once the tool was fully developed, it was applied to two studies in brain research. In the first study, the goal was to create and train an ANN to detect epileptic seizures from subdural EEG. This analysis involved extracting features from the spectral power in the gamma frequencies. In the second application, a unique method was devised to link EEG recordings to epileptic and nonepileptic subjects. The contribution of this method consisted of developing a descriptor matrix that can be used to represent any EEG file regarding its duration and the number of electrodes. The first study showed that the inter-electrode mean of the spectral power in the gamma frequencies and its duration above a specific threshold performs better than the other frequencies in seizure detection, exhibiting an accuracy of 95.90%, a sensitivity of 92.59%, and a specificity of 96.84%. The second study yielded that Hjorth’s parameter activity is sufficient to accurately relate EEG to epileptic and non-epileptic subjects. After testing, accuracy, sensitivity and specificity of the classifier were all above 0.9667. Statistical tests measured the superiority of activity at over 99.99 % certainty. It was demonstrated that (1) the spectral power in the gamma frequencies is highly effective in locating seizures from EEG and (2) activity can be used to link EEG recordings to epileptic and non-epileptic subjects. These two studies required high computational load and could be addressed thanks to NeuralStudio. From a medical perspective, both methods proved the merits of NeuralStudio in brain research applications. For its outstanding features, NeuralStudio has been recently awarded a patent (US patent No. 7502763).
Resumo:
This dissertation established a state-of-the-art programming tool for designing and training artificial neural networks (ANNs) and showed its applicability to brain research. The developed tool, called NeuralStudio, allows users without programming skills to conduct studies based on ANNs in a powerful and very user friendly interface. A series of unique features has been implemented in NeuralStudio, such as ROC analysis, cross-validation, network averaging, topology optimization, and optimization of the activation function’s slopes. It also included a Support Vector Machines module for comparison purposes. Once the tool was fully developed, it was applied to two studies in brain research. In the first study, the goal was to create and train an ANN to detect epileptic seizures from subdural EEG. This analysis involved extracting features from the spectral power in the gamma frequencies. In the second application, a unique method was devised to link EEG recordings to epileptic and non-epileptic subjects. The contribution of this method consisted of developing a descriptor matrix that can be used to represent any EEG file regarding its duration and the number of electrodes. The first study showed that the inter-electrode mean of the spectral power in the gamma frequencies and its duration above a specific threshold performs better than the other frequencies in seizure detection, exhibiting an accuracy of 95.90%, a sensitivity of 92.59%, and a specificity of 96.84%. The second study yielded that Hjorth’s parameter activity is sufficient to accurately relate EEG to epileptic and non-epileptic subjects. After testing, accuracy, sensitivity and specificity of the classifier were all above 0.9667. Statistical tests measured the superiority of activity at over 99.99 % certainty. It was demonstrated that 1) the spectral power in the gamma frequencies is highly effective in locating seizures from EEG and 2) activity can be used to link EEG recordings to epileptic and non-epileptic subjects. These two studies required high computational load and could be addressed thanks to NeuralStudio. From a medical perspective, both methods proved the merits of NeuralStudio in brain research applications. For its outstanding features, NeuralStudio has been recently awarded a patent (US patent No. 7502763).
Resumo:
This research aims at a study of the hybrid flow shop problem which has parallel batch-processing machines in one stage and discrete-processing machines in other stages to process jobs of arbitrary sizes. The objective is to minimize the makespan for a set of jobs. The problem is denoted as: FF: batch1,sj:Cmax. The problem is formulated as a mixed-integer linear program. The commercial solver, AMPL/CPLEX, is used to solve problem instances to their optimality. Experimental results show that AMPL/CPLEX requires considerable time to find the optimal solution for even a small size problem, i.e., a 6-job instance requires 2 hours in average. A bottleneck-first-decomposition heuristic (BFD) is proposed in this study to overcome the computational (time) problem encountered while using the commercial solver. The proposed BFD heuristic is inspired by the shifting bottleneck heuristic. It decomposes the entire problem into three sub-problems, and schedules the sub-problems one by one. The proposed BFD heuristic consists of four major steps: formulating sub-problems, prioritizing sub-problems, solving sub-problems and re-scheduling. For solving the sub-problems, two heuristic algorithms are proposed; one for scheduling a hybrid flow shop with discrete processing machines, and the other for scheduling parallel batching machines (single stage). Both consider job arrival and delivery times. An experiment design is conducted to evaluate the effectiveness of the proposed BFD, which is further evaluated against a set of common heuristics including a randomized greedy heuristic and five dispatching rules. The results show that the proposed BFD heuristic outperforms all these algorithms. To evaluate the quality of the heuristic solution, a procedure is developed to calculate a lower bound of makespan for the problem under study. The lower bound obtained is tighter than other bounds developed for related problems in literature. A meta-search approach based on the Genetic Algorithm concept is developed to evaluate the significance of further improving the solution obtained from the proposed BFD heuristic. The experiment indicates that it reduces the makespan by 1.93 % in average within a negligible time when problem size is less than 50 jobs.
Resumo:
The eggs of the dengue fever vector Aedes aegypti possess the ability to undergo an extended quiescence period hosting a fully developed first instar larvae within its chorion. As a result of this life history stage, pharate larvae can withstand months of dormancy inside the egg where they depend on stored reserves of maternal origin. This adaptation known as pharate first instar quiescence, allows A. aegypti to cope with fluctuations in water availability. An examination of this fundamental adaptation has shown that there are trade-offs associated with it. ^ Aedes aegypti mosquitoes are frequently associated with urban habitats that may contain metal pollution. My research has demonstrated that the duration of this quiescence and the extent of nutritional depletion associated with it affects the physiology and survival of larvae that hatch in a suboptimal habitat; nutrient reserves decrease during pharate first instar quiescence and alter subsequent larval and adult fitness. The duration of quiescence compromises metal tolerance physiology and is coupled to a decrease in metallothionein mRNA levels. My findings also indicate that even low levels of environmentally relevant larval metal stress alter the parameters that determine vector capacity. ^ My research has also demonstrated that extended pharate first instar quiescence can elicit a plastic response resulting in an adult phenotype distinct from adults reared from short quiescence eggs. Extended pharate first instar quiescence affects the performance and reproductive fitness of the adult female mosquito as well as the nutritional status of its progeny via maternal effects in an adaptive manner, i.e., anticipatory phenotypic plasticity results as a consequence of the duration of pharate first instar quiescence and alternative phenotypes may exist for this mosquito with quiescence serving as a cue possibly signaling the environmental conditions that follow a dry period. M findings may explain, in part, A. aegypti's success as a vector and its geographic distribution and have implications for its vector capacity and control.^
Resumo:
Voice communication systems such as Voice-over IP (VoIP), Public Switched Telephone Networks, and Mobile Telephone Networks, are an integral means of human tele-interaction. These systems pose distinctive challenges due to their unique characteristics such as low volume, burstiness and stringent delay/loss requirements across heterogeneous underlying network technologies. Effective quality evaluation methodologies are important for system development and refinement, particularly by adopting user feedback based measurement. Presently, most of the evaluation models are system-centric (Quality of Service or QoS-based), which questioned us to explore a user-centric (Quality of Experience or QoE-based) approach as a step towards the human-centric paradigm of system design. We research an affect-based QoE evaluation framework which attempts to capture users' perception while they are engaged in voice communication. Our modular approach consists of feature extraction from multiple information sources including various affective cues and different classification procedures such as Support Vector Machines (SVM) and k-Nearest Neighbor (kNN). The experimental study is illustrated in depth with detailed analysis of results. The evidences collected provide the potential feasibility of our approach for QoE evaluation and suggest the consideration of human affective attributes in modeling user experience.
Resumo:
The eggs of the dengue fever vector Aedes aegypti possess the ability to undergo an extended quiescence period hosting a fully developed first instar larvae within its chorion. As a result of this life history stage, pharate larvae can withstand months of dormancy inside the egg where they depend on stored reserves of maternal origin. This adaptation known as pharate first instar quiescence, allows A. aegypti to cope with fluctuations in water availability. An examination of this fundamental adaptation has shown that there are trade-offs associated with it. Aedes aegypti mosquitoes are frequently associated with urban habitats that may contain metal pollution. My research has demonstrated that the duration of this quiescence and the extent of nutritional depletion associated with it affects the physiology and survival of larvae that hatch in a suboptimal habitat; nutrient reserves decrease during pharate first instar quiescence and alter subsequent larval and adult fitness. The duration of quiescence compromises metal tolerance physiology and is coupled to a decrease in metallothionein mRNA levels. My findings also indicate that even low levels of environmentally relevant larval metal stress alter the parameters that determine vector capacity. My research has also demonstrated that extended pharate first instar quiescence can elicit a plastic response resulting in an adult phenotype distinct from adults reared from short quiescence eggs. Extended pharate first instar quiescence affects the performance and reproductive fitness of the adult female mosquito as well as the nutritional status of its progeny via maternal effects in an adaptive manner, i.e., anticipatory phenotypic plasticity results as a consequence of the duration of pharate first instar quiescence and alternative phenotypes may exist for this mosquito with quiescence serving as a cue possibly signaling the environmental conditions that follow a dry period. M findings may explain, in part, A. aegypti’s success as a vector and its geographic distribution and have implications for its vector capacity and control.
Resumo:
This dissertation identifies, examines, and assesses the relative influence of identified empirically and conceptually relevant variables on incarcerated substance abusers' expectations of postrelease adjustment. A purposive sampling procedure was used to recruit 101 male and female substance-abusing offenders participating in prison- and jail-based drug treatment programs in south Florida. A 92-item survey questionnaire was used to collect basic demographic data; measure inmate preincarceration characteristics, social support, and rehabilitation program participation; and record archival data. Regression equations were developed utilizing ten different measures of the participants' expectations of their postrelease adjustment. Two equations yielded statistically significant F ratios; maintaining a stable living and maintaining abstinence. Twenty-two percent of the variance in respondents' expectations of maintaining a stable living was explained by preincarceration characteristics, social support, and rehabilitation program participation (F = 1.89; df = 13,87; p $<$.05). The only significant predictor variable was perception of social support (b = $-$.05; t = $-$3.6; p $<$.001). Twenty-three percent of the variance in respondents' expectations of maintaining abstinence from substances was explained by preincarceration characteristics, social support, and rehabilitation program participation (F = 2; df = 13,87; p $<$.05). Once again, the only significant predictor variable was perception of social support. The results of the analyses indicate that social support was the only important variable for understanding these respondents' efficacy expectations of postrelease abstinence and stable living. The results of this investigation demonstrate the complexity of the social support variable for prisoners, and identify social support as a potential rehabilitative resource for substance-abusing inmates. The results of this investigation underscore the importance of continued, detailed empirical study in order to understand and clarify how social support, efficacy expectations, and actual postrelease performance interrelate for this population of offenders.
Resumo:
The current study was designed to explore the salience of social support, immigrant status, and risk in middle childhood and early adolescence across two time periods as indicated by measures of school adjustment and well-being. Participants included 691 children of public elementary schools in grades 4 and 6 who were interviewed in 1997 (Time 1) and reinterviewed two years later (Time 2); 539 were U.S.-born, and 152 were foreign-born. ^ Repeated measures multivariate analyses of variance (MANOVA's) were conducted to assess the effects of immigrant status and risk on total support, well-being, and school adjustment from Time 1 to Time 2. Follow-up analyses, including Student-Newman-Keuls post hoc tests, were used to test the significance of the differences among the means of support categories (low and high), immigrant status (U.S. born and non-U.S. born), risk (low and high) and time (time 1 and time 2). ^ Results showed that immigrant participants in the high risk group reported significantly lower levels of support than their peers. Further, children of low risk at Time 2 indicated the highest levels of support. Second, immigrant preadolescents, preadolescents who reported low levels of social support, and preadolescents of the high risk reported lower levels of emotional well-being. There was also an interaction of support by risk by time, indicating that children who are at risk and had low levels of social support reported more emotional problems at Time 1. Finally, preadolescents who are at risk and preadolescents who reported lower levels of support were more likely to show school adaptation problems. Findings from this study highlight the importance of a multivariable approach to the study of support, emotional adjustment, and academic adjustment of immigrant preadolescents. ^
Resumo:
The need to provide computers with the ability to distinguish the affective state of their users is a major requirement for the practical implementation of affective computing concepts. This dissertation proposes the application of signal processing methods on physiological signals to extract from them features that can be processed by learning pattern recognition systems to provide cues about a person's affective state. In particular, combining physiological information sensed from a user's left hand in a non-invasive way with the pupil diameter information from an eye-tracking system may provide a computer with an awareness of its user's affective responses in the course of human-computer interactions. In this study an integrated hardware-software setup was developed to achieve automatic assessment of the affective status of a computer user. A computer-based "Paced Stroop Test" was designed as a stimulus to elicit emotional stress in the subject during the experiment. Four signals: the Galvanic Skin Response (GSR), the Blood Volume Pulse (BVP), the Skin Temperature (ST) and the Pupil Diameter (PD), were monitored and analyzed to differentiate affective states in the user. Several signal processing techniques were applied on the collected signals to extract their most relevant features. These features were analyzed with learning classification systems, to accomplish the affective state identification. Three learning algorithms: Naïve Bayes, Decision Tree and Support Vector Machine were applied to this identification process and their levels of classification accuracy were compared. The results achieved indicate that the physiological signals monitored do, in fact, have a strong correlation with the changes in the emotional states of the experimental subjects. These results also revealed that the inclusion of pupil diameter information significantly improved the performance of the emotion recognition system. ^
Resumo:
The purpose of this study was to determine if there was a difference in the self-determined evaluations of work performance and support needs by adults with mental retardation in supported employment and in sheltered workshop environments. The instrument, Job Observation and Behavior Scale: Opportunity for Self-Determination (JOBS: OSD; Brady, Rosenberg, & Frain, 2006), was administered to 38 adults with mental retardation from sheltered workshops and 32 adults with mental retardation from supported employment environments. Cross-tabulations with Chi-square tests and independent samples t-tests were conducted to evaluate differences between the two groups, sheltered workshop and supported work. Two Multivariate Analyses of Variance (MANOVAs) were conducted to determine the effect of work environment on Quality of Performance (QP) and Types of Support (TS) test scores and their subscales. ^ This study found that there were significant differences between the groups on the QP Behavior and Job Duties subscales. The sheltered workshop group perceived themselves as performing significantly better on job duties than the supported work group. Conversely, the supported work group perceived themselves to have better behavior than the sheltered workshop group. However, there were no significant differences between groups in their perception of support needs for the three subscales. ^ The findings imply that work environment affects the self-determined evaluations of work performance by adults with mental retardation. Recommendations for further study include (a) detailing the characteristics of supported work and sheltered workshops that support and/or discourage self-determined behaviors, (b) exploring the behavior of adults with mental retardation in sheltered workshops and supported work environments, and (c) analysis of the support needs for and understanding of them by adults with mental retardation in sheltered workshops and in supported work environments. ^
Resumo:
The purpose of this study is to investigate supervisory support as a moderator of the effects of role conflict and role ambiguity on emotional exhaustion and job satisfaction. This study also examines the moderating role of supervisory support on the relationship between emotional exhaustion and job satisfaction. Data were collected from a sample of frontline hotel employees in Northern Cyprus. The aforementioned relationships were tested based on hierarchical multiple regression analysis. The results demonstrate that supervisory support mitigates the impact of role conflict on emotional exhaustion and further reveal that supervisory support reduces the effect of emotional exhaustion on job satisfaction. There is no empirical support for the rest of the hypothesized relationships. Implications of the empirical results are discussed, and future research directions are offered.
Resumo:
The proliferation of legalized gaming has significantly changed the nature of the hospitality industry. While several aspects of gaming have flourished, none has become more popular, profitable, or technologically advanced as the slot machine. While more than half of all casino gambling, and earnings, is generated by slot machines, little has been written about the technology integral to these devices. The author describes the workings of computer-controlled slot machines and exposes some of the popular operating myths.
Resumo:
The rapid growth of virtualized data centers and cloud hosting services is making the management of physical resources such as CPU, memory, and I/O bandwidth in data center servers increasingly important. Server management now involves dealing with multiple dissimilar applications with varying Service-Level-Agreements (SLAs) and multiple resource dimensions. The multiplicity and diversity of resources and applications are rendering administrative tasks more complex and challenging. This thesis aimed to develop a framework and techniques that would help substantially reduce data center management complexity.^ We specifically addressed two crucial data center operations. First, we precisely estimated capacity requirements of client virtual machines (VMs) while renting server space in cloud environment. Second, we proposed a systematic process to efficiently allocate physical resources to hosted VMs in a data center. To realize these dual objectives, accurately capturing the effects of resource allocations on application performance is vital. The benefits of accurate application performance modeling are multifold. Cloud users can size their VMs appropriately and pay only for the resources that they need; service providers can also offer a new charging model based on the VMs performance instead of their configured sizes. As a result, clients will pay exactly for the performance they are actually experiencing; on the other hand, administrators will be able to maximize their total revenue by utilizing application performance models and SLAs. ^ This thesis made the following contributions. First, we identified resource control parameters crucial for distributing physical resources and characterizing contention for virtualized applications in a shared hosting environment. Second, we explored several modeling techniques and confirmed the suitability of two machine learning tools, Artificial Neural Network and Support Vector Machine, to accurately model the performance of virtualized applications. Moreover, we suggested and evaluated modeling optimizations necessary to improve prediction accuracy when using these modeling tools. Third, we presented an approach to optimal VM sizing by employing the performance models we created. Finally, we proposed a revenue-driven resource allocation algorithm which maximizes the SLA-generated revenue for a data center.^
Resumo:
In this study we have identified key genes that are critical in development of astrocytic tumors. Meta-analysis of microarray studies which compared normal tissue to astrocytoma revealed a set of 646 differentially expressed genes in the majority of astrocytoma. Reverse engineering of these 646 genes using Bayesian network analysis produced a gene network for each grade of astrocytoma (Grade I–IV), and ‘key genes’ within each grade were identified. Genes found to be most influential to development of the highest grade of astrocytoma, Glioblastoma multiforme were: COL4A1, EGFR, BTF3, MPP2, RAB31, CDK4, CD99, ANXA2, TOP2A, and SERBP1. All of these genes were up-regulated, except MPP2 (down regulated). These 10 genes were able to predict tumor status with 96–100% confidence when using logistic regression, cross validation, and the support vector machine analysis. Markov genes interact with NFkβ, ERK, MAPK, VEGF, growth hormone and collagen to produce a network whose top biological functions are cancer, neurological disease, and cellular movement. Three of the 10 genes - EGFR, COL4A1, and CDK4, in particular, seemed to be potential ‘hubs of activity’. Modified expression of these 10 Markov Blanket genes increases lifetime risk of developing glioblastoma compared to the normal population. The glioblastoma risk estimates were dramatically increased with joint effects of 4 or more than 4 Markov Blanket genes. Joint interaction effects of 4, 5, 6, 7, 8, 9 or 10 Markov Blanket genes produced 9, 13, 20.9, 26.7, 52.8, 53.2, 78.1 or 85.9%, respectively, increase in lifetime risk of developing glioblastoma compared to normal population. In summary, it appears that modified expression of several ‘key genes’ may be required for the development of glioblastoma. Further studies are needed to validate these ‘key genes’ as useful tools for early detection and novel therapeutic options for these tumors.