6 resultados para class imbalance problems
em Digital Commons at Florida International University
Resumo:
Thanks to the advanced technologies and social networks that allow the data to be widely shared among the Internet, there is an explosion of pervasive multimedia data, generating high demands of multimedia services and applications in various areas for people to easily access and manage multimedia data. Towards such demands, multimedia big data analysis has become an emerging hot topic in both industry and academia, which ranges from basic infrastructure, management, search, and mining to security, privacy, and applications. Within the scope of this dissertation, a multimedia big data analysis framework is proposed for semantic information management and retrieval with a focus on rare event detection in videos. The proposed framework is able to explore hidden semantic feature groups in multimedia data and incorporate temporal semantics, especially for video event detection. First, a hierarchical semantic data representation is presented to alleviate the semantic gap issue, and the Hidden Coherent Feature Group (HCFG) analysis method is proposed to capture the correlation between features and separate the original feature set into semantic groups, seamlessly integrating multimedia data in multiple modalities. Next, an Importance Factor based Temporal Multiple Correspondence Analysis (i.e., IF-TMCA) approach is presented for effective event detection. Specifically, the HCFG algorithm is integrated with the Hierarchical Information Gain Analysis (HIGA) method to generate the Importance Factor (IF) for producing the initial detection results. Then, the TMCA algorithm is proposed to efficiently incorporate temporal semantics for re-ranking and improving the final performance. At last, a sampling-based ensemble learning mechanism is applied to further accommodate the imbalanced datasets. In addition to the multimedia semantic representation and class imbalance problems, lack of organization is another critical issue for multimedia big data analysis. In this framework, an affinity propagation-based summarization method is also proposed to transform the unorganized data into a better structure with clean and well-organized information. The whole framework has been thoroughly evaluated across multiple domains, such as soccer goal event detection and disaster information management.
Resumo:
An implementation of Sem-ODB—a database management system based on the Semantic Binary Model is presented. A metaschema of Sem-ODB database as well as the top-level architecture of the database engine is defined. A new benchmarking technique is proposed which allows databases built on different database models to compete fairly. This technique is applied to show that Sem-ODB has excellent efficiency comparing to a relational database on a certain class of database applications. A new semantic benchmark is designed which allows evaluation of the performance of the features characteristic of semantic database applications. An application used in the benchmark represents a class of problems requiring databases with sparse data, complex inheritances and many-to-many relations. Such databases can be naturally accommodated by semantic model. A fixed predefined implementation is not enforced allowing the database designer to choose the most efficient structures available in the DBMS tested. The results of the benchmark are analyzed. ^ A new high-level querying model for semantic databases is defined. It is proven adequate to serve as an efficient native semantic database interface, and has several advantages over the existing interfaces. It is optimizable and parallelizable, supports the definition of semantic userviews and the interoperability of semantic databases with other data sources such as World Wide Web, relational, and object-oriented databases. The query is structured as a semantic database schema graph with interlinking conditionals. The query result is a mini-database, accessible in the same way as the original database. The paradigm supports and utilizes the rich semantics and inherent ergonomics of semantic databases. ^ The analysis and high-level design of a system that exploits the superiority of the Semantic Database Model to other data models in expressive power and ease of use to allow uniform access to heterogeneous data sources such as semantic databases, relational databases, web sites, ASCII files, and others via a common query interface is presented. The Sem-ODB engine is used to control all the data sources combined under a unified semantic schema. A particular application of the system to provide an ODBC interface to the WWW as a data source is discussed. ^
Resumo:
Multi-problem youth undergoing treatment for substance use problems are at high behavioral risk for exposure to sexually transmitted infections (STIs), including human immunodeficiency virus (HIV). Specific risk factors include childhood adversities such as maltreatment experiences and subsequent forms of psychopathology. The current study used a person-centered analytical approach to examine how childhood maltreatment experiences were related to patterns of psychiatric symptoms and HIV/STI risk behaviors in a sample of adolescents (N = 408) receiving treatment services. Data were collected in face-to-face interviews at two community-based facilities. Descriptive statistics and Latent Profile Analysis (LPA) were used to (a) classify adolescents into groups based on past year psychiatric symptoms, and (b) examine relations between class membership and forms of childhood maltreatment experiences, as well as past year sexual risk behavior (SRB). ^ LPA results indicated significant heterogeneity in psychiatric symptoms among the participants. The three classes generated via the optimal LPA solution included: (a) a low psychiatric symptoms class, (b) a high alcohol symptoms class and (c) a high internalizing symptoms class. Class membership was associated significantly with adolescents’ self-reported scores for childhood sexual abuse and emotional neglect. ANOVAs documented significant differences in mean scores for multiple indices of SRB indices by class membership, demonstrating differential risk for HIV/STI exposure across classes. The two classes characterized by elevated psychiatric symptom profiles and more severe maltreatment histories were at increased behavioral risk for HIV/STI exposure, compared to the low psychiatric symptoms class. The high internalizing symptoms class reported the highest scores for most of the indices of SRB assessed. The heterogeneity of psychiatric symptom patterns documented in the current study has important implications for HIV/STI prevention programs implemented with multi-problem youth. The results highlight complex relations between childhood maltreatment experiences, psychopathology and multiple forms of health risk behavior among adolescents. The results underscore the importance of further integration between substance abuse treatment and HIV/STI risk reduction efforts to improve morbidity and mortality among vulnerable youth. ^
Resumo:
Dual-class stock structure is characterized by the separation of voting rights and cash flow rights. The departure from a common "one share-one vote" configuration creates ideal conditions for conflicts of interest and agency problems between controlling insiders (the holders of voting rights) and remaining shareholders. The owners of voting rights have the opportunity to extract private benefits and act in their personal interest; as a result, dual-class firms are often perceived to have low transparency and high information asymmetry. This dissertation investigates the quality of information and the information environment of firms with two classes of stock. The first essay examines the quality of information by studying accruals in dual-class firms in comparison to firms with only one class of stock. The results suggest that the quality of accruals is better in dual-class firms than in single-class firms. In addition, the difference in the quality of accruals between firms that abolish their dual-class share structure by unification and singe-class firms disappears in the post-unification period. The second essay investigates the earnings informativeness of dual-class firms by examining the explanatory power of earnings for returns. The results indicate that the earnings informativeness is lower for dual-class firms as compared to single-class firms. Earnings informativeness improves in firms that unify their shares. The third essay compares the level of information asymmetry between dual-class firms and single-class firms. It is documented that the information environment for dual-class firms is worse than for single-class firms. Also, the finding suggests that the difference in information environment between dual-class firms and single-class firms disappears after dual-class stock unification.
Resumo:
Multi-problem youth undergoing treatment for substance use problems are at high behavioral risk for exposure to sexually transmitted infections (STIs), including human immunodeficiency virus (HIV). Specific risk factors include childhood adversities such as maltreatment experiences and subsequent forms of psychopathology. The current study used a person-centered analytical approach to examine how childhood maltreatment experiences were related to patterns of psychiatric symptoms and HIV/STI risk behaviors in a sample of adolescents (N = 408) receiving treatment services. Data were collected in face-to-face interviews at two community-based facilities. Descriptive statistics and Latent Profile Analysis (LPA) were used to (a) classify adolescents into groups based on past year psychiatric symptoms, and (b) examine relations between class membership and forms of childhood maltreatment experiences, as well as past year sexual risk behavior (SRB). LPA results indicated significant heterogeneity in psychiatric symptoms among the participants. The three classes generated via the optimal LPA solution included: (a) a low psychiatric symptoms class, (b) a high alcohol symptoms class and (c) a high internalizing symptoms class. Class membership was associated significantly with adolescents’ self-reported scores for childhood sexual abuse and emotional neglect. ANOVAs documented significant differences in mean scores for multiple indices of SRB indices by class membership, demonstrating differential risk for HIV/STI exposure across classes. The two classes characterized by elevated psychiatric symptom profiles and more severe maltreatment histories were at increased behavioral risk for HIV/STI exposure, compared to the low psychiatric symptoms class. The high internalizing symptoms class reported the highest scores for most of the indices of SRB assessed. The heterogeneity of psychiatric symptom patterns documented in the current study has important implications for HIV/STI prevention programs implemented with multi-problem youth. The results highlight complex relations between childhood maltreatment experiences, psychopathology and multiple forms of health risk behavior among adolescents. The results underscore the importance of further integration between substance abuse treatment and HIV/STI risk reduction efforts to improve morbidity and mortality among vulnerable youth.
Resumo:
Dual-class stock structure is characterized by the separation of voting rights and cash flow rights. The departure from a common “one share-one vote” configuration creates ideal conditions for conflicts of interest and agency problems between controlling insiders (the holders of voting rights) and remaining shareholders. The owners of voting rights have the opportunity to extract private benefits and act in their personal interest; as a result, dual-class firms are often perceived to have low transparency and high information asymmetry. This dissertation investigates the quality of information and the information environment of firms with two classes of stock. The first essay examines the quality of information by studying accruals in dual-class firms in comparison to firms with only one class of stock. The results suggest that the quality of accruals is better in dual-class firms than in single-class firms. In addition, the difference in the quality of accruals between firms that abolish their dual-class share structure by unification and singe-class firms disappears in the post-unification period. The second essay investigates the earnings informativeness of dual-class firms by examining the explanatory power of earnings for returns. The results indicate that the earnings informativeness is lower for dual-class firms as compared to single-class firms. Earnings informativeness improves in firms that unify their shares. The third essay compares the level of information asymmetry between dual-class firms and single-class firms. It is documented that the information environment for dual-class firms is worse than for single-class firms. Also, the finding suggests that the difference in information environment between dual-class firms and single-class firms disappears after dual-class stock unification.