5 resultados para Online knowledge communities
em DRUM (Digital Repository at the University of Maryland)
Resumo:
In the past decade, systems that extract information from millions of Internet documents have become commonplace. Knowledge graphs -- structured knowledge bases that describe entities, their attributes and the relationships between them -- are a powerful tool for understanding and organizing this vast amount of information. However, a significant obstacle to knowledge graph construction is the unreliability of the extracted information, due to noise and ambiguity in the underlying data or errors made by the extraction system and the complexity of reasoning about the dependencies between these noisy extractions. My dissertation addresses these challenges by exploiting the interdependencies between facts to improve the quality of the knowledge graph in a scalable framework. I introduce a new approach called knowledge graph identification (KGI), which resolves the entities, attributes and relationships in the knowledge graph by incorporating uncertain extractions from multiple sources, entity co-references, and ontological constraints. I define a probability distribution over possible knowledge graphs and infer the most probable knowledge graph using a combination of probabilistic and logical reasoning. Such probabilistic models are frequently dismissed due to scalability concerns, but my implementation of KGI maintains tractable performance on large problems through the use of hinge-loss Markov random fields, which have a convex inference objective. This allows the inference of large knowledge graphs using 4M facts and 20M ground constraints in 2 hours. To further scale the solution, I develop a distributed approach to the KGI problem which runs in parallel across multiple machines, reducing inference time by 90%. Finally, I extend my model to the streaming setting, where a knowledge graph is continuously updated by incorporating newly extracted facts. I devise a general approach for approximately updating inference in convex probabilistic models, and quantify the approximation error by defining and bounding inference regret for online models. Together, my work retains the attractive features of probabilistic models while providing the scalability necessary for large-scale knowledge graph construction. These models have been applied on a number of real-world knowledge graph projects, including the NELL project at Carnegie Mellon and the Google Knowledge Graph.
Resumo:
In economics of information theory, credence products are those whose quality is difficult or impossible for consumers to assess, even after they have consumed the product (Darby & Karni, 1973). This dissertation is focused on the content, consumer perception, and power of online reviews for credence services. Economics of information theory has long assumed, without empirical confirmation, that consumers will discount the credibility of claims about credence quality attributes. The same theories predict that because credence services are by definition obscure to the consumer, reviews of credence services are incapable of signaling quality. Our research aims to question these assumptions. In the first essay we examine how the content and structure of online reviews of credence services systematically differ from the content and structure of reviews of experience services and how consumers judge these differences. We have found that online reviews of credence services have either less important or less credible content than reviews of experience services and that consumers do discount the credibility of credence claims. However, while consumers rationally discount the credibility of simple credence claims in a review, more complex argument structure and the inclusion of evidence attenuate this effect. In the second essay we ask, “Can online reviews predict the worst doctors?” We examine the power of online reviews to detect low quality, as measured by state medical board sanctions. We find that online reviews are somewhat predictive of a doctor’s suitability to practice medicine; however, not all the data are useful. Numerical or star ratings provide the strongest quality signal; user-submitted text provides some signal but is subsumed almost completely by ratings. Of the ratings variables in our dataset, we find that punctuality, rather than knowledge, is the strongest predictor of medical board sanctions. These results challenge the definition of credence products, which is a long-standing construct in economics of information theory. Our results also have implications for online review users, review platforms, and for the use of predictive modeling in the context of information systems research.
Resumo:
This document is the Online Supplement to ‘Myopic Allocation Policy with Asymptotically Optimal Sampling Rate,’ to be published in the IEEE Transactions of Automatic Control in 2017.
Resumo:
This research concerns the conceptual and empirical relationship between environmental justice and social-ecological resilience as it relates to climate change vulnerability and adaptation. Two primary questions guided this work. First, what is the level of resilience and adaptive capacity for social-ecological systems that are characterized by environmental injustice in the face of climate change? And second, what is the role of an environmental justice approach in developing adaptation policies that will promote social-ecological resilience? These questions were investigated in three African American communities that are particularly vulnerable to flooding from sea-level rise on the Eastern Shore of the Chesapeake Bay. Using qualitative and quantitative methods, I found that in all three communities, religious faith and the church, rootedness in the landscape, and race relations were highly salient to community experience. The degree to which these common aspects of the communities have imparted adaptive capacity has changed over time. Importantly, a given social-ecological factor does not have the same effect on vulnerability in all communities; however, in all communities political isolation decreases adaptive capacity and increases vulnerability. This political isolation is at least partly due to procedural injustice, which occurs for a number of interrelated reasons. This research further revealed that while all stakeholders (policymakers, environmentalists, and African American community members) generally agree that justice needs to be increased on the Eastern Shore, stakeholder groups disagree about what a justice approach to adaptation would look like. When brought together at a workshop, however, these stakeholders were able to identify numerous challenges and opportunities for increasing justice. Resilience was assessed by the presence of four resilience factors: living with uncertainty, nurturing diversity, combining different types of knowledge, and creating opportunities for self-organization. Overall, these communities seem to have low resilience; however, there is potential for resilience to increase. Finally, I argue that the use of resilience theory for environmental justice communities is limited by the great breadth and depth of knowledge required to evaluate the state of the social-ecological system, the complexities of simultaneously promoting resilience at both the regional and local scale, and the lack of attention to issues of justice.
Resumo:
Microorganisms in the plant rhizosphere, the zone under the influence of roots, and phyllosphere, the aboveground plant habitat, exert a strong influence on plant growth, health, and protection. Tomatoes and cucumbers are important players in produce safety, and the microbial life on their surfaces may contribute to their fitness as hosts for foodborne pathogens such as Salmonella enterica and Listeria monocytogenes. External factors such as agricultural inputs and environmental conditions likely also play a major role. However, the relative contributions of the various factors at play concerning the plant surface microbiome remain obscure, although this knowledge could be applied to crop protection from plant and human pathogens. Recent advances in genomic technology have made investigations into the diversity and structure of microbial communities possible in many systems and at multiple scales. Using Illumina sequencing to profile particular regions of the 16S rRNA gene, this study investigates the influences of climate and crop management practices on the field-grown tomato and cucumber microbiome. The first research chapter (Chapter 3) involved application of 4 different soil amendments to a tomato field and profiling of harvest-time phyllosphere and rhizosphere microbial communities. Factors such as water activity, soil texture, and field location influenced microbial community structure more than soil amendment use, indicating that field conditions may exert more influence on the tomato microbiome than certain agricultural inputs. In Chapter 4, the impact of rain on tomato and cucumber-associated microbial community structures was evaluated. Shifts in bacterial community composition and structure were recorded immediately following rain events, an effect which was partially reversed after 4 days and was strongest on cucumber fruit surfaces. Chapter 5 focused on the contribution of insect visitors to the tomato microbiota, finding that insects introduced diverse bacterial taxa to the blossom and green tomato fruit microbiome. This study advances our understanding of the factors that influence the microbiomes of tomato and cucumber. Farms are complex environments, and untangling the interactions between farming practices, the environment, and microbial diversity will help us develop a comprehensive understanding of how microbial life, including foodborne pathogens, may be influenced by agricultural conditions.