3 resultados para Baseball
em Queensland University of Technology - ePrints Archive
Resumo:
Big Data presents many challenges related to volume, whether one is interested in studying past datasets or, even more problematically, attempting to work with live streams of data. The most obvious challenge, in a ‘noisy’ environment such as contemporary social media, is to collect the pertinent information; be that information for a specific study, tweets which can inform emergency services or other responders to an ongoing crisis, or give an advantage to those involved in prediction markets. Often, such a process is iterative, with keywords and hashtags changing with the passage of time, and both collection and analytic methodologies need to be continually adapted to respond to this changing information. While many of the data sets collected and analyzed are preformed, that is they are built around a particular keyword, hashtag, or set of authors, they still contain a large volume of information, much of which is unnecessary for the current purpose and/or potentially useful for future projects. Accordingly, this panel considers methods for separating and combining data to optimize big data research and report findings to stakeholders. The first paper considers possible coding mechanisms for incoming tweets during a crisis, taking a large stream of incoming tweets and selecting which of those need to be immediately placed in front of responders, for manual filtering and possible action. The paper suggests two solutions for this, content analysis and user profiling. In the former case, aspects of the tweet are assigned a score to assess its likely relationship to the topic at hand, and the urgency of the information, whilst the latter attempts to identify those users who are either serving as amplifiers of information or are known as an authoritative source. Through these techniques, the information contained in a large dataset could be filtered down to match the expected capacity of emergency responders, and knowledge as to the core keywords or hashtags relating to the current event is constantly refined for future data collection. The second paper is also concerned with identifying significant tweets, but in this case tweets relevant to particular prediction market; tennis betting. As increasing numbers of professional sports men and women create Twitter accounts to communicate with their fans, information is being shared regarding injuries, form and emotions which have the potential to impact on future results. As has already been demonstrated with leading US sports, such information is extremely valuable. Tennis, as with American Football (NFL) and Baseball (MLB) has paid subscription services which manually filter incoming news sources, including tweets, for information valuable to gamblers, gambling operators, and fantasy sports players. However, whilst such services are still niche operations, much of the value of information is lost by the time it reaches one of these services. The paper thus considers how information could be filtered from twitter user lists and hash tag or keyword monitoring, assessing the value of the source, information, and the prediction markets to which it may relate. The third paper examines methods for collecting Twitter data and following changes in an ongoing, dynamic social movement, such as the Occupy Wall Street movement. It involves the development of technical infrastructure to collect and make the tweets available for exploration and analysis. A strategy to respond to changes in the social movement is also required or the resulting tweets will only reflect the discussions and strategies the movement used at the time the keyword list is created — in a way, keyword creation is part strategy and part art. In this paper we describe strategies for the creation of a social media archive, specifically tweets related to the Occupy Wall Street movement, and methods for continuing to adapt data collection strategies as the movement’s presence in Twitter changes over time. We also discuss the opportunities and methods to extract data smaller slices of data from an archive of social media data to support a multitude of research projects in multiple fields of study. The common theme amongst these papers is that of constructing a data set, filtering it for a specific purpose, and then using the resulting information to aid in future data collection. The intention is that through the papers presented, and subsequent discussion, the panel will inform the wider research community not only on the objectives and limitations of data collection, live analytics, and filtering, but also on current and in-development methodologies that could be adopted by those working with such datasets, and how such approaches could be customized depending on the project stakeholders.
Resumo:
The purposes of this study were to describe and compare the specific physical activity choices and sedentary pursuits of African American and Caucasian American girls. Participants were 1,124 African American and 1,068 Caucasian American eighth grade students from 31 middle schools. The 3-Day Physical Activity Recall (3DPAR) was used to measure participation in physical activities and sedentary pursuits. The most frequently reported physical activities were walking, basketball, jogging or running, bicycling, and social dancing. Differences between groups were found in 11 physical activities and 3 sedentary pursuits. Participation rates were higher in African American girls (p<.001)for social dancing, basketball, watching television, and church attendance but lower in calisthenics, ballet and other dance, jogging or running, rollerblading, soccer, softball or baseball, using an exercise machine, swimming, and homework. Cultural differences of groups should be considered when planning interventions to promote physical activity.
Resumo:
Digital media have contributed to significant disruptions in the business of audience measurement. Television broadcasters have long relied on simple and authoritative measures of who is watching what. The demand for ratings data, as a common currency in transactions involving advertising and program content, will likely remain, but accompanying measurements of audience engagement with media content would also be of value. Today's media environment increasingly includes social media and second-screen use, providing a data trail that affords an opportunity to measure engagement. If the limitations of using social media to indicate audience engagement can be overcome, social media use may allow for quantitative and qualitative measures of engagement. Raw social media data must be contextualized, and it is suggested that tools used by sports analysts be incorporated to do so. Inspired by baseball's Sabremetrics, the authors propose Telemetrics in an attempt to separate actual performance from contextual factors. Telemetrics facilitates measuring audience activity in a manner controlling for factors such as time slot, network, and so forth. It potentially allows both descriptive and predictive measures of engagement.