971 resultados para clustered binary data
Resumo:
Queensland University of Technology (QUT) Library offers a range of resources and services to researchers as part of their research support portfolio. This poster will present key features of two of the data management services offered by research support staff at QUT Library. The first service is QUT Research Data Finder (RDF), a product of the Australian National Data Service (ANDS) funded Metadata Stores project. RDF is a data registry (metadata repository) that aims to publicise datasets that are research outputs arising from completed QUT research projects. The second is a software and code registry, which is currently under development with the sole purpose of improving discovery of source code and software as QUT research outputs. RESEARCH DATA FINDER As an integrated metadata repository, Research Data Finder aligns with institutional sources of truth, such as QUT’s research administration system, ResearchMaster, as well as QUT’s Academic Profiles system to provide high quality data descriptions that increase awareness of, and access to, shareable research data. The repository and its workflows are designed to foster better data management practices, enhance opportunities for collaboration and research, promote cross-disciplinary research and maximise the impact of existing research data sets. SOFTWARE AND CODE REGISTRY The QUT Library software and code registry project stems from concerns amongst researchers with regards to development activities, storage, accessibility, discoverability and impact, sharing, copyright and IP ownership of software and code. As a result, the Library is developing a registry for code and software research outputs, which will use existing Research Data Finder architecture. The underpinning software for both registries is VIVO, open source software developed by Cornell University. The registry will use the Research Data Finder service instance of VIVO and will include a searchable interface, links to code/software locations and metadata feeds to Research Data Australia. Key benefits of the project include:improving the discoverability and reuse of QUT researchers’ code and software amongst QUT and the QUT research community; increasing the profile of QUT research outputs on a national level by providing a metadata feed to Research Data Australia, and; improving the metrics for access and reuse of code and software in the repository.
Resumo:
Children are encountering more and more graphic representations of data in their learning and everyday life. Much of this data occurs in quantitative forms as different forms of measurement are incorporated into the graphics during their construction. In their formal education, children are required to learn to use a range of these quantitative representations in subjects across the school curriculum. Previous research that focuses on the use of information processing and traditional approaches to cognitive psychology concludes that the development of an understanding of such representations of data is a complex process. An alternative approach is to investigate the experiences of children as they interact with graphic representations of quantitative data in their own life-worlds. This paper demonstrates how a phenomenographic approach may be used to reveal the qualitatively different ways in which children in Australian primary and secondary education understand the phenomenon of graphic representations of quantitative data. Seven variations of the children’s understanding were revealed. These have been described interpretively in the article and confirmed through the words of the children. A detailed outcome space demonstrates how these seven variations are structurally related.
Resumo:
Objectives: This study examines the accuracy of Gestational Diabetes Mellitus (GDM) case-ascertainment in routinely collected data. Methods: Retrospective cohort study analysed routinely collected data from all births at Cairns Base Hospital, Australia, from 1 January 2004 to 31 December 2010 in the Cairns Base Hospital Clinical Coding system (CBHCC) and the Queensland Perinatal Data Collection (QPDC). GDM case ascertainment in the National Diabetes Services Scheme (NDSS) and Cairns Diabetes Centre (CDC) data were compared. Results: From 2004 to 2010, the specificity of GDM case-ascertainment in the QPDC was 99%. In 2010, only 2 of 225 additional cases were identified from the CDC and CBHCC, suggesting QPDC sensitivity is also over 99%. In comparison, the sensitivity of the CBHCC data was 80% during 2004–2010. The sensitivity of CDC data was 74% in 2010. During 2010, 223 births were coded as GDM in the QPDC, and the NDSS registered 247 women with GDM from the same postcodes, suggesting reasonable uptake on the NDSS register. However, the proportion of Aboriginal and Torres Strait Islander women was lower than expected. Conclusion: The accuracy of GDM case ascertainment in the QPDC appears high, with lower accuracy in routinely collected hospital and local health service data. This limits capacity of local data for planning and evaluation, and developing structured systems to improve post-pregnancy care, and may underestimate resources required. Implications: Data linkage should be considered to improve accuracy of routinely collected local health service data. The accuracy of the NDSS for Aboriginal and Torres Strait Islander women requires further evaluation.
Resumo:
Operational modal analysis (OMA) is prevalent in modal identifi cation of civil structures. It asks for response measurements of the underlying structure under ambient loads. A valid OMA method requires the excitation be white noise in time and space. Although there are numerous applications of OMA in the literature, few have investigated the statistical distribution of a measurement and the infl uence of such randomness to modal identifi cation. This research has attempted modifi ed kurtosis to evaluate the statistical distribution of raw measurement data. In addition, a windowing strategy employing this index has been proposed to select quality datasets. In order to demonstrate how the data selection strategy works, the ambient vibration measurements of a laboratory bridge model and a real cable-stayed bridge have been respectively considered. The analysis incorporated with frequency domain decomposition (FDD) as the target OMA approach for modal identifi cation. The modal identifi cation results using the data segments with different randomness have been compared. The discrepancy in FDD spectra of the results indicates that, in order to fulfi l the assumption of an OMA method, special care shall be taken in processing a long vibration measurement data. The proposed data selection strategy is easy-to-apply and verifi ed effective in modal analysis.
Resumo:
Currently there are ~3000 known species of Sarcophagidae (Diptera), which are classified into 173 genera in three subfamilies. Almost 25% of sarcophagids belong to the genus Sarcophaga (sensu lato) however little is known about the validity of, and relationships between the ~150 (or more) subgenera of Sarcophaga s.l. In this preliminary study, we evaluated the usefulness of three sources of data for resolving relationships between 35 species from 14 Sarcophaga s.l. subgenera: the mitochondrial COI barcode region, ~800. bp of the nuclear gene CAD, and 110 morphological characters. Bayesian, maximum likelihood (ML) and maximum parsimony (MP) analyses were performed on the combined dataset. Much of the tree was only supported by the Bayesian and ML analyses, with the MP tree poorly resolved. The genus Sarcophaga s.l. was resolved as monophyletic in both the Bayesian and ML analyses and strong support was obtained at the species-level. Notably, the only subgenus consistently resolved as monophyletic was Liopygia. The monophyly of and relationships between the remaining Sarcophaga s.l. subgenera sampled remain questionable. We suggest that future phylogenetic studies on the genus Sarcophaga s.l. use combined datasets for analyses. We also advocate the use of additional data and a range of inference strategies to assist with resolving relationships within Sarcophaga s.l.
Resumo:
Big Data is a rising IT trend similar to cloud computing, social networking or ubiquitous computing. Big Data can offer beneficial scenarios in the e-health arena. However, one of the scenarios can be that Big Data needs to be kept secured for a long period of time in order to gain its benefits such as finding cures for infectious diseases and protecting patient privacy. From this connection, it is beneficial to analyse Big Data to make meaningful information while the data is stored securely. Therefore, the analysis of various database encryption techniques is essential. In this study, we simulated 3 types of technical environments, namely, Plain-text, Microsoft Built-in Encryption, and custom Advanced Encryption Standard, using Bucket Index in Data-as-a-Service. The results showed that custom AES-DaaS has a faster range query response time than MS built-in encryption. Furthermore, while carrying out the scalability test, we acknowledged that there are performance thresholds depending on physical IT resources. Therefore, for the purpose of efficient Big Data management in eHealth it is noteworthy to examine their scalability limits as well even if it is under a cloud computing environment. In addition, when designing an e-health database, both patient privacy and system performance needs to be dealt as top priorities.
Resumo:
This study is an inquiry into early childhood teacher professional identities. In Australia, workforce reforms in early childhood include major shifts in qualification requirements that call for a university four-year degree-qualified teacher to be employed in child care. This marks a shift in the early years workforce, where previously there was no such requirement. At the same time as these reforms to quality measures are being implemented, and requiring a substantive up skilling of the workforce, there is a growing body of evidence through recent studies that suggests these same four-year degree-qualified early childhood teachers have an aversion to working in child care. Their preferred employment option is to work in the early years of more formal schooling, not in before-school contexts. This collision of agendas warrants investigation. This inquiry is designed to investigate the site at which advocacy for higher qualification requirements meets early childhood teachers who are reluctant to choose child care as a possible career pathway. The key research question for this study is: How are early childhood teachers’ professional identities currently produced? The work of this thesis is to problematise the early childhood teacher in child care through a particular method of discourse analysis. There are two sets of data. The first was a key early childhood political document that read as a "moment of arising" (Foucault, 1984a, p. 83). It is a political document which was selected for its current influence on the early childhood field, and in particular, workforce reforms that call for four-year degree-qualified teachers to work in before-school contexts, including child care. The second data set was generated through four focus group discussions conducted with preservice early childhood teachers. The document and transcripts of the focus groups were both analysed as text, as conceptualised by Foucault (1981). Foucault’s work spans a number of years and a range of philosophical matters. This thesis draws particularly on Foucault’s writings on discourse, power/knowledge, regimes of truth and resistance. In order to consider the production of early childhood teachers’ professional identities, the study is also informed by identity theorists, who have worked on gender, performativity and investment (Davies, 2004/2006; McNay, 1992; Osgood, 2012; Walkerdine, 1990; Weedon, 1997). The ways in which discourses intersect, compete and collide produce the subject (Foucault, 1981) and, in the case of this inquiry, there are a number of competing discourses at play, which produce the early childhood teacher. These particular theories turn particular lenses on the question of professional identities in early childhood, and such a study calls for the application of particular methodologies. Discourse analysis was used as the methodological framework, and the analysis was informed by Foucauldian concepts of discourse. While Foucault did not prescribe a form of discourse analysis as a method, his writings nonetheless provide a valuable framework for illuminating discursive practices and, in turn, how people are affected, through the shifts and distribution of power (Foucault, 1980a). The treatment used with both data sets involved redescription. For the policy document, a technique for reading document-as-text applied a genealogical approach (Foucault, 1984a). For the focus groups, the process of redescription (Rorty, 1989) involved reading talk-as-text. As a method, redescription involves describing "lots and lots of things in new ways until you have created a pattern of linguistic behaviour which will tempt the new generation to adopt it" (Rorty, 1989, p. 9). The development and application of categories (Davies, 2004/2006) built on a poststructuralist theoretical framework and the literature review informed the data analysis method of discourse analysis. Irony provided a rhetorical and playful tool (Haraway, 1991; Rorty, 1989), to look to how seemingly opposing discourses are held together. This opens a space to collapse binary thinking and consider seemingly contradictory terms in a way in which both terms are possible and both are true. Irony resists the choice of one or the other being right, and holds the opposites together in tension. The thesis concludes with proposals for new, ironic categories, which work to bring together seemingly opposing terms, located at sites in the field of early childhood where discourses compete, collide and intersect to produce and maintain early childhood teacher professional identities. The process of mapping these discourses goes some way to investigating the complexities about identities and career choices of early childhood teachers. The category of "the cost of loving" captures the collision between care/love, inherent in child care, and new discourses of investment/economics. Investment/economics has not completely replaced care/love, and these apparent opposites were not read as a binary because both are necessary and both are true (Haraway, 1991). They are held together in tension to produce early childhood teacher professional identities. The policy document under scrutiny was New Directions, released in 2007 by the then opposition ALP leader, Kevin Rudd. The claim was made strongly that the "economic prosperity" of Australia relies on investment in early childhood. The arguments to invest are compelling and the neuroscience/brain research/child development together with economic/investment discourses demand that early childhood is funding is increased. The intersection of these discourses produces professional identities of early childhood teachers as a necessary part of the country’s economy, and thus, worthy of high status. The child care sector and work in child care settings are necessary, with children and the early childhood teacher playing key roles in the economy of the nation. Through New Directions it becomes sayable (Foucault, 1972/1989) that the work the early childhood teacher performs is legitimated and valued. The children are produced as "economic units". A focus on what children are able to contribute to the future economy of the nation re-positions children and produces these "smart productive citizens", making future economic contribution. The early childhood teacher is produced through this image of a child and "the cost of loving" is emphasised. A number of these categories were produced through the readings of the document-as text and the talk-as-text. Two ironic categories were read in the analysis of the transcripts of the focus group discussions, when treated as talk-as-text data: the early childhood teacher as a "heroic victim"; and the early childhood teacher as a "glorified babysitter". This thesis raises new questions about professional identities in early childhood. These new questions might go some way to prompt re-thinking of some government policy, as well as some aspects of early childhood teacher education course design. The images of children and images of child care provide provocations to consider preservice teacher education course design. In particular, how child care, as one of the early childhood contexts, is located, conceptualised and spoken throughout the course. Consideration by course designers and teacher educators of what discourses are privileged in course content —what discourses are diminished or silenced—would go some way to reconceptualising child care within preservice teacher education and challenging dominant ways of speaking child care, and work in child care. This inquiry into early childhood teachers’ professional identities has gone some way to exploring the complexities around the early childhood teacher in child care. It is anticipated that the significance of this study will thus have immediate applicably and relevance for the Australian early childhood policy landscape. The early childhood field is in a state of rapid change, and this inquiry has examined some of the disconnects between policy and practice. Awareness of the discourses that are in play in the field will continue to allow space for conversations that challenge dominant assumptions about child care, work in child care and ways of being an early childhood teacher in child care.
Resumo:
This paper describes the work being conducted in the baseline rail level crossing project, supported by the Australian rail industry and the Cooperative Research Centre for Rail Innovation. The paper discusses the limitations of near-miss data for analysis obtained using current level crossing occurrence reporting practices. The project is addressing these limitations through the development of a data collection and analysis system with an underlying level crossing accident causation model. An overview of the methodology and improved data recording process are described. The paper concludes with a brief discussion of benefits this project is expected to provide the Australian rail industry.
Resumo:
This research aims to develop a reliable density estimation method for signalised arterials based on cumulative counts from upstream and downstream detectors. In order to overcome counting errors associated with urban arterials with mid-link sinks and sources, CUmulative plots and Probe Integration for Travel timE estimation (CUPRITE) is employed for density estimation. The method, by utilizing probe vehicles’ samples, reduces or cancels the counting inconsistencies when vehicles’ conservation is not satisfied within a section. The method is tested in a controlled environment, and the authors demonstrate the effectiveness of CUPRITE for density estimation in a signalised section, and discuss issues associated with the method.
Resumo:
Background: Multiple sclerosis (MS) is the most common cause of chronic neurologic disability beginning in early to middle adult life. Results from recent genome-wide association studies (GWAS) have substantially lengthened the list of disease loci and provide convincing evidence supporting a multifactorial and polygenic model of inheritance. Nevertheless, the knowledge of MS genetics remains incomplete, with many risk alleles still to be revealed. Methods: We used a discovery GWAS dataset (8,844 samples, 2,124 cases and 6,720 controls) and a multi-step logistic regression protocol to identify novel genetic associations. The emerging genetic profile included 350 independent markers and was used to calculate and estimate the cumulative genetic risk in an independent validation dataset (3,606 samples). Analysis of covariance (ANCOVA) was implemented to compare clinical characteristics of individuals with various degrees of genetic risk. Gene ontology and pathway enrichment analysis was done using the DAVID functional annotation tool, the GO Tree Machine, and the Pathway-Express profiling tool. Results: In the discovery dataset, the median cumulative genetic risk (P-Hat) was 0.903 and 0.007 in the case and control groups, respectively, together with 79.9% classification sensitivity and 95.8% specificity. The identified profile shows a significant enrichment of genes involved in the immune response, cell adhesion, cell communication/ signaling, nervous system development, and neuronal signaling, including ionotropic glutamate receptors, which have been implicated in the pathological mechanism driving neurodegeneration. In the validation dataset, the median cumulative genetic risk was 0.59 and 0.32 in the case and control groups, respectively, with classification sensitivity 62.3% and specificity 75.9%. No differences in disease progression or T2-lesion volumes were observed among four levels of predicted genetic risk groups (high, medium, low, misclassified). On the other hand, a significant difference (F = 2.75, P = 0.04) was detected for age of disease onset between the affected misclassified as controls (mean = 36 years) and the other three groups (high, 33.5 years; medium, 33.4 years; low, 33.1 years). Conclusions: The results are consistent with the polygenic model of inheritance. The cumulative genetic risk established using currently available genome-wide association data provides important insights into disease heterogeneity and completeness of current knowledge in MS genetics.
Resumo:
We present a method for optical encryption of information, based on the time-dependent dynamics of writing and erasure of refractive index changes in a bulk lithium niobate medium. Information is written into the photorefractive crystal with a spatially amplitude modulated laser beam which when overexposed significantly degrades the stored data making it unrecognizable. We show that the degradation can be reversed and that a one-to-one relationship exists between the degradation and recovery rates. It is shown that this simple relationship can be used to determine the erasure time required for decrypting the scrambled index patterns. In addition, this method could be used as a straightforward general technique for determining characteristic writing and erasure rates in photorefractive media.
Resumo:
During the current (1995-present) eruptive phase of the Soufrière Hills volcano on Montserrat, voluminous pyroclastic flows entered the sea off the eastern flank of the island, resulting in the deposition of well-defined submarine pyroclastic lobes. Previously reported bathymetric surveys documented the sequential construction of these deposits, but could not image their internal structure, the morphology or extent of their base, or interaction with the underlying sediments. We show, by combining these bathymetric data with new high-resolution three dimensional (3D) seismic data, that the sequence of previously detected pyroclastic deposits from different phases of the ongoing eruptive activity is still well preserved. A detailed interpretation of the 3D seismic data reveals the absence of significant (> 3. m) basal erosion in the distal extent of submarine pyroclastic deposits. We also identify a previously unrecognized seismic unit directly beneath the stack of recent lobes. We propose three hypotheses for the origin of this seismic unit, but prefer an interpretation that the deposit is the result of the subaerial flank collapse that formed the English's Crater scarp on the Soufrière Hills volcano. The 1995-recent volcanic activity on Montserrat accounts for a significant portion of the sediments on the southeast slope of Montserrat, in places forming deposits that are more than 60. m thick, which implies that the potential for pyroclastic flows to build volcanic island edifices is significant.
Resumo:
The Bluetooth technology is being increasingly used to track vehicles throughout their trips, within urban networks and across freeway stretches. One important opportunity offered by this type of data is the measurement of Origin-Destination patterns, emerging from the aggregation and clustering of individual trips. In order to obtain accurate estimations, however, a number of issues need to be addressed, through data filtering and correction techniques. These issues mainly stem from the use of the Bluetooth technology amongst drivers, and the physical properties of the Bluetooth sensors themselves. First, not all cars are equipped with discoverable Bluetooth devices and the Bluetooth-enabled vehicles may belong to some small socio-economic groups of users. Second, the Bluetooth datasets include data from various transport modes; such as pedestrian, bicycles, cars, taxi driver, buses and trains. Third, the Bluetooth sensors may fail to detect all of the nearby Bluetooth-enabled vehicles. As a consequence, the exact journey for some vehicles may become a latent pattern that will need to be extracted from the data. Finally, sensors that are in close proximity to each other may have overlapping detection areas, thus making the task of retrieving the correct travelled path even more challenging. The aim of this paper is twofold. We first give a comprehensive overview of the aforementioned issues. Further, we propose a methodology that can be followed, in order to cleanse, correct and aggregate Bluetooth data. We postulate that the methods introduced by this paper are the first crucial steps that need to be followed in order to compute accurate Origin-Destination matrices in urban road networks.
Resumo:
Background: Little is known about the health effects of worksite wellness programs on police department staff. Objective: To examine 1-2 year changes in health profiles of participants in the Queensland Police Service’s wellness program. Methods: Participants underwent yearly physical assessments. Health profile data collected during assessments from 2008 to 2012 were included in the analysis. Data Analysis: Repeated-measures ANOVA was used for continuous outcome variables, related-samples Wilcoxon Signed Rank test for non-normally continuous variables, and McNemar’s test for binary variables. Results: Significant changes in physical measures included decreases in waist circumference and percent body fat, and increases in cardiorespiratory fitness and flexibility (p<0.01). Changes in serum cholesterol, haemoglobin, total cholesterol ratios, HDL, LDL and Triglyceride levels were also significant (p<0.01). Conclusion: Participants’ health profiles mostly improved between cycles although most changes were not clinically significant. As this evaluation used a single-group pre-test post-test design, it provides initial indications that wellness programs can benefit staff in police departments.
Resumo:
This thesis is a study for automatic discovery of text features for describing user information needs. It presents an innovative data-mining approach that discovers useful knowledge from both relevance and non-relevance feedback information. The proposed approach can largely reduce noises in discovered patterns and significantly improve the performance of text mining systems. This study provides a promising method for the study of Data Mining and Web Intelligence.