3 resultados para use pattern analysis
em CORA - Cork Open Research Archive - University College Cork - Ireland
Resumo:
A substantial amount of information on the Internet is present in the form of text. The value of this semi-structured and unstructured data has been widely acknowledged, with consequent scientific and commercial exploitation. The ever-increasing data production, however, pushes data analytic platforms to their limit. This thesis proposes techniques for more efficient textual big data analysis suitable for the Hadoop analytic platform. This research explores the direct processing of compressed textual data. The focus is on developing novel compression methods with a number of desirable properties to support text-based big data analysis in distributed environments. The novel contributions of this work include the following. Firstly, a Content-aware Partial Compression (CaPC) scheme is developed. CaPC makes a distinction between informational and functional content in which only the informational content is compressed. Thus, the compressed data is made transparent to existing software libraries which often rely on functional content to work. Secondly, a context-free bit-oriented compression scheme (Approximated Huffman Compression) based on the Huffman algorithm is developed. This uses a hybrid data structure that allows pattern searching in compressed data in linear time. Thirdly, several modern compression schemes have been extended so that the compressed data can be safely split with respect to logical data records in distributed file systems. Furthermore, an innovative two layer compression architecture is used, in which each compression layer is appropriate for the corresponding stage of data processing. Peripheral libraries are developed that seamlessly link the proposed compression schemes to existing analytic platforms and computational frameworks, and also make the use of the compressed data transparent to developers. The compression schemes have been evaluated for a number of standard MapReduce analysis tasks using a collection of real-world datasets. In comparison with existing solutions, they have shown substantial improvement in performance and significant reduction in system resource requirements.
Resumo:
Introduction Seizures are harmful to the neonatal brain; this compels many clinicians and researchers to persevere further in optimizing every aspects of managing neonatal seizures. Aims To delineate the seizure profile between non-cooled versus cooled neonates with hypoxic-ischaemic encephalopathy (HIE), in neonates with stroke, the response of seizure burden to phenobarbitone and to quantify the degree of electroclinical dissociation (ECD) of seizures. Methods The multichannel video-EEG was used in this research study as the gold standard to detect seizures, allowing accurate quantification of seizure burden to be ascertained in term neonates. The entire EEG recording for each neonate was independently reviewed by at least 1 experienced neurophysiologist. Data were expressed in medians and interquartile ranges. Linear mixed models results were presented as mean (95% confidence interval); p values <0.05 were deemed as significant. Results Seizure burden in cooled neonates was lower than in non-cooled neonates [60(39-224) vs 203(141-406) minutes; p=0.027]. Seizure burden was reduced in cooled neonates with moderate HIE [49(26-89) vs 162(97-262) minutes; p=0.020] when compared with severe HIE. In neonates with stroke, the background pattern showed suppression over the infarcted side and seizures demonstrated a characteristic pattern. Compared with 10 mg/kg, phenobarbitone doses at 20 mg/kg reduced seizure burden (p=0.004). Seizure burden was reduced within 1 hour of phenobarbitone administration [mean (95% confidence interval): -14(-20 to -8) minutes/hour; p<0.001], but seizures returned to pre-treatment levels within 4 hours (p=0.064). The ECD index in cooled, non-cooled neonates with HIE, stroke and in neonates with other diagnoses were 88%, 94%, 64% and 75% respectively. Conclusions Further research exploring the treatment effects on seizure burden in the neonatal brain is required. A change to our current treatment strategy is warranted as we continue to strive for more effective seizure control, anchored with use of the multichannel EEG as the surveillance tool.
Resumo:
Since children already use and explore applications on smartphones, we use this as the starting point for design. Our monitoring and analysis framework, BaranC, enables us to discover and analyse which applications children uses and precisely how they interact with them. The monitoring happens unobtrusively in the background so children interact normally in their own natural environment without artificial constraints. Thus, we can discover to what extent a child of a particular age engages with, and how they physically interact with, existing applications. This information in turn provides the basis for design of new child-centred applications which can then be subject to the same comprehensive child use analysis using our framework. The work focuses on the first aspect, namely, the monitoring and analysis of current child use of smartphones. Experiments show the value of this approach and interesting results have been obtained from this precise monitoring of child smartphone usage.