121 resultados para Anotación de corpus
Resumo:
Livecoding is an artistic programming practice in which an artist's low-level interaction can be observed with sufficiently high fidelity to allow for transcription and analysis. This paper presents the first reported" coding" of livecoding videos. From an identified corpus of videos available on the web, we coded performances of two different livecoding artists, recording both the (textual) programming edit events and the musical effect of these edits.
Resumo:
Acoustic sensors allow scientists to scale environmental monitoring over large spatiotemporal scales. The faunal vocalisations captured by these sensors can answer ecological questions, however, identifying these vocalisations within recorded audio is difficult: automatic recognition is currently intractable and manual recognition is slow and error prone. In this paper, a semi-automated approach to call recognition is presented. An automated decision support tool is tested that assists users in the manual annotation process. The respective strengths of human and computer analysis are used to complement one another. The tool recommends the species of an unknown vocalisation and thereby minimises the need for the memorization of a large corpus of vocalisations. In the case of a folksonomic tagging system, recommending species tags also minimises the proliferation of redundant tag categories. We describe two algorithms: (1) a “naïve” decision support tool (16%–64% sensitivity) with efficiency of O(n) but which becomes unscalable as more data is added and (2) a scalable alternative with 48% sensitivity and an efficiency ofO(log n). The improved algorithm was also tested in a HTML-based annotation prototype. The result of this work is a decision support tool for annotating faunal acoustic events that may be utilised by other bioacoustics projects.
Resumo:
With the overwhelming increase in the amount of data on the web and data bases, many text mining techniques have been proposed for mining useful patterns in text documents. Extracting closed sequential patterns using the Pattern Taxonomy Model (PTM) is one of the pruning methods to remove noisy, inconsistent, and redundant patterns. However, PTM model treats each extracted pattern as whole without considering included terms, which could affect the quality of extracted patterns. This paper propose an innovative and effective method that extends the random set to accurately weigh patterns based on their distribution in the documents and their terms distribution in patterns. Then, the proposed approach will find the specific closed sequential patterns (SCSP) based on the new calculated weight. The experimental results on Reuters Corpus Volume 1 (RCV1) data collection and TREC topics show that the proposed method significantly outperforms other state-of-the-art methods in different popular measures.
Resumo:
Like many other cataclysmic events September 11, a day now popularly believed to have 'changed the world', has become a topic taken up by children's writers. This thesis, titled The Whole World Shook: Ethnic, National and Heroic Identities in Children's Fiction About 9/11, examines how cultural identities are constructed within fictional texts for young people written about the attacks on the Twin Towers. It identifies three significant identity categories encoded in 9/11 books for children: ethnic identities, national identities, and heroic identities. The thesis argues that the identities formed within the selected children's texts are in flux, privileging performances of identities that are contingent on post-9/11 politics. This study is located within the field of children's literature criticism, which supports the understanding that children's books, like all texts, play a role in the production of identities. Children's literature is highly significant both in its pedagogical intent (to instruct and induct children into cultural practices and beliefs) and in its obscurity (in making the complex simple enough for children, and from sometimes intentionally shying away from difficult things). This literary criticism informed the study that the texts, if they were to be written at all, would be complex, varied and most likely as ambiguous and contradictory as the responses to the attacks on New York themselves. The theoretical framework for this thesis draws on a range of critical theories including literary theory, cultural studies, studies of performativity and postmodernism. This critical framework informs the approach by providing ways for: (i) understanding how political and ideological work is performed in children's literature; (ii) interrogating the constructed nature of cultural identities; (iii) developing a nuanced methodology for carrying out a close textual analysis. The textual analysis examines a representative sample of children's texts about 9/11, including picture books, young adult fiction, and a selection of DC Comics. Each chapter focuses on a different though related identity category. Chapter Four examines the performance of ethnic identities and race politics within a sample of picture books and young adult fiction; Chapter Five analyses the construction of collective, national identities in another set of texts; and Chapter Six does analytic work on a third set of texts, demonstrating the strategic performance of particular kinds of heroic identities. I argue that performances of cultural identities constructed in these texts draw on familiar versions of identities as well as contribute to new ones. These textual constructions can be seen as offering some certainties in increasingly uncertain times. The study finds, in its sample of books a co-mingling of xenophobia and tolerance; a binaried competition between good and evil and global harmony and national insularity; and a lauding of both the commonplace hero and the super-human. Being a recent corpus of texts about 9/11, these texts provide information on the kinds of 'selves' that appear to be privileged in the West since 2001. The thesis concludes that the shifting identities evident in texts that are being produced for children about 9/11 offer implicit and explicit accounts of what constitute good citizenship, loyalty to nation and community, and desirable attributes in a Western post-9/11 context. This thesis makes an original contribution to the field of children's literature by providing a focussed and sustained analysis of how texts for children about 9/11 contribute to formations of identity in these complex times of cultural unease and global unrest.
Resumo:
The celebrated work of Lortie (1975) alerted teacher educators to the extended period of 'apprenticeship' that student teachers have been through before they arrive at teacher education programmes. The subjective implicit theories (Marland, 1992) developed by prospective teachers are shaped by their lifeworld experiences at school and in the case of physical education teachers, their experiences in sport. The biography of physical education teacher education (PETE) students tends to be characterised by ecto-mesomorphic individuals who have been socialised by the rigours of highly competitive sport (Gore, 1990; Macdonald, 1992; Rossi, 1996). We can add to this, the requirements of teacher preparation in physical education which for the most part are dominated by the traditions and rhetoric of the 'natural' bio-physical sciences; largely a legacy of Henry's (1964) work on physical education as an academic discipline, as well as that of Abernathy and Waltz the same year (Abernathy & Waltz, 1964). In the United Kingdom, Curl (1973) further advanced the argument in an attempt to justify human movement as an independent field of study with its own corpus of knowledge. It is little wonder then, that the dominant pedagogical discourse in physical education is, as Tinning (1991) discusses, one of performance pedagogy (see also Hendry, 1986 for an earlier discussion). The knowledge required to support such a discourse could be described as 'official' (Apple, 1993) and it assumes such status by virtue of the power appropriated by and bestowed upon the scientific community in PETE (Macdonald & Tinning, 1995; Sparkes, 1989, 1993). However, there are social reifiers too, and these tend to relate to the social construction of the body (Kirk, 1993; Kirk & Spiller, 1994; Gilroy, 1994) and what Tinning (1985) has termed the Cult of Slenderness. Furthermore the 'slender image' has become a signifier of 'good health'. This is inextricably linked to what might be considered as a health triplex—'exercise = fitness = health' (see Kirk & Colquhoun, 1989; Tinning & Kirk, 1991) which in Australia, underpins curriculum packages such as Daily Physical Education which teachers (often including physical education primary...
Resumo:
This article begins with the premise that morality is an intrinsic, although often invisible, aspect of everyday social action. Drawn from a corpus of fifty audiorecorded telephone calls to Kids Helpline, an Australian helpline for children and young people, we examine one call to show how the young caller and counsellor co-construct ‘morality-in-action’. Ethnomethodological understandings and, in particular, Sacks’ (1992) description of ‘Class 2’ rules and infractions show how an adolescent caller and counsellor collaboratively assemble moral versions of the caller. In puzzling out possible motives, the caller and counsellor can be seen to be attending to the implications of different moral versions of the caller. This attribution of motives is moral work in action, with motives contingently assembled, displayed and evaluated, with such work understood as displays of moral reasoning. The counselling call makes visible the counsellor’s interactional work to support and empower the client. Analysis such as this offers counsellors ways of understanding and making visible their interactional and moral work within helpline call interactions.
Resumo:
The early years are significant in optimising children’s educational, emotional and social outcomes and have become a major international policy priority. Within Australia, policy levers have prioritised early childhood education, with a focus on program quality, as it is associated with lifelong success. Longitudinal studies have found that high quality teacher-child interactions are an essential element of high quality programs, and teacher questioning is one aspect of teacher-child interactions that has been attributed to affecting the quality of education, linking open ended questioning to higher cognitive achievement. Teachers, however, overwhelmingly ask more closed than open questions. In the classroom, like everyday interaction, questions in interaction require answers. They are used to request, offer, repair, challenge, seek agreement (Curl & Drew, 2008; Enfield, Stivers, & Levinson, 2010; Hayano, 2013; Schegloff, 2007). Teachers use questions to set agendas and manage lessons (McHoul, 1978; Mehan, 1979; Sacks, 1995), and to gauge students’ knowledge and understanding (Lerner, 1995; McHoul, 1978; Mehan, 1979). Drawing on data from the Australian Research Council project Interacting with Knowledge: Interacting with people: Web searching in early childhood, this paper focuses on an extended sequence of talk between a teacher with two students aged between 3.5 and 5 years in a preschool classroom. The episode, drawn from a corpus of over 200 hours of video recorded data, captures how the teacher and children undertake an online search for images of lady beetles and hairy caterpillars on the Web. Ethnomethodological and conversation analysis approaches examine how the teacher asks questions, which call on the children to display their factual knowledge about the search topic. The fine grained analysis shows how teachers design their interactions to prompt children’s displays of factual knowledge, and how the design of factual questions affect a student’s response in terms of what and how they respond. In focussing on how the teacher designs factual questions and how children respond to these questions it shows that question design can close down a student’s reply; or elicit a range of answers, from one word to extended more detailed responses. Understanding how the design of teachers’ questions can influence students’ responses has pedagogic implications and may support educators to make intentional decisions regarding their own questioning techniques.
Resumo:
This is the third (but first edited) volume in Sen and Hill’s corpus on Indonesian media. An anthology built from contributions to a 2006 workshop, it is necessarily more fragmented than the editors’ earlier monographs. While this fragmented character helps to evoke a fractured context, it also makes for unwieldiness...
Resumo:
This article presents and evaluates a model to automatically derive word association networks from text corpora. Two aspects were evaluated: To what degree can corpus-based word association networks (CANs) approximate human word association networks with respect to (1) their ability to quantitatively predict word associations and (2) their structural network characteristics. Word association networks are the basis of the human mental lexicon. However, extracting such networks from human subjects is laborious, time consuming and thus necessarily limited in relation to the breadth of human vocabulary. Automatic derivation of word associations from text corpora would address these limitations. In both evaluations corpus-based processing provided vector representations for words. These representations were then employed to derive CANs using two measures: (1) the well known cosine metric, which is a symmetric measure, and (2) a new asymmetric measure computed from orthogonal vector projections. For both evaluations, the full set of 4068 free association networks (FANs) from the University of South Florida word association norms were used as baseline human data. Two corpus based models were benchmarked for comparison: a latent topic model and latent semantic analysis (LSA). We observed that CANs constructed using the asymmetric measure were slightly less effective than the topic model in quantitatively predicting free associates, and slightly better than LSA. The structural networks analysis revealed that CANs do approximate the FANs to an encouraging degree.
Resumo:
Traditional text classification technology based on machine learning and data mining techniques has made a big progress. However, it is still a big problem on how to draw an exact decision boundary between relevant and irrelevant objects in binary classification due to much uncertainty produced in the process of the traditional algorithms. The proposed model CTTC (Centroid Training for Text Classification) aims to build an uncertainty boundary to absorb as many indeterminate objects as possible so as to elevate the certainty of the relevant and irrelevant groups through the centroid clustering and training process. The clustering starts from the two training subsets labelled as relevant or irrelevant respectively to create two principal centroid vectors by which all the training samples are further separated into three groups: POS, NEG and BND, with all the indeterminate objects absorbed into the uncertain decision boundary BND. Two pairs of centroid vectors are proposed to be trained and optimized through the subsequent iterative multi-learning process, all of which are proposed to collaboratively help predict the polarities of the incoming objects thereafter. For the assessment of the proposed model, F1 and Accuracy have been chosen as the key evaluation measures. We stress the F1 measure because it can display the overall performance improvement of the final classifier better than Accuracy. A large number of experiments have been completed using the proposed model on the Reuters Corpus Volume 1 (RCV1) which is important standard dataset in the field. The experiment results show that the proposed model has significantly improved the binary text classification performance in both F1 and Accuracy compared with three other influential baseline models.
Resumo:
Purpose – The purpose of this paper is to describe an innovative compliance control architecture for hybrid multi‐legged robots. The approach was verified on the hybrid legged‐wheeled robot ASGUARD, which was inspired by quadruped animals. The adaptive compliance controller allows the system to cope with a variety of stairs, very rough terrain, and is also able to move with high velocity on flat ground without changing the control parameters. Design/methodology/approach – The paper shows how this adaptivity results in a versatile controller for hybrid legged‐wheeled robots. For the locomotion control we use an adaptive model of motion pattern generators. The control approach takes into account the proprioceptive information of the torques, which are applied on the legs. The controller itself is embedded on a FPGA‐based, custom designed motor control board. An additional proprioceptive inclination feedback is used to make the same controller more robust in terms of stair‐climbing capabilities. Findings – The robot is well suited for disaster mitigation as well as for urban search and rescue missions, where it is often necessary to place sensors or cameras into dangerous or inaccessible areas to get a better situation awareness for the rescue personnel, before they enter a possibly dangerous area. A rugged, waterproof and dust‐proof corpus and the ability to swim are additional features of the robot. Originality/value – Contrary to existing approaches, a pre‐defined walking pattern for stair‐climbing was not used, but an adaptive approach based only on internal sensor information. In contrast to many other walking pattern based robots, the direct proprioceptive feedback was used in order to modify the internal control loop, thus adapting the compliance of each leg on‐line.
Resumo:
We present a clustering-only approach to the problem of speaker diarization to eliminate the need for the commonly employed and computationally expensive Viterbi segmentation and realignment stage. We use multiple linear segmentations of a recording and carry out complete-linkage clustering within each segmentation scenario to obtain a set of clustering decisions for each case. We then collect all clustering decisions, across all cases, to compute a pairwise vote between the segments and conduct complete-linkage clustering to cluster them at a resolution equal to the minimum segment length used in the linear segmentations. We use our proposed cluster-voting approach to carry out speaker diarization and linking across the SAIVT-BNEWS corpus of Australian broadcast news data. We compare our technique to an equivalent baseline system with Viterbi realignment and show that our approach can outperform the baseline technique with respect to the diarization error rate (DER) and attribution error rate (AER).
Resumo:
The NTRK1 gene (also known as TRKA) encodes a high-affinity receptor for NGF, a neurotrophin involved in nervous system development and myelination. NTRK1 has been implicated in neurological function via links between the T allele at rs6336 (NTRK1-T) and schizophrenia risk. A variant in the neurotrophin gene, BDNF, was previously associated with white matter integrity in young adults, highlighting the importantce of neurotrophins to white matter development. We hypothesized that NTRK1-T would relate to lower fractional anisotropy in healthy adults. We scanned 391 healthy adult human twins and their siblings (mean age: 23.6 ± 2.2 years; 31 NTRK1-T carriers, 360 non-carriers) using 105-gradient diffusion tensor imaging at 4 tesla. We evaluated in brain white matter how NTRK1-T and NTRK1 rs4661063 allele A (rs4661063-A, which is in moderate linkage disequilibrium with rs6336) related to voxelwise fractional anisotropy-acommondiffusion tensor imaging measure of white matter microstructure. We used mixed-model regression to control for family relatedness, age, and sex. The sample was split in half to test reproducibility of results. The false discovery rate method corrected for voxelwise multiple comparisons. NTRK1-T and rs4661063-A correlated with lower white matter fractional anisotropy, independent of age and sex (multiple-comparisons corrected: false discovery rate critical p=0.038 forNTRK1-Tand0.013 for rs4661063-A). In each half-sample, theNTRK1-T effectwasreplicated in the cingulum, corpus callosum, superior and inferior longitudinal fasciculi, inferior fronto-occipital fasciculus, superior corona radiata, and uncinate fasciculus. Our results suggest that NTRK1-T is important for developing white matter microstructure.
Resumo:
There is a strong genetic risk for late-onset Alzheimer's disease (AD), but so far few gene variants have been identified that reliably contribute to that risk. A newly confirmed genetic risk allele C of the clusterin (CLU) gene variant rs11136000 is carried by ~88% of Caucasians. The C allele confers a 1.16 greater odds of developing late-onset AD than the T allele. AD patients have reductions in regional white matter integrity. We evaluated whether the CLU risk variant was similarly associated with lower white matter integrity in healthy young humans. Evidence of early brain differences would offer a target for intervention decades before symptom onset. We scanned 398 healthy young adults (mean age, 23.6 ± 2.2 years) with diffusion tensor imaging, a variation of magnetic resonance imaging sensitive to white matter integrity in the living brain. We assessed genetic associations using mixed-model regression at each point in the brain to map the profile of these associations with white matter integrity. Each C allele copy of the CLUvariant was associated with lower fractional anisotropy-a widely accepted measure of white matter integrity-in multiple brain regions, including several known to degenerate in AD. These regions included the splenium of the corpus callosum, the fornix, cingulum, and superior and inferior longitudinal fasciculi in both brain hemispheres. Young healthy carriers of the CLU gene risk variant showed a distinct profile of lower white matter integrity that may increase vulnerability to developing AD later in life.
Resumo:
The NTRK3 gene (also known as TRKC) encodes a high affinity receptor for the neurotrophin 3'-nucleotidase (NT3), which is implicated in oligodendrocyte and myelin development. We previously found that white matter integrity in young adults is related to common variants in genes encoding neurotrophins and their receptors. This underscores the importance of neurotrophins for white matter development. NTRK3 variants are putative risk factors for schizophrenia, bipolar disorder, and obsessive-compulsive disorder hoarding, suggesting that some NTRK3 variants may affect the brain.To test this, we scanned 392 healthy adult twins and their siblings (mean age, 23.6. ±. 2.2. years; range: 20-29. years) with 105-gradient 4-Tesla diffusion tensor imaging (DTI). We identified 18 single nucleotide polymorphisms (SNPs) in the NTRK3 gene that have been associated with neuropsychiatric disorders. We used a multi-SNP model, adjusting for family relatedness, age, and sex, to relate these variants to voxelwise fractional anisotropy (FA) - a DTI measure of white matter integrity.FA was optimally predicted (based on the highest false discovery rate critical p), by five SNPs (rs1017412, rs2114252, rs16941261, rs3784406, and rs7176429; overall FDR critical p=. 0.028). Gene effects were widespread and included the corpus callosum genu and inferior longitudinal fasciculus - regions implicated in several neuropsychiatric disorders and previously associated with other neurotrophin-related genetic variants in an overlapping sample of subjects. NTRK3 genetic variants, and neurotrophins more generally, may influence white matter integrity in brain regions implicated in neuropsychiatric disorders.