918 resultados para second language, spelling errors
Resumo:
Il a été avancé que des apprenants expérimentés développeraient des niveaux élevés de conscience métalinguistique (MLA), ce qui leur faciliterait l’apprentissage de langues subséquentes (p.ex., Singleton & Aronin, 2007). De plus, des chercheurs dans le domaine de l’acquisition des langues tierces insistent sur les influences positives qu’exercent les langues précédemment apprises sur l’apprentissage formel d’une langue étrangère (p.ex., Cenoz & Gorter, 2015), et proposent de délaisser le regard traditionnel qui mettait l’accent sur l’interférence à l’origine des erreurs des apprenants pour opter pour une vision plus large et positive de l’interaction entre les langues. Il a été démontré que la similarité typologique ainsi que la compétence dans la langue source influence tous les types de transfert (p.ex., Ringbom, 1987, 2007). Cependant, le défi méthodologique de déterminer, à la fois l’usage pertinent d’une langue cible en tant que résultat d’une influence translinguistique (p.ex., Falk & Bardel, 2010) et d’établir le rôle crucial de la MLA dans l’activation consciente de mots ou de constructions reliés à travers différentes langues, demeure. La présente étude avait pour but de relever ce double défi en faisant appel à des protocoles oraux (TAPs) pour examiner le transfert positif de l’anglais (L2) vers l’allemand (L3) chez des Québécois francophones après cinq semaines d’enseignement formel de la L3. Les participants ont été soumis à une tâche de traduction développée aux fins de la présente étude. Les 42 items ont été sélectionnés sur la base de jugements de similarité et d’imagibilité ainsi que de fréquence des mots provenant d’une étude de cognats allemands-anglais (Friel & Kennison, 2001). Les participants devaient réfléchir à voix haute pendant qu’ils traduisaient des mots inconnus de l’allemand (L3) vers le français (L1). Le transfert positif a été opérationnalisé par des traductions correctes qui étaient basées sur un cognat anglais. La MLA a été mesurée par le biais du THAM (Test d’habiletés métalinguistiques) (Pinto & El Euch, 2015) ainsi que par l’analyse des TAPs. Les niveaux de compétence en anglais ont été établis sur la base du Michigan Test (Corrigan et al., 1979), tandis que les niveaux d’exposition ainsi que l’intérêt envers la langue et la culture allemandes ont été mesurés à l’aide d’un questionnaire. Une analyse fine des TAPs a révélé de la variabilité inter- et intra-individuelle dans l’activation consciente du vocabulaire en L2, tout en permettant l’identification de niveaux distincts de prise de conscience. Deux modèles indépendants de régressions logistiques ont permis d’identifier les deux dimensions de MLA comme prédicteurs de transfert positif. Le premier modèle, dans lequel le THAM était la mesure exclusive de MLA, a déterminé cette dimension réflexive comme principal prédicteur, suivie de la compétence en anglais, tandis qu’aucune des autres variables indépendantes pouvait prédire le transfert positif de l’anglais. Dans le second modèle, incluant le THAM ainsi que les TAPs comme mesures complémentaires de MLA, la dimension appliquée de MLA, telle que mesurée par les TAPs, était de loin le prédicteur principal, suivie de la dimension réflexive, telle que mesurée par le THAM, tandis que la compétence en anglais ne figurait plus parmi les facteurs ayant une influence significative sur la variable réponse. Bien que la verbalisation puisse avoir influencé la performance dans une certaine mesure, nos observations mettent en évidence la contribution précieuse de données introspectives comme complément aux résultats basés sur des caractéristiques purement linguistiques du transfert. Nos analyses soulignent la complexité des processus métalinguistiques et des stratégies individuelles, ce qui reflète une perspective dynamique du multilinguisme (p.ex., Jessner, 2008).
Resumo:
Global Network for the Molecular Surveillance of Tuberculosis 2010: A. Miranda (Tuberculosis Laboratory of the National Institute of Health, Porto, Portugal)
Resumo:
This study reports on research that examines the family language policy (FLP) and biliteracy practices of middle-class Chinese immigrant families in a metropolitan area in the southwest of the U.S. by exploring language practices pattern among family members, language and literacy environment at home, parents’ language management, parents’ language attitudes and ideologies, and biliteracy practices. In this study, I employed mixed methods, including survey and interviews, to investigate Chinese immigrant parents’ FLP, biliteracy practices, their life stories, and their experience of raising and nurturing children in an English-dominant society. Survey questionnaires were distributed to 55 Chinese immigrant parents and interviews were conducted with five families, including mothers and children. One finding from this study is that the language practices pattern at home shows the trend of language shift among the Chinese immigrants’ children. Children prefer speaking English with parents, siblings, and peers, and home literacy environment for children manifests an English-dominant trend. Chinese immigrant parents’ language attitudes and ideologies are largely influenced by English-only ideology. The priority for learning English surpasses the importance of Chinese learning, which is demonstrated by the English-dominant home literacy practices and an English-dominant language policy. Parents invest more in English literacy activities and materials for children, and very few parents implement Chinese-only policy for their children. A second finding from this study is that a multitude of factors from different sources shape and influence Chinese immigrants’ FLP and biliteracy practices. The factors consist of family-related factors, social factors, linguistic factors, and individual factors. A third finding from this study is that a wide variety of strategies are adopted by Chinese immigrant families, which have raised quite balanced bilingual children, to help children maintain Chinese heritage language (HL) and develop both English and Chinese literacy. The close examination and comparison of different families with English monolingual children, with children who have limited knowledge of HL, and with quite balanced bilingual children, this study discovers that immigrant parents, especially mothers, play a fundamental and irreplaceable role in their children’s HL maintenance and biliteracy development and it recommends to immigrant parents in how to implement the findings of this study to nurture their children to become bilingual and biliterate. Due to the limited number and restricted area and group of participant sampling, the results of this study may not be generalized to other groups in different contexts.
Resumo:
The purpose of the current thesis is to develop a better understanding of the interaction between Spanish and Quichua in the Salcedo region and provide more information for the processes that might have given rise to Media Lengua, a ‘mixed’ language comprised of a Quichua grammar and Spanish lexicon. Muysken attributes the formation of Media Lengua to relexification, ruling out any influence from other bilingual phenomena. I argue that the only characteristic that distinguishes Media Lengua from other language contact varieties in central Ecuador is the quantity of the overall Spanish borrowings and not the type of processes that might have been employed by Quichua speakers during the genesis of Media Lengua. The results from the Salcedo data that I have collected show how processes such as adlexification, code-mixing, and structural convergence produce Media Lengua-type sentences, evidence that supports an alternative analysis to Muysken’s relexification hypothesis. Overall, this dissertation is developed around four main objectives: (1) to describe the variation of Spanish loanwords within a bilingual community in Salcedo; (2) to analyze some of the prominent and recent structural changes in Quichua and Spanish; (3) to determine whether Spanish loanword use can be explained by the relationship consultants have with particular social categories; and (4) to analyze the consultants’ language ideologies toward syncretic uses of Spanish and Quichua. Overall, 58% of the content words, 39% of the basic vocabulary, and 50% of the subject pronouns in the Salcedo corpus were derived from Spanish. When compared to Muysken’s description of highlander Quichua in the 1970’s, Spanish loanwords have more than doubled in each category. The overall level of Spanish loanwords in Salcedo Quichua has grown to a level between highlander Quichua in the 1970’s and Media Lengua. Similar to Spanish’s lexical influence in Media Lengua, the increase of Spanish borrowings in today’s rural Quichua can be seen in non-basic and basic vocabularies as well as the subject pronoun system. Significantly, most of the growth has occurred through forms of adlexification i.e., doublets, well-established borrowings, and cultural borrowings, suggesting that ‘ordinary’ lexical borrowing is also capable of producing Media Lengua-type sentences. I approach the second objective by investigating two separate phenomena related to structural convergence. The first examines the complex verbal constructions that have developed in Quichua through Spanish loan translations while the second describes the type of Quichua particles that are attached to Spanish lexemes while speaking Spanish. The calquing of the complex verbal constructions from Spanish were employed when speaking standard Quichua. Since this standard form is typically used by language purists, I argue that their use of calques is a strategy of exploiting the full range of expression from Spanish without incorporating any of the Spanish lexemes which would give the appearance of ‘contamination’. The use of Quichua particles in local varieties of Spanish is a defining characteristic of Quichuacized Spanish, spoken most frequently by women and young children in the community. Although the use of Quichua particles was probably not the main catalyst engendering Media Lengua, I argue that its contribution as a source language to other ‘mixed’ varieties, such as Media Lengua, needs to be accounted for in descriptions of BML genesis. Contrary to Muysken’s representation of relatively ‘unmixed’ Spanish and Quichua as the two source languages of Media Lengua, I propose that local varieties of Spanish might have already been ‘mixed’ to a large degree before Media Lengua was created. The third objective attempts to draw a relationship between particular social variables and the use of Spanish loanwords. Whisker Boxplots and ANOVAs were used to determine which social group, if any, have been introducing new Spanish borrowings into the bilingual communities in Salcedo. Specifically, I controlled for age, education, native language, urban migration, and gender. The results indicate that none of the groups in each of the five social variables indicate higher or lower loanword use. The implication of these results are twofold: (a) when lexical borrowing occurs, it is immediately adopted as the community-wide norm and spoken by members from different backgrounds and generations, or (b) this level of Spanish borrowing (58%) is not a recent phenomenon. The fourth and final objective draws on my ethnographic research that addresses the attitudes of syncretic language use. I observed that Quichuacized Spanish and Hispanicized Quichua are highly stigmatized varieties spoken by the country’s most marginalized populations and families, yet within the community, syncretic ways of speaking are in fact the norm. It was shown that there exists a range of different linguistic definitions for ‘Chaupi Lengua’ and other syncretic language practices as well as many contrasting connotations, most of which were negative. One theme that emerged from the interviews was that speaking syncretic varieties of Quichua weakened the consultant’s claim to an indigenous identity. The linguistic and social data presented in this dissertation supports an alternative view to Muysken’s relexification hypothesis, one that has the advantage of operating with well-precedented linguistic processes and which is actually observable in the present-day Salcedo area. The results from the study on lexical borrowing are significant because they demonstrate how a dynamic bilingual speech community has gradually diversified their Quichua lexicon under intense pressure to shift toward Spanish. They also show that Hispanicized Quichua (Quichua with heavy lexical borrowing) clearly arose from adlexification and prolonged lexical borrowing, and is one of at least six identifiable speech styles found in Salcedo. These results challenge particular interpretations of language contact outcomes, such as, ones that depict sources languages as discrete and ‘unmixed.’ The bilingual continuum presented in this thesis shows on the one hand, the range of speech styles that are accessible to different speakers, and on the other hand, the overlapping, syncretic features that are shared among the different registers and language varieties. It was observed that syncretic speech styles in Salcedo are employed by different consultants in varied interactional contexts, and in turn, produce different evaluations by other fellow community members. In the current dissertation, I challenge the claim that relexification and Media Lengua-type sentences develop in isolation and without the influence of other bilingual phenomena. Based on Muysken's Media Lengua example sentences and the speech styles from the Salcedo corpus, I argue that Media Lengua may have arisen as an institutionalized variant of the highly mixed "middle ground" within the range of the Salcedo bilingual continuum discussed above. Such syncretic forms of Spanish and Quichua strongly resemble Media Lengua sentences in Muysken’s research, and therefore demonstrate how its development could have occurred through several different language contact processes and not only through relexification.
Resumo:
Our aim was to assess the impact of an invented spelling programme conducted in small groups on children’s written language acquisition in Portuguese. We expected the experimental group to have better post-test results than the control group in spelling and reading. Participants were 160 preschool-age children who were randomly divided into an experimental and a control group. Their age, cognitive ability, knowledge of letters and phonological abilities were controlled. Children’s spelling and reading were evaluated in a pre- and a post-test. Inbetween, experimental group participated in an invented spelling programme in small groups and the control group in story readings. The experimental group showed better results in spelling and reading in the post-test than the control one. Different dynamics occurred in the small groups which had different impacts on children’s acquisitions. These results provide empirical support for the proposal that invented spelling should be incorporated into early literacy instruction.
Resumo:
Dissertação de Mestrado apresentada ao Instituto Superior de Psicologia Aplicada para obtenção de grau de Mestre na especialidade de Psicologia Educacional.
Resumo:
In the context of computer numerical control (CNC) and computer aided manufacturing (CAM), the capabilities of programming languages such as symbolic and intuitive programming, program portability and geometrical portfolio have special importance -- They allow to save time and to avoid errors during part programming and permit code re-usage -- Our updated literature review indicates that the current state of art presents voids in parametric programming, program portability and programming flexibility -- In response to this situation, this article presents a compiler implementation for EGCL (Extended G-code Language), a new, enriched CNC programming language which allows the use of descriptive variable names, geometrical functions and flow-control statements (if-then-else, while) -- Our compiler produces low-level generic, elementary ISO-compliant Gcode, thus allowing for flexibility in the choice of the executing CNC machine and in portability -- Our results show that readable variable names and flow control statements allow a simplified and intuitive part programming and permit re-usage of the programs -- Future work includes allowing the programmer to define own functions in terms of EGCL, in contrast to the current status of having them as library built-in functions
Resumo:
The preparation and administration of medications is one of the most common and relevant functions of nurses, demanding great responsibility. Incorrect administration of medication, currently constitutes a serious problem in health services, and is considered one of the main adverse effects suffered by hospitalized patients. Objectives: Identify the major errors in the preparation and administration of medication by nurses in hospitals and know what factors lead to the error occurred in the preparation and administration of medication. Methods: A systematic review of the literature. Deined as inclusion criteria: original scientiic papers, complete, published in the period 2011 to May 2016, the SciELO and LILACS databases, performed in a hospital environment, addressing errors in preparation and administration of medication by nurses and in Portuguese language. After application of the inclusion criteria obtained a sample of 7 articles. Results: The main errors identiied in the pr eparation and administration of medication were wrong dose 71.4%, wrong time 71.4%, 57.2% dilution inadequate, incorrect selection of the patient 42.8% and 42.8% via inadequate. The factors that were most commonly reported by the nursing staff, as the cause of the error was the lack of human appeal 57.2%, inappropriate locations for the preparation of medication 57.2%, the presence of noise and low brightness in preparation location 57, 2%, professionals untrained 42.8%, fatigue and stress 42.8% and inattention 42.8%. Conclusions: The literature shows a high error rate in the preparation and administration of medication for various reasons, making it important that preventive measures of this occurrence are implemented.
MINING AND VERIFICATION OF TEMPORAL EVENTS WITH APPLICATIONS IN COMPUTER MICRO-ARCHITECTURE RESEARCH
Resumo:
Computer simulation programs are essential tools for scientists and engineers to understand a particular system of interest. As expected, the complexity of the software increases with the depth of the model used. In addition to the exigent demands of software engineering, verification of simulation programs is especially challenging because the models represented are complex and ridden with unknowns that will be discovered by developers in an iterative process. To manage such complexity, advanced verification techniques for continually matching the intended model to the implemented model are necessary. Therefore, the main goal of this research work is to design a useful verification and validation framework that is able to identify model representation errors and is applicable to generic simulators. The framework that was developed and implemented consists of two parts. The first part is First-Order Logic Constraint Specification Language (FOLCSL) that enables users to specify the invariants of a model under consideration. From the first-order logic specification, the FOLCSL translator automatically synthesizes a verification program that reads the event trace generated by a simulator and signals whether all invariants are respected. The second part consists of mining the temporal flow of events using a newly developed representation called State Flow Temporal Analysis Graph (SFTAG). While the first part seeks an assurance of implementation correctness by checking that the model invariants hold, the second part derives an extended model of the implementation and hence enables a deeper understanding of what was implemented. The main application studied in this work is the validation of the timing behavior of micro-architecture simulators. The study includes SFTAGs generated for a wide set of benchmark programs and their analysis using several artificial intelligence algorithms. This work improves the computer architecture research and verification processes as shown by the case studies and experiments that have been conducted.
Resumo:
In this thesis we aimed to explore the potential of gamification - defined as “the use of game elements in non-game contexts” [30] - in increasing children's (aged 5 to 6) engagement with the task. This is mainly due to the fact that our world is living a technological era, and videogames are an example of this engagement by being able to maintain children’s (and adults) engagement for hours straight. For the purpose of limiting complexity, we only addressed the feedback element by introducing it with an anthropomorphic virtual agent (human-like aspect), because research shows that virtual agents (VA’s) can influence behavioural change [17], or even induce emotions on humans both through the use of feedback provided and their facial expressions, which can interpreted in the same way as of humans’ [2]. By pairing the VA with the gamification concept, we wanted to 1) create a VA that is likely to be well-received by children (appearance and behaviour), and 2) have the immediate feedback that games have, so we can give children an assessment of their actions in real-time, as opposed to waiting for feedback from someone (traditional teaching), and with this give students more chances to succeed [32, 43]. Our final system consisted on a virtual environment, where children formed words that corresponded to a given image. In order to measure the impact that the VA had on engagement, the system was developed in two versions: one version of the system was limited to provide a simple feedback environment, where the VA provided feedback, by responding with simple phrases (i.e. “correct” or “incorrect”); for the second version, the VA had a more complex approach where it tried to encourage children to complete the word – a motivational feedback - even when they weren’t succeeding. Lastly we conducted a field study with two groups of children, where one group tested the version with the simple feedback, and the other group tested the ‘motivational’ version of the system. We used a quantitative approach to analyze the collected data that measured the engagement, based on the number of tasks (words) completed and time spent with system. The results of the evaluation showed that the use of motivational feedback may carry a positive effect on engaging children.
Resumo:
Literature is not generally considered as a coherent branch of the curriculum in relation to language development in either native or foreign language teaching. As teachers of English in multicultural Indian classrooms, we come across students with varying degrees of competence in English language learning. Although language learning is a natural process for natives, students of other languages put in colossal efforts to learn it. Despite their sincere efforts, they face challenges regarding pronunciation, spelling, and vocabulary. Indian classrooms are a microcosm of the larger society, so teaching English language in a manner that equips the students to face the cutthroat competition has become a necessity and a challenge for English language teachers. English today has become the key determinant for being successful in their careers. The hackneyed and stereotypical methods of teaching are not acceptable now. Teachers are no longer arbitrary dispensers of knowledge, but they are playing the role of a guide and facilitator for the students. Teachers of English are using innovative ideas to make English language teaching and learning interesting and simple. Teachers have started using literary texts and their analyses to explore and ignite the imagination and creative skills of the students. One needs to think and rethink the contribution of literature to intelligent thinking as well as its role in the process of teaching/learning. This article is, therefore, an attempt at exploring the nature of the literary experience in the present-day classrooms and the broader role of literature in life.
Resumo:
Dans son milieu familial, le jeune enfant développe ses habiletés langagières en plus de s’initier à la lecture et à l’écriture. Ce chapitre se divise en deux sections. Dans la première, nous décrivons un ensemble d’études qui convergent vers un modèle théorique de la littératie familiale et de son lien avec le développement du langage et de la lecture. Ce modèle, proposé par Sénéchal et ses collègues, suggère une association robuste entre lecture partagée et langage oral, d’une part, et entre enseignement parental et habiletés de littératie, d’autre part. Dans la deuxième section du chapitre, nous montrons, en résumant des études corrélationnelles et quasi-expérimentales, comment l’entrée de l’enfant dans le monde de la lecture peut être facilitée par ses premières tentatives, même non conventionnelles, d’écriture de mots. Dans chacune des deux sections, nous nous intéressons aux trajectoires d’apprentissage allant d’habiletés émergentes à la compétence en lecture.
Resumo:
This study focuses on the learning and teaching of Reading in English as a Foreign Language (REFL), in Libya. The study draws on an action research process in which I sought to look critically at students and teachers of English as a Foreign Language (EFL) in Libya as they learned and taught REFL in four Libyan research sites. The Libyan EFL educational system is influenced by two main factors: the method of teaching the Holy-Quran and the long-time ban on teaching EFL by the former Libyan regime under Muammar Gaddafi. Both of these factors have affected the learning and teaching of REFL and I outline these contextual factors in the first chapter of the thesis. This investigation, and the exploration of the challenges that Libyan university students encounter in their REFL, is supported by attention to reading models. These models helped to provide an analytical framework and starting point for understanding the many processes involved in reading for meaning and in reading to satisfy teacher instructions. The theoretical framework I adopted was based, mainly and initially, on top-down, bottom-up, interactive and compensatory interactive models. I drew on these models with a view to understanding whether and how the processes of reading described in the models could be applied to the reading of EFL students and whether these models could help me to better understand what was going on in REFL. The diagnosis stage of the study provided initial data collected from four Libyan research sites with research tools including video-recorded classroom observations, semi-structured interviews with teachers before and after lesson observation, and think-aloud protocols (TAPs) with 24 students (six from each university) in which I examined their REFL reading behaviours and strategies. This stage indicated that the majority of students shared behaviours such as reading aloud, reading each word in the text, articulating the phonemes and syllables of words, or skipping words if they could not pronounce them. Overall this first stage indicated that alternative methods of teaching REFL were needed in order to encourage ‘reading for meaning’ that might be based on strategies related to eventual interactive reading models adapted for REFL. The second phase of this research project was an Intervention Phase involving two team-teaching sessions in one of the four stage one universities. In each session, I worked with the teacher of one group to introduce an alternative method of REFL. This method was based on teaching different reading strategies to encourage the students to work towards an eventual interactive way of reading for meaning. A focus group discussion and TAPs followed the lessons with six students in order to discuss the 'new' method. Next were two video-recorded classroom observations which were followed by an audio-recorded discussion with the teacher about these methods. Finally, I conducted a Skype interview with the class teacher at the end of the semester to discuss any changes he had made in his teaching or had observed in his students' reading with respect to reading behaviour strategies, and reactions and performance of the students as he continued to use the 'new' method. The results of the intervention stage indicate that the teacher, perhaps not surprisingly, can play an important role in adding to students’ knowledge and confidence and in improving their REFL strategies. For example, after the intervention stage, students began to think about the title, and to use their own background knowledge to comprehend the text. The students employed, also, linguistic strategies such as decoding and, above all, the students abandoned the behaviour of reading for pronunciation in favour of reading for meaning. Despite the apparent efficacy of the alternative method, there are, inevitably, limitations related to the small-scale nature of the study and the time I had available to conduct the research. There are challenges, too, related to the students’ first language, the idiosyncrasies of the English language, the teacher training and continuing professional development of teachers, and the continuing political instability of Libya. The students’ lack of vocabulary and their difficulties with grammatical functions such as phrasal and prepositional verbs, forms which do not exist in Arabic, mean that REFL will always be challenging. Given such constraints, the ‘new’ methods I trialled and propose for adoption can only go so far in addressing students’ difficulties in REFL. Overall, the study indicates that the Libyan educational system is underdeveloped and under resourced with respect to REFL. My data indicates that the teacher participants have received little to no professional developmental that could help them improve their teaching in REFL and skills in teaching EFL. These circumstances, along with the perennial problem of large but varying class sizes; student, teacher and assessment expectations; and limited and often poor quality resources, affect the way EFL students learn to read in English. Against this background, the thesis concludes by offering tentative conclusions; reflections on the study, including a discussion of its limitations, and possible recommendations designed to improve REFL learning and teaching in Libyan universities.
Resumo:
In this thesis we aimed to explore the potential of gamification - defined as “the use of game elements in non-game contexts” [30] - in increasing children's (aged 5 to 6) engagement with the task. This is mainly due to the fact that our world is living a technological era, and videogames are an example of this engagement by being able to maintain children’s (and adults) engagement for hours straight. For the purpose of limiting complexity, we only addressed the feedback element by introducing it with an anthropomorphic virtual agent (human-like aspect), because research shows that virtual agents (VA’s) can influence behavioural change [17], or even induce emotions on humans both through the use of feedback provided and their facial expressions, which can interpreted in the same way as of humans’ [2]. By pairing the VA with the gamification concept, we wanted to 1) create a VA that is likely to be well-received by children (appearance and behaviour), and 2) have the immediate feedback that games have, so we can give children an assessment of their actions in real-time, as opposed to waiting for feedback from someone (traditional teaching), and with this give students more chances to succeed [32, 43]. Our final system consisted on a virtual environment, where children formed words that corresponded to a given image. In order to measure the impact that the VA had on engagement, the system was developed in two versions: one version of the system was limited to provide a simple feedback environment, where the VA provided feedback, by responding with simple phrases (i.e. “correct” or “incorrect”); for the second version, the VA had a more complex approach where it tried to encourage children to complete the word – a motivational feedback - even when they weren’t succeeding. Lastly we conducted a field study with two groups of children, where one group tested the version with the simple feedback, and the other group tested the ‘motivational’ version of the system. We used a quantitative approach to analyze the collected data that measured the engagement, based on the number of tasks (words) completed and time spent with system. The results of the evaluation showed that the use of motivational feedback may carry a positive effect on engaging children.
Resumo:
Although the debate of what data science is has a long history and has not reached a complete consensus yet, Data Science can be summarized as the process of learning from data. Guided by the above vision, this thesis presents two independent data science projects developed in the scope of multidisciplinary applied research. The first part analyzes fluorescence microscopy images typically produced in life science experiments, where the objective is to count how many marked neuronal cells are present in each image. Aiming to automate the task for supporting research in the area, we propose a neural network architecture tuned specifically for this use case, cell ResUnet (c-ResUnet), and discuss the impact of alternative training strategies in overcoming particular challenges of our data. The approach provides good results in terms of both detection and counting, showing performance comparable to the interpretation of human operators. As a meaningful addition, we release the pre-trained model and the Fluorescent Neuronal Cells dataset collecting pixel-level annotations of where neuronal cells are located. In this way, we hope to help future research in the area and foster innovative methodologies for tackling similar problems. The second part deals with the problem of distributed data management in the context of LHC experiments, with a focus on supporting ATLAS operations concerning data transfer failures. In particular, we analyze error messages produced by failed transfers and propose a Machine Learning pipeline that leverages the word2vec language model and K-means clustering. This provides groups of similar errors that are presented to human operators as suggestions of potential issues to investigate. The approach is demonstrated on one full day of data, showing promising ability in understanding the message content and providing meaningful groupings, in line with previously reported incidents by human operators.