918 resultados para concept learning


Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper focuses on two basic issues: the anxiety-generating nature of the interpreting task and the relevance of interpreter trainees’ academic self-concept. The first has already been acknowledged, although not extensively researched, in several papers, and the second has only been mentioned briefly in interpreting literature. This study seeks to examine the relationship between the anxiety and academic self-concept constructs among interpreter trainees. An adapted version of the Foreign Language Anxiety Scale (Horwitz et al., 1986), the Academic Autoconcept Scale (Schmidt, Messoulam & Molina, 2008) and a background information questionnaire were used to collect data. Students’ t-Test analysis results indicated that female students reported experiencing significantly higher levels of anxiety than male students. No significant gender difference in self-concept levels was found. Correlation analysis results suggested, on the one hand, that younger would-be interpreters suffered from higher anxiety levels and students with higher marks tended to have lower anxiety levels; and, on the other hand, that younger students had lower self-concept levels and higher-ability students held higher self-concept levels. In addition, the results revealed that students with higher anxiety levels tended to have lower self-concept levels. Based on these findings, recommendations for interpreting pedagogy are discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The last couple of years there has been a lot of attention for MOOCs. More and more universities start offering MOOCs. Although the open dimension of MOOC indicates that it is open in every aspect, in most cases it is a course with a structure and a timeline within which learning activities are positioned. There is a contradiction there. The open aspect puts MOOCs more in the non-formal professional learning domain, while the course structure takes it into the formal, traditional education domain. Accordingly, there is no consensus yet on solid pedagogical approaches for MOOCs. Something similar can be said for learning analytics, another upcoming concept that is receiving a lot of attention. Given its nature, learning analytics offers a large potential to support learners in particular in MOOCs. Learning analytics should then be applied to assist the learners and teachers in understanding the learning process and could predict learning, provide opportunities for pro-active feedback, but should also results in interventions aimed at improving progress. This paper illustrates pedagogical and learning analytics approaches based on practices developed in formal online and distance teaching university education that have been fine-tuned for MOOCs and have been piloted in the context of the EU-funded MOOC projects ECO (Elearning, Communication, Open-Data: http://ecolearning.eu) and EMMA (European Multiple MOOC Aggregator: http://platform.europeanmoocs.eu).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The angle concept is a multifaceted concept having static and dynamic definitions. The static definition of the angle refers to “the space between two rays” or “the intersection of two rays at the same end point” (Mitchelmore & White, 1998), whereas the dynamic definition of the angle concept highlights that the size of angle is the amount of rotation in direction (Fyhn, 2006). Since both definitions represent two diverse situations and have unique limitations (Henderson & Taimina, 2005), students may hold misconceptions about the angle concept. In this regard, the aim of this research was to explore high achievers’ knowledge regarding the definition of the angle concept as well as to investigate their erroneous answers on the angle concept.

104 grade 6 students drawn from four well-established elementary schools of Yozgat, Turkey were participated in this research. All participants were selected via a purposive sampling method and their mathematics grades were 4 or 5 out of 5, and. Data were collected through four questions prepared by considering the learning competencies set out in the grade 6 curriculum in Turkey and the findings of previous studies whose purposes were to identify students’ misconceptions of the angle concept. The findings were analyzed by two researchers, and their inter-rater agreement was calculated as 0.91, or almost perfect. Thereafter, coding discrepancies were resolved, and consensus was established.

The angle concept is a multifaceted concept having static and dynamic definitions.The static definition of the angle refers to “the space between two rays” or“the intersection of two rays at the same end point” (Mitchelmore & White, 1998), whereas the dynamicdefinition of the angle concept highlights that the size of angle is the amountof rotation in direction (Fyhn, 2006). Since both definitionsrepresent two diverse situations and have unique limitations (Henderson & Taimina, 2005), students may holdmisconceptions about the angle concept. In this regard, the aim of thisresearch was to explore high achievers’ knowledge regarding the definition ofthe angle concept as well as to investigate their erroneous answers on theangle concept.

104grade 6 students drawn from four well-established elementary schools of Yozgat,Turkey were participated in this research. All participants were selected via a purposive sampling method and their mathematics grades were 4 or 5 out of 5,and. Data were collected through four questions prepared by considering the learning competencies set out in the grade 6 curriculum in Turkey and the findings of previous studies whose purposes were to identify students’ misconceptions of the angle concept. The findings were analyzed by two researchers, and their inter-rater agreement was calculated as 0.91, or almost perfect. Thereafter, coding discrepancies were resolved, and consensus was established.

In the first question, students were asked to answer a multiple choice questions consisting of two statics definitions and one dynamic definition of the angle concept. Only 38 of 104 students were able to recognize these three definitions. Likewise, Mitchelmore and White (1998) investigated that less than10% of grade 4 students knew the dynamic definition of the angle concept. Additionally,the purpose of the second question was to figure out how well students could recognize 0-degree angle. We found that 49 of 104 students were unable to recognize MXW as an angle. While 6 students indicated that the size of MXW is0, other 6 students revealed that the size of MXW is 360. Therefore, 12 of 104students correctly answered this questions. On the other hand, 28 of 104students recognized the MXW angle as 180-degree angle. This finding demonstrated that these students have difficulties in naming the angles.Moreover, the third question consisted of three concentric circles with center O and two radiuses of the outer circle, and the intersection of the radiuses with these circles were named. Then, students were asked to compare the size of AOB, GOD and EOF angles. Only 36 of 104 students answered correctly by indicating that all three angles are equal, whereas 68 of 104 students incorrectly responded this question by revealing AOB<GOD< EOF. These students erroneously thought the size of the angle is related to either the size of the arc marking the angle or the area between the arms of the angle and the arc marking angle. These two erroneous strategies for determining the size of angles have been found by a few studies (Clausen-May,2008; Devichi & Munier, 2013; Kim & Lee, 2014; Mithcelmore, 1998;Wilson & Adams, 1992). The last question, whose aim was to determine how well students can adapt theangle concept to real life, consisted of an observer and a barrier, and students were asked to color the hidden area behind the barrier. Only 2 of 104students correctly responded this question, whereas 19 of 104 students drew rays from the observer to both sides of the barrier, and colored the area covered by the rays, the observer and barrier. While 35 of 104 students just colored behind the barrier without using any strategies, 33 of 104 students constructed two perpendicular lines at the both end of the barrier, and colored behind the barrier. Similarly, Munier, Devinci and Merle (2008) found that this incorrect strategy was used by 27% of students.

Consequently, we found that although the participants in this study were high achievers, they still held several misconceptions on the angle concept and had difficulties in adapting the angle concept to real life.

Keywords: the angle concept;misconceptions; erroneous answers; high achievers

References

Clausen-May, T. (2008). AnotherAngle on Angles. Australian Primary Mathematics Classroom, 13(1),4–8.

Devichi, C., & Munier, V.(2013). About the concept of angle in elementary school: Misconceptions andteaching sequences. The Journal of Mathematical Behavior, 32(1),1–19. http://doi.org/10.1016/j.jmathb.2012.10.001

Fyhn, A. B. (2006). A climbinggirl’s reflections about angles. The Journal of Mathematical Behavior, 25(2),91–102. http://doi.org/10.1016/j.jmathb.2006.02.004

Henderson, D. W., & Taimina,D. (2005). Experiencing geometry: Euclidean and non-Euclidean with history(3rd ed.). New York, USA: Prentice Hall.

Kim, O.-K., & Lee, J. H.(2014). Representations of Angle and Lesson Organization in Korean and AmericanElementary Mathematics Curriculum Programs. KAERA Research Forum, 1(3),28–37.

Mitchelmore, M. C., & White,P. (1998). Development of angle concepts: A framework for research. MathematicsEducation Research Journal, 10(3), 4–27.

Mithcelmore, M. C. (1998). Youngstudents’ concepts of turning and angle. Cognition and Instruction, 16(3),265–284.

Munier, V., Devichi, C., &Merle, H. (2008). A Physical Situation as a Way to Teach Angle. TeachingChildren Mathematics, 14(7), 402–407.

Wilson, P. S., & Adams, V.M. (1992). A Dynamic Way to Teach Angle and Angle Measure. ArithmeticTeacher, 39(5), 6–13.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Algorithms for concept drift handling are important for various applications including video analysis and smart grids. In this paper we present decision tree ensemble classication method based on the Random Forest algorithm for concept drift. The weighted majority voting ensemble aggregation rule is employed based on the ideas of Accuracy Weighted Ensemble (AWE) method. Base learner weight in our case is computed for each sample evaluation using base learners accuracy and intrinsic proximity measure of Random Forest. Our algorithm exploits both temporal weighting of samples and ensemble pruning as a forgetting strategy. We present results of empirical comparison of our method with îriginal random forest with incorporated replace-the-looser forgetting andother state-of-the-art concept-drift classiers like AWE2.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, we suggest that portrayal of research is often undervalued and seen as ‘unwork’ (Galloway, 2012). Portrayal is often seen as an issue that is relatively straight forward by qualitative researchers, and invariably refers to putting the findings of the study together with excerpts from participants and usually, but not always, some interpretation. It tends to be seen as the means by which the researcher has chosen to position people and their perspectives, and it is imbued with a sense of not only positioning but also a contextual painting of a person in a particular way. Yet there are an array of issues and challenges about what portrayal can or might mean in digital spaces. In this paper we argue that researching education in a digital age provides greater or different opportunities to represent and portray data differently and suggest that these ways are underutilised. For example, for many researchers legitimacy comes through the use of participants’ voices in the form of quotations. However, we argue that this stance towards plausibility and legitimacy is problematic and needs to be reconsidered in terms of understanding differences in types of portrayal, recognizing how researchers position themselves in relation to portrayal, and understanding decision-making in relation to portrayal. We suggest that there need to be new perspectives about portrayal and concept, and ideas are provided that offer a different view. Three key recommendations are made: Portrayal should be reconceptualised as four overlapping concepts: mustering, folding, cartography, and portrayal. Adopting such an approach will enable audiences, researchers and other stakeholders to critique the assumptions that researchers on tour bring to portrayal and encourage reflexivity. Researchers on tour should highlight the temporal, mutable and shifting nature of portrayed research findings, emphasising the need for continued and varied research to inform understanding. There is a significant need for greater insight into the influence of portrayal, as well as the difference between representation and portrayal. Future studies should prioritise this, and ensure that portrayal is considered and critiqued from the outset.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Given the need of a growing internationalization of business, to have a good command of English is, most of the times important for the development of technical (specific) competences. It is, thus, critical that professionals use accurate terminology to set grounds for a well-succeeded communication. Furthermore, business communication is increasingly moving to ICT-mediated sets and professionals have to be able to promptly adjust to these needs, resorting to trustworthy online information sources, but also using technologies that better serve their business purposes. In this scenario, the main objective of this study is to find evidence as to the utility of concept mapping as a teaching and learning strategy for the appropriation of business English terminology, enabling students to use English more efficiently as language of communication in business context. This study was based on a case study methodology, mainly of exploratory nature. Participants were students (n= 30) enrolled in the subject English Applied to Management II at Águeda School of Technology and Management – University of Aveiro (2013/14 edition). They were asked to create and peer review two concept maps (cmaps), one individually and another in pairs. The data gathered were treated and analysed resorting qualitative (content analysis) and to quantitative (descriptive statistical analysis) techniques. Results of the data analysis unveil that the use of a collaborative concept mapping tool promotes the development of linguistic competences as to the use of business terminology, but also of communication and collaboration competences. Besides, it was also a very important motivation element in the students’ engagement with the subject content.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Performing Macroscopy in Pathology implies to plan and implement methods of selection, description and collection of biological material from human organs and tissues, actively contributing to the clinical pathology analysis by preparing macroscopic report and the collection and identification of fragments, according to the standardized protocols and recognizing the criteria internationally established for determining the prognosis. The Macroscopy in Pathology course is a full year program with theoretical and pratical components taught by Pathologists. It is divided by organ/system surgical pathology into weekly modules and includes a practical "hands-on" component in Pathology Departments. The students are 50 biomedical scientists aged from 22 to 50 years old from all across the country that want to acquire competences in macroscopy. A blended learning strategy was used in order to: give students the opportunity to attend from distance; support the contents, lessons and the interaction with colleagues and teachers; facilitate the formative/summative assessment.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract Scheduling problems are generally NP-hard combinatorial problems, and a lot of research has been done to solve these problems heuristically. However, most of the previous approaches are problem-specific and research into the development of a general scheduling algorithm is still in its infancy. Mimicking the natural evolutionary process of the survival of the fittest, Genetic Algorithms (GAs) have attracted much attention in solving difficult scheduling problems in recent years. Some obstacles exist when using GAs: there is no canonical mechanism to deal with constraints, which are commonly met in most real-world scheduling problems, and small changes to a solution are difficult. To overcome both difficulties, indirect approaches have been presented (in [1] and [2]) for nurse scheduling and driver scheduling, where GAs are used by mapping the solution space, and separate decoding routines then build solutions to the original problem. In our previous indirect GAs, learning is implicit and is restricted to the efficient adjustment of weights for a set of rules that are used to construct schedules. The major limitation of those approaches is that they learn in a non-human way: like most existing construction algorithms, once the best weight combination is found, the rules used in the construction process are fixed at each iteration. However, normally a long sequence of moves is needed to construct a schedule and using fixed rules at each move is thus unreasonable and not coherent with human learning processes. When a human scheduler is working, he normally builds a schedule step by step following a set of rules. After much practice, the scheduler gradually masters the knowledge of which solution parts go well with others. He can identify good parts and is aware of the solution quality even if the scheduling process is not completed yet, thus having the ability to finish a schedule by using flexible, rather than fixed, rules. In this research we intend to design more human-like scheduling algorithms, by using ideas derived from Bayesian Optimization Algorithms (BOA) and Learning Classifier Systems (LCS) to implement explicit learning from past solutions. BOA can be applied to learn to identify good partial solutions and to complete them by building a Bayesian network of the joint distribution of solutions [3]. A Bayesian network is a directed acyclic graph with each node corresponding to one variable, and each variable corresponding to individual rule by which a schedule will be constructed step by step. The conditional probabilities are computed according to an initial set of promising solutions. Subsequently, each new instance for each node is generated by using the corresponding conditional probabilities, until values for all nodes have been generated. Another set of rule strings will be generated in this way, some of which will replace previous strings based on fitness selection. If stopping conditions are not met, the Bayesian network is updated again using the current set of good rule strings. The algorithm thereby tries to explicitly identify and mix promising building blocks. It should be noted that for most scheduling problems the structure of the network model is known and all the variables are fully observed. In this case, the goal of learning is to find the rule values that maximize the likelihood of the training data. Thus learning can amount to 'counting' in the case of multinomial distributions. In the LCS approach, each rule has its strength showing its current usefulness in the system, and this strength is constantly assessed [4]. To implement sophisticated learning based on previous solutions, an improved LCS-based algorithm is designed, which consists of the following three steps. The initialization step is to assign each rule at each stage a constant initial strength. Then rules are selected by using the Roulette Wheel strategy. The next step is to reinforce the strengths of the rules used in the previous solution, keeping the strength of unused rules unchanged. The selection step is to select fitter rules for the next generation. It is envisaged that the LCS part of the algorithm will be used as a hill climber to the BOA algorithm. This is exciting and ambitious research, which might provide the stepping-stone for a new class of scheduling algorithms. Data sets from nurse scheduling and mall problems will be used as test-beds. It is envisaged that once the concept has been proven successful, it will be implemented into general scheduling algorithms. It is also hoped that this research will give some preliminary answers about how to include human-like learning into scheduling algorithms and may therefore be of interest to researchers and practitioners in areas of scheduling and evolutionary computation. References 1. Aickelin, U. and Dowsland, K. (2003) 'Indirect Genetic Algorithm for a Nurse Scheduling Problem', Computer & Operational Research (in print). 2. Li, J. and Kwan, R.S.K. (2003), 'Fuzzy Genetic Algorithm for Driver Scheduling', European Journal of Operational Research 147(2): 334-344. 3. Pelikan, M., Goldberg, D. and Cantu-Paz, E. (1999) 'BOA: The Bayesian Optimization Algorithm', IlliGAL Report No 99003, University of Illinois. 4. Wilson, S. (1994) 'ZCS: A Zeroth-level Classifier System', Evolutionary Computation 2(1), pp 1-18.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract Scheduling problems are generally NP-hard combinatorial problems, and a lot of research has been done to solve these problems heuristically. However, most of the previous approaches are problem-specific and research into the development of a general scheduling algorithm is still in its infancy. Mimicking the natural evolutionary process of the survival of the fittest, Genetic Algorithms (GAs) have attracted much attention in solving difficult scheduling problems in recent years. Some obstacles exist when using GAs: there is no canonical mechanism to deal with constraints, which are commonly met in most real-world scheduling problems, and small changes to a solution are difficult. To overcome both difficulties, indirect approaches have been presented (in [1] and [2]) for nurse scheduling and driver scheduling, where GAs are used by mapping the solution space, and separate decoding routines then build solutions to the original problem. In our previous indirect GAs, learning is implicit and is restricted to the efficient adjustment of weights for a set of rules that are used to construct schedules. The major limitation of those approaches is that they learn in a non-human way: like most existing construction algorithms, once the best weight combination is found, the rules used in the construction process are fixed at each iteration. However, normally a long sequence of moves is needed to construct a schedule and using fixed rules at each move is thus unreasonable and not coherent with human learning processes. When a human scheduler is working, he normally builds a schedule step by step following a set of rules. After much practice, the scheduler gradually masters the knowledge of which solution parts go well with others. He can identify good parts and is aware of the solution quality even if the scheduling process is not completed yet, thus having the ability to finish a schedule by using flexible, rather than fixed, rules. In this research we intend to design more human-like scheduling algorithms, by using ideas derived from Bayesian Optimization Algorithms (BOA) and Learning Classifier Systems (LCS) to implement explicit learning from past solutions. BOA can be applied to learn to identify good partial solutions and to complete them by building a Bayesian network of the joint distribution of solutions [3]. A Bayesian network is a directed acyclic graph with each node corresponding to one variable, and each variable corresponding to individual rule by which a schedule will be constructed step by step. The conditional probabilities are computed according to an initial set of promising solutions. Subsequently, each new instance for each node is generated by using the corresponding conditional probabilities, until values for all nodes have been generated. Another set of rule strings will be generated in this way, some of which will replace previous strings based on fitness selection. If stopping conditions are not met, the Bayesian network is updated again using the current set of good rule strings. The algorithm thereby tries to explicitly identify and mix promising building blocks. It should be noted that for most scheduling problems the structure of the network model is known and all the variables are fully observed. In this case, the goal of learning is to find the rule values that maximize the likelihood of the training data. Thus learning can amount to 'counting' in the case of multinomial distributions. In the LCS approach, each rule has its strength showing its current usefulness in the system, and this strength is constantly assessed [4]. To implement sophisticated learning based on previous solutions, an improved LCS-based algorithm is designed, which consists of the following three steps. The initialization step is to assign each rule at each stage a constant initial strength. Then rules are selected by using the Roulette Wheel strategy. The next step is to reinforce the strengths of the rules used in the previous solution, keeping the strength of unused rules unchanged. The selection step is to select fitter rules for the next generation. It is envisaged that the LCS part of the algorithm will be used as a hill climber to the BOA algorithm. This is exciting and ambitious research, which might provide the stepping-stone for a new class of scheduling algorithms. Data sets from nurse scheduling and mall problems will be used as test-beds. It is envisaged that once the concept has been proven successful, it will be implemented into general scheduling algorithms. It is also hoped that this research will give some preliminary answers about how to include human-like learning into scheduling algorithms and may therefore be of interest to researchers and practitioners in areas of scheduling and evolutionary computation. References 1. Aickelin, U. and Dowsland, K. (2003) 'Indirect Genetic Algorithm for a Nurse Scheduling Problem', Computer & Operational Research (in print). 2. Li, J. and Kwan, R.S.K. (2003), 'Fuzzy Genetic Algorithm for Driver Scheduling', European Journal of Operational Research 147(2): 334-344. 3. Pelikan, M., Goldberg, D. and Cantu-Paz, E. (1999) 'BOA: The Bayesian Optimization Algorithm', IlliGAL Report No 99003, University of Illinois. 4. Wilson, S. (1994) 'ZCS: A Zeroth-level Classifier System', Evolutionary Computation 2(1), pp 1-18.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

8th International Symposium on Project Approaches in Engineering Education (PAEE)

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Proceedings of the 8th International Symposium on Project Approaches in Engineering Education (PAEE), Guimarães, 2016

Relevância:

30.00% 30.00%

Publicador:

Resumo:

As an effect of marketisation, the importance of workplace learning in Germany has increased. The article follows up on the long-standing discourse around the question of how economic and pedagogical ideals interact in this context. In order to develop a theoretical framework for empirical research, three major positions of the discipline of business ethics are introduced. Business ethics in more abstract ways deals with the very same question, namely how do ideas such as profit orientation interact with other norms and values? The new perspectives show that the discourse has been hitherto based on a specific understanding of economy. In order to derive an empirical answer to the research question, the question is re-formulated as follows: Which values are inherent in the decisions taken? Consequently, it suggests using the concept of ‘rationalities of justification’ for empirical research. The article shows how this concept can be applied by conducting a test run. (DIPF/Orig.)

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Digital divide is an issue that concerns our technology dominated society. The parents of ubiquitous computing dreamt of a total proliferation of information technology. But the reality we live in is not yet prepared for this future. There is current a need to develop programs in order to diminish this difference between the digitally included and the excluded one. PROEJA-Transiarte is a project ran by Universidade de Brasília in the city of Ceilândia, Federal District of Brazil. It proposes a different approach on the issue of digital divide, by introducing the cooperative creation of cyberart, based on the life stories of each participant, into the regular curriculum of EJA (Educação de Jovens e Adultos) classes, thus implementing the concept of solidary education. This research project investigated the role played by the cooperative learning the students put in practice during the workshops of the project in the diminishing of the digital exclusion a great part of the students feel. It looked into their activities, analyzing the development of their cooperation, putting it next in the context of the digital and social inclusion. After a multi-dimensional research on the theme, in the context of PROEJA-Transiarte, the conclusion shows the impact cooperative learning has in the reduction of the digital divide, analyzing the perception of the currently involved students, the researchers active in the project, or the former students that had their lives improved because of the workshops they participated in.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

My dissertation emphasizes a cognitive account of multimodality that explicitly integrates experiential knowledge work into the rhetorical pedagogy that informs so many composition and technical communication programs. In these disciplines, multimodality is widely conceived in terms of what Gunther Kress calls “socialsemiotic” modes of communication shaped primarily by culture. In the cognitive and neurolinguistic theories of Vittorio Gallese and George Lakoff, however, multimodality is described as a key characteristic of our bodies’ sensory-motor systems which link perception to action and action to meaning, grounding all communicative acts in knowledge shaped through body-engaged experience. I argue that this “situated” account of cognition – which closely approximates Maurice Merleau-Ponty’s phenomenology of perception, a major framework for my study – has pedagogical precedence in the mimetic pedagogy that informed ancient Sophistic rhetorical training, and I reveal that training’s multimodal dimensions through a phenomenological exegesis of the concept mimesis. Plato’s denigration of the mimetic tradition and his elevation of conceptual contemplation through reason, out of which developed the classic Cartesian separation of mind from body, resulted in a general degradation of experiential knowledge in Western education. But with the recent introduction into college classrooms of digital technologies and multimedia communication tools, renewed emphasis is being placed on the “hands-on” nature of inventive and productive praxis, necessitating a revision of methods of instruction and assessment that have traditionally privileged the acquisition of conceptual over experiential knowledge. The model of multimodality I construct from Merleau-Ponty’s phenomenology, ancient Sophistic rhetorical pedagogy, and current neuroscientific accounts of situated cognition insists on recognizing the significant role knowledges we acquire experientially play in our reading and writing, speaking and listening, discerning and designing practices.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis presents a study of the Grid data access patterns in distributed analysis in the CMS experiment at the LHC accelerator. This study ranges from the deep analysis of the historical patterns of access to the most relevant data types in CMS, to the exploitation of a supervised Machine Learning classification system to set-up a machinery able to eventually predict future data access patterns - i.e. the so-called dataset “popularity” of the CMS datasets on the Grid - with focus on specific data types. All the CMS workflows run on the Worldwide LHC Computing Grid (WCG) computing centers (Tiers), and in particular the distributed analysis systems sustains hundreds of users and applications submitted every day. These applications (or “jobs”) access different data types hosted on disk storage systems at a large set of WLCG Tiers. The detailed study of how this data is accessed, in terms of data types, hosting Tiers, and different time periods, allows to gain precious insight on storage occupancy over time and different access patterns, and ultimately to extract suggested actions based on this information (e.g. targetted disk clean-up and/or data replication). In this sense, the application of Machine Learning techniques allows to learn from past data and to gain predictability potential for the future CMS data access patterns. Chapter 1 provides an introduction to High Energy Physics at the LHC. Chapter 2 describes the CMS Computing Model, with special focus on the data management sector, also discussing the concept of dataset popularity. Chapter 3 describes the study of CMS data access patterns with different depth levels. Chapter 4 offers a brief introduction to basic machine learning concepts and gives an introduction to its application in CMS and discuss the results obtained by using this approach in the context of this thesis.