887 resultados para LINK-BASED AND MULTIDIMENSIONAL QUERY LANGUAGE (LMDQL)
Resumo:
This thesis addressed the problem of risk analysis in mental healthcare, with respect to the GRiST project at Aston University. That project provides a risk-screening tool based on the knowledge of 46 experts, captured as mind maps that describe relationships between risks and patterns of behavioural cues. Mind mapping, though, fails to impose control over content, and is not considered to formally represent knowledge. In contrast, this thesis treated GRiSTs mind maps as a rich knowledge base in need of refinement; that process drew on existing techniques for designing databases and knowledge bases. Identifying well-defined mind map concepts, though, was hindered by spelling mistakes, and by ambiguity and lack of coverage in the tools used for researching words. A novel use of the Edit Distance overcame those problems, by assessing similarities between mind map texts, and between spelling mistakes and suggested corrections. That algorithm further identified stems, the shortest text string found in related word-forms. As opposed to existing approaches’ reliance on built-in linguistic knowledge, this thesis devised a novel, more flexible text-based technique. An additional tool, Correspondence Analysis, found patterns in word usage that allowed machines to determine likely intended meanings for ambiguous words. Correspondence Analysis further produced clusters of related concepts, which in turn drove the automatic generation of novel mind maps. Such maps underpinned adjuncts to the mind mapping software used by GRiST; one such new facility generated novel mind maps, to reflect the collected expert knowledge on any specified concept. Mind maps from GRiST are stored as XML, which suggested storing them in an XML database. In fact, the entire approach here is ”XML-centric”, in that all stages rely on XML as far as possible. A XML-based query language allows user to retrieve information from the mind map knowledge base. The approach, it was concluded, will prove valuable to mind mapping in general, and to detecting patterns in any type of digital information.
Resumo:
Methods for accessing data on the Web have been the focus of active research over the past few years. In this thesis we propose a method for representing Web sites as data sources. We designed a Data Extractor data retrieval solution that allows us to define queries to Web sites and process resulting data sets. Data Extractor is being integrated into the MSemODB heterogeneous database management system. With its help database queries can be distributed over both local and Web data sources within MSemODB framework. ^ Data Extractor treats Web sites as data sources, controlling query execution and data retrieval. It works as an intermediary between the applications and the sites. Data Extractor utilizes a twofold “custom wrapper” approach for information retrieval. Wrappers for the majority of sites are easily built using a powerful and expressive scripting language, while complex cases are processed using Java-based wrappers that utilize specially designed library of data retrieval, parsing and Web access routines. In addition to wrapper development we thoroughly investigate issues associated with Web site selection, analysis and processing. ^ Data Extractor is designed to act as a data retrieval server, as well as an embedded data retrieval solution. We also use it to create mobile agents that are shipped over the Internet to the client's computer to perform data retrieval on behalf of the user. This approach allows Data Extractor to distribute and scale well. ^ This study confirms feasibility of building custom wrappers for Web sites. This approach provides accuracy of data retrieval, and power and flexibility in handling of complex cases. ^
Resumo:
Moving objects database systems are the most challenging sub-category among Spatio-Temporal database systems. A database system that updates in real-time the location information of GPS-equipped moving vehicles has to meet even stricter requirements. Currently existing data storage models and indexing mechanisms work well only when the number of moving objects in the system is relatively small. This dissertation research aimed at the real-time tracking and history retrieval of massive numbers of vehicles moving on road networks. A total solution has been provided for the real-time update of the vehicles' location and motion information, range queries on current and history data, and prediction of vehicles' movement in the near future. ^ To achieve these goals, a new approach called Segmented Time Associated to Partitioned Space (STAPS) was first proposed in this dissertation for building and manipulating the indexing structures for moving objects databases. ^ Applying the STAPS approach, an indexing structure of associating a time interval tree to each road segment was developed for real-time database systems of vehicles moving on road networks. The indexing structure uses affordable storage to support real-time data updates and efficient query processing. The data update and query processing performance it provides is consistent without restrictions such as a time window or assuming linear moving trajectories. ^ An application system design based on distributed system architecture with centralized organization was developed to maximally support the proposed data and indexing structures. The suggested system architecture is highly scalable and flexible. Finally, based on a real-world application model of vehicles moving in region-wide, main issues on the implementation of such a system were addressed. ^
Resumo:
Graph-structured databases are widely prevalent, and the problem of effective search and retrieval from such graphs has been receiving much attention recently. For example, the Web can be naturally viewed as a graph. Likewise, a relational database can be viewed as a graph where tuples are modeled as vertices connected via foreign-key relationships. Keyword search querying has emerged as one of the most effective paradigms for information discovery, especially over HTML documents in the World Wide Web. One of the key advantages of keyword search querying is its simplicity—users do not have to learn a complex query language, and can issue queries without any prior knowledge about the structure of the underlying data. The purpose of this dissertation was to develop techniques for user-friendly, high quality and efficient searching of graph structured databases. Several ranked search methods on data graphs have been studied in the recent years. Given a top-k keyword search query on a graph and some ranking criteria, a keyword proximity search finds the top-k answers where each answer is a substructure of the graph containing all query keywords, which illustrates the relationship between the keyword present in the graph. We applied keyword proximity search on the web and the page graph of web documents to find top-k answers that satisfy user’s information need and increase user satisfaction. Another effective ranking mechanism applied on data graphs is the authority flow based ranking mechanism. Given a top- k keyword search query on a graph, an authority-flow based search finds the top-k answers where each answer is a node in the graph ranked according to its relevance and importance to the query. We developed techniques that improved the authority flow based search on data graphs by creating a framework to explain and reformulate them taking in to consideration user preferences and feedback. We also applied the proposed graph search techniques for Information Discovery over biological databases. Our algorithms were experimentally evaluated for performance and quality. The quality of our method was compared to current approaches by using user surveys.
Resumo:
The purpose of this phenomenological study was to describe how Colombian adult English language learners (ELL) select and use language learning strategies (LLS). This study used Oxford’s (1990a) taxonomy for LLS as its theoretical framework. Semi-structured interviews and a focus group interview, were conducted, transcribed, and analyzed for 12 Colombian adult ELL. A communicative activity known as strip story (Gibson, 1975) was used to elicit participants’ use of LLS. This activity preceded the focus group session. Additionally, participants’ reflective journals were collected and analyzed. Data were analyzed using inductive, deductive, and comparative analyses. Four themes emerged from the inductive analysis of the data: (a) learning conditions, (b) problem-solving resources, (c) information processing, and (d) target language practice. Oxford’s classification of LLS was used as a guide in deductively analyzing data concerning the participants’ experiences. The deductive analysis revealed that participants do not use certain strategies included in Oxford’s taxonomy at the third level. For example, semantic mapping, or physical response or sensation was not reported by participants. The findings from the inductive and deductive analyses were then compared to look for patterns and answers to the research questions. The comparative analysis revealed that participants used additional LLS that are not included in Oxford’s taxonomy. Some examples of these strategies are: using sound transcription in native language and help from children. The study was conducted at the MDC InterAmerican campus in South Florida, one of the largest Hispanic-influenced communities in the U.S. Based on the findings from this study, the researcher proposed a framework to study LLS that includes both external (i.e., learning context, community) and internal (i.e., culture, prior education) factors that influence the selection and use of LLS. The findings from this study imply that given the importance of the both external and internal factors in learners’ use of LLS, these factors should be considered for inclusion in any study of language learner strategies use by adult learners. Implications for teaching and learning as well as recommendations for further research are provided.
Resumo:
Hispanic Generation 1.5 students are foreign-born, U.S. high school graduates who are socialized in the English dominant K-12 school system while still maintaining the native language and culture at home (Allison, 2006; Blumenthal, 2002; Harklau, Siegal, & Losey, 1999; Rumbault & Ima, 1988). When transitioning from high school to college, these students sometimes assess into ESL courses based on their English language abilities, and because of this ESL placement, Hispanic Generation 1.5 students might have different engagement experiences than their mainstream peers. Engagement is a critical factor in student success and long-term retention because students’ positive and negative engagement experiences affect their membership and sense of belonging at the institution. The purpose of this study was to describe the engagement and membership experiences of Hispanic Generation 1.5 students’ at a Massachusetts community college. This study employed naturalistic inquiry within an embedded descriptive case study design that included three units of analysis: the students’ engagement experiences in (a) ESL courses, (b) developmental courses, and (c) mainstream courses. The main source of data was in-depth interviews with Hispanic Generation 1.5 students at Commonwealth of Massachusetts Community College. Criterion sampling was used to select the interview participants, ensuring that all participants were native Spanish speakers and were taking or had taken at least one ESL course at the institution. The study findings show that these Hispanic Generation 1.5 students at the college did not perceive peer engagement as critical to academic success. Most times the participants avoided peer engagement outside of the classroom, especially with fellow Hispanic students, who they felt would deter them from their English language development and general academic work. Engagement with ESL faculty and ESL academic support staff played the most critical role in the participants’ sense of belonging and success, and students who were required to engage with faculty and academic support staff outside of the classroom were the most satisfied with their educational experiences. While the participants were all disappointed with some aspect of their ESL placement, they valued the ESL engagement experiences more than the engagement experiences while completing developmental and credit coursework.
Resumo:
The article argues against an ahistorical deficit model of Spanish/English bilingualism in educational practice based on interlinguistic research. The bidirectional facilitative effects of Hispanic bilingualism allow Spanish-speaking minorities to exploit their language background while learning academic English and integrating their language and culture into the American mainstream.
Resumo:
Changing demographics impact our schools as children come from more linguistically and culturally diverse backgrounds. The various social, cultural, and economic backgrounds of the students affect their early language learning experiences which expose them to the academic language needed to succeed in school. Teachers can help students acquire academic language by introducing words that are within their Zone of Proximal Development and increasing exposure to and use of academic language. This study investigated the effects of increasing structured activities for students to orally interact with informational text on their scientific academic language development and comprehension of expository text. ^ The Academic Text Talk activities, designed to scaffold verbalization of new words and ideas, included discussion, retelling, games, and sentence walls. This study also evaluated if there were differences in scientific language proficiency and comprehension between boys and girls, and between English language learners and native English speakers. ^ A quasi-experimental design was used to determine the relationship between increasing students' oral practice with academic language and their academic language proficiency. Second graders (n = 91) from an urban public school participated in two science units over an 8 week period and were pre and post tested using the Woodcock Muñoz Language Survey-Revised and vocabulary tests from the National Energy Education Project. Analysis of covariance was performed on the pre to post scores by treatment group to determine differences in academic language proficiency for students taught using Academic Text Talk compared to students taught using a text-centered method, using the initial Florida Assessment for Instruction in Reading test as a covariate. Students taught using Academic Text Talk multimodal strategies showed significantly greater increases in their pre to posttest means on the Woodcock Muñoz Language Survey-Revised Oral Language Totals and National Energy Education Development Project Vocabulary tests than students taught using the text-centered method, ps < .05. Boys did not show significantly greater increases than girls, nor did English language learners show significantly greater increases than the native English speakers. ^ This study informs the field of reading research by evaluating the effectiveness of a multimodal combination of strategies emphasizing discourse to build academic language.^
Resumo:
Understanding the language of one’s cultural environment is important for effective communication and function. As such, students entering U.S. schools from foreign countries are given access to English to Speakers of Other Languages (ESOL) programs and they are referred to as English Language Learner (ELL) students. This dissertation examined the correlation of ELL ACCESS Composite Performance Level (CPL) score to the End of Course tests (EOCTs) and the Georgia High School Graduation Tests (GHSGTs) in the four content courses (language arts, mathematics, science, and social studies). A premise of this study was that English language proficiency is critical in meeting or exceeding state and county assessment standards. A quantitative descriptive research design was conducted using Cross-sectional archival data from a secondary source. There were 148 participants from school years 2011-2012 to 2013- 2014 from Grades 9-12. A Pearson product moment correlation was run to assess the relationship between the ACCESS CPL (independent variable) and the EOCT scores and the GHSGT scores (dependent variables). The findings showed that there was a positive correlation between ACCESS CPL scores and the EOCT scores where language arts showed a strong positive correlation and mathematics showed a positive weak correlation. Also, there was a positive correlation between ACCESS CPL scores and GHSGT scores where language arts showed a weak positive correlation. The results of this study indicated that that there is a relationship between the stated variables, ACCESS CPL, EOCT and GHSGT. Also, the results of this study showed that there were positive correlations at varying degrees for each grade levels. While the null hypothesis for Research Question 1 and Research Question 2 were rejected, there was a slight relationship between the variables.
Resumo:
Since 2004 the Colombian Ministry of Education has been implementing the Programa Nacional de Bilingüismo (PNB) with the goal of having bilingual high school graduates in English and Spanish by 2019. However, implementation of the PNB has been criticized by English Language Teaching (ELT) specialists in the country who say, among other things, that the PNB introduced a discourse associated exclusively with bilingualism in English and Spanish. This study analyzed interviews with 15 participants of a public school of the Colombian Escuela Nueva, a successful model of community-based education that has begun a process of internationalization, regarding the participants’ perceptions of foreign language education and the policies of the PNB. Six students, five teachers, and four administrators were each interviewed twice using semi-structured interviews. To offer a critique of the PNB, this study tried to determine to what extent the school implemented the elements of Responsible ELT, a model developed by the researcher incorporating the concepts of hegemony of English, critical language-policy research, and resistance in ELT. Findings included the following: (a) students and teachers saw English as the universal language whereas most administrators saw English imposed due to political and economic reasons; (b) some teachers misinterpreted the 1994 General Law of Education mandating the teaching of a foreign language as a law mandating English; and (c) some teachers and administrators saw the PNB’s adoption of competence standards based on the Common European Framework of Reference for languages as beneficial whereas others saw it as arbitrary. Conclusions derived from this study of this Escuela Nueva school were: (a) most participants found the goal of the PNB unrealistic; (b) most teachers and administrators saw the policies of the PNB as top-down policies without assessment or continuity; and (c) teachers and administrators mentioned a disarticulation between elementary and high school ELT policies that may be discouraging students in public schools from learning English. Thus, this study suggests that the policies of the PNB may be contributing to English becoming a gatekeeper for higher education and employment thereby becoming a tool for sustaining inequality in Colombia.
Resumo:
Individual cues to deception are subtle and often missed by lay people and law enforcement alike. Linguistic statement analysis remains a potentially useful way of overcoming individual diagnostic limitations (e.g. Criteria based Content Analysis; Steller & Köhnken, 1989; Reality monitoring; Johnson & Raye, 1981; Scientific Content Analysis; Sapir, 1996). Unfortunately many of these procedures are time-consuming, require in-depth training, as well as lack empirical support and/or external validity. The current dissertation develops a novel approach to statement veracity analysis that is simple to learn, easy to administer, theoretically sound, and empirically validated. Two strategies were proposed for detecting differences between liars' and truth-tellers' statements. Liars were hypothesized to strategically write statements with the goal of self-exoneration. Liars' statements were predicted to contain more first person pronouns and fewer third person pronouns. Truth-tellers were hypothesized to be motivated toward being informative and thus produce statements with fewer first person pronouns and more third person pronouns. Three studies were conducted to test this hypothesis. The first study explored the verbal patterns of exoneration and informativeness focused statements. The second study used a traditional theft paradigm to examine these verbal patterns in guilty liars and innocent truth tellers. In the third study to better match the context of a criminal investigation a cheating paradigm was used in which spontaneous lying was induced and written statements were taken. Support for the first person pronoun hypothesis was found. Limited support was found for the third person pronoun hypothesis. Results, implications, and future directions for the current research are discussed.
Resumo:
Moving objects database systems are the most challenging sub-category among Spatio-Temporal database systems. A database system that updates in real-time the location information of GPS-equipped moving vehicles has to meet even stricter requirements. Currently existing data storage models and indexing mechanisms work well only when the number of moving objects in the system is relatively small. This dissertation research aimed at the real-time tracking and history retrieval of massive numbers of vehicles moving on road networks. A total solution has been provided for the real-time update of the vehicles’ location and motion information, range queries on current and history data, and prediction of vehicles’ movement in the near future. To achieve these goals, a new approach called Segmented Time Associated to Partitioned Space (STAPS) was first proposed in this dissertation for building and manipulating the indexing structures for moving objects databases. Applying the STAPS approach, an indexing structure of associating a time interval tree to each road segment was developed for real-time database systems of vehicles moving on road networks. The indexing structure uses affordable storage to support real-time data updates and efficient query processing. The data update and query processing performance it provides is consistent without restrictions such as a time window or assuming linear moving trajectories. An application system design based on distributed system architecture with centralized organization was developed to maximally support the proposed data and indexing structures. The suggested system architecture is highly scalable and flexible. Finally, based on a real-world application model of vehicles moving in region-wide, main issues on the implementation of such a system were addressed.
Resumo:
Methods for accessing data on the Web have been the focus of active research over the past few years. In this thesis we propose a method for representing Web sites as data sources. We designed a Data Extractor data retrieval solution that allows us to define queries to Web sites and process resulting data sets. Data Extractor is being integrated into the MSemODB heterogeneous database management system. With its help database queries can be distributed over both local and Web data sources within MSemODB framework. Data Extractor treats Web sites as data sources, controlling query execution and data retrieval. It works as an intermediary between the applications and the sites. Data Extractor utilizes a two-fold "custom wrapper" approach for information retrieval. Wrappers for the majority of sites are easily built using a powerful and expressive scripting language, while complex cases are processed using Java-based wrappers that utilize specially designed library of data retrieval, parsing and Web access routines. In addition to wrapper development we thoroughly investigate issues associated with Web site selection, analysis and processing. Data Extractor is designed to act as a data retrieval server, as well as an embedded data retrieval solution. We also use it to create mobile agents that are shipped over the Internet to the client's computer to perform data retrieval on behalf of the user. This approach allows Data Extractor to distribute and scale well. This study confirms feasibility of building custom wrappers for Web sites. This approach provides accuracy of data retrieval, and power and flexibility in handling of complex cases.
Resumo:
This study subdivides the Weddell Sea, Antarctica, into seafloor regions using multivariate statistical methods. These regions are categories used for comparing, contrasting and quantifying biogeochemical processes and biodiversity between ocean regions geographically but also regions under development within the scope of global change. The division obtained is characterized by the dominating components and interpreted in terms of ruling environmental conditions. The analysis uses 28 environmental variables for the sea surface, 25 variables for the seabed and 9 variables for the analysis between surface and bottom variables. The data were taken during the years 1983-2013. Some data were interpolated. The statistical errors of several interpolation methods (e.g. IDW, Indicator, Ordinary and Co-Kriging) with changing settings have been compared for the identification of the most reasonable method. The multivariate mathematical procedures used are regionalized classification via k means cluster analysis, canonical-correlation analysis and multidimensional scaling. Canonical-correlation analysis identifies the influencing factors in the different parts of the cove. Several methods for the identification of the optimum number of clusters have been tested. For the seabed 8 and 12 clusters were identified as reasonable numbers for clustering the Weddell Sea. For the sea surface the numbers 8 and 13 and for the top/bottom analysis 8 and 3 were identified, respectively. Additionally, the results of 20 clusters are presented for the three alternatives offering the first small scale environmental regionalization of the Weddell Sea. Especially the results of 12 clusters identify marine-influenced regions which can be clearly separated from those determined by the geological catchment area and the ones dominated by river discharge.
Resumo:
Since the emergence of the European Landscape Convention (ELC) in 2000, the important link between landscape and planning has greatly intensified. Now, more than ever, the fundamental role of the planning system in delivering the ELC’s requirements is recognised. This has been further substantiated within Ireland’s recently published National Landscape Strategy. However it has continually been suggested that decision-making processes need to adapt better to the holistic, valueladen and multidimensional approaches underpinning the ELC. In light of these milestones for the preservation, management and planning of landscape, this research sets out to establish synergies and disparities in the existing relationship between landscape and planning. It investigates detailed evidence of the presence and manifestations of landscape in key processes of day-to-day planning practice in Ireland, from individual planning appeals and ‘special’ cases, to the major strategic instruments that inform the making of landscape policies within development plans. This is set within wider theoretical and policy contexts where the compatibility of landscape and planning is subjected to critical scrutiny and then explored through these practical case studies. Driving this research is the intention to make a case for the planning domain to be an ideal ‘home’ for landscape – in all its deep, multidimensional meaning – and for enhancing landscape arguments and objectives in the face of conflict, competing values and power-plays in the real world. Emerging out of this research is a set of recommendations for how, at a national level, new approaches for decision making for and about landscape can be more effective and meaningful.