900 resultados para Hierarchy of text classifiers
Resumo:
A hierarchy of enzyme-catalyzed positive feedback loops is examined by mathematical and numerical analysis. Four systems are described, from the simplest, in which an enzyme catalyzes its own formation from an inactive precursor, to the most complex, in which two sequential feedback loops act in a cascade. In the latter we also examine the function of a long-range feedback, in which the final enzyme produced in the second loop activates the initial step in the first loop. When the enzymes generated are subject to inhibition or inactivation, all four systems exhibit threshold properties akin to excitable systems like neuron firing. For those that are amenable to mathematical analysis, expressions are derived that relate the excitation threshold to the kinetics of enzyme generation and inhibition and the initial conditions. For the most complex system, it was expedient to employ numerical simulation to demonstrate threshold behavior, and in this case long-range feedback was seen to have two distinct effects. At sufficiently high catalytic rates, this feedback is capable of exciting an otherwise subthreshold system. At lower catalytic rates, where the long-range feedback does not significantly affect the threshold, it nonetheless has a major effect in potentiating the response above the threshold. In particular, oscillatory behavior observed in simulations of sequential feedback loops is abolished when a long-range feedback is present.
Resumo:
Calcium-dependent homotypic cell-cell adhesion, mediated by molecules such as E-cadherin, guides the establishment of classical epithelial cell polarity and contributes to the control of migration, growth, and differentiation. These actions involve additional proteins, including alpha- and beta-catenin (or plakoglobin) and p120, as well as linkage to the cortical actin cytoskeleton. The molecular basis for these interactions and their hierarchy of interaction remain controversial. We demonstrate a direct interaction between F-actin and alpha (E)-catenin, an activity not shared by either the cytoplasmic domain of E-cadherin or beta-catenin. Sedimentation assays and direct visualization by transmission electron microscopy reveal that alpha 1(E)-catenin binds and bundles F-actin in vitro with micromolar affinity at a catenin/G-actin monomer ratio of approximately 1:7 (mol/mol). Recombinant human beta-catenin can simultaneously bind to the alpha-catenin/actin complex but does not bind actin directly. Recombinant fragments encompassing the amino-terminal 228 residues of alpha 1(E)-catenin or the carboxyl-terminal 447 residues individually bind actin in cosedimentation assays with reduced affinity compared with the full-length protein, and neither fragment bundles actin. Except for similarities to vinculin, neither region contains sequences homologous to established actin-binding proteins. Collectively these data indicate that alpha 1 (E)-catenin is a novel actin-binding and -bundling protein and support a model in which alpha 1(E)-catenin is responsible for organizing and tethering actin filaments at the zones of E-cadherin-mediated cell-cell contact.
Resumo:
There is a strong relationship between orthographic-motor integration related to handwriting and students' ability to produce creative and well-structured written text. This relationship is thought to be due to the cognitive load which results when attention is required by writers to write letters and words on the page. Lack of automaticity in orthographic-motor integration means that writers do not have sufficient cognitive resources to accomplish the more demanding aspects of text production such as ideation, text monitoring, and pragmatic awareness. A systematic handwriting program can significantly improve the quality of written text by young children experiencing problems with orthographic-motor integration. This study investigated the effectiveness of a handwriting program in remediating older students' problems in orthographic-motor integration and consequently enhancing their written language skills. Two groups of students in Grades 8 and 9 were provided with either practice in handwriting or daily completion of a written journal. There were no differences between the two groups at pre-test. However, at post-test, the handwriting group had significantly higher scores in orthographic-motor integration as well as for the length and quality of the text they wrote.
Resumo:
This thesis sets out to investigate the role of cohesion in the organisation and processing of three text types in English and Arabic. In other words, it attempts to shed some light on the descriptive and explanatory power of cohesion in different text typologies. To this effect, three text types, namely, literary fictional narrative, newspaper editorial and science were analysed to ascertain the intra- and inter-sentential trends in textual cohesion characteristic of each text type in each language. In addition, two small scale experiments which aimed at exploring the facilitatory effect of one cohesive device (i.e. lexical repetition) on the comprehension of three English text types by Arab learners were carried out. The first experiment examined this effect in an English science text; the second covered three English text types, i.e. fictional narrative, culturally-oriented and science. Some interesting and significant results have emerged from the textual analysis and the pilot studies. Most importantly, each text type tends to utilize the cohesive trends that are compatible with its readership, reader knowledge, reading style and pedagogical purpose. Whereas fictional narratives largely cohere through pronominal co-reference, editorials and science texts derive much cohesion from lexical repetition. As for cross-language differences English opts for economy in the use of cohesive devices, while Arabic largely coheres through the redundant effect created by the high frequency of most of those devices. Thus, cohesion is proved to be a variable rather than a homogeneous phenomenon which is dictated by text type among other factors. The results of the experiments suggest that lexical repetition does facilitate the comprehension of English texts by Arab learners. Fictional narratives are found to be easier to process and understand than expository texts. Consequently, cohesion can assist in the processing of text as it can in its creation.
Resumo:
The present work studies the overall structuring of radio news discourse via investigating three metatextual/interactive functions: (1) Discourse Organizing Elements (DOEs), (2) Attribution and (3) Sentential and Nominal Background Information (SBI & NBI). An extended corpus of about 73,000 words from BBC and Radio Damascus news is used to study DOEs and a restricted corpus of 38,000 words for Attribution and S & NBI. A situational approach is adopted to assess the influence of factors such as medium and audience on these functions and their frequence. It is found that: (1) DOEs are organizational and their frequency is determined by length of text; (2) Attribution Function in accordance with the editor's strategy and its frequency is audience sensitive; and (3) BI provides background information and is determined by audience and news topics. Secondly, the salient grammatical elements in DOEs are discourse deictic demonstratives, address pronouns and nouns referring to `the news'. Attribution is realized in reporting/reported clauses, and BI in a sentence, a clause or a nominal group. Thirdly, DOEs establish a hierarchy of (1) news, (2) summary/expansion and (3) item: including topic introduction and details. While Attribution is generally, and SBI solely, a function of detailing, NBI and proper names are generally a function of summary and topic introduction. Being primarily addressed to audience and referring metatextually, the functions investigated support Sinclair's interactive and autonomous planes of discourse. They also shed light on the part(s) of the linguistic system which realize the metatextual/interactive function. Strictly, `discourse structure' inevitably involves a rank-scale; but news discourse also shows a convention of item `listing'. Hence only within the boundary of variety (ultimately interpreted across language and in its situation) can textual functions and discourse structure be studied. Finally, interlingual variety study provides invaluable insights into a level of translation that goes beyond matching grammatical systems or situational factors, an interpretive level which has to be described in linguistic analysis of translation data.
Resumo:
Working within the framework of the branch of Linguistics known as discourse analysis, and more specifically within the current approach of genre analysis, this thesis presents an analysis of the English of economic forecasting. The language of economic forecasting is highly specialised and follows certain conventions of structure and style. This research project identifies these characteristics and explains them in terms of their communicative function. The work is based on a corpus of texts published in economic reports and surveys by major corporate bodies. These documents are targeted at an international expert readership familiar with this genre. The data is analysed at two broad levels: firstly, the macro-level of text structure which is described in terms of schema-theory, a currently influential model of analysis, and, secondly, the micro-level of authors' strategies for modulating the predictions which form the key move in the forecasting schema. The thesis aims to contribute to the newly developing field of genre analysis in a number of ways: firstly, by a coverage of a hitherto neglected but intrinsically interesting and important genre (Economic Forecasting); secondly, by testing the applicability of existing models of analysis at the level of schematic structure and proposing a genre-specific model; thirdly by offering insights into the nature of modulation of propositions which is often broadly classified as `hedging' or `modality', and which has been recently described as lq`an area for prolonged fieldwork'. This phenomenon is shown to be a key feature of this particular genre. It is suggested that this thesis, in addition to its contribution to the theory of genre analysis, provides a useful basis for work by teachers of English for Economics, an important area of English for Specific Purposes.
Resumo:
Ontology construction for any domain is a labour intensive and complex process. Any methodology that can reduce the cost and increase efficiency has the potential to make a major impact in the life sciences. This paper describes an experiment in ontology construction from text for the Animal Behaviour domain. Our objective was to see how much could be done in a simple and rapid manner using a corpus of journal papers. We used a sequence of text processing steps, and describe the different choices made to clean the input, to derive a set of terms and to structure those terms in a hierarchy. We were able in a very short space of time to construct a 17000 term ontology with a high percentage of suitable terms. We describe some of the challenges, especially that of focusing the ontology appropriately given a starting point of a heterogeneous corpus.
Resumo:
The given work is devoted to development of the computer-aided system of semantic text analysis of a technical specification. The purpose of this work is to increase efficiency of software engineering based on automation of semantic text analysis of a technical specification. In work it is offered and investigated the model of the analysis of the text of the technical project is submitted, the attribute grammar of a technical specification, intended for formalization of limited Russian is constructed with the purpose of analysis of offers of text of a technical specification, style features of the technical project as class of documents are considered, recommendations on preparation of text of a technical specification for the automated processing are formulated. The computer-aided system of semantic text analysis of a technical specification is considered. This system consists of the following subsystems: preliminary text processing, the syntactic and semantic analysis and construction of software models, storage of documents and interface.
Resumo:
The given work is devoted to development of the computer-aided system of semantic text analysis of a technical specification. The purpose of this work is to increase efficiency of software engineering based on automation of semantic text analysis of a technical specification. In work it is offered and investigated a technique of the text analysis of a technical specification is submitted, the expanded fuzzy attribute grammar of a technical specification, intended for formalization of limited Russian language is constructed with the purpose of analysis of offers of text of a technical specification, style features of the technical specification as class of documents are considered, recommendations on preparation of text of a technical specification for the automated processing are formulated. The computer-aided system of semantic text analysis of a technical specification is considered. This system consist of the following subsystems: preliminary text processing, the syntactic and semantic analysis and construction of software models, storage of documents and interface.
Resumo:
This study investigated the effects of word prediction and text-to-speech on the narrative composition writing skills of 6, fifth-grade Hispanic boys with specific learning disabilities (SLD). A multiple baseline design across subjects was used to explore the efficacy of word prediction and text-to-speech alone and in combination on four dependent variables: writing fluency (words per minute), syntax (T-units), spelling accuracy, and overall organization (holistic scoring rubric). Data were collected and analyzed during baseline, assistive technology interventions, and at 2-, 4-, and 6-week maintenance probes. ^ Participants were equally divided into Cohorts A and B, and two separate but related studies were conducted. Throughout all phases of the study, participants wrote narrative compositions for 15-minute sessions. During baseline, participants used word processing only. During the assistive technology intervention condition, Cohort A participants used word prediction followed by word prediction with text-to-speech. Concurrently, Cohort B participants used text-to-speech followed by text-to-speech with word prediction. ^ The results of this study indicate that word prediction alone or in combination with text-to-speech has a positive effect on the narrative writing compositions of students with SLD. Overall, participants in Cohorts A and B wrote more words, more T-units, and spelled more words correctly. A sign test indicated that these perceived effects were not likely due to chance. Additionally, the quality of writing improved as measured by holistic rubric scores. When participants in Cohort B used text-to-speech alone, with the exception of spelling accuracy, inconsequential results were observed on all dependent variables. ^ This study demonstrated that word prediction alone or in combination assists students with SLD to write longer, improved-quality, narrative compositions. These results suggest that word prediction or word prediction with text-to-speech be considered as a writing support to facilitate the production of a first draft of a narrative composition. However, caution should be given to the use of text-to-speech alone as its effectiveness has not been established. Recommendations for future research include investigating the use of these technologies in other phases of the writing process, with other student populations, and with other writing styles. Further, these technologies should be investigated while integrated into classroom composition instruction. ^
Resumo:
Physiological signals, which are controlled by the autonomic nervous system (ANS), could be used to detect the affective state of computer users and therefore find applications in medicine and engineering. The Pupil Diameter (PD) seems to provide a strong indication of the affective state, as found by previous research, but it has not been investigated fully yet. ^ In this study, new approaches based on monitoring and processing the PD signal for off-line and on-line affective assessment ("relaxation" vs. "stress") are proposed. Wavelet denoising and Kalman filtering methods are first used to remove abrupt changes in the raw Pupil Diameter (PD) signal. Then three features (PDmean, PDmax and PDWalsh) are extracted from the preprocessed PD signal for the affective state classification. In order to select more relevant and reliable physiological data for further analysis, two types of data selection methods are applied, which are based on the paired t-test and subject self-evaluation, respectively. In addition, five different kinds of the classifiers are implemented on the selected data, which achieve average accuracies up to 86.43% and 87.20%, respectively. Finally, the receiver operating characteristic (ROC) curve is utilized to investigate the discriminating potential of each individual feature by evaluation of the area under the ROC curve, which reaches values above 0.90. ^ For the on-line affective assessment, a hard threshold is implemented first in order to remove the eye blinks from the PD signal and then a moving average window is utilized to obtain the representative value PDr for every one-second time interval of PD. There are three main steps for the on-line affective assessment algorithm, which are preparation, feature-based decision voting and affective determination. The final results show that the accuracies are 72.30% and 73.55% for the data subsets, which were respectively chosen using two types of data selection methods (paired t-test and subject self-evaluation). ^ In order to further analyze the efficiency of affective recognition through the PD signal, the Galvanic Skin Response (GSR) was also monitored and processed. The highest affective assessment classification rate obtained from GSR processing is only 63.57% (based on the off-line processing algorithm). The overall results confirm that the PD signal should be considered as one of the most powerful physiological signals to involve in future automated real-time affective recognition systems, especially for detecting the "relaxation" vs. "stress" states.^
Resumo:
Online international introduction sites that offer romance tours to American men in search of a foreign bride are an important and rapidly growing component of the internet dating industry; the number of these agencies in the U.S. tripled from two hundred to six hundred in the past 10 years. Previous scholars have examined the so-called `mail order bride' industry in order to demonstrate that the women involved are agents and not victims. Many scholars have also highlighted the importance of race in shaping American men's desires in one particular region or country. My dissertation provides an important addition to the literature surrounding romance tourism by including participants from all three major regions associated with romance tourism: Eastern Europe, South America, and Southeast Asia. I collected the data for the dissertation by becoming a participant observer of a romance tour in Ukraine, Colombia, and the Philippines. I argue that romance tourism is an important example of the global intimate, and the ways in which globalized processes are created and sustained through everyday intimate emotions and interactions. By examining the ways in which the emotions of desire, disgust, and anxiety influence individual romance tour participant's constructions of racialized hierarchies, the links between individual emotions and global systems are revealed. The concept of the global intimate challenges the hierarchy of scale that places the body, the home, and the intimate on a much lower level than the scale of the global or the national, and at the same time challenges the binary that divides the individual from the global. Through highlighting the different emotional negotiations that are constantly occurring in the romance tour industry, I highlight the important ways in which individual emotions and affects influence global processes on a large scale and vice versa.^
Resumo:
In the last decade, large numbers of social media services have emerged and been widely used in people's daily life as important information sharing and acquisition tools. With a substantial amount of user-contributed text data on social media, it becomes a necessity to develop methods and tools for text analysis for this emerging data, in order to better utilize it to deliver meaningful information to users. ^ Previous work on text analytics in last several decades is mainly focused on traditional types of text like emails, news and academic literatures, and several critical issues to text data on social media have not been well explored: 1) how to detect sentiment from text on social media; 2) how to make use of social media's real-time nature; 3) how to address information overload for flexible information needs. ^ In this dissertation, we focus on these three problems. First, to detect sentiment of text on social media, we propose a non-negative matrix tri-factorization (tri-NMF) based dual active supervision method to minimize human labeling efforts for the new type of data. Second, to make use of social media's real-time nature, we propose approaches to detect events from text streams on social media. Third, to address information overload for flexible information needs, we propose two summarization framework, dominating set based summarization framework and learning-to-rank based summarization framework. The dominating set based summarization framework can be applied for different types of summarization problems, while the learning-to-rank based summarization framework helps utilize the existing training data to guild the new summarization tasks. In addition, we integrate these techneques in an application study of event summarization for sports games as an example of how to better utilize social media data. ^
Resumo:
In the last decade, large numbers of social media services have emerged and been widely used in people's daily life as important information sharing and acquisition tools. With a substantial amount of user-contributed text data on social media, it becomes a necessity to develop methods and tools for text analysis for this emerging data, in order to better utilize it to deliver meaningful information to users. Previous work on text analytics in last several decades is mainly focused on traditional types of text like emails, news and academic literatures, and several critical issues to text data on social media have not been well explored: 1) how to detect sentiment from text on social media; 2) how to make use of social media's real-time nature; 3) how to address information overload for flexible information needs. In this dissertation, we focus on these three problems. First, to detect sentiment of text on social media, we propose a non-negative matrix tri-factorization (tri-NMF) based dual active supervision method to minimize human labeling efforts for the new type of data. Second, to make use of social media's real-time nature, we propose approaches to detect events from text streams on social media. Third, to address information overload for flexible information needs, we propose two summarization framework, dominating set based summarization framework and learning-to-rank based summarization framework. The dominating set based summarization framework can be applied for different types of summarization problems, while the learning-to-rank based summarization framework helps utilize the existing training data to guild the new summarization tasks. In addition, we integrate these techneques in an application study of event summarization for sports games as an example of how to better utilize social media data.
Resumo:
Research and discourse on Eating Disorders (ED) have a tendency to perpetuate certain stereotypes regarding women of color, however unintentional or innocuous it may appear. The research conducted, does show that the prominent ED within communities of minorities or of lower social status is Binge Eating Disorder (BED). A Foucauldian discourse analysis of online forums by young women with ED, such as myproana.com, would be the primary method of documenting how these women form a hierarchy of the community's ED, and analyze if it correlates with the social status primarily associated with the Eating Disorder. Within these forums, and ED based communities, a clear hierarchy is formed, with Anorexia Nervosa at the top and BED at the bottom. From the minimal research available on women of color and ED, it is clear that, for Black women at least, BED is the most common—and thus it is my goal to explore any possible correlation if applicable. Because so little exists that focuses on EDs in women of color, this analysis may help to promote more research in race and class dynamics of Eating Disorders, as well on how those factors affect how women view their own bodies.