791 resultados para Rule-Based Classification
Resumo:
Softcatalà is a non-profit associationcreated more than 10 years ago to fightthe marginalisation of the Catalan languagein information and communicationtechnologies. It has led the localisationof many applications and thecreation of a website which allows itsusers to translate texts between Spanishand Catalan using an external closed-sourcetranslation engine. Recently,the closed-source translation back-endhas been replaced by a free/open-sourcesolution completely managed by Softcatalà: the Apertium machine translationplatform and the ScaleMT web serviceframework. Thanks to the opennessof the new solution, it is possibleto take advantage of the huge amount ofusers of the Softcatalà translation serviceto improve it, using a series ofmethods presented in this paper. In addition,a study of the translations requestedby the users has been carriedout, and it shows that the translationback-end change has not affected theusage patterns.
Resumo:
This paper presents an Italian to CatalanRBMT system automatically built bycombining the linguistic data of theexisting pairs Spanish-Catalan andSpanish-Italian. A lightweight manualpostprocessing is carried out in order tofix inconsistencies in the automaticallyderived dictionaries and to add very frequentwords that are missing accordingto a corpus analysis. The system isevaluated on the KDE4 corpus and outperformsGoogle Translate by approximatelyten absolute points in terms ofboth TER and GTM.
Resumo:
The General Assembly Line Balancing Problem with Setups (GALBPS) was recently defined in the literature. It adds sequence-dependent setup time considerations to the classical Simple Assembly Line Balancing Problem (SALBP) as follows: whenever a task is assigned next to another at the same workstation, a setup time must be added to compute the global workstation time, thereby providing the task sequence inside each workstation. This paper proposes over 50 priority-rule-based heuristic procedures to solve GALBPS, many of which are an improvement upon heuristic procedures published to date.
Resumo:
Existing digital rights management (DRM) systems, initiatives like Creative Commons or research works as some digital rights ontologies provide limited support for content value chains modelling and management. This is becoming a critical issue as content markets start to profit from the possibilities of digital networks and the World Wide Web. The objective is to support the whole copyrighted content value chain across enterprise or business niches boundaries. Our proposal provides a framework that accommodates copyright law and a rich creation model in order to cope with all the creation life cycle stages. The dynamic aspects of value chains are modelled using a hybrid approach that combines ontology-based and rule-based mechanisms. The ontology implementation is based on Web Ontology Language and Description Logic (OWL-DL) reasoners, are directly used for license checking. On the other hand, for more complex aspects of the dynamics of content value chains, rule languages are the choice.
Resumo:
Aim: Modelling species at the assemblage level is required to make effective forecast of global change impacts on diversity and ecosystem functioning. Community predictions may be achieved using macroecological properties of communities (MEM), or by stacking of individual species distribution models (S-SDMs). To obtain more realistic predictions of species assemblages, the SESAM framework suggests applying successive filters to the initial species source pool, by combining different modelling approaches and rules. Here we provide a first test of this framework in mountain grassland communities. Location: The western Swiss Alps. Methods: Two implementations of the SESAM framework were tested: a "Probability ranking" rule based on species richness predictions and rough probabilities from SDMs, and a "Trait range" rule that uses the predicted upper and lower bound of community-level distribution of three different functional traits (vegetative height, specific leaf area and seed mass) to constraint a pool of environmentally filtered species from binary SDMs predictions. Results: We showed that all independent constraints expectedly contributed to reduce species richness overprediction. Only the "Probability ranking" rule allowed slightly but significantly improving predictions of community composition. Main conclusion: We tested various ways to implement the SESAM framework by integrating macroecological constraints into S-SDM predictions, and report one that is able to improve compositional predictions. We discuss possible improvements, such as further improving the causality and precision of environmental predictors, using other assembly rules and testing other types of ecological or functional constraints.
Resumo:
Children with Wiskott-Aldrich syndrome (WAS) are often first diagnosed with immune thrombocytopenia (ITP), potentially leading to both inappropriate treatment and the delay of life-saving definitive therapy. WAS is traditionally differentiated from ITP based on the small size of WAS platelets. In practice, microthrombocytopenia is often not present or not appreciated in children with WAS. To develop an alternative method of differentiating WAS from ITP, we retrospectively reviewed all complete blood counts and measurements of immature platelet fraction (IPF) in 18 subjects with WAS and 38 subjects with a diagnosis of ITP treated at our hospital. Examination of peripheral blood smears revealed a wide range of platelet sizes in subjects with WAS. Mean platelet volume (MPV) was not reported in 26% of subjects, and subjects in whom MPV was not reported had lower platelet counts than did subjects in whom MPV was reported. Subjects with WAS had a lower IPF than would be expected for their level of thrombocytopenia, and the IPF in subjects with WAS was significantly lower than in subjects with a diagnosis of ITP. Using logistic regression, we developed and validated a rule based on platelet count and IPF that was more sensitive for the diagnosis of WAS than was the MPV, and was applicable regardless of the level of platelets or the availability of the MPV. Our observations demonstrate that MPV is often not available in severely thrombocytopenic subjects, which may hinder the diagnosis of WAS. In addition, subjects with WAS have a low IPF, which is consistent with the notion that a platelet production defect contributes to the thrombocytopenia of WAS. Knowledge of this detail of WAS pathophysiology allows to differentiate WAS from ITP with increased sensitivity, thereby allowing a physician to spare children with WAS from inappropriate treatment, and make definitive therapy available in a timely manner.
Resumo:
PURPOSE: We conducted a comprehensive review of the design, implementation, and outcome of first-in-human (FIH) trials of monoclonal antibodies (mAbs) to clearly determine early clinical development strategies for this class of compounds. METHODS: We performed a PubMed search using appropriate terms to identify reports of FIH trials of mAbs published in peer-reviewed journals between January 2000 and April 2013. RESULTS: A total of 82 publications describing FIH trials were selected for analysis. Only 27 articles (33%) reported the criteria used for selecting the starting dose (SD). Dose escalation was performed using rule-based methods in 66 trials (80%). The median number of planned dose levels was five (range, two to 13). The median of the ratio between the highest planned dose and the SD was 27 (range, two to 3,333). Although in 56 studies (68%) at least one grade 3 or 4 toxicity event was reported, no dose-limiting toxicity was observed in 47 trials (57%). The highest planned dose was reached in all trials, but the maximum-tolerated dose (MTD) was defined in only 13 studies (16%). The median of the ratio between MTD and SD was eight (range, four to 1,000). The recommended phase II dose was indicated in 34 studies (41%), but in 25 (73%) of these trials, this dose was chosen without considering toxicity as the main selection criterion. CONCLUSION: This literature review highlights the broad design heterogeneity of FIH trials testing mAbs. Because of the limited observed toxicity, the MTD was infrequently reached, and therefore, the recommended phase II dose for subsequent clinical trials was only tentatively defined.
Resumo:
Päivittäistavarakaupassa energiankulutus on kohtuullisen suurta. Etenkin kylmälaitteet, ilmanvaihto ja valaistus kuluttavat paljon sähköä. Kaupan alalla on viime vuosina tehty paljon energiansäästötoimenpiteitä, joiden ansiosta myymälöiden energiatehokkuutta on saatu merkittävästi parannettua. Yksi tärkeimmistä toimenpiteistä on lämmön talteenotto, jolla lämmönkulutusta on saatu pienennettyä. Tässä opinnäytetyössä on selvitetty kahdenkymmenen ympäri Suomea sijaitsevan Prisma-hypermarketin energiatehokkuus. Sähkön- ja lämmön sekä veden kulutusta on arvioitu suhteessa rakennusvuoteen, pinta-alaan, rakennuksen tilavuuteen, lämmitystarvelukuun, kylmätehoon sekä myyntiin. Työssä on hyödynnetty internet-pohjaista Promise-luokitustyökalua.
Resumo:
UNLABELLED: Glioblastoma (GBM) is the most aggressive human brain tumor. Although several molecular subtypes of GBM are recognized, a robust molecular prognostic marker has yet to be identified. Here, we report that the stemness regulator Sox2 is a new, clinically important target of microRNA-21 (miR-21) in GBM, with implications for prognosis. Using the MiR-21-Sox2 regulatory axis, approximately half of all GBM tumors present in the Cancer Genome Atlas (TCGA) and in-house patient databases can be mathematically classified into high miR-21/low Sox2 (Class A) or low miR-21/high Sox2 (Class B) subtypes. This classification reflects phenotypically and molecularly distinct characteristics and is not captured by existing classifications. Supporting the distinct nature of the subtypes, gene set enrichment analysis of the TCGA dataset predicted that Class A and Class B tumors were significantly involved in immune/inflammatory response and in chromosome organization and nervous system development, respectively. Patients with Class B tumors had longer overall survival than those with Class A tumors. Analysis of both databases indicated that the Class A/Class B classification is a better predictor of patient survival than currently used parameters. Further, manipulation of MiR-21-Sox2 levels in orthotopic mouse models supported the longer survival of the Class B subtype. The MiR-21-Sox2 association was also found in mouse neural stem cells and in the mouse brain at different developmental stages, suggesting a role in normal development. Therefore, this mechanism-based classification suggests the presence of two distinct populations of GBM patients with distinguishable phenotypic characteristics and clinical outcomes. SIGNIFICANCE STATEMENT: Molecular profiling-based classification of glioblastoma (GBM) into four subtypes has substantially increased our understanding of the biology of the disease and has pointed to the heterogeneous nature of GBM. However, this classification is not mechanism based and its prognostic value is limited. Here, we identify a new mechanism in GBM (the miR-21-Sox2 axis) that can classify ∼50% of patients into two subtypes with distinct molecular, radiological, and pathological characteristics. Importantly, this classification can predict patient survival better than the currently used parameters. Further, analysis of the miR-21-Sox2 relationship in mouse neural stem cells and in the mouse brain at different developmental stages indicates that miR-21 and Sox2 are predominantly expressed in mutually exclusive patterns, suggesting a role in normal neural development.
Resumo:
This paper describes an approach for the colour-based classification of RGB (red-green-blue) images, acquired using a common scanner, of commercial carbonated soft drinks. Mean histograms of image colour channels were evaluated for the PCA classification of 29 brands of Guaraná, Cola, and orange flavors. Loadings for principal component axes resulted in different patterns for sample grouping on score plots according to RGB histograms. pH, sorbic acid and sucrose measurements were also correlated to the analyzed brands through PCA score plots of the digitalized images.
Resumo:
This dissertation considers the segmental durations of speech from the viewpoint of speech technology, especially speech synthesis. The idea is that better models of segmental durations lead to higher naturalness and better intelligibility. These features are the key factors for better usability and generality of synthesized speech technology. Even though the studies are based on a Finnish corpus the approaches apply to all other languages as well. This is possibly due to the fact that most of the studies included in this dissertation are about universal effects taking place on utterance boundaries. Also the methods invented and used here are suitable for any other study of another language. This study is based on two corpora of news reading speech and sentences read aloud. The other corpus is read aloud by a 39-year-old male, whilst the other consists of several speakers in various situations. The use of two corpora is twofold: it involves a comparison of the corpora and a broader view on the matters of interest. The dissertation begins with an overview to the phonemes and the quantity system in the Finnish language. Especially, we are covering the intrinsic durations of phonemes and phoneme categories, as well as the difference of duration between short and long phonemes. The phoneme categories are presented to facilitate the problem of variability of speech segments. In this dissertation we cover the boundary-adjacent effects on segmental durations. In initial positions of utterances we find that there seems to be initial shortening in Finnish, but the result depends on the level of detail and on the individual phoneme. On the phoneme level we find that the shortening or lengthening only affects the very first ones at the beginning of an utterance. However, on average, the effect seems to shorten the whole first word on the word level. We establish the effect of final lengthening in Finnish. The effect in Finnish has been an open question for a long time, whilst Finnish has been the last missing piece for it to be a universal phenomenon. Final lengthening is studied from various angles and it is also shown that it is not a mere effect of prominence or an effect of speech corpus with high inter- and intra-speaker variation. The effect of final lengthening seems to extend from the final to the penultimate word. On a phoneme level it reaches a much wider area than the initial effect. We also present a normalization method suitable for corpus studies on segmental durations. The method uses an utterance-level normalization approach to capture the pattern of segmental durations within each utterance. This prevents the impact of various problematic variations within the corpora. The normalization is used in a study on final lengthening to show that the results on the effect are not caused by variation in the material. The dissertation shows an implementation and prowess of speech synthesis on a mobile platform. We find that the rule-based method of speech synthesis is a real-time software solution, but the signal generation process slows down the system beyond real time. Future aspects of speech synthesis on limited platforms are discussed. The dissertation considers ethical issues on the development of speech technology. The main focus is on the development of speech synthesis with high naturalness, but the problems and solutions are applicable to any other speech technology approaches.
Resumo:
Endometriosis is a common hormone-dependent gynecological disease leading to severe menstrual and/or chronic pelvic pain with or without subfertility. The disease is defined by the presence of endometrium-like tissue outside the uterine cavity, primarily on the pelvic peritoneum, ovaries and infiltrating organs of the peritoneal cavity. The current tools for diagnosis and treatment of endometriosis need to be improved to ensure reliable diagnosis and effective treatment. In addition, endometriosis is associated with increased risk of ovarian cancer and, therefore, the differential diagnosis between the benign and malignant ovarian cysts is of importance. The long-term objective of the present study was to support the discovery of novel tools for diagnosis and treatment of endometriosis. This was approached by exploiting genome-wide expression analysis of endometriosis specimens. A novel expression profiling -based classification of endometriosis indicated specific subgroups of lesions partially consistent with the clinical appearance, but partially according to unknown factors. The peritoneum of women with endometriosis appeared to be altered in comparison to that of healthy control subjects, suggesting a novel aspect on the pathogenesis of the disease. The evaluation of action and metabolism of sex hormones in endometrium and endometriosis tissue indicated a novel role of androgens in regulation of the tissues. In addition, an enzyme involved in androgen and neurosteroid metabolism, hydroxysteroid (17beta) dehydrogenase 6, was found to be highly up-regulated in endometriosis tissue as compared to healthy endometrium. The enzyme may have a role in the pathogenesis of endometriosis or in the endometriosis associated pain generation. Finally, a new diagnostic biomarker, HE4, was discovered distinguishing patients with ovarian endometriotic cysts from those with malignant ovarian cancer. The information acquired in this study enables deeper understanding of endometriosis and facilitates the development of improved diagnostic tools and more specific treatments of the disease
Resumo:
Physical activity (PA) is an important field of healthcare research internationally and within Finland. As technology devices and services penetrate deeper levels within society, the need for studying the usefulness for PA turns vital. We started this research work by reviewing literature consisting of two hundred research journals, all of which have found technology to significantly improve an individual’s ability to get motivation and achieve officially recommended levels of physical activity, like the 10000 steps a day, being tracked with the help of pedometers. Physical activity recommendations require sustained encouragement, consistent performance in order to achieve the long term benefits. We surveyed within the city of Turku, how the motivation levels and thirty three other criterions encompassing technology awareness, adoption and usage attitudes are impacted. Our aim was to know the factors responsible for achieving consistent growth in activity levels within the individuals and focus groups, as well as to determine the causes of failures and for collecting user experience feedback. The survey results were quite interesting and contain impeccable information for this field. While the focus groups confirmed the theory established by past studies within our literature review, it also establishes our research propositions that ict tools and services have provided and can further add higher benefits and value to individuals in tracking and maintain their activity levels consistently for longer time durations. This thesis includes two new models which dictate technology and physical activity adoption patterns based on four easy to evaluate criterions, thereby helping the healthcare providers to recommend improvements and address issues with an easy rule based approach. This research work provides vital clues on technology based healthcare objectives and achievement of standard PA recommendations by people within Turku and nearby regions.
Resumo:
The purpose of this doctoral thesis is to widen and develop our theoretical frameworks for discussion and analyses of feedback practices in management accounting, particularly shedding light on its formal and informal aspects. The concept of feedback in management accounting has conventionally been analyzed within cybernetic control theory, in which feedback flows as a diagnostic or comparative loop between measurable outputs and pre-set goals (see e.g. Flamholtz et al. 1985; Flamholtz 1996, 1983), i.e. as a formal feedback loop. However, the everyday feedback practices in organizations are combinations of formal and informal elements. In addition to technique-driven feedback approaches (like budgets, measurement, and reward systems) we could also categorize social feedback practices that managers see relevant and effective in the pursuit of organizational control. While cybernetics or control theories successfully capture rational and measured aspects of organizational performance and offer a broad organizational context for the analysis, many individual and informal aspects remain vague and isolated. In order to discuss and make sense of the heterogeneous field of interpretations of formal and informal feedback, both in theory and practice, dichotomous approaches seem to be insufficient. Therefore, I suggest an analytical framework of formal and informal feedback with three dimensions (3D’s): source, time, and rule. Based on an abductive analysis of the theoretical and empirical findings from an interpretive case study around a business unit called Division Steelco, the 3Dframework and formal and informal feedback practices are further elaborated vis-á-vis the four thematic layers in the organizational control model by Flamholtz et al. (1985; Flamholtz 1996, 1983): core control system, organizational structure, organizational culture, and external environment. Various personal and cultural meanings given to the formal and informal feedback practices (“feedback as something”) create multidimensional interpretative contexts. Multidimensional frameworks aim to capture and better understand both the variety of interpretations and their implications to the functionality of feedback practices, important in interpretive research.
Resumo:
Human activity recognition in everyday environments is a critical, but challenging task in Ambient Intelligence applications to achieve proper Ambient Assisted Living, and key challenges still remain to be dealt with to realize robust methods. One of the major limitations of the Ambient Intelligence systems today is the lack of semantic models of those activities on the environment, so that the system can recognize the speci c activity being performed by the user(s) and act accordingly. In this context, this thesis addresses the general problem of knowledge representation in Smart Spaces. The main objective is to develop knowledge-based models, equipped with semantics to learn, infer and monitor human behaviours in Smart Spaces. Moreover, it is easy to recognize that some aspects of this problem have a high degree of uncertainty, and therefore, the developed models must be equipped with mechanisms to manage this type of information. A fuzzy ontology and a semantic hybrid system are presented to allow modelling and recognition of a set of complex real-life scenarios where vagueness and uncertainty are inherent to the human nature of the users that perform it. The handling of uncertain, incomplete and vague data (i.e., missing sensor readings and activity execution variations, since human behaviour is non-deterministic) is approached for the rst time through a fuzzy ontology validated on real-time settings within a hybrid data-driven and knowledgebased architecture. The semantics of activities, sub-activities and real-time object interaction are taken into consideration. The proposed framework consists of two main modules: the low-level sub-activity recognizer and the high-level activity recognizer. The rst module detects sub-activities (i.e., actions or basic activities) that take input data directly from a depth sensor (Kinect). The main contribution of this thesis tackles the second component of the hybrid system, which lays on top of the previous one, in a superior level of abstraction, and acquires the input data from the rst module's output, and executes ontological inference to provide users, activities and their in uence in the environment, with semantics. This component is thus knowledge-based, and a fuzzy ontology was designed to model the high-level activities. Since activity recognition requires context-awareness and the ability to discriminate among activities in di erent environments, the semantic framework allows for modelling common-sense knowledge in the form of a rule-based system that supports expressions close to natural language in the form of fuzzy linguistic labels. The framework advantages have been evaluated with a challenging and new public dataset, CAD-120, achieving an accuracy of 90.1% and 91.1% respectively for low and high-level activities. This entails an improvement over both, entirely data-driven approaches, and merely ontology-based approaches. As an added value, for the system to be su ciently simple and exible to be managed by non-expert users, and thus, facilitate the transfer of research to industry, a development framework composed by a programming toolbox, a hybrid crisp and fuzzy architecture, and graphical models to represent and con gure human behaviour in Smart Spaces, were developed in order to provide the framework with more usability in the nal application. As a result, human behaviour recognition can help assisting people with special needs such as in healthcare, independent elderly living, in remote rehabilitation monitoring, industrial process guideline control, and many other cases. This thesis shows use cases in these areas.