922 resultados para Supervised and Unsupervised Classification
Resumo:
Fossil associations from the middle and upper Eocene (Bartonian and Priabonian) sedimentary succession of the Pamplona Basin are described. This succession was accumulated in the western part of the South Pyrenean peripheral foreland basin and extends from deep-marine turbiditic (Ezkaba Sandstone Formation) to deltaic (Pamplona Marl, Ardanatz Sandstone and Ilundain Marl formations) and marginal marine deposits (Gendulain Formation). The micropalaeontological content is high. It is dominated by foraminifera, and common ostracods and other microfossils are also present. The fossil ichnoasssemblages include at least 23 ichnogenera and 28 ichnospecies indicative of Nereites, Cruziana, Glossifungites and ?Scoyenia-Mermia ichnofacies. Body macrofossils of 78 taxa corresponding to macroforaminifera, sponges, corals, bryozoans, brachiopods, annelids, molluscs, arthropods, echinoderms and vertebrates have been identified. Both the number of ichnotaxa and of species (e. g. bryozoans, molluscs and condrichthyans) may be considerably higher. Body fossil assemblages are comparable to those from the Eocene of the Nord Pyrenean area (Basque Coast), and also to those from the Eocene of the west-central and eastern part of South Pyrenean area (Aragon and Catalonia). At the European scale, the molluscs assemblages seem endemic from the Pyrenean area, although several Tethyan (Italy and Alps) and Northern elements (Paris basin and Normandy) have been recorded. Palaeontological data of studied sedimentary units fit well with the shallowing process that throughout the middle and late Eocene occurs in the area, according to the sedimentological and stratigraphical data.
Resumo:
Recommendation systems aim to help users make decisions more efficiently. The most widely used method in recommendation systems is collaborative filtering, of which, a critical step is to analyze a user's preferences and make recommendations of products or services based on similarity analysis with other users' ratings. However, collaborative filtering is less usable for recommendation facing the "cold start" problem, i.e. few comments being given to products or services. To tackle this problem, we propose an improved method that combines collaborative filtering and data classification. We use hotel recommendation data to test the proposed method. The accuracy of the recommendation is determined by the rankings. Evaluations regarding the accuracies of Top-3 and Top-10 recommendation lists using the 10-fold cross-validation method and ROC curves are conducted. The results show that the Top-3 hotel recommendation list proposed by the combined method has the superiority of the recommendation performance than the Top-10 list under the cold start condition in most of the times.
Resumo:
Background Physical activity in children with intellectual disabilities is a neglected area of study, which is most apparent in relation to physical activity measurement research. Although objective measures, specifically accelerometers, are widely used in research involving children with intellectual disabilities, existing research is based on measurement methods and data interpretation techniques generalised from typically developing children. However, due to physiological and biomechanical differences between these populations, questions have been raised in the existing literature on the validity of generalising data interpretation techniques from typically developing children to children with intellectual disabilities. Therefore, there is a need to conduct population-specific measurement research for children with intellectual disabilities and develop valid methods to interpret accelerometer data, which will increase our understanding of physical activity in this population. Methods Study 1: A systematic review was initially conducted to increase the knowledge base on how accelerometers were used within existing physical activity research involving children with intellectual disabilities and to identify important areas for future research. A systematic search strategy was used to identify relevant articles which used accelerometry-based monitors to quantify activity levels in ambulatory children with intellectual disabilities. Based on best practice guidelines, a novel form was developed to extract data based on 17 research components of accelerometer use. Accelerometer use in relation to best practice guidelines was calculated using percentage scores on a study-by-study and component-by-component basis. Study 2: To investigate the effect of data interpretation methods on the estimation of physical activity intensity in children with intellectual disabilities, a secondary data analysis was conducted. Nine existing sets of child-specific ActiGraph intensity cut points were applied to accelerometer data collected from 10 children with intellectual disabilities during an activity session. Four one-way repeated measures ANOVAs were used to examine differences in estimated time spent in sedentary, moderate, vigorous, and moderate to vigorous intensity activity. Post-hoc pairwise comparisons with Bonferroni adjustments were additionally used to identify where significant differences occurred. Study 3: The feasibility on a laboratory-based calibration protocol developed for typically developing children was investigated in children with intellectual disabilities. Specifically, the feasibility of activities, measurements, and recruitment was investigated. Five children with intellectual disabilities and five typically developing children participated in 14 treadmill-based and free-living activities. In addition, resting energy expenditure was measured and a treadmill-based graded exercise test was used to assess cardiorespiratory fitness. Breath-by-breath respiratory gas exchange and accelerometry were continually measured during all activities. Feasibility was assessed using observations, activity completion rates, and respiratory data. Study 4: Thirty-six children with intellectual disabilities participated in a semi-structured school-based physical activity session to calibrate accelerometry for the estimation of physical activity intensity. Participants wore a hip-mounted ActiGraph wGT3X+ accelerometer, with direct observation (SOFIT) used as the criterion measure. Receiver operating characteristic curve analyses were conducted to determine the optimal accelerometer cut points for sedentary, moderate, and vigorous intensity physical activity. Study 5: To cross-validate the calibrated cut points and compare classification accuracy with existing cut points developed in typically developing children, a sub-sample of 14 children with intellectual disabilities who participated in the school-based sessions, as described in Study 4, were included in this study. To examine the validity, classification agreement was investigated between the criterion measure of SOFIT and each set of cut points using sensitivity, specificity, total agreement, and Cohen’s kappa scores. Results Study 1: Ten full text articles were included in this review. The percentage of review criteria met ranged from 12%−47%. Various methods of accelerometer use were reported, with most use decisions not based on population-specific research. A lack of measurement research, specifically the calibration/validation of accelerometers for children with intellectual disabilities, is limiting the ability of researchers to make appropriate and valid accelerometer use decisions. Study 2: The choice of cut points had significant and clinically meaningful effects on the estimation of physical activity intensity and sedentary behaviour. For the 71-minute session, estimations for time spent in each intensity between cut points ranged from: sedentary = 9.50 (± 4.97) to 31.90 (± 6.77) minutes; moderate = 8.10 (± 4.07) to 40.40 (± 5.74) minutes; vigorous = 0.00 (± .00) to 17.40 (± 6.54) minutes; and moderate to vigorous = 8.80 (± 4.64) to 46.50 (± 6.02) minutes. Study 3: All typically developing participants and one participant with intellectual disabilities completed the protocol. No participant met the maximal criteria for the graded exercise test or attained a steady state during the resting measurements. Limitations were identified with the usability of respiratory gas exchange equipment and the validity of measurements. The school-based recruitment strategy was not effective, with a participation rate of 6%. Therefore, a laboratory-based calibration protocol was not feasible for children with intellectual disabilities. Study 4: The optimal vertical axis cut points (cpm) were ≤ 507 (sedentary), 1008−2300 (moderate), and ≥ 2301 (vigorous). Sensitivity scores ranged from 81−88%, specificity 81−85%, and AUC .87−.94. The optimal vector magnitude cut points (cpm) were ≤ 1863 (sedentary), ≥ 2610 (moderate) and ≥ 4215 (vigorous). Sensitivity scores ranged from 80−86%, specificity 77−82%, and AUC .86−.92. Therefore, the vertical axis cut points provide a higher level of accuracy in comparison to the vector magnitude cut points. Study 5: Substantial to excellent classification agreement was found for the calibrated cut points. The calibrated sedentary cut point (ĸ =.66) provided comparable classification agreement with existing cut points (ĸ =.55−.67). However, the existing moderate and vigorous cut points demonstrated low sensitivity (0.33−33.33% and 1.33−53.00%, respectively) and disproportionately high specificity (75.44−.98.12% and 94.61−100.00%, respectively), indicating that cut points developed in typically developing children are too high to accurately classify physical activity intensity in children with intellectual disabilities. Conclusions The studies reported in this thesis are the first to calibrate and validate accelerometry for the estimation of physical activity intensity in children with intellectual disabilities. In comparison with typically developing children, children with intellectual disabilities require lower cut points for the classification of moderate and vigorous intensity activity. Therefore, generalising existing cut points to children with intellectual disabilities will underestimate physical activity and introduce systematic measurement error, which could be a contributing factor to the low levels of physical activity reported for children with intellectual disabilities in previous research.
Resumo:
We develop some new techniques to calculate the Schur indicator for self-dual irreducible Langlands quotients of the principal series representations. Using these techniques we derive some new formulas for the Schur indicator and the real-quaternionic indicator. We make progress towards developing an algorithm to decide whether or not two root data are isomorphic. When the derived group has cyclic center, we solve the isomorphism problem completely. An immediate consequence is a clean and precise classification theorem for connected complex reductive groups whose derived groups have cyclic center.
Resumo:
Objective Structured Clinical Examinations (OSCE) improved communication skills of student of Pharmacology in Medicine and Podiatry degree. Bellido I, Blanco E, Gomez-Luque A. D. Pharmacology and Clinical Therapeutic. Medicine School. University of Malaga. IBIMA. Malaga, Spain. Objective Structured Clinical Examinations (OSCEs) are versatile multipurpose evaluative tools that can be utilized to assess health care professionals in a clinical setting including communication skills and ability to handle unpredictable patient behavior, which usually are not included in the traditional clinical exam. To designee and perform OSCEs by student is a novelty that really like to the students and may improve their arguing and planning capacities and their communication skills. Aim: To evaluate the impact of designing, developing and presenting Objective Structured Clinical Examinations (OSCE) by student in the communication skills development and in the learning of medicines in Medicine and Podiatry undergraduate students. Methods: A one-year study in which students were invited to voluntarily form groups (4 students maximum). Each group has to design and perform an OSCE (10 min maximum) showing a clinical situation/problem in which medicines’ use was needed. A clinical history, camera, a mobile-phone's video editor, photos, actors, dolls, simulators or whatever they may use was allowed. The job of each group was supervised and helped by a teacher. The students were invited to present their work to the rest of the class. After each OSCE performance the students were encouraged to ask questions if they wanted to do it. After all the OSCEs performances the students voluntarily answered a satisfaction survey. Results: Students of Pharmacology of Medicine degree and Podiatry degree, N=80, 53.75% female, 21±2.3 years old were enrolled. 26 OSCEs showing a clinical situation or clinical problem were made. The average time spent by students in making the OSCE was 21.5±9 h. The percentage of students which were satisfied with this way of presentation of the OSCE was 89.7%. Conclusion: Objective Structured Clinical Examinations (OSCE) designed and performed by student of Pharmacology of the Medicine and Podiatry Degree improved their communication skills.
Resumo:
The purpose was to determine the prevalence and related factors of vitamin D (VitD) insufficiency in adolescents and young adults with perinatally acquired human immunodeficiency virus. A cohort of 65 patients (17.6 ± 2 years) at the Federal University of Rio de Janeiro, Brazil, were examined for pubertal development, nutrition, serum parathormone and serum 25-hydroxyvitamin D [s25(OH)D]. s25(OH)D levels < 30 ng/mL (< 75 nmol/L) were defined as VitD insufficiency. CD4+ T-cell counts and viral load, history of worst clinical status, immunologic status as nadir, current immunologic status, and antiretroviral (ART) regimen were also evaluated as risk factors for VitD insufficiency. Mean s25(OH)D was 37.7 ± 13.9 ng/mL and 29.2% had VitD insufficiency. There was no difference between VitD status and gender, age, nutritional status, clinical and immunological classification, and type of ART. Only VitD consumption showed tendency of association with s25(OH)D (p = 0.064). Individuals analysed in summer/autumn season had a higher s25(OH)D compared to the ones analysed in winter/spring (42.6 ± 14.9 vs. 34.0 ± 11.9, p = 0.011). Although, the frequency of VitD insufficiency did not differ statistically between the groups (summer/autumn 17.9% vs. winter/spring 37.8%, p = 0.102), we suggest to monitor s25(OH)D in seropositive adolescents and young adults, especially during winter/spring months, even in sunny regions.
Resumo:
This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Resumo:
This thesis presents a study of the Grid data access patterns in distributed analysis in the CMS experiment at the LHC accelerator. This study ranges from the deep analysis of the historical patterns of access to the most relevant data types in CMS, to the exploitation of a supervised Machine Learning classification system to set-up a machinery able to eventually predict future data access patterns - i.e. the so-called dataset “popularity” of the CMS datasets on the Grid - with focus on specific data types. All the CMS workflows run on the Worldwide LHC Computing Grid (WCG) computing centers (Tiers), and in particular the distributed analysis systems sustains hundreds of users and applications submitted every day. These applications (or “jobs”) access different data types hosted on disk storage systems at a large set of WLCG Tiers. The detailed study of how this data is accessed, in terms of data types, hosting Tiers, and different time periods, allows to gain precious insight on storage occupancy over time and different access patterns, and ultimately to extract suggested actions based on this information (e.g. targetted disk clean-up and/or data replication). In this sense, the application of Machine Learning techniques allows to learn from past data and to gain predictability potential for the future CMS data access patterns. Chapter 1 provides an introduction to High Energy Physics at the LHC. Chapter 2 describes the CMS Computing Model, with special focus on the data management sector, also discussing the concept of dataset popularity. Chapter 3 describes the study of CMS data access patterns with different depth levels. Chapter 4 offers a brief introduction to basic machine learning concepts and gives an introduction to its application in CMS and discuss the results obtained by using this approach in the context of this thesis.
Resumo:
In the context of the International Society for Knowledge Organization, we often consider knowledge organization systems to comprise catalogues, thesauri, and bibliothecal classification schemes – schemes for library arrangement. In recent years we have added ontologies and folksonomies to our sphere of study. In all of these cases it seems we are concerned with improving access to information. We want a good system.And much of the literature from the late 19th into the late 20th century took that as their goal – to analyze the world of knowledge and the structures of representing it as its objects of study; again, with the ethos for creating a good system. In most cases this meant we had to be correct in our assertions about the universe of knowledge and the relationships that obtain between its constituent parts. As a result much of the literature of knowledge organization is prescriptive – instructing designers and professionals how to build or use the schemes correctly – that is to maximize redundant success in accessing information.In 2005, there was a turn in some of the knowledge organization literature. It has been called the descriptive turn. This is in relation to the otherwise prescriptive efforts of researchers in KO. And it is the descriptive turn that makes me think of context, languages, and cultures in knowledge organization–the theme of this year’s conference.Work in the descriptive turn questions the basic assumptions about what we want to do when we create, implement, maintain, and evaluate knowledge organization systems. Following on these assumptions researchers have examined a wider range of systems and question the motivations behind system design. Online websites that allow users to curate their own collections are one such addition, for example Pinterest (cf., Feinberg, 2011). However, researchers have also looked back at other lineages of organizing to compare forms and functions. For example, encyclopedias, catalogues raisonnés, archival description, and winter counts designed and used by Native Americans.In this case of online curated collections, Melanie Feinberg has started to examine the craft of curation, as she calls it. In this line of research purpose, voice, and rhetorical stance surface as design considerations. For example, in the case of the Pinterest, users are able and encouraged to create boards. The process of putting together these boards is an act of curation in contemporary terminology. It is describing this craft that comes from the descriptive turn in KO.In the second case, when researchers in the descriptive turn look back at older and varied examples of knowledge organization systems, we are looking for a full inventory of intent and inspiration for future design. Encyclopedias, catalogues raisonnés, archival description, and works of knowledge organization in other cultures provide a rich world for the descriptive turn. And researchers have availed themselves of this.
Resumo:
In this paper, we describe one of the approaches of the participation of Universidade de Évora. Our approach is similar to usual methods where text is preprocessed, features are extracted, and then used in SVMs with cross validation. The main difference is that features used come from averages of word embeddings, specifically word2vec vectors. Using PAN 2016 dataset, we were able to achieve 44.8% and 68.2% for English age and gender classification respectively. We were also able to achieve 51.3% and 67.1% accuracy for Spanish age and gender classification. Finally, we report 71.9% accuracy for Dutch age classification.
Resumo:
This paper describes various experiments done to investigate author profiling of tweets in 4 different languages – English, Dutch, Italian, and Spanish. Profiling consists of age and gender classification, as well as regression on 5 different person- ality dimensions – extroversion, stability, agreeableness, open- ness, and conscientiousness. Different sets of features were tested – bag-of-words, word ngrams, POS ngrams, and average of word embeddings. SVM was used as the classifier. Tfidf worked best for most English tasks while for most of the tasks from the other languages, the combination of the best features worked better.
Resumo:
One of the great challenges of the scientific community on theories of genetic information, genetic communication and genetic coding is to determine a mathematical structure related to DNA sequences. In this paper we propose a model of an intra-cellular transmission system of genetic information similar to a model of a power and bandwidth efficient digital communication system in order to identify a mathematical structure in DNA sequences where such sequences are biologically relevant. The model of a transmission system of genetic information is concerned with the identification, reproduction and mathematical classification of the nucleotide sequence of single stranded DNA by the genetic encoder. Hence, a genetic encoder is devised where labelings and cyclic codes are established. The establishment of the algebraic structure of the corresponding codes alphabets, mappings, labelings, primitive polynomials (p(x)) and code generator polynomials (g(x)) are quite important in characterizing error-correcting codes subclasses of G-linear codes. These latter codes are useful for the identification, reproduction and mathematical classification of DNA sequences. The characterization of this model may contribute to the development of a methodology that can be applied in mutational analysis and polymorphisms, production of new drugs and genetic improvement, among other things, resulting in the reduction of time and laboratory costs.
Resumo:
The objective of this work was to compare the soybean crop mapping in the western of Parana State by MODIS/Terra and TM/Landsat 5 images. Firstly, it was generated a soybean crop mask using six TM images covering the crop season, which was used as a reference. The images were submitted to Parallelepiped and Maximum Likelihood digital classification algorithms, followed by visual inspection. Four MODIS images, covering the vegetative peak, were classified using the Parallelepiped method. The quality assessment of MODIS and TM classification was carried out through an Error Matrix, considering 100 sample points between soybean or not soybean, randomly allocated in each of the eight municipalities within the study area. The results showed that both the Overall Classification (OC) and the Kappa Index (KI) have produced values ranging from 0.55 to 0.80, considered good to very good performances, either in TM or MODIS images. When OC and KI, from both sensors were compared, it wasn't found no statistical difference between them. The soybean mapping, using MODIS, has produced 70% of reliance in terms of users. The main conclusion is that the mapping of soybean by MODIS is feasible, with the advantage to have better temporal resolution than Landsat, and to be available on the internet, free of charge.
Resumo:
Universidade Estadual de Campinas . Faculdade de Educação Física
Resumo:
Universidade Estadual de Campinas . Faculdade de Educação Física