879 resultados para Gender classification model
Resumo:
This paper examines the quantitative effects of gender gaps in entrepreneurship and labor force participation on aggregate productivity and income per capita. We simulate an occupational choice model with heterogeneous agents in entrepreneurial ability, where agents choose to be workers, self-employed or employers. The model assumes that men and women have the same talent distribution, but we impose several frictions on women's opportunities and pay in the labor market. In particular, we restrict the fraction of women participating in the labor market.
Resumo:
This paper examines the quantitative effects of gender gaps in entrepreneurship and labor force participation on aggregate productivity and income per capita. We simulate an occupational choice model with heterogeneous agents in entrepreneurial ability, where agents choose to be workers, self-employed or employers. The model assumes that men and women have the same talent distribution, but we impose several frictions on women's opportunities and pay in the labor market. In particular, we restrict the fraction of women participating in the labor market.
Resumo:
This paper presents a validation study on statistical nonsupervised brain tissue classification techniques in magnetic resonance (MR) images. Several image models assuming different hypotheses regarding the intensity distribution model, the spatial model and the number of classes are assessed. The methods are tested on simulated data for which the classification ground truth is known. Different noise and intensity nonuniformities are added to simulate real imaging conditions. No enhancement of the image quality is considered either before or during the classification process. This way, the accuracy of the methods and their robustness against image artifacts are tested. Classification is also performed on real data where a quantitative validation compares the methods' results with an estimated ground truth from manual segmentations by experts. Validity of the various classification methods in the labeling of the image as well as in the tissue volume is estimated with different local and global measures. Results demonstrate that methods relying on both intensity and spatial information are more robust to noise and field inhomogeneities. We also demonstrate that partial volume is not perfectly modeled, even though methods that account for mixture classes outperform methods that only consider pure Gaussian classes. Finally, we show that simulated data results can also be extended to real data.
Resumo:
In this work, a new one-class classification ensemble strategy called approximate polytope ensemble is presented. The main contribution of the paper is threefold. First, the geometrical concept of convex hull is used to define the boundary of the target class defining the problem. Expansions and contractions of this geometrical structure are introduced in order to avoid over-fitting. Second, the decision whether a point belongs to the convex hull model in high dimensional spaces is approximated by means of random projections and an ensemble decision process. Finally, a tiling strategy is proposed in order to model non-convex structures. Experimental results show that the proposed strategy is significantly better than state of the art one-class classification methods on over 200 datasets.
Resumo:
Using a large prospective cohort of over 12,000 women, we determined 2 thresholds (high risk and low risk of hip fracture) to use in a 10-yr hip fracture probability model that we had previously described, a model combining the heel stiffness index measured by quantitative ultrasound (QUS) and a set of easily determined clinical risk factors (CRFs). The model identified a higher percentage of women with fractures as high risk than a previously reported risk score that combined QUS and CRF. In addition, it categorized women in a way that was quite consistent with the categorization that occurred using dual X-ray absorptiometry (DXA) and the World Health Organization (WHO) classification system; the 2 methods identified similar percentages of women with and without fractures in each of their 3 categories, but the 2 identified only in part the same women. Nevertheless, combining our composite probability model with DXA in a case findings strategy will likely further improve the detection of women at high risk of fragility hip fracture. We conclude that the currently proposed model may be of some use as an alternative to the WHO classification criteria for osteoporosis, at least when access to DXA is limited.
Resumo:
In this paper, we consider active sampling to label pixels grouped with hierarchical clustering. The objective of the method is to match the data relationships discovered by the clustering algorithm with the user's desired class semantics. The first is represented as a complete tree to be pruned and the second is iteratively provided by the user. The active learning algorithm proposed searches the pruning of the tree that best matches the labels of the sampled points. By choosing the part of the tree to sample from according to current pruning's uncertainty, sampling is focused on most uncertain clusters. This way, large clusters for which the class membership is already fixed are no longer queried and sampling is focused on division of clusters showing mixed labels. The model is tested on a VHR image in a multiclass classification setting. The method clearly outperforms random sampling in a transductive setting, but cannot generalize to unseen data, since it aims at optimizing the classification of a given cluster structure.
Resumo:
Resume : Mieux comprendre les stromatolithes et les tapis microbiens est un sujet important en biogéosciences puisque cela aide à l'étude des premières formes de vie sur Terre, a mieux cerner l'écologie des communautés microbiennes et la contribution des microorganismes a la biominéralisation, et même à poser certains fondements dans les recherches en exobiologie. D'autre part, la modélisation est un outil puissant utilisé dans les sciences naturelles pour appréhender différents phénomènes de façon théorique. Les modèles sont généralement construits sur un système d'équations différentielles et les résultats sont obtenus en résolvant ce système. Les logiciels disponibles pour implémenter les modèles incluent les logiciels mathématiques et les logiciels généraux de simulation. L'objectif principal de cette thèse est de développer des modèles et des logiciels pour aider a comprendre, via la simulation, le fonctionnement des stromatolithes et des tapis microbiens. Ces logiciels ont été développés en C++ en ne partant d'aucun pré-requis de façon a privilégier performance et flexibilité maximales. Cette démarche permet de construire des modèles bien plus spécifiques et plus appropriés aux phénomènes a modéliser. Premièrement, nous avons étudié la croissance et la morphologie des stromatolithes. Nous avons construit un modèle tridimensionnel fondé sur l'agrégation par diffusion limitée. Le modèle a été implémenté en deux applications C++: un moteur de simulation capable d'exécuter un batch de simulations et de produire des fichiers de résultats, et un outil de visualisation qui permet d'analyser les résultats en trois dimensions. Après avoir vérifié que ce modèle peut en effet reproduire la croissance et la morphologie de plusieurs types de stromatolithes, nous avons introduit un processus de sédimentation comme facteur externe. Ceci nous a mené a des résultats intéressants, et permis de soutenir l'hypothèse que la morphologie des stromatolithes pourrait être le résultat de facteurs externes autant que de facteurs internes. Ceci est important car la classification des stromatolithes est généralement fondée sur leur morphologie, imposant que la forme d'un stromatolithe est dépendante de facteurs internes uniquement (c'est-à-dire les tapis microbiens). Les résultats avancés dans ce mémoire contredisent donc ces assertions communément admises. Ensuite, nous avons décidé de mener des recherches plus en profondeur sur les aspects fonctionnels des tapis microbiens. Nous avons construit un modèle bidimensionnel de réaction-diffusion fondé sur la simulation discrète. Ce modèle a été implémenté dans une application C++ qui permet de paramétrer et exécuter des simulations. Nous avons ensuite pu comparer les résultats de simulation avec des données du monde réel et vérifier que le modèle peut en effet imiter le comportement de certains tapis microbiens. Ainsi, nous avons pu émettre et vérifier des hypothèses sur le fonctionnement de certains tapis microbiens pour nous aider à mieux en comprendre certains aspects, comme la dynamique des éléments, en particulier le soufre et l'oxygène. En conclusion, ce travail a abouti à l'écriture de logiciels dédiés à la simulation de tapis microbiens d'un point de vue tant morphologique que fonctionnel, suivant deux approches différentes, l'une holistique, l'autre plus analytique. Ces logiciels sont gratuits et diffusés sous licence GPL (General Public License). Abstract : Better understanding of stromatolites and microbial mats is an important topic in biogeosciences as it helps studying the early forms of life on Earth, provides clues re- garding the ecology of microbial ecosystems and their contribution to biomineralization, and gives basis to a new science, exobiology. On the other hand, modelling is a powerful tool used in natural sciences for the theoretical approach of various phenomena. Models are usually built on a system of differential equations and results are obtained by solving that system. Available software to implement models includes mathematical solvers and general simulation software. The main objective of this thesis is to develop models and software able to help to understand the functioning of stromatolites and microbial mats. Software was developed in C++ from scratch for maximum performance and flexibility. This allows to build models much more specific to a phenomenon rather than general software. First, we studied stromatolite growth and morphology. We built a three-dimensional model based on diffusion-limited aggregation. The model was implemented in two C++ applications: a simulator engine, which can run a batch of simulations and produce result files, and a Visualization tool, which allows results to be analysed in three dimensions. After verifying that our model can indeed reproduce the growth and morphology of several types of stromatolites, we introduced a sedimentation process as an external factor. This lead to interesting results, and allowed to emit the hypothesis that stromatolite morphology may be the result of external factors as much as internal factors. This is important as stromatolite classification is usually based on their morphology, imposing that a stromatolite shape is dependant on internal factors only (i.e. the microbial mat). This statement is contradicted by our findings, Second, we decided to investigate deeper the functioning of microbial mats, We built a two-dimensional reaction-diffusion model based on discrete simulation, The model was implemented in a C++ application that allows setting and running simulations. We could then compare simulation results with real world data and verify that our model can indeed mimic the behaviour of some microbial mats. Thus, we have proposed and verified hypotheses regarding microbial mats functioning in order to help to better understand them, e.g. the cycle of some elements such as oxygen or sulfur. ln conclusion, this PhD provides a simulation software, dealing with two different approaches. This software is free and available under a GPL licence.
Resumo:
INTRODUCTION: Preoperative scores are widely used predictors of complications after major surgery. These scores, however, are not widely used in transurethral procedures. The aim of this study was to assess the value of the Charlson Comorbidity Index (CCI), the age-adjusted CCI, the American Society of Anesthesiologist score (ASA) and the Nutritional Risk Score (NRS) in predicting early morbidity after transurethral urological procedures. METHODS: Consecutive patients undergoing transurethral resection of the bladder or the prostate were prospectively enrolled. The scores were calculated preoperatively; 30-day complications were prospectively recorded according to the Dindo-Clavien classification. Univariate logistic regression was performed to investigate the value of each score and of other factors (i.e., age, sex, body mass index, anemia, smoking habit, type of operation and anaesthesia) as predictors of complications. A multivariate model was then calculated using these predictors. RESULTS: Overall, 197 patients were included. The mean age was 72 (standard deviation ± 10). In total, 26.9% patients had at least 1 complication. Using univariate analysis, we found that each score significantly predicted complications. In multivariate analysis, only the ASA (odds ration [OR] 2.11; 95% confidence interval [CI] 1.01-4.43) and the NRS (OR 2.42; 95% CI 1.56-3.74) remained independent predictors. The best model incorporated ASA, NRS and gender, and predicted morbidity with an area under the curve of 76%. Our study's main limitations are population heterogeneity and limited sample size. CONCLUSION: The ASA and the NRS are important and independent determinants of early morbidity after transurethral procedures. The use of these indices may assist clinicians in the decision-making process to balance the possible benefits of transurethral procedures with the potential risks.
Resumo:
Background: Development of three classification trees (CT) based on the CART (Classification and Regression Trees), CHAID (Chi-Square Automatic Interaction Detection) and C4.5 methodologies for the calculation of probability of hospital mortality; the comparison of the results with the APACHE II, SAPS II and MPM II-24 scores, and with a model based on multiple logistic regression (LR). Methods: Retrospective study of 2864 patients. Random partition (70:30) into a Development Set (DS) n = 1808 and Validation Set (VS) n = 808. Their properties of discrimination are compared with the ROC curve (AUC CI 95%), Percent of correct classification (PCC CI 95%); and the calibration with the Calibration Curve and the Standardized Mortality Ratio (SMR CI 95%). Results: CTs are produced with a different selection of variables and decision rules: CART (5 variables and 8 decision rules), CHAID (7 variables and 15 rules) and C4.5 (6 variables and 10 rules). The common variables were: inotropic therapy, Glasgow, age, (A-a)O2 gradient and antecedent of chronic illness. In VS: all the models achieved acceptable discrimination with AUC above 0.7. CT: CART (0.75(0.71-0.81)), CHAID (0.76(0.72-0.79)) and C4.5 (0.76(0.73-0.80)). PCC: CART (72(69- 75)), CHAID (72(69-75)) and C4.5 (76(73-79)). Calibration (SMR) better in the CT: CART (1.04(0.95-1.31)), CHAID (1.06(0.97-1.15) and C4.5 (1.08(0.98-1.16)). Conclusion: With different methodologies of CTs, trees are generated with different selection of variables and decision rules. The CTs are easy to interpret, and they stratify the risk of hospital mortality. The CTs should be taken into account for the classification of the prognosis of critically ill patients.
Resumo:
Near-infrared spectroscopy (NIRS) was used to analyse the crude protein content of dried and milled samples of wheat and to discriminate samples according to their stage of growth. A calibration set of 72 samples from three growth stages of wheat (tillering, heading and harvest) and a validation set of 28 samples was collected for this purpose. Principal components analysis (PCA) of the calibration set discriminated groups of samples according to the growth stage of the wheat. Based on these differences, a classification procedure (SIMCA) showed a very accurate classification of the validation set samples : all of them were successfully classified in each group using this procedure when both the residual and the leverage were used in the classification criteria. Looking only at the residuals all the samples were also correctly classified except one of tillering stage that was assigned to both tillering and heading stages. Finally, the determination of the crude protein content of these samples was considered in two ways: building up a global model for all the growth stages, and building up local models for each stage, separately. The best prediction results for crude protein were obtained using a global model for samples in the two first growth stages (tillering and heading), and using a local model for the harvest stage samples.
Resumo:
Background: Model organisms are used for research because they provide a framework on which to develop and optimize methods that facilitate and standardize analysis. Such organisms should be representative of the living beings for which they are to serve as proxy. However, in practice, a model organism is often selected ad hoc, and without considering its representativeness, because a systematic and rational method to include this consideration in the selection process is still lacking. Methodology/Principal Findings: In this work we propose such a method and apply it in a pilot study of strengths and limitations of Saccharomyces cerevisiae as a model organism. The method relies on the functional classification of proteins into different biological pathways and processes and on full proteome comparisons between the putative model organism and other organisms for which we would like to extrapolate results. Here we compare S. cerevisiae to 704 other organisms from various phyla. For each organism, our results identify the pathways and processes for which S. cerevisiae is predicted to be a good model to extrapolate from. We find that animals in general and Homo sapiens in particular are some of the non-fungal organisms for which S. cerevisiae is likely to be a good model in which to study a significant fraction of common biological processes. We validate our approach by correctly predicting which organisms are phenotypically more distant from S. cerevisiae with respect to several different biological processes. Conclusions/Significance: The method we propose could be used to choose appropriate substitute model organisms for the study of biological processes in other species that are harder to study. For example, one could identify appropriate models to study either pathologies in humans or specific biological processes in species with a long development time, such as plants.
Resumo:
BACKGROUND: Literature on the disease profile of prisoners that differentiates by age and gender remains sparse. This study aimed to describe the health of correctional inmates in terms of substance abuse problems and mental and somatic health conditions, and compare it by gender and age. METHODS: This study examined cross-sectional data from the Canton of Vaud in Switzerland on the health conditions of detainees who were in prison on January 1, 2011 or entered prison in 2011. Health conditions validated by physician examination were reported using the International Classification of Diseases (ICD) version 10. The analyses were descriptive by groups of prisoners: the entire sample (All), Men, Older adults and Women. RESULTS: A total of 1,664 individuals were included in the analysis. Men comprised 91.5 % of the sample and had a mean age of 33 years. The other 8.5 % were women and had an average age of 39. Older adults (i.e., age 50 and older) represented 7 % of the total sample. Overall, 80 % of inmates were non-Swiss citizens, but the proportion of Swiss prisoners was higher among the older adults (51 %) and women (29 %). Overall, 41 % of inmates self-reported substance abuse problems. Of those, 27 % were being treated by psychiatrists for behavioral disorders related to substance abuse. Chronic infectious diseases were found in 9 % of the prison population. In addition, 27 % of detainees suffered from serious mental health conditions. Gender and age had an influence on the disease profile of this sample: compared to the entire prison population, the older inmates were less likely to misuse illegal drugs and to suffer from communicable infections but exhibited more problems with alcohol and a higher burden of chronic health conditions. Female prisoners were more disposed to mental health problems (including drug abuse) and infectious diseases. In terms of chronic diseases, women suffered from the same conditions as men, but the diseases were more prevalent in women. CONCLUSION: It is important to understand the different disease profiles of prisoners by gender and age, as it helps identify the needs of different groups and tailor age-and gender-specific interventions.
Resumo:
Tutkielman tarkoituksena oli mallintaa varastonhallintajärjestelmä, joka olisi sopiva case yritykselle. Tutkimus aloitettiin case yrityksen varastonhallinan nykytilan kartoituksella, jonka jälkeen tutkittiin varastonhallinnan eri osa-alueisiin. Varastonhallinnan osa-alueista käsiteltiin varastotyyppejä, motiiveja, tavoitteita, kysynnän ennustamista sekä erilaisia varastonhallinnan työkaluja. Sen lisäksi tutkittiin erilaisia varaston täydennysmalleja. Teoriaosuudessa käsiteltiin lisäksi kolmea erilaista tietojärjestelmätyyppiä: toiminnanohjausjärjestelmää, sähköisen kaupankäynnin järjestelmää sekä räätälöityä järjestelmää. Tutkimussuunnitelmassa nämä kolme järjestelmää rajattiin vaihtoehdoiksi, joista jokin valittaisiin case yrityksen varastonhallintajärjestelmäksi. Teorian ja nykytilan pohjalta tehtiin viitekehys, jossa esiteltiin varastonhallintajärjestelmän tieto- ja toiminnallisuusominaisuuksia. Nämä ominaisuudet priorisoitiin neljään eri luokkaan ominaisuuden kriittisyyden mukaan. Järjestelmävaihtoehdot arvioitiin viitekehyksen kriteerien mukaisesti, miten helposti ominaisuudet olisivat toteutettavissa eri vaihtoehdoissa. Tulokset laskettiin näiden arviointien perusteella, jonka jälkeen tulosten analysoinnissa huomattiin, että toiminnanohjausjärjestelmä sopisi parhaiten case yrityksen varastonhallintajärjestelmäksi.
Resumo:
Background. We elaborated a model that predicts the centiles of the 25(OH)D distribution taking into account seasonal variation. Methods. Data from two Swiss population-based studies were used to generate (CoLaus) and validate (Bus Santé) the model. Serum 25(OH)D was measured by ultra high pressure LC-MS/MS and immunoassay. Linear regression models on square-root transformed 25(OH)D values were used to predict centiles of the 25(OH)D distribution. Distribution functions of the observations from the replication set predicted with the model were inspected to assess replication. Results. Overall, 4,912 and 2,537 Caucasians were included in original and replication sets, respectively. Mean (SD) 25(OH)D, age, BMI, and % of men were 47.5 (22.1) nmol/L, 49.8 (8.5) years, 25.6 (4.1) kg/m(2), and 49.3% in the original study. The best model included gender, BMI, and sin-cos functions of measurement day. Sex- and BMI-specific 25(OH)D centile curves as a function of measurement date were generated. The model estimates any centile of the 25(OH)D distribution for given values of sex, BMI, and date and the quantile corresponding to a 25(OH)D measurement. Conclusions. We generated and validated centile curves of 25(OH)D in the general adult Caucasian population. These curves can help rank vitamin D centile independently of when 25(OH)D is measured.
Resumo:
Le cancer testiculaire, bien que peu fréquent, revêt une importance particulière en oncologie ; il représente actuellement un modèle pour optimiser un suivi radiologique tout en essayant de diminuer l'apparition de tumeurs radio-induites.En effet, cette pathologie présente un taux très élevé de survie nécessitant, au vu du jeune âge des patients, des bilans radiologiques à long terme, auxquels pourront être liés des effets secondaires, en particulier les tumeurs secondaires.Afin de diminuer cela, les recommandations de prise en charge ont évolué et les protocoles de radiologie s'améliorent afin d'exposer à moins de rayonnements ionisants pour un résultat identique.Il est donc devenu primordial de maintenir un suivi optimal tout en essayant d'en minimiser la toxicité. Despite being rare cancers, testicular seminoma and non-seminoma play an important role in oncology: they represent a model on how to optimize radiological follow-up, aiming at a lowest possible radiation exposure and secondary cancer risk. Males diagnosed with testicular cancer undergo frequently prolonged follow-up with CT-scans with potential toxic side effects, in particular secondary cancers. To reduce the risks linked to ionizing radiation, precise follow-up protocols have been developed. The number of recommended CT-scanners has been significantly reduced over the last 10 years. The CT scanners have evolved technically and new acquisition protocols have the potential to reduce the radiation exposure further.