916 resultados para Logistic regression mixture models
Resumo:
This thesis entitled Reliability Modelling and Analysis in Discrete time Some Concepts and Models Useful in the Analysis of discrete life time data.The present study consists of five chapters. In Chapter II we take up the derivation of some general results useful in reliability modelling that involves two component mixtures. Expression for the failure rate, mean residual life and second moment of residual life of the mixture distributions in terms of the corresponding quantities in the component distributions are investigated. Some applications of these results are also pointed out. The role of the geometric,Waring and negative hypergeometric distributions as models of life lengths in the discrete time domain has been discussed already. While describing various reliability characteristics, it was found that they can be often considered as a class. The applicability of these models in single populations naturally extends to the case of populations composed of sub-populations making mixtures of these distributions worth investigating. Accordingly the general properties, various reliability characteristics and characterizations of these models are discussed in chapter III. Inference of parameters in mixture distribution is usually a difficult problem because the mass function of the mixture is a linear function of the component masses that makes manipulation of the likelihood equations, leastsquare function etc and the resulting computations.very difficult. We show that one of our characterizations help in inferring the parameters of the geometric mixture without involving computational hazards. As mentioned in the review of results in the previous sections, partial moments were not studied extensively in literature especially in the case of discrete distributions. Chapters IV and V deal with descending and ascending partial factorial moments. Apart from studying their properties, we prove characterizations of distributions by functional forms of partial moments and establish recurrence relations between successive moments for some well known families. It is further demonstrated that partial moments are equally efficient and convenient compared to many of the conventional tools to resolve practical problems in reliability modelling and analysis. The study concludes by indicating some new problems that surfaced during the course of the present investigation which could be the subject for a future work in this area.
Resumo:
Production Planning and Control (PPC) systems have grown and changed because of the developments in planning tools and models as well as the use of computers and information systems in this area. Though so much is available in research journals, practice of PPC is lagging behind and does not use much from published research. The practices of PPC in SMEs lag behind because of many reasons, which need to be explored This research work deals with the effect of identified variables such as forecasting, planning and control methods adopted, demographics of the key person, standardization practices followed, effect of training, learning and IT usage on firm performance. A model and framework has been developed based on literature. Empirical testing of the model has been done after collecting data using a questionnaire schedule administered among the selected respondents from Small and Medium Enterprises (SMEs) in India. Final data included 382 responses. Hypotheses linking SME performance with the use of forecasting, planning and controlling were formed and tested. Exploratory factor analysis was used for data reduction and for identifying the factor structure. High and low performing firms were classified using a Logistic Regression model. A confirmatory factor analysis was used to study the structural relationship between firm performance and dependent variables.
Resumo:
Modeling and predicting co-occurrences of events is a fundamental problem of unsupervised learning. In this contribution we develop a statistical framework for analyzing co-occurrence data in a general setting where elementary observations are joint occurrences of pairs of abstract objects from two finite sets. The main challenge for statistical models in this context is to overcome the inherent data sparseness and to estimate the probabilities for pairs which were rarely observed or even unobserved in a given sample set. Moreover, it is often of considerable interest to extract grouping structure or to find a hierarchical data organization. A novel family of mixture models is proposed which explain the observed data by a finite number of shared aspects or clusters. This provides a common framework for statistical inference and structure discovery and also includes several recently proposed models as special cases. Adopting the maximum likelihood principle, EM algorithms are derived to fit the model parameters. We develop improved versions of EM which largely avoid overfitting problems and overcome the inherent locality of EM--based optimization. Among the broad variety of possible applications, e.g., in information retrieval, natural language processing, data mining, and computer vision, we have chosen document retrieval, the statistical analysis of noun/adjective co-occurrence and the unsupervised segmentation of textured images to test and evaluate the proposed algorithms.
Resumo:
Objetivo: El objetivo de este estudio fue determinar la relación entre la actividad física (AF) en el tiempo libre y la auto percepción del estado de salud en Colombia. Métodos: a partir de los datos de una muestra compleja se obtuvieron 14601 registros de sujetos entre 18 y 64 años de Colombia. Se aplicaron modelos de regresión logística para la auto percepción de la salud. Resultados: la prevalencia de AF en el tiempo libre fue de 5,8% en mujeres y de 13% en hombres (p < 0,001) y el auto reporte de salud encontró que 27,7% de las mujeres y 19,7% de los hombres se perciben regulares o malos (p < 0,001). Se encontró influencia de grupos de mayor edad, menor escolaridad, afiliados al sistema de seguridad social y área rural de residencia con pobres auto reportes de salud. Un OR de 1,92 (IC 95% 1,19 3,10) reportan las mujeres con bajos niveles de AF en el tiempo libre de auto percibirse pobre en su salud frente a las mujeres con alta AF en el tiempo libre. En los hombres no se encontró esta misma evidencia. Discusión: la influencia de un nivel vigoroso de AF en el tiempo libre sobre la auto percepción del estado de salud en el grupo de mujeres es uno de los principales hallazgos. Estos resultados permiten direccionar políticas públicas tendientes a fomentar la práctica de AF, garantizar el acceso a la educación y a la afiliación a un sistema de salud de la población.
Resumo:
La inducción del trabajo de parto ha demostrado aumentar simultáneamente las tasas de cesárea, especialmente en nulíparas con cérvix clínicamente desfavorables. Ya que la valoración clínica del cérvix es un método subjetivo, aunque ampliamente utilizado, el objetivo del presente estudio fue determinar la utilidad de la medición ecográfica de la longitud cervical comparándola con el puntaje de Bishop, en la predicción del éxito de la inducción del parto en las pacientes nulíparas en el servicio de Obstetricia del Hospital Universitario Clínica San Rafael, Bogotá. Materiales y métodos: Se realizó un estudio observacional, evaluando una cohorte prospectiva de 80 gestantes a quienes se les realizó valoración ultrasonográfica y clínica del cérvix antes de iniciar la inducción del trabajo de parto. Resultados: El análisis bivariado demostró que las pacientes con longitud cervical >20mm tienen 1.57 veces la probabilidad de tener parto por cesárea (RR 1.57 IC95% 1.03-2.39 p <0.05). De manera similar las pacientes con puntaje de Bishop 0 a 3 tienen 2.33 veces la probabilidad de tener parto por cesárea (RR 2.33 IC95% 1.28-4.23 p <0.05). La regresión logística binaria demostró que la edad materna y la longitud cervical fueron los únicos parámetros independientes con significancia estadística para predecir el éxito de la inducción. Conclusiones: La medición ecográfica de la longitud cervical tiene mayor utilidad que la valoración clínica del cérvix en la predicción del éxito de la inducción del parto en nulíparas.
Resumo:
Introducción: En Colombia la investigación sobre condiciones de trabajo y salud en minería carbonífera es escasa y no considera la percepción de la población expuesta y sus comportamientos frente a los riesgos inherentes. Objetivo: Determinar la asociación entre las condiciones de trabajo y morbilidad percibidas entre trabajadores de minas de carbón en Guachetá, Cundinamarca. Materiales y métodos: Se realizó un estudio transversal con 154 trabajadores seleccionados aleatoriamente del total registrado en la alcaldía municipal. Se indagó sobre características sociodemográficas, condiciones de trabajo y salud en las minas. Se estimaron prevalencias de los trastornos respiratorios, osteomusculares y auditivos, y se exploraron las asociaciones entre algunas condiciones de trabajo y los eventos con prevalencia superior a 30% de forma bivariada y múltiple, con regresiones Poisson con varianza robusta. Resultados: Los trabajadores fueron en su mayoría hombres, con edades entre 18 y 77 años de edad. Los problemas de salud más frecuentemente reportados fueron dolor lumbar (46,10%), dolor del miembro superior (40,26%), dolor del miembro inferior (34,42%), trastornos respiratorios (17,53%) y problemas auditivos (13,64%). Existen diferencias importantes en la percepción dependiendo de la antigüedad laboral y las condiciones subterráneas o no del trabajo. Conclusión: Los riesgos más reconocidos por los trabajadores son los relacionados con trastornos osteomusculares, al parecer por ser más evidentes en su cotidianidad. Las acciones en salud ocupacional podrán considerar estos hallazgos en sus planes de prevención de la enfermedad en las minas del carbón colombianas.
Resumo:
Introducción: El delirium es un trastorno de conciencia de inicio agudo asociado a confusión o disfunción cognitiva, se puede presentar hasta en 42% de pacientes, de los cuales hasta el 80% ocurren en UCI. El delirium aumenta la estancia hospitalaria, el tiempo de ventilación mecánica y la morbimortalidad. Se pretendió evaluar la prevalencia de periodo de delirium en adultos que ingresaron a la UCI en un hospital de cuarto nivel durante 2012 y los factores asociados a su desarrollo. Metodología Se realizó un estudio transversal con corte analítico, se incluyeron pacientes hospitalizados en UCI médica y UCI quirúrgica. Se aplicó la escala de CAM-ICU y el Examen Mínimo del Estado Mental para evaluar el estado mental. Las asociaciones significativas se ajustaron con análisis multivariado. Resultados: Se incluyeron 110 pacientes, el promedio de estancia fue 5 días; la prevalencia de periodo de delirium fue de 19.9%, la mediana de edad fue 64.5 años. Se encontró una asociación estadísticamente significativa entre el delirium y la alteración cognitiva de base, depresión, administración de anticolinérgicos y sepsis (p< 0,05). Discusión Hasta la fecha este es el primer estudio en la institución. La asociación entre delirium en la UCI y sepsis, uso de anticolinérgicos, y alteración cognitiva de base son consistentes y comparables con factores de riesgo descritos en la literatura mundial.
Resumo:
In the midst of health care reform, Colombia has succeeded in increasing health insurance coverage and the quality of health care. In spite of this, efficiency continues to be a matter of concern, and small-area variations in health care are one of the plausible causes of such inefficiencies. In order to understand this issue, we use individual data of all births from a Contributory-Regimen insurer in Colombia. We perform two different specifications of a multilevel logistic regression model. Our results reveal that hospitals account for 20% of variation on the probability of performing cesarean sections. Geographic area only explains 1/3 of the variance attributable to the hospital. Furthermore, some variables from both demand and supply sides are found to be also relevant on the probability of undergoing cesarean sections. This paper contributes to previous research by using a hierarchical model and by defining hospitals as cluster. Moreover, we also include clinical and supply induced demand variables.
Resumo:
Introducción: La OMS revela que en 2010 alrededor de 43 millones de niños menores de 5 años presentan sobrepeso. En Colombia según la Encuesta Nacional de Situación Nutricional en Colombia en su versión 2005, mostraba una prevalencia general de sobrepeso de 3.1% niños de 0 a 4 años. Es una condición de salud de origen multifactorial en la que interviene factores genéticos, ambientales, maternos y perinatales. Objetivo: Establecer la asociación de riesgo entre el bajo peso al nacer y el desarrollo de sobrepeso y obesidad en niños de 4 a 5 años. Metodología: Se realizó un estudio observacional descriptivo retrospectivo de corte transversal con los datos nutricionales, maternos y perinatales de la Encuesta Nacional de Demografía en Salud del año 2010 en Colombia. Se analizó la asociación entre la variable independiente bajo peso al nacer con el desenlace sobrepeso y obesidad en menores de 4 a 5 años, usando como medida el IMC según la edad. Se realizaron análisis univariados, bivariados y de regresión logística con un modelo de riesgo según las variables que inciden en el desenlace y la variable independiente. Resultados: La muestra obtenida para el estudio fue de 2166 niños de 4 a 5 años de edad quienes cumplían los criterios de inclusión. La prevalencia de sobrepeso u obesidad en la primera infancia fue de 21.8% (472) y el bajo peso al nacer. Los resultados sugieren la asociación de bajo peso y sobrepeso u obesidad es de ORajustado= 0.560 (0.356 – 0.881). Conclusiones: Los resultados sugieren que existe una asociación como factor protector entre el bajo peso y el sobrepeso u obesidad en la primera infancia. Sin embargo, debido al comportamiento de las variables consideradas en la muestra no hay suficiente información para rechazar completamente la hipótesis nula.
Resumo:
[EU]Lan honetan semantika distribuzionalaren eta ikasketa automatikoaren erabilera aztertzen dugu itzulpen automatiko estatistikoa hobetzeko. Bide horretan, erregresio logistikoan oinarritutako ikasketa automatikoko eredu bat proposatzen dugu hitz-segiden itzulpen- probabilitatea modu dinamikoan modelatzeko. Proposatutako eredua itzulpen automatiko estatistikoko ohiko itzulpen-probabilitateen orokortze bat dela frogatzen dugu, eta testuinguruko nahiz semantika distribuzionaleko informazioa barneratzeko baliatu ezaugarri lexiko, hitz-cluster eta hitzen errepresentazio bektorialen bidez. Horretaz gain, semantika distribuzionaleko ezagutza itzulpen automatiko estatistikoan txertatzeko beste hurbilpen bat lantzen dugu: hitzen errepresentazio bektorial elebidunak erabiltzea hitz-segiden itzulpenen antzekotasuna modelatzeko. Gure esperimentuek proposatutako ereduen baliagarritasuna erakusten dute, emaitza itxaropentsuak eskuratuz oinarrizko sistema sendo baten gainean. Era berean, gure lanak ekarpen garrantzitsuak egiten ditu errepresentazio bektorialen mapaketa elebidunei eta hitzen errepresentazio bektorialetan oinarritutako hitz-segiden antzekotasun neurriei dagokienean, itzulpen automatikoaz haratago balio propio bat dutenak semantika distribuzionalaren arloan.
Resumo:
To migrate successfully, birds need to store adequate fat reserves to fuel each leg of the journey. Migrants acquire their fuel reserves at stopover sites; this often entails exposure to predators. Therefore, the safety attributes of sites may be as important as the feeding opportunities. Furthermore, site choice might depend on fuel load, with lean birds more willing to accept danger to obtain good feeding. Here, we evaluate the factors underlying stopover-site usage by migrant Western Sandpipers (Calidris mauri) on a landscape scale. We measured the food and danger attributes of 17 potential stopover sites in the Strait of Georgia and Puget Sound region. We used logistic regression models to test whether food, safety, or both were best able to predict usage of these sites by Western Sandpipers. Eight of the 17 sites were used by sandpipers on migration. Generally, sites that were high in food and safety were used, whereas sites that were low in food and safety were not. However, dangerous sites were used if there was ample food abundance, and sites with low food abundance were used if they were safe. The model including both food and safety best-predicted site usage by sandpipers. Furthermore, lean sandpipers used the most dangerous sites, whereas heavier birds (which do not need to risk feeding in dangerous locations) used safer sites. This study demonstrates that both food and danger attributes are considered by migrant birds when selecting stopover sites, thus both these attributes should be considered to prioritize and manage stopover sites for conservation.
Resumo:
The Marbled Murrelet (Brachyramphus marmoratus) is a threatened alcid that nests almost exclusively in old-growth forests along the Pacific coast of North America. Nesting habitat has significant economic importance. Murrelet nests are extremely difficult and costly to find, which adds uncertainty to management and conservation planning. Models based on air photo interpretation of forest cover maps or assessments by low-level helicopter flights are currently used to rank presumed Marbled Murrelet nesting habitat quality in British Columbia. These rankings are assumed to correlate with nest usage and murrelet breeding productivity. Our goal was to find the models that best predict Marbled Murrelet nesting habitat in the ground-accessible portion of the two regions studied. We generated Resource Selection Functions (RSF) using logistic regression models of ground-based forest stand variables gathered at plots around 64 nests, located using radio-telemetry, versus 82 random habitat plots. The RSF scores are proportional to the probability of nests occurring in a forest patch. The best models differed somewhat between the two regions, but include both ground variables at the patch scale (0.2-2.0 ha), such as platform tree density, height and trunk diameter of canopy trees and canopy complexity, and landscape scale variables such as elevation, aspect, and slope. Collecting ground-based habitat selection data would not be cost-effective for widespread use in forestry management; air photo interpretation and low-level aerial surveys are much more efficient methods for ranking habitat suitability on a landscape scale. This study provides one method for ground-truthing the remote methods, an essential step made possible using the numerical RSF scores generated herein.
Resumo:
Detailed knowledge of waterfowl abundance and distribution across Canada is lacking, which limits our ability to effectively conserve and manage their populations. We used 15 years of data from an aerial transect survey to model the abundance of 17 species or species groups of ducks within southern and boreal Canada. We included 78 climatic, hydrological, and landscape variables in Boosted Regression Tree models, allowing flexible response curves and multiway interactions among variables. We assessed predictive performance of the models using four metrics and calculated uncertainty as the coefficient of variation of predictions across 20 replicate models. Maps of predicted relative abundance were generated from resulting models, and they largely match spatial patterns evident in the transect data. We observed two main distribution patterns: a concentrated prairie-parkland distribution and a more dispersed pan-Canadian distribution. These patterns were congruent with the relative importance of predictor variables and model evaluation statistics among the two groups of distributions. Most species had a hydrological variable as the most important predictor, although the specific hydrological variable differed somewhat among species. In some cases, important variables had clear ecological interpretations, but in some instances, e.g., topographic roughness, they may simply reflect chance correlations between species distributions and environmental variables identified by the model-building process. Given the performance of our models, we suggest that the resulting prediction maps can be used in future research and to guide conservation activities, particularly within the bounds of the survey area.
Resumo:
Investigation of preferred structures of planetary wave dynamics is addressed using multivariate Gaussian mixture models. The number of components in the mixture is obtained using order statistics of the mixing proportions, hence avoiding previous difficulties related to sample sizes and independence issues. The method is first applied to a few low-order stochastic dynamical systems and data from a general circulation model. The method is next applied to winter daily 500-hPa heights from 1949 to 2003 over the Northern Hemisphere. A spatial clustering algorithm is first applied to the leading two principal components (PCs) and shows significant clustering. The clustering is particularly robust for the first half of the record and less for the second half. The mixture model is then used to identify the clusters. Two highly significant extratropical planetary-scale preferred structures are obtained within the first two to four EOF state space. The first pattern shows a Pacific-North American (PNA) pattern and a negative North Atlantic Oscillation (NAO), and the second pattern is nearly opposite to the first one. It is also observed that some subspaces show multivariate Gaussianity, compatible with linearity, whereas others show multivariate non-Gaussianity. The same analysis is also applied to two subperiods, before and after 1978, and shows a similar regime behavior, with a slight stronger support for the first subperiod. In addition a significant regime shift is also observed between the two periods as well as a change in the shape of the distribution. The patterns associated with the regime shifts reflect essentially a PNA pattern and an NAO pattern consistent with the observed global warming effect on climate and the observed shift in sea surface temperature around the mid-1970s.
Resumo:
1. Jerdon's courser Rhinoptilus bitorquatus is a nocturnally active cursorial bird that is only known to occur in a small area of scrub jungle in Andhra Pradesh, India, and is listed as critically endangered by the IUCN. Information on its habitat requirements is needed urgently to underpin conservation measures. We quantified the habitat features that correlated with the use of different areas of scrub jungle by Jerdon's coursers, and developed a model to map potentially suitable habitat over large areas from satellite imagery and facilitate the design of surveys of Jerdon's courser distribution. 2. We used 11 arrays of 5-m long tracking strips consisting of smoothed fine soil to detect the footprints of Jerdon's coursers, and measured tracking rates (tracking events per strip night). We counted the number of bushes and trees, and described other attributes of vegetation and substrate in a 10-m square plot centred on each strip. We obtained reflectance data from Landsat 7 satellite imagery for the pixel within which each strip lay. 3. We used logistic regression models to describe the relationship between tracking rate by Jerdon's coursers and characteristics of the habitat around the strips, using ground-based survey data and satellite imagery. 4. Jerdon's coursers were most likely to occur where the density of large (>2 m tall) bushes was in the range 300-700 ha(-1) and where the density of smaller bushes was less than 1000 ha(-1). This habitat was detectable using satellite imagery. 5. Synthesis and applications. The occurrence of Jerdon's courser is strongly correlated with the density of bushes and trees, and is in turn affected by grazing with domestic livestock, woodcutting and mechanical clearance of bushes to create pasture, orchards and farmland. It is likely that there is an optimal level of grazing and woodcutting that would maintain or create suitable conditions for the species. Knowledge of the species' distribution is incomplete and there is considerable pressure from human use of apparently suitable habitats. Hence, distribution mapping is a high conservation priority. A two-step procedure is proposed, involving the use of ground surveys of bush density to calibrate satellite image-based mapping of potential habitat. These maps could then be used to select priority areas for Jerdon's courser surveys. The use of tracking strips to study habitat selection and distribution has potential in studies of other scarce and secretive species.