994 resultados para statistical software


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Swain corrects the chi-square overidentification test (i.e., likelihood ratio test of fit) for structural equation models whethr with or without latent variables. The chi-square statistic is asymptotically correct; however, it does not behave as expected in small samples and/or when the model is complex (cf. Herzog, Boomsma, & Reinecke, 2007). Thus, particularly in situations where the ratio of sample size (n) to the number of parameters estimated (p) is relatively small (i.e., the p to n ratio is large), the chi-square test will tend to overreject correctly specified models. To obtain a closer approximation to the distribution of the chi-square statistic, Swain (1975) developed a correction; this scaling factor, which converges to 1 asymptotically, is multiplied with the chi-square statistic. The correction better approximates the chi-square distribution resulting in more appropriate Type 1 reject error rates (see Herzog & Boomsma, 2009; Herzog, et al., 2007).

Relevância:

100.00% 100.00%

Publicador:

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The topic of this thesis is the development of knowledge based statistical software. The shortcomings of conventional statistical packages are discussed to illustrate the need to develop software which is able to exhibit a greater degree of statistical expertise, thereby reducing the misuse of statistical methods by those not well versed in the art of statistical analysis. Some of the issues involved in the development of knowledge based software are presented and a review is given of some of the systems that have been developed so far. The majority of these have moved away from conventional architectures by adopting what can be termed an expert systems approach. The thesis then proposes an approach which is based upon the concept of semantic modelling. By representing some of the semantic meaning of data, it is conceived that a system could examine a request to apply a statistical technique and check if the use of the chosen technique was semantically sound, i.e. will the results obtained be meaningful. Current systems, in contrast, can only perform what can be considered as syntactic checks. The prototype system that has been implemented to explore the feasibility of such an approach is presented, the system has been designed as an enhanced variant of a conventional style statistical package. This involved developing a semantic data model to represent some of the statistically relevant knowledge about data and identifying sets of requirements that should be met for the application of the statistical techniques to be valid. Those areas of statistics covered in the prototype are measures of association and tests of location.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Accurate detection of subpopulation size determinations in bimodal populations remains problematic yet it represents a powerful way by which cellular heterogeneity under different environmental conditions can be compared. So far, most studies have relied on qualitative descriptions of population distribution patterns, on population-independent descriptors, or on arbitrary placement of thresholds distinguishing biological ON from OFF states. We found that all these methods fall short of accurately describing small population sizes in bimodal populations. Here we propose a simple, statistics-based method for the analysis of small subpopulation sizes for use in the free software environment R and test this method on real as well as simulated data. Four so-called population splitting methods were designed with different algorithms that can estimate subpopulation sizes from bimodal populations. All four methods proved more precise than previously used methods when analyzing subpopulation sizes of transfer competent cells arising in populations of the bacterium Pseudomonas knackmussii B13. The methods' resolving powers were further explored by bootstrapping and simulations. Two of the methods were not severely limited by the proportions of subpopulations they could estimate correctly, but the two others only allowed accurate subpopulation quantification when this amounted to less than 25% of the total population. In contrast, only one method was still sufficiently accurate with subpopulations smaller than 1% of the total population. This study proposes a number of rational approximations to quantifying small subpopulations and offers an easy-to-use protocol for their implementation in the open source statistical software environment R.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Background: The repertoire of statistical methods dealing with the descriptive analysis of the burden of a disease has been expanded and implemented in statistical software packages during the last years. The purpose of this paper is to present a web-based tool, REGSTATTOOLS http://regstattools.net intended to provide analysis for the burden of cancer, or other group of disease registry data. Three software applications are included in REGSTATTOOLS: SART (analysis of disease"s rates and its time trends), RiskDiff (analysis of percent changes in the rates due to demographic factors and risk of developing or dying from a disease) and WAERS (relative survival analysis). Results: We show a real-data application through the assessment of the burden of tobacco-related cancer incidence in two Spanish regions in the period 1995-2004. Making use of SART we show that lung cancer is the most common cancer among those cancers, with rising trends in incidence among women. We compared 2000-2004 data with that of 1995-1999 to assess percent changes in the number of cases as well as relative survival using RiskDiff and WAERS, respectively. We show that the net change increase in lung cancer cases among women was mainly attributable to an increased risk of developing lung cancer, whereas in men it is attributable to the increase in population size. Among men, lung cancer relative survival was higher in 2000-2004 than in 1995-1999, whereas it was similar among women when these time periods were compared. Conclusions: Unlike other similar applications, REGSTATTOOLS does not require local software installation and it is simple to use, fast and easy to interpret. It is a set of web-based statistical tools intended for automated calculation of population indicators that any professional in health or social sciences may require.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

As in any field of scientific inquiry, advancements in the field of second language acquisition (SLA) rely in part on the interpretation and generalizability of study findings using quantitative data analysis and inferential statistics. While statistical techniques such as ANOVA and t-tests are widely used in second language research, this review article provides a review of a class of newer statistical models that have not yet been widely adopted in the field, but have garnered interest in other fields of language research. The class of statistical models called mixed-effects models are introduced, and the potential benefits of these models for the second language researcher are discussed. A simple example of mixed-effects data analysis using the statistical software package R (R Development Core Team, 2011) is provided as an introduction to the use of these statistical techniques, and to exemplify how such analyses can be reported in research articles. It is concluded that mixed-effects models provide the second language researcher with a powerful tool for the analysis of a variety of types of second language acquisition data.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Statistical software is now commonly available to calculate Power (P') and sample size (N) for most experimental designs. In many circumstances, however, sample size is constrained by lack of time, cost, and in research involving human subjects, the problems of recruiting suitable individuals. In addition, the calculation of N is often based on erroneous assumptions about variability and therefore such estimates are often inaccurate. At best, we would suggest that such calculations provide only a very rough guide of how to proceed in an experiment. Nevertheless, calculation of P' is very useful especially in experiments that have failed to detect a difference which the experimenter thought was present. We would recommend that P' should always be calculated in these circumstances to determine whether the experiment was actually too small to test null hypotheses adequately.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

2010 Mathematics Subject Classification: 60J80.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

For a long time, electronic data analysis has been associated with quantitative methods. However, Computer Assisted Qualitative Data Analysis Software (CAQDAS) are increasingly being developed. Although the CAQDAS has been there for decades, very few qualitative health researchers report using it. This may be due to the difficulties that one has to go through to master the software and the misconceptions that are associated with using CAQDAS. While the issue of mastering CAQDAS has received ample attention, little has been done to address the misconceptions associated with CAQDAS. In this paper, the author reflects on his experience of interacting with one of the popular CAQDAS (NVivo) in order to provide evidence-based implications of using the software. The key message is that unlike statistical software, the main function of CAQDAS is not to analyse data but rather to aid the analysis process, which the researcher must always remain in control of. In other words, researchers must equally know that no software can analyse qualitative data. CAQDAS are basically data management packages, which support the researcher during analysis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Universidade Estadual de Campinas . Faculdade de Educação Física

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Neste trabalho são descritas as técnicas de análise estatística utilizadas e a acessibilidade estatística em uma amostra dos artigos originais publicados no período 1996-2006 em duas revistas de pesquisa na área de fruticultura: a Revista Brasileira de Fruticultura (RBF) e a revista francesa Fruits. No total foram classificados 986 artigos em 16 categorias de análise estatística, ordenadas em grau ascendente de complexidade. No período analisado, foi constatado um aumento no uso de análises mais sofisticadas ao longo do tempo em ambos as revistas. Os trabalhos publicados pela RBF aplicaram com maior freqüência técnicas estatísticas mais complexas, com maior utilização de delineamentos em blocos aleatorizados, arranjos fatoriais, parcelas subdivididas e modelos hierárquicos, e do teste de Tukey para comparações múltiplas de médias. Nos trabalhos publicados pela revista Fruits, predominou o uso de outros testes paramétricos e do teste de Duncan. O pacote estatístico SAS foi o mais utilizado nos artigos publicados em ambas as revistas. Os leitores da revista RBF precisaram de um nível de conhecimento estatístico mais elevado para ter acesso à maior parte dos artigos publicados no período, em comparação com os leitores da revista francesa.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: To determine the prevalence of diabetes mellitus (DM) and impaired glucose tolerance (IGT) in a rural community (Bengo) of Angola. Methods: A random sample of 421 subjects aged 30 to 69 years (30% men and 70% women) was selected from three villages of Bengo province. This cross-sectional home survey was conducted using a sampling design of stage conglomerates. First, clinical and anthropometric data were obtained and fasting capillary glucose level was determined. Subjects who screened positive (fasting capillary glucose >= 100 mg/dl and < 200 mg/dl) and each sixth consecutive subject who screened negative (fasting capillary glucose < 100 mg/dl) were submitted to the second phase of survey, consisting of the 75 g oral glucose tolerance test. Data was analyzed by the use of SAS statistical software. Results: The prevalence rates of diabetes mellitus and IGT were 2.8% and 8.1%, respectively. The age group with the highest prevalence of diabetes was 60 to 69 years (42%). Impaired glucose tolerance prevalence was 38% in the 40 to 49 year age group and it increased with age, considering that the 50 to 59 and 60 to 69 year age groups as a whole represent 50% of all subjects with impaired glucose tolerance. The prevalence of diabetes mellitus did not differ significantly between men (3.2%) and women (2.7%) (p = 0.47). On the other hand, the prevalence of impaired glucose tolerance among women showed almost twice that found in men (9.1% vs. 5.6%, respectively). Overweight was present in 66.7% of the individuals with diabetes mellitus and 26.5% of individuals with impaired glucose tolerance showed overweight or obesity. Conclusions: Although the prevalence of diabetes mellitus was low, the prevalence of impaired glucose tolerance is considered to be within an intermediary range, suggesting a future increase in the frequency of diabetes in this population.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

OBJECTIVE To evaluate the frequency and clinical features of endemic and other opportunistic infections in liver or kidney transplant recipients in four transplant centres in different geographical areas of Brazil. METHODS Retrospective analysis of medical and laboratory records of four transplant centres on endemic and other opportunistic infections in liver or kidney transplant recipients. Analyses were performed with spss statistical software. RESULTS From 2001 to 2006, 1046 kidney and 708 liver transplants were registered in all centres. The average age was 42 years. Among 82 (4.7%) cases with infections, the most frequent was tuberculosis (2.0%), followed by systemic protozoal infections (0.7%), toxoplasmosis (0.4%) and visceral leishmaniasis (0.3%). Systemic fungal infections occurred in 0.6%, of which 0.4% were cryptococcosis and 0.2% were histoplasmosis. Dengue was the only systemic viral infection and was registered in two cases (0.1%), of which one was classified as the classic form and the other as dengue haemorrhagic fever. Nocardiosis was described in one case (0.05%). The infectious agents most frequently associated with diarrhoea were Blastocystis sp., Schistosoma mansoni and Strongyloides stercoralis. CONCLUSIONS Opportunistic Infections in transplant patients have a wide spectrum and may vary from asymptomatic to severe infections with high mortality. A better understanding of the epidemiology of endemic pathogens and clinical manifestations can contribute to the establishment of an early diagnosis as well as correct treatment aimed at decreasing morbidity and mortality.