772 resultados para Random Sample Size
Resumo:
Obesity and related chronic diseases represent a tremendous public health burden among Mexican Americans, a young and rapidly-expanding population. This study investigated the impact of variation within eight candidate obesity genes, which include leptin (LEP), leptin receptor (LEPR), neuropeptide Y (NPY), NPYY1 receptor (NPYY1), glucagon-like peptide-1 (GLP-1), GLP-1 receptor (GLP1R), beta-3 adrenergic receptor (β3AR), and uncoupling protein (UCP1), on variation in human obesity status and/or quantitative traits related to obesity in Mexican Americans from Starr County, Texas. The Trp64Arg polymorphism within β3AR was typed in 820 random individuals and 240 pedigrees (N = 2,044). The Arg allele frequency was significantly greater in obese versus non-obese individuals (0.20 versus 0. 15, respectively). In addition, within the random sample, the Arg allele was associated with significantly greater body weight (p = 0.031) and body mass index (BMI, p = 0.008) than the Trp allele. In the family sample, the Trp64Arg locus was also linked to percent fat (p = 0.045) but not to body weight or BMI. No linkage between obesity, diabetes, hypertension, or gallbladder disease and the Trp64Arg mutation was observed in families using affected sib pair linkage analysis or the transmission disequilibrium test. Microsatellite markers proximate to the remaining seven genes were typed in 302 individuals from 59 families. Sib pair linkage analysis provided evidence for linkage between obesity and NPY within affected sibling pairs (p = 0.042; n = 170 pairs). NPY was also linked to weight (p = 0.020), abdominal circumference (p = 0.031), hip circumference (p = 0.012), DBP (p ≤ 0.005), and a composite measure of body mass/fat (p ≤ 0.048) in all sibling pairs (n = 545 pairs). Additionally, LEP was linked to waist/hip ratio (p ≤ 0.009), total cholesterol (p ≤ 0.030), and HDL cholesterol (p ≤ 0.026), and LEPR was linked to fasting blood glucose (p ≤ 0.018) and DBP (p ≤ 0.003). Subsequent to the linkage analyses, the NPY gene was sequenced and eight variant sites identified. Two variant sites (-880I/D and 69I/D) were typed in a random sample of 914 individuals. The 880I/D variant was significantly associated with waist/hip ratio (p = 0.035) in the entire sample (N = 914) and with BMI (p = 0. 031), abdominal circumference (p = 0.044), and waist/hip ratio (p = 0.041) in a non-obese subsample (BW < 30 kg/m2, n = 594). The 69I/D variant was a rare mutation observed in only one pedigree and was not associated with obesity or body size/mass within this pedigree. Results of this study indicate that variation at or near β3AR, LEP, LEPR, and NPY may exert effects which increase obesity susceptibility and influence obesity-related measures in this population. ^
Resumo:
Sample size calculations are advocated by the CONSORT group to justify sample sizes in randomized controlled trials (RCTs). The aim of this study was primarily to evaluate the reporting of sample size calculations, to establish the accuracy of these calculations in dental RCTs and to explore potential predictors associated with adequate reporting. Electronic searching was undertaken in eight leading specific and general dental journals. Replication of sample size calculations was undertaken where possible. Assumed variances or odds for control and intervention groups were also compared against those observed. The relationship between parameters including journal type, number of authors, trial design, involvement of methodologist, single-/multi-center study and region and year of publication, and the accuracy of sample size reporting was assessed using univariable and multivariable logistic regression. Of 413 RCTs identified, sufficient information to allow replication of sample size calculations was provided in only 121 studies (29.3%). Recalculations demonstrated an overall median overestimation of sample size of 15.2% after provisions for losses to follow-up. There was evidence that journal, methodologist involvement (OR = 1.97, CI: 1.10, 3.53), multi-center settings (OR = 1.86, CI: 1.01, 3.43) and time since publication (OR = 1.24, CI: 1.12, 1.38) were significant predictors of adequate description of sample size assumptions. Among journals JCP had the highest odds of adequately reporting sufficient data to permit sample size recalculation, followed by AJODO and JDR, with 61% (OR = 0.39, CI: 0.19, 0.80) and 66% (OR = 0.34, CI: 0.15, 0.75) lower odds, respectively. Both assumed variances and odds were found to underestimate the observed values. Presentation of sample size calculations in the dental literature is suboptimal; incorrect assumptions may have a bearing on the power of RCTs.
Resumo:
Stroke is one of the most common conditions requiring rehabilitation, and its motor impairments are a major cause of permanent disability. Hemiparesis is observed by 80% of the patients after acute stroke. Neuroimaging studies showed that real and imagined movements have similarities regarding brain activation, supplying evidence that those similarities are based on the same process. Within this context, the combination of MP with physical and occupational therapy appears to be a natural complement based on neurorehabilitation concepts. Our study seeks to investigate if MP for stroke rehabilitation of upper limbs is an effective adjunct therapy. PubMed (Medline), ISI knowledge (Institute for Scientific Information) and SciELO (Scientific Electronic Library) were terminated on 20 February 2015. Data were collected on variables as follows: sample size, type of supervision, configuration of mental practice, setting the physical practice (intensity, number of sets and repetitions, duration of contractions, rest interval between sets, weekly and total duration), measures of sensorimotor deficits used in the main studies and significant results. Random effects models were used that take into account the variance within and between studies. Seven articles were selected. As there was no statistically significant difference between the two groups (MP vs Control), showed a – 0.6 (95% CI: –1.27 to 0.04), for upper limb motor restoration after stroke. The present meta-analysis concluded that MP is not effective as adjunct therapeutic strategy for upper limb motor restoration after stroke.
Resumo:
This paper examines how the geospatial accuracy of samples and sample size influence conclusions from geospatial analyses. It does so using the example of a study investigating the global phenomenon of large-scale land acquisitions and the socio-ecological characteristics of the areas they target. First, we analysed land deal datasets of varying geospatial accuracy and varying sizes and compared the results in terms of land cover, population density, and two indicators for agricultural potential: yield gap and availability of uncultivated land that is suitable for rainfed agriculture. We found that an increase in geospatial accuracy led to a substantial and greater change in conclusions about the land cover types targeted than an increase in sample size, suggesting that using a sample of higher geospatial accuracy does more to improve results than using a larger sample. The same finding emerged for population density, yield gap, and the availability of uncultivated land suitable for rainfed agriculture. Furthermore, the statistical median proved to be more consistent than the mean when comparing the descriptive statistics for datasets of different geospatial accuracy. Second, we analysed effects of geospatial accuracy on estimations regarding the potential for advancing agricultural development in target contexts. Our results show that the target contexts of the majority of land deals in our sample whose geolocation is known with a high level of accuracy contain smaller amounts of suitable, but uncultivated land than regional- and national-scale averages suggest. Consequently, the more target contexts vary within a country, the more detailed the spatial scale of analysis has to be in order to draw meaningful conclusions about the phenomena under investigation. We therefore advise against using national-scale statistics to approximate or characterize phenomena that have a local-scale impact, particularly if key indicators vary widely within a country.
Resumo:
BACKGROUND: There is increasing evidence that a history of childhood abuse and neglect is not uncommon among individuals who experience mental disorder and that childhood trauma experiences are associated with adult psychopathology. Although several interview and self-report instruments for retrospective trauma assessment have been developed, many focus on sexual abuse (SexAb) rather than on multiple types of trauma or adversity. METHODS: Within the European Prediction of Psychosis Study, the Trauma and Distress Scale (TADS) was developed as a new self-report assessment of multiple types of childhood trauma and distressing experiences. The TADS includes 43 items and, following previous measures including the Childhood Trauma Questionnaire, focuses on five core domains: emotional neglect (EmoNeg), emotional abuse (EmoAb), physical neglect (PhyNeg), physical abuse (PhyAb), and SexAb.This study explores the psychometric properties of the TADS (internal consistency and concurrent validity) in 692 participants drawn from the general population who completed a mailed questionnaire, including the TADS, a depression self-report and questions on help-seeking for mental health problems. Inter-method reliability was examined in a random sample of 100 responders who were reassessed in telephone interviews. RESULTS: After minor revisions of PhyNeg and PhyAb, internal consistencies were good for TADS totals and the domain raw score sums. Intra-class coefficients for TADS total score and the five revised core domains were all good to excellent when compared to the interviewed TADS as a gold standard. In the concurrent validity analyses, the total TADS and its all core domains were significantly associated with depression and help-seeking for mental problems as proxy measures for traumatisation. In addition, robust cutoffs for the total TADS and its domains were calculated. CONCLUSIONS: Our results suggest the TADS as a valid, reliable, and clinically useful instrument for assessing retrospectively reported childhood traumatisation.
Resumo:
Currently there is no general method to study the impact of population admixture within families on the assumptions of random mating and consequently, Hardy-Weinberg equilibrium (HWE) and linkage equilibrium (LE) and on the inference obtained from traditional linkage analysis. ^ First, through simulation, the effect of admixture of two populations on the log of the odds (LOD) score was assessed, using Prostate Cancer as the typical disease model. Comparisons between simulated mixed and homogeneous families were performed. LOD scores under both models of admixture (within families and within a data set of homogeneous families) were closest to the homogeneous family scores of the population having the highest mixing proportion. Random sampling of families or ascertainment of families with disease affection status did not affect this observation, nor did the mode of inheritance (dominant/recessive) or sample size. ^ Second, after establishing the effect of admixture on the LOD score and inference for linkage, the presence of induced disequilibria by population admixture within families was studied and an adjustment procedure was developed. The adjustment did not force all disequilibria to disappear but because the families were adjusted for the population admixture, those replicates where the disequilibria exist are no longer affected by the disequilibria in terms of maximization for linkage. Furthermore, the adjustment was able to exclude uninformative families or families that had such a high departure from HWE and/or LE that their LOD scores were not reliable. ^ Together these observations imply that the presence of families of mixed population ancestry impacts linkage analysis in terms of the LOD score and the estimate of the recombination fraction. ^
Resumo:
With hundreds of single nucleotide polymorphisms (SNPs) in a candidate gene and millions of SNPs across the genome, selecting an informative subset of SNPs to maximize the ability to detect genotype-phenotype association is of great interest and importance. In addition, with a large number of SNPs, analytic methods are needed that allow investigators to control the false positive rate resulting from large numbers of SNP genotype-phenotype analyses. This dissertation uses simulated data to explore methods for selecting SNPs for genotype-phenotype association studies. I examined the pattern of linkage disequilibrium (LD) across a candidate gene region and used this pattern to aid in localizing a disease-influencing mutation. The results indicate that the r2 measure of linkage disequilibrium is preferred over the common D′ measure for use in genotype-phenotype association studies. Using step-wise linear regression, the best predictor of the quantitative trait was not usually the single functional mutation. Rather it was a SNP that was in high linkage disequilibrium with the functional mutation. Next, I compared three strategies for selecting SNPs for application to phenotype association studies: based on measures of linkage disequilibrium, based on a measure of haplotype diversity, and random selection. The results demonstrate that SNPs selected based on maximum haplotype diversity are more informative and yield higher power than randomly selected SNPs or SNPs selected based on low pair-wise LD. The data also indicate that for genes with small contribution to the phenotype, it is more prudent for investigators to increase their sample size than to continuously increase the number of SNPs in order to improve statistical power. When typing large numbers of SNPs, researchers are faced with the challenge of utilizing an appropriate statistical method that controls the type I error rate while maintaining adequate power. We show that an empirical genotype based multi-locus global test that uses permutation testing to investigate the null distribution of the maximum test statistic maintains a desired overall type I error rate while not overly sacrificing statistical power. The results also show that when the penetrance model is simple the multi-locus global test does as well or better than the haplotype analysis. However, for more complex models, haplotype analyses offer advantages. The results of this dissertation will be of utility to human geneticists designing large-scale multi-locus genotype-phenotype association studies. ^
Resumo:
With the recognition of the importance of evidence-based medicine, there is an emerging need for methods to systematically synthesize available data. Specifically, methods to provide accurate estimates of test characteristics for diagnostic tests are needed to help physicians make better clinical decisions. To provide more flexible approaches for meta-analysis of diagnostic tests, we developed three Bayesian generalized linear models. Two of these models, a bivariate normal and a binomial model, analyzed pairs of sensitivity and specificity values while incorporating the correlation between these two outcome variables. Noninformative independent uniform priors were used for the variance of sensitivity, specificity and correlation. We also applied an inverse Wishart prior to check the sensitivity of the results. The third model was a multinomial model where the test results were modeled as multinomial random variables. All three models can include specific imaging techniques as covariates in order to compare performance. Vague normal priors were assigned to the coefficients of the covariates. The computations were carried out using the 'Bayesian inference using Gibbs sampling' implementation of Markov chain Monte Carlo techniques. We investigated the properties of the three proposed models through extensive simulation studies. We also applied these models to a previously published meta-analysis dataset on cervical cancer as well as to an unpublished melanoma dataset. In general, our findings show that the point estimates of sensitivity and specificity were consistent among Bayesian and frequentist bivariate normal and binomial models. However, in the simulation studies, the estimates of the correlation coefficient from Bayesian bivariate models are not as good as those obtained from frequentist estimation regardless of which prior distribution was used for the covariance matrix. The Bayesian multinomial model consistently underestimated the sensitivity and specificity regardless of the sample size and correlation coefficient. In conclusion, the Bayesian bivariate binomial model provides the most flexible framework for future applications because of its following strengths: (1) it facilitates direct comparison between different tests; (2) it captures the variability in both sensitivity and specificity simultaneously as well as the intercorrelation between the two; and (3) it can be directly applied to sparse data without ad hoc correction. ^
Resumo:
Background. Each year thousands of people participate in mass health screenings for diabetes and hypertension, but little is known about whether or not those who receive higher than normal screening results obtain the recommended follow-up medical care, or what barriers they perceive to doing so. ^ Methods. Study participants were recruited from attendees at three health fairs in low-income neighborhoods in Houston, Texas Potential participants had higher than normal blood pressure (> 90/140 mgHg) or blood glucose readings (100 mm/dL fasting or 140 mm/dL random). Study participants were called at one, two, and three months and asked if they had obtained follow-up medical care; those who had not yet obtained follow-up care were asked to identify barriers. Using a modified Aday-Andersen model of health service access, the independent variables were individual and community characteristics and self-perceived need. The dependent variable was obtaining follow-up care, with barriers to care a secondary outcome. ^ Results. Eighty-two study participants completed the initial questionnaire and 59 participants completed the study protocol. Forty-eight participants (59% under an intent to treat analysis, 81% of those completing the study protocol) obtained follow-up care. Those who completed the initial questionnaire and who reported a regular source of care were significantly more likely to obtain follow-up care. For those who completed the study protocol the relationship between having a regular source of care and obtaining follow-up care approached but did not reach significance. For those who completed the initial questionnaire, self-described health status, when examined as a binary variable (good, very good, excellent, or poor, fair, not sure) was associated with obtaining follow-up care for those who rated their health as poor, fair, or not sure. While the group who completed the study protocol did not reach statistical significance, the same relationship between self-described health status of poor, fair, or not sure and obtaining follow-up care was present. The participants who completed the study protocol and described their blood pressure as OK or a little high were statistically more likely to get follow-up care than those who described it as high or very high. All those on oral medications for hypertension (12/12) and diabetes (4/4) who were told to obtain follow-up care did so; however, the small sample size allows this correlation to be of statistical significance only for those treating hypertension. ^ The variables significantly associated with obtaining follow-up care were having a regular source of care, self-described health status of poor, fair, or not sure, self-described blood pressure of OK or a little high, and taking medication for blood pressure. ^ At the follow-up telephone calls, 34 participants identified barriers to care; cost was a significant barrier reported by 16 participants, and 10 reported that they didn’t have time because they were working long hours after Hurricane Ike. ^ The study included the offer of access assistance: information about nearby safety-net providers, a visit to or information from the Health Information Center at their Neighborhood Center location, or information from Project Safety Net (a searchable web site for safety net providers). Access assistance was offered at the health fairs and then again at follow-up telephone calls to those who had not yet obtained follow-up care. Of the 48 participants who reported obtaining follow-up care, 26 said they had made use of the access assistance to do so. The use of access assistance was associated with being Hispanic, not having health insurance or a regular source of care, and speaking Spanish. It was also associated with being worried about blood glucose. ^ Conclusion. Access assistance, as a community enabling characteristic, may be useful in aiding low-income people in obtaining medical care. ^
Resumo:
Aim. To review and explore cataract prevalence in stable and unstable countries by examining published and unpublished ocular literature about Africa from 1980 onwards.^ Methods. Searches using OVID, Proquest Dissertations, WHO, and Ebsco Host were done. The review was restricted to articles utilizing WHO definitions of blindness and low vision. Random cluster sampling technique with a minimum sample size of 1,500, and reporting causes of blindness categorized by age and gender were inclusion considerations in the selected articles. ^ Results. Blindness and low vision increased with conflict. Women and the elderly were more likely to have vision impairing cataract. Cataract was the leading cause of blindness; the prevalence range was 22%–81% for the reviewed nations.^ Conclusion. Instability was connected to higher cataract prevalence and worse visual outcome across all characteristics examined except cataract surgical rates. ^
Resumo:
Health departments, research institutions, policy-makers, and healthcare providers are often interested in knowing the health status of their clients/constituents. Without the resources, financially or administratively, to go out into the community and conduct health assessments directly, these entities frequently rely on data from population-based surveys to supply the information they need. Unfortunately, these surveys are ill-equipped for the job due to sample size and privacy concerns. Small area estimation (SAE) techniques have excellent potential in such circumstances, but have been underutilized in public health due to lack of awareness and confidence in applying its methods. The goal of this research is to make model-based SAE accessible to a broad readership using clear, example-based learning. Specifically, we applied the principles of multilevel, unit-level SAE to describe the geographic distribution of HPV vaccine coverage among females aged 11-26 in Texas.^ Multilevel (3 level: individual, county, public health region) random-intercept logit models of HPV vaccination (receipt of ≥ 1 dose Gardasil® ) were fit to data from the 2008 Behavioral Risk Factor Surveillance System (outcome and level 1 covariates) and a number of secondary sources (group-level covariates). Sampling weights were scaled (level 1) or constructed (levels 2 & 3), and incorporated at every level. Using the regression coefficients (and standard errors) from the final models, I simulated 10,000 datasets for each regression coefficient from the normal distribution and applied them to the logit model to estimate HPV vaccine coverage in each county and respective demographic subgroup. For simplicity, I only provide coverage estimates (and 95% confidence intervals) for counties.^ County-level coverage among females aged 11-17 varied from 6.8-29.0%. For females aged 18-26, coverage varied from 1.9%-23.8%. Aggregated to the state level, these values translate to indirect state estimates of 15.5% and 11.4%, respectively; both of which fall within the confidence intervals for the direct estimates of HPV vaccine coverage in Texas (Females 11-17: 17.7%, 95% CI: 13.6, 21.9; Females 18-26: 12.0%, 95% CI: 6.2, 17.7).^ Small area estimation has great potential for informing policy, program development and evaluation, and the provision of health services. Harnessing the flexibility of multilevel, unit-level SAE to estimate HPV vaccine coverage among females aged 11-26 in Texas counties, I have provided (1) practical guidance on how to conceptualize and conduct modelbased SAE, (2) a robust framework that can be applied to other health outcomes or geographic levels of aggregation, and (3) HPV vaccine coverage data that may inform the development of health education programs, the provision of health services, the planning of additional research studies, and the creation of local health policies.^
Resumo:
Investigation into the medical care utilization of elderly Medicare enrollees in an HMO (Kaiser - Portland, Oregon): The specific research topics are: (1) The utilization of medical care by selected determinants such as: place of service, type of service, type of appointment, physician status, physician specialty and number of associated morbidities. (2) The attended prevalence of 3 chronic diseases: hypertension, diabetes and arthritis in addition to pneumonias as an example of acute diseases. The selection of these examples was based on their importance in morbidity/or mortality results among the elderly. The share of these diseases in outpatient and inpatient contacts was examined as an example of the relation between morbidity and medical care utilization. (3) The tendency of individual utilization patterns to persist in subsequent time periods. The concept of contagion or proneness was studied in a period of 2 years. Fitting the negative binomial and the Poisson distributions was applied to the utilization in the 2nd year conditional on that in the 1st year as regards outpatient and inpatient contacts.^ The present research is based on a longitudinal study of 20% random sample of elderly Medicare enrollees. The sample size is 1683 individuals during the period from August 1980-December 1982.^ The results of the research were: (1) The distribution of contacts by selected determinants did not reveal a consistent pattern between sexes and age groups. (2) The attended prevalence of hypertension and arthritis showed excess prevalence among females. For diabetes and pneumonias no female excess was noticed. Consistent increased prevalence with increasing age was not detected.^ There were important findings pertaining to the relatively big share of the combined 3 chronic diseases in utilization. They accounted for 20% of male outpatient contacts vs. 25% of female outpatients. For inpatient contacts, they consumed 20% in case of males vs. 24% in case of females. (3) Finding that the negative binomial distribution fit the utilization experience supported the research hypothesis concerning the concept of contagion in utilization. This important finding can be helpful in estimating liability functions needed for forecasting future utilization according to previous experience. Such information has its relevance to organization, administration and planning for medical care in general. (Abstract shortened with permission of author.) ^
Resumo:
Objectives. The central objective of this study was to systematically examine the internal structure of multihospital systems, determining the management principles used and the performance levels achieved in medical care and administrative areas.^ The Universe. The study universe consisted of short-term general American hospitals owned and operated by multihospital corporations. Corporations compared were the investor-owned (for-profit) and the voluntary multihospital systems. The individual hospital was the unit of analysis for the study.^ Theoretical Considerations. The contingency theory, using selected aspects of the classical and human relations schools of thought, seemed well suited to describe multihospital organization and was used in this research.^ The Study Hypotheses. The main null hypotheses generated were that there are no significant differences between the voluntary and the investor-owned multihospital sectors in their (1) hospital structures and (2) patient care and administrative performance levels.^ The Sample. A stratified random sample of 212 hospitals owned by multihospital systems was selected to equally represent the two study sectors. Of the sampled hospitals approached, 90.1% responded.^ The Analysis. Sixteen scales were constructed in conjunction with 16 structural variables developed from the major questions and sub-items of the questionnaire. This was followed by analysis of an additional 7 structural and 24 effectiveness (performance) measures, using frequency distributions. Finally, summary statistics and statistical testing for each variable and sub-items were completed and recorded in 38 tables.^ Study Findings. While it has been argued that there are great differences between the two sectors, this study found that with a few exceptions the null hypotheses of no difference in organizational and operational characteristics of non-profit and for-profit hospitals was accepted. However, there were several significant differences found in the structural variables: functional specialization, and autonomy were significantly higher in the voluntary sector. Only centralization was significantly different in the investor owned. Among the effectiveness measures, occupancy rate, cost of data processing, total manhours worked, F.T.E. ratios, and personnel per occupied bed were significantly higher in the voluntary sector. The findings indicated that both voluntary and for-profit systems were converging toward a common hierarchical corporate management approach. Factors of size and management style may be better descriptors to characterize a specific multihospital group than its profit or nonprofit status. (Abstract shortened with permission of author.) ^
Resumo:
An interim analysis is usually applied in later phase II or phase III trials to find convincing evidence of a significant treatment difference that may lead to trial termination at an earlier point than planned at the beginning. This can result in the saving of patient resources and shortening of drug development and approval time. In addition, ethics and economics are also the reasons to stop a trial earlier. In clinical trials of eyes, ears, knees, arms, kidneys, lungs, and other clustered treatments, data may include distribution-free random variables with matched and unmatched subjects in one study. It is important to properly include both subjects in the interim and the final analyses so that the maximum efficiency of statistical and clinical inferences can be obtained at different stages of the trials. So far, no publication has applied a statistical method for distribution-free data with matched and unmatched subjects in the interim analysis of clinical trials. In this simulation study, the hybrid statistic was used to estimate the empirical powers and the empirical type I errors among the simulated datasets with different sample sizes, different effect sizes, different correlation coefficients for matched pairs, and different data distributions, respectively, in the interim and final analysis with 4 different group sequential methods. Empirical powers and empirical type I errors were also compared to those estimated by using the meta-analysis t-test among the same simulated datasets. Results from this simulation study show that, compared to the meta-analysis t-test commonly used for data with normally distributed observations, the hybrid statistic has a greater power for data observed from normally, log-normally, and multinomially distributed random variables with matched and unmatched subjects and with outliers. Powers rose with the increase in sample size, effect size, and correlation coefficient for the matched pairs. In addition, lower type I errors were observed estimated by using the hybrid statistic, which indicates that this test is also conservative for data with outliers in the interim analysis of clinical trials.^
Resumo:
The purpose of this research is to develop a new statistical method to determine the minimum set of rows (R) in a R x C contingency table of discrete data that explains the dependence of observations. The statistical power of the method will be empirically determined by computer simulation to judge its efficiency over the presently existing methods. The method will be applied to data on DNA fragment length variation at six VNTR loci in over 72 populations from five major racial groups of human (total sample size is over 15,000 individuals; each sample having at least 50 individuals). DNA fragment lengths grouped in bins will form the basis of studying inter-population DNA variation within the racial groups are significant, will provide a rigorous re-binning procedure for forensic computation of DNA profile frequencies that takes into account intra-racial DNA variation among populations. ^