994 resultados para transformed data
Resumo:
There are two main types of data sources of income distributions in China: household survey data and grouped data. Household survey data are typically available for isolated years and individual provinces. In comparison, aggregate or grouped data are typically available more frequently and usually have national coverage. In principle, grouped data allow investigation of the change of inequality over longer, continuous periods of time, and the identification of patterns of inequality across broader regions. Nevertheless, a major limitation of grouped data is that only mean (average) income and income shares of quintile or decile groups of the population are reported. Directly using grouped data reported in this format is equivalent to assuming that all individuals in a quintile or decile group have the same income. This potentially distorts the estimate of inequality within each region. The aim of this paper is to apply an improved econometric method designed to use grouped data to study income inequality in China. A generalized beta distribution is employed to model income inequality in China at various levels and periods of time. The generalized beta distribution is more general and flexible than the lognormal distribution that has been used in past research, and also relaxes the assumption of a uniform distribution of income within quintile and decile groups of populations. The paper studies the nature and extent of inequality in rural and urban China over the period 1978 to 2002. Income inequality in the whole of China is then modeled using a mixture of province-specific distributions. The estimated results are used to study the trends in national inequality, and to discuss the empirical findings in the light of economic reforms, regional policies, and globalization of the Chinese economy.
Resumo:
This article investigates the researcher's work in the coproduction (or not) of complaint sequences in research interviews. Using a conversation analytic approach, we show how the interviewer's management of complaint sequences in a research setting is consequential for subsequent talk and thus directly affects the data generated. In the examples shown here, researchers sharing cocategorial incumbency with respondents may well provide spaces for research participants to formulate complaints. This article examines sequences of talk surrounding complaints to show how researchers generate complaints (or not) and handle unsafe complaints. Researchers are able to provoke specific types of accounts from respondents, whereas their respondents may actively resist the researchers' direction. For researchers using the interview as a method of data generation, examination of complaint sequences and how these appear in interview data provides insight into how interview talk is coproduced and managed within a socially situated setting.
Resumo:
With the proliferation of relational database programs for PC's and other platforms, many business end-users are creating, maintaining, and querying their own databases. More importantly, business end-users use the output of these queries as the basis for operational, tactical, and strategic decisions. Inaccurate data reduce the expected quality of these decisions. Implementing various input validation controls, including higher levels of normalisation, can reduce the number of data anomalies entering the databases. Even in well-maintained databases, however, data anomalies will still accumulate. To improve the quality of data, databases can be queried periodically to locate and correct anomalies. This paper reports the results of two experiments that investigated the effects of different data structures on business end-users' abilities to detect data anomalies in a relational database. The results demonstrate that both unnormalised and higher levels of normalisation lower the effectiveness and efficiency of queries relative to the first normal form. First normal form databases appear to provide the most effective and efficient data structure for business end-users formulating queries to detect data anomalies.
Resumo:
The collection of spatial information to quantify changes to the state and condition of the environment is a fundamental component of conservation or sustainable utilization of tropical and subtropical forests, Age is an important structural attribute of old-growth forests influencing biological diversity in Australia eucalypt forests. Aerial photograph interpretation has traditionally been used for mapping the age and structure of forest stands. However this method is subjective and is not able to accurately capture fine to landscape scale variation necessary for ecological studies. Identification and mapping of fine to landscape scale vegetative structural attributes will allow the compilation of information associated with Montreal Process indicators lb and ld, which seek to determine linkages between age structure and the diversity and abundance of forest fauna populations. This project integrated measurements of structural attributes derived from a canopy-height elevation model with results from a geometrical-optical/spectral mixture analysis model to map forest age structure at a landscape scale. The availability of multiple-scale data allows the transfer of high-resolution attributes to landscape scale monitoring. Multispectral image data were obtained from a DMSV (Digital Multi-Spectral Video) sensor over St Mary's State Forest in Southeast Queensland, Australia. Local scene variance levels for different forest tapes calculated from the DMSV data were used to optimize the tree density and canopy size output in a geometric-optical model applied to a Landsat Thematic Mapper (TU) data set. Airborne laser scanner data obtained over the project area were used to calibrate a digital filter to extract tree heights from a digital elevation model that was derived from scanned colour stereopairs. The modelled estimates of tree height, crown size, and tree density were used to produce a decision-tree classification of forest successional stage at a landscape scale. The results obtained (72% accuracy), were limited in validation, but demonstrate potential for using the multi-scale methodology to provide spatial information for forestry policy objectives (ie., monitoring forest age structure).
Resumo:
In order to understand the determinants of schistosome-related hepato- and spleno-megaly better, 14 002 subjects aged 3-60 years (59% male; mean age =32 years) were randomly selected from 43 villages, all in Hunan province, China, where schistosomiasis caused by Schistosoma japonicum is endemic. The abdomen of each subject was examined along the mid-sternal (MSL) and mid-clavicular lines, for evidence of current hepato- and/or spleno-megaly, and a questionnaire was used to collect information on the medical history of each individual. Current infections with S. japonicum were detected by stool examination. Almost all (99.8%) of the subjects were ethnically Han by descent and most (77%) were engaged in farming. Although schistosomiasis appeared common (42% of the subjects claiming to have had the disease), only 45% of the subjects said they had received anti-schistosomiasis drugs. Overall, 1982 (14%) of the subjects had S. japonicum infections (as revealed by miracidium-hatching tests and/or Katon Katz smears) when examined and 22% had palpable hepatomegaly (i.e. enlargement of at least 3 cm along the MSL), although only 2.5% had any form of detectable splenomegaly (i.e. a Hackett's grade of at least 1). Multiple logistic regression revealed that male subjects, fishermen, farmers, subjects aged greater than or equal to 25 years, subjects with a history of schistosomiasis, and subjects who had had bloody stools in the previous 2 weeks were all at relatively high risk of hepato- and/or spleno-megaly. In areas moderately endemic for Schistosoma japonicum, occupational exposure and disease history appear to be good predictors of current disease status among older residents. These results reconfirm those reported earlier in the same region.
Resumo:
Medication data retrieved from Australian Repatriation Pharmaceutical Benefits Scheme (RPBS) claims for 44 veterans residing in nursing homes and Pharmaceutical Benefits Scheme (PBS) claims for 898 nursing home residents were compared with medication data from nursing home records to determine the optimal time interval for retrieving claims data and its validity. Optimal matching was achieved using 12 weeks of RPBS claims data, with 60% of medications in the RPBS claims located in nursing home administration records, and 78% of medications administered to nursing home residents identified in RPBS claims. In comparison, 48% of medications administered to nursing home residents could be found in 12 weeks of PBS data, and 56% of medications present in PBS claims could be matched with nursing home administration records. RPBS claims data was superior to PBS, due to the larger number of scheduled items available to veterans and the veteran's file number, which acts as a unique identifier. These findings should be taken into account when using prescription claims data for medication histories, prescriber feedback, drug utilisation, intervention or epidemiological studies. (C) 2001 Elsevier Science Inc. All rights reserved.
Resumo:
The Eysenck Personality Questionnaire-Revised (EPQ-R), the Eysenck Personality Profiler Short Version (EPP-S), and the Big Five Inventory (BFI-V4a) were administered to 135 postgraduate students of business in Pakistan. Whilst Extraversion and Neuroticism scales from the three questionnaires were highly correlated, it was found that Agreeableness was most highly correlated with Psychoticism in the EPQ-R and Conscientiousness was most highly correlated with Psychoticism in the EPP-S. Principal component analyses with varimax rotation were carried out. The analyses generally suggested that the five factor model rather than the three-factor model was more robust and better for interpretation of all the higher order scales of the EPQ-R, EPP-S, and BFI-V4a in the Pakistani data. Results show that the superiority of the five factor solution results from the inclusion of a broader variety of personality scales in the input data, whereas Eysenck's three factor solution seems to be best when a less complete but possibly more important set of variables are input. (C) 2001 Elsevier Science Ltd. All rights reserved.
Resumo:
OBJECTIVE: To establish body mass index (BMI) norms for standard figural stimuli using a large Caucasian population-based sample. In addition, we sought to determine the effectiveness of the figural stimuli to identify individuals as obese or thin. DESIGN: All Caucasian twins born in Virginia between 1915 and 1971 were identified by public birth record. In addition, 3347 individual twins responded to a letter published in the newsletter of the American Association of Retired Persons (AARP). All adult twins (aged 18 and over) from both of these sources and their family members were mailed a 16 page 'Health and Lifestyle' questionnaire. SUBJECTS: BMI and silhouette data were available on 16 728 females and 11 366 males ranging in age from 18- 100. MEASUREMENTS: Self-report information on height-weight, current body size, desired body size and a discrepancy score using standard figural stimuli. RESULTS: Gender- and age-specific norms are presented linking BMI to each of the figural stimuli. Additional norms for desired body size and discrepancy scores are also presented. Receiver operating curves (ROC) indicate that the figural stimuli are effective in classifying individuals as obese or thin. CONCLUSIONS: With the establishment of these norms, the silhouettes used in standard body image assessment can now be linked to BMI. Differences were observed between women and men in terms of desired body size and discrepancy scores, with women preferring smaller sizes. The figural stimuli are a robust technique for classifying individuals as obese or thin.
Resumo:
Seven hundred and nineteen samples from throughout the Cainozoic section in CRP-3 were analysed by a Malvern Mastersizer laser particle analyser, in order to derive a stratigraphic distribution of grain-size parameters downhole. Entropy analysis of these data (using the method of Woolfe and Michibayashi, 1995) allowed recognition of four groups of samples, each group characterised by a distinctive grain-size distribution. Group 1, which shows a multi-modal distribution, corresponds to mudrocks, interbedded mudrock/sandstone facies, muddy sandstones and diamictites. Group 2, with a sand-grade mode but showing wide dispersion of particle size, corresponds to muddy sandstones, a few cleaner sandstones and some conglomerates. Group 3 and Group 4 are also sand-dominated, with better grain-size sorting, and correspond to clean, well-washed sandstones of varying mean grain-size (medium and fine modes, respectively). The downhole disappearance of Group 1, and dominance of Groups 3 and 4 reflect a concomitant change from mudrock- and diamictite-rich lithology to a section dominated by clean, well-washed sandstones with minor conglomerates. Progressive downhole increases in percentage sand and principal mode also reflect these changes. Significant shifts in grain-size parameters and entropy group membership were noted across sequence boundaries and seismic reflectors, as recognised in others studies.
Resumo:
Use of specific histone deacetylase inhibitors has revealed critical roles for the histone deacetylases (HDAC) in controlling proliferation. Although many studies have correlated the function of HDAC inhibitors with the hyperacetylation of histones, few studies have specifically addressed whether the accumulation of acetylated histones, caused by HDAC inhibitor treatment, is responsible for growth inhibition. In the present study we show that HDAC inhibitors cause growth inhibition in normal and transformed keratinocytes but not in normal dermal fibroblasts, This was despite the observation that the HDAC inhibitor, suberic bishydroxamate (SBHA), caused a kinetically similar accumulation of hyperacetylated histones, This cell type-specific response to SBHA was not due to the inactivation of SBHA by fibroblasts, nor was it due to differences in the expression of specific HDAC family members. Remarkably, overexpression of HDACs 1, 4, and 6 in normal human fibroblasts resulted in cells that could be growth-inhibited by SBHA. These data suggest that, although histone acetylation is a major target for HDAC inhibitors, the accumulation of hyperacetylated histones is not sufficient to cause growth inhibition in all cell types, This suggests that growth inhibition, caused by HDAC inhibitors, may be the culmination of histone hyperacetylation acting in concert with other growth regulatory pathways.