973 resultados para correlated data
Resumo:
Mesoamerica, defined as the broad linguistic and cultural area from middle southern Mexico to Costa Rica, might have played a pivotal role during the colonization of theAmerican continent. It has been suggested that the Mesoamerican isthmus could have played an important role in severely restricting prehistorically gene flow between North and SouthAmerica. Although the Native American component has been already described in admixedMexican populations, few studies have been carried out in native Mexican populations. In thisstudy we present mitochondrial DNA (mtDNA) sequence data for the first hypervariable region (HVR-I) in 477 unrelated individuals belonging to eleven different native populations from Mexico. Almost all the Native Mexican mtDNAs could be classified into the four pan-Amerindian haplogroups (A2, B2, C1 and D1); only three of them could be allocated to the rare Native American lineage D4h3. Their haplogroup phylogenies are clearly star-like, as expected from relatively young populations that have experienced diverse episodes of genetic drift (e.g. extensive isolation, genetic drift and founder effects) and posterior population expansions. In agreement with this observation is the fact that Native Mexican populations show a high degree of heterogeneity in their patterns of haplogroup frequencies. HaplogroupX2a was absent in our samples, supporting previous observations where this clade was only detected in the American northernmost areas. The search for identical sequences in the American continent shows that, although Native Mexican populations seem to show a closer relationship to North American populations, they cannot be related to a single geographical region within the continent. Finally, we did not find significant population structure on the maternal lineages when considering the four main and distinct linguistic groups represented in our Mexican samples (Oto-Manguean, Uto-Aztecan, Tarascan, and Mayan), suggesting that genetic divergence predates linguistic diversification in Mexico.
Resumo:
This paper aims to estimate a translog stochastic frontier production function in the analysis of a panel of 150 mixed Catalan farms in the period 1989-1993, in order to attempt to measure and explain variation in technical inefficiency scores with a one-stage approach. The model uses gross value added as the output aggregate measure. Total employment, fixed capital, current assets, specific costs and overhead costs are introduced into the model as inputs. Stochasticfrontier estimates are compared with those obtained using a linear programming method using a two-stage approach. The specification of the translog stochastic frontier model appears as an appropriate representation of the data, technical change was rejected and the technical inefficiency effects were statistically significant. The mean technical efficiency in the period analyzed was estimated to be 64.0%. Farm inefficiency levels were found significantly at 5%level and positively correlated with the number of economic size units.
Resumo:
Measurement of total energy expenditure may be crucial to an understanding of the relation between physical activity and disease and in order to frame public health intervention. To devise a self-administered physical activity frequency questionnaire (PAFQ), the following data-based approach was used. A 24-hour recall was administered to a random sample of 919 adult residents of Geneva, Switzerland. The data obtained were used to establish the list of activities (and their median duration) that contributed to 95% of the energy expended, separately for men and women. Activities that were trivial for the whole sample but that contributed to > or = 10% of an individual's energy expenditure were also selected. The final PAFQ lists 70 activities or group of activities with their typical duration. About 20 minutes are required for respondents to indicate the number of days and the number of hours per day that they performed each activity. The PAFQ method was validated against a heart rate monitor, a more objective method. The total energy estimated by the PAFQ in 41 volunteers correlated well (r = 0.76) with estimates using a heart rate monitor. The authors conclude that the design of their self-administered physical activity frequency questionnaire based on data from 24-hour recall appeared to accurately estimate energy expenditure.
Estimation of surface roughness in a semiarid region from C-band ERS-1 synthetic aperture radar data
Resumo:
In this study, we investigated the feasibility of using the C-band European Remote Sensing Satellite (ERS-1) synthetic aperture radar (SAR) data to estimate surface soil roughness in a semiarid rangeland. Radar backscattering coefficients were extracted from a dry and a wet season SAR image and were compared with 47 in situ soil roughness measurements obtained in the rocky soils of the Walnut Gulch Experimental Watershed, southeastern Arizona, USA. Both the dry and the wet season SAR data showed exponential relationships with root mean square (RMS) height measurements. The dry C-band ERS-1 SAR data were strongly correlated (R² = 0.80), while the wet season SAR data have somewhat higher secondary variation (R² = 0.59). This lower correlation was probably provoked by the stronger influence of soil moisture, which may not be negligible in the wet season SAR data. We concluded that the single configuration C-band SAR data is useful to estimate surface roughness of rocky soils in a semiarid rangeland.
A filtering method to correct time-lapse 3D ERT data and improve imaging of natural aquifer dynamics
Resumo:
We have developed a processing methodology that allows crosshole ERT (electrical resistivity tomography) monitoring data to be used to derive temporal fluctuations of groundwater electrical resistivity and thereby characterize the dynamics of groundwater in a gravel aquifer as it is infiltrated by river water. Temporal variations of the raw ERT apparent-resistivity data were mainly sensitive to the resistivity (salinity), temperature and height of the groundwater, with the relative contributions of these effects depending on the time and the electrode configuration. To resolve the changes in groundwater resistivity, we first expressed fluctuations of temperature-detrended apparent-resistivity data as linear superpositions of (i) time series of riverwater-resistivity variations convolved with suitable filter functions and (ii) linear and quadratic representations of river-water-height variations multiplied by appropriate sensitivity factors; river-water height was determined to be a reliable proxy for groundwater height. Individual filter functions and sensitivity factors were obtained for each electrode configuration via deconvolution using a one month calibration period and then the predicted contributions related to changes in water height were removed prior to inversion of the temperature-detrended apparent-resistivity data. Applications of the filter functions and sensitivity factors accurately predicted the apparent-resistivity variations (the correlation coefficient was 0.98). Furthermore, the filtered ERT monitoring data and resultant time-lapse resistivity models correlated closely with independently measured groundwater electrical resistivity monitoring data and only weakly with the groundwater-height fluctuations. The inversion results based on the filtered ERT data also showed significantly less inversion artefacts than the raw data inversions. We observed resistivity increases of up to 10% and the arrival time peaks in the time-lapse resistivity models matched those in the groundwater resistivity monitoring data.
Resumo:
INTRODUCTION: Diverse microarray and sequencing technologies have been widely used to characterise the molecular changes in malignant epithelial cells in breast cancers. Such gene expression studies to identify markers and targets in tumour cells are, however, compromised by the cellular heterogeneity of solid breast tumours and by the lack of appropriate counterparts representing normal breast epithelial cells. METHODS: Malignant neoplastic epithelial cells from primary breast cancers and luminal and myoepithelial cells isolated from normal human breast tissue were isolated by immunomagnetic separation methods. Pools of RNA from highly enriched preparations of these cell types were subjected to expression profiling using massively parallel signature sequencing (MPSS) and four different genome wide microarray platforms. Functional related transcripts of the differential tumour epithelial transcriptome were used for gene set enrichment analysis to identify enrichment of luminal and myoepithelial type genes. Clinical pathological validation of a small number of genes was performed on tissue microarrays. RESULTS: MPSS identified 6,553 differentially expressed genes between the pool of normal luminal cells and that of primary tumours substantially enriched for epithelial cells, of which 98% were represented and 60% were confirmed by microarray profiling. Significant expression level changes between these two samples detected only by microarray technology were shown by 4,149 transcripts, resulting in a combined differential tumour epithelial transcriptome of 8,051 genes. Microarray gene signatures identified a comprehensive list of 907 and 955 transcripts whose expression differed between luminal epithelial cells and myoepithelial cells, respectively. Functional annotation and gene set enrichment analysis highlighted a group of genes related to skeletal development that were associated with the myoepithelial/basal cells and upregulated in the tumour sample. One of the most highly overexpressed genes in this category, that encoding periostin, was analysed immunohistochemically on breast cancer tissue microarrays and its expression in neoplastic cells correlated with poor outcome in a cohort of poor prognosis estrogen receptor-positive tumours. CONCLUSION: Using highly enriched cell populations in combination with multiplatform gene expression profiling studies, a comprehensive analysis of molecular changes between the normal and malignant breast tissue was established. This study provides a basis for the identification of novel and potentially important targets for diagnosis, prognosis and therapy in breast cancer.
The combined use of reflectance, emissivity and elevation Aster/Terra data for tropical soil studies
Resumo:
Reflectance, emissivity and elevation data of the sensor ASTER (Advanced Spaceborne Thermal Emission and Reflection Radiometer)/Terra were used to characterize soil composition variations according to the toposequence position. Normalized data of SWIR (shortwave infrared) reflectance and TIR (thermal infrared) emissivity, coupled to a soil-fraction image from a spectral mixture model, were evaluated to separate bare soils from nonphotosynthetic vegetation. Regression relationships of some soil properties with reflectance and emissivity data were then applied on the exposed soil pixels. The resulting estimated values were plotted on the ASTER-derived digital elevation model. Results showed that the SWIR bands 5 and 6 and the TIR bands 10 and 14 measured the clay mineral absorption band and the quartz emissivity feature, respectively. These bands improved also the discrimination between nonphotosynthetic vegetation and soils. Despite the differences in pixel size and field sampling size, some soil properties were correlated with reflectance (R² of 0.65 for Al2O3 in band 6; 0.61 for Fe2O3 in band 3) and emissivity (R² of 0.65 for total sand fraction in the 10/14 band ratio). The combined use of reflectance, emissivity and elevation data revealed variations in soil composition with topography in specific parts of the landscape. From higher to lower slope positions, a general decrease in Al2O3 and increase in total sand fraction was observed, due to the prevalence of Rhodic Acrustox at the top and its gradual transition to Typic Acrustox at the bottom.
Resumo:
The south-western part of the Iberian Peninsula, including the southern branch of the Iberian Massif, has recently been the subject of several magnetotelluric (MT) studies. This area is made up of three different tectonic terranes: the South Portuguese Zone (SPZ), the Ossa Morena Zone (OMZ) and the Central Iberian Zone (CIZ). The boundaries between these zones are considered to be sutures, which appear as high electrical conductivity anomalies in the MT surveys. The OMZ is characterised by a conductive layer at middle-lower crustal levels. To investigate the continuity of this conductive layer into the CIZ, a new MT profile was carried out. This 75-km long ENE profile goes through the boundary between the OMZ and the CIZ. The results of a two-dimensional magnetotelluric inversion revealed a high-conductivity anomaly in the transition OMZ/CIZ (the so-called Central Unit), which is interpreted as due to interconnected graphite along shear planes. High-conductivity anomalies appeared in the middle crust of the CIZ, whose geometry and location are consistent with the conductive layer previously found in the OMZ, thus confirming the prolongation of the conductive layer into the CIZ. The top of this layer correlated spatially with a broad reflector detected by a seismic profile previously acquired in the same area. This, together with other geological and petrological evidence, points to a common origin for both features.
Resumo:
OBJECTIVE: To determine if the results of resin-dentin microtensile bond strength (µTBS) is correlated with the outcome parameters of clinical studies on non-retentive Class V restorations. METHODS: Resin-dentin µTBS data were obtained from one test center; the in vitro tests were all performed by the same operator. The µTBS testing was performed 8h after bonding and after 6 months of storing the specimens in water. Pre-test failures (PTFs) of specimens were included in the analysis, attributing them a value of 1MPa. Prospective clinical studies on cervical restorations (Class V) with an observation period of at least 18 months were searched in the literature. The clinical outcome variables were retention loss, marginal discoloration and marginal integrity. Furthermore, an index was formulated to be better able to compare the laboratory and clinical results. Estimates of adhesive effects in a linear mixed model were used to summarize the clinical performance of each adhesive between 12 and 36 months. Spearman correlations between these clinical performances and the µTBS values were calculated subsequently. RESULTS: Thirty-six clinical studies with 15 adhesive/restorative systems for which µTBS data were also available were included in the statistical analysis. In general 3-step and 2-step etch-and-rinse systems showed higher bond strength values than the 2-step/3-step self-etching systems, which, however, produced higher values than the 1-step self-etching and the resin modified glass ionomer systems. Prolonged water storage of specimens resulted in a significant decrease of the mean bond strength values in 5 adhesive systems (Wilcoxon, p<0.05). There was a significant correlation between µTBS values both after 8h and 6 months of storage and marginal discoloration (r=0.54 and r=0.67, respectively). However, the same correlation was not found between µTBS values and the retention rate, clinical index or marginal integrity. SIGNIFICANCE: As µTBS data of adhesive systems, especially after water storage for 6 months, showed a good correlation with marginal discoloration in short-term clinical Class V restorations, longitudinal clinical trials should explore whether early marginal staining is predictive for future retention loss in non-carious cervical restorations.
Resumo:
Background: In longitudinal studies where subjects experience recurrent incidents over a period of time, such as respiratory infections, fever or diarrhea, statistical methods are required to take into account the within-subject correlation. Methods: For repeated events data with censored failure, the independent increment (AG), marginal (WLW) and conditional (PWP) models are three multiple failure models that generalize Cox"s proportional hazard model. In this paper, we revise the efficiency, accuracy and robustness of all three models under simulated scenarios with varying degrees of within-subject correlation, censoring levels, maximum number of possible recurrences and sample size. We also study the methods performance on a real dataset from a cohort study with bronchial obstruction. Results: We find substantial differences between methods and there is not an optimal method. AG and PWP seem to be preferable to WLW for low correlation levels but the situation reverts for high correlations. Conclusions: All methods are stable in front of censoring, worsen with increasing recurrence levels and share a bias problem which, among other consequences, makes asymptotic normal confidence intervals not fully reliable, although they are well developed theoretically.
Resumo:
The objective of this work was to quantify the genetic diversity of elite genotypes of irrigated barley in the Brazilian savanna. Thirty elite barley genotypes from Embrapa Cerrados' collection were evaluated using 160 RAPD markers, 12 agronomic traits related to yield components, and 10 malting quality parameters. The genetic dissimilarity matrices based on molecular markers, quantitative traits, and malting quality characters were calculated and a cluster analysis was performed using the unweighted pair-group method with arithmetic mean (UPGMA) as grouping criterion. High genetic diversity among accessions were observed. The estimated genetic dissimilarities were weakly correlated, showing the complementarity of the different character groups. Selection indices and graphical dispersion analysis allowed the selection of promising genotypes and the indication of suitable crosses for maximizing the heterotic effects in breeding programs for irrigated barley in the Brazilian savanna.
Resumo:
Advances in flow cytometry and other single-cell technologies have enabled high-dimensional, high-throughput measurements of individual cells as well as the interrogation of cell population heterogeneity. However, in many instances, computational tools to analyze the wealth of data generated by these technologies are lacking. Here, we present a computational framework for unbiased combinatorial polyfunctionality analysis of antigen-specific T-cell subsets (COMPASS). COMPASS uses a Bayesian hierarchical framework to model all observed cell subsets and select those most likely to have antigen-specific responses. Cell-subset responses are quantified by posterior probabilities, and human subject-level responses are quantified by two summary statistics that describe the quality of an individual's polyfunctional response and can be correlated directly with clinical outcome. Using three clinical data sets of cytokine production, we demonstrate how COMPASS improves characterization of antigen-specific T cells and reveals cellular 'correlates of protection/immunity' in the RV144 HIV vaccine efficacy trial that are missed by other methods. COMPASS is available as open-source software.
Resumo:
BACKGROUND: Globally, Africans and African Americans experience a disproportionate burden of type 2 diabetes, compared to other race and ethnic groups. The aim of the study was to examine the association of plasma glucose with indices of glucose metabolism in young adults of African origin from 5 different countries. METHODS: We identified participants from the Modeling the Epidemiologic Transition Study, an international study of weight change and cardiovascular disease (CVD) risk in five populations of African origin: USA (US), Jamaica, Ghana, South Africa, and Seychelles. For the current study, we included 667 participants (34.8 ± 6.3 years), with measures of plasma glucose, insulin, leptin, and adiponectin, as well as moderate and vigorous physical activity (MVPA, minutes/day [min/day]), daily sedentary time (min/day), anthropometrics, and body composition. RESULTS: Among the 282 men, body mass index (BMI) ranged from 22.1 to 29.6 kg/m(2) in men and from 25.8 to 34.8 kg/m(2) in 385 women. MVPA ranged from 26.2 to 47.1 min/day in men, and from 14.3 to 27.3 min/day in women and correlated with adiposity (BMI, waist size, and % body fat) only among US males after controlling for age. Plasma glucose ranged from 4.6 ± 0.8 mmol/L in the South African men to 5.8 mmol/L US men, while the overall prevalence for diabetes was very low, except in the US men and women (6.7 and 12 %, respectively). Using multivariate linear regression, glucose was associated with BMI, age, sex, smoking hypertension, daily sedentary time but not daily MVPA. CONCLUSION: Obesity, metabolic risk, and other potential determinants vary significantly between populations at differing stages of the epidemiologic transition, requiring tailored public health policies to address local population characteristics.
Resumo:
Background: In longitudinal studies where subjects experience recurrent incidents over a period of time, such as respiratory infections, fever or diarrhea, statistical methods are required to take into account the within-subject correlation. Methods: For repeated events data with censored failure, the independent increment (AG), marginal (WLW) and conditional (PWP) models are three multiple failure models that generalize Cox"s proportional hazard model. In this paper, we revise the efficiency, accuracy and robustness of all three models under simulated scenarios with varying degrees of within-subject correlation, censoring levels, maximum number of possible recurrences and sample size. We also study the methods performance on a real dataset from a cohort study with bronchial obstruction. Results: We find substantial differences between methods and there is not an optimal method. AG and PWP seem to be preferable to WLW for low correlation levels but the situation reverts for high correlations. Conclusions: All methods are stable in front of censoring, worsen with increasing recurrence levels and share a bias problem which, among other consequences, makes asymptotic normal confidence intervals not fully reliable, although they are well developed theoretically.
Resumo:
Genome-wide association studies (GWASs) have identified many genetic variants underlying complex traits. Many detected genetic loci harbor variants that associate with multiple-even distinct-traits. Most current analysis approaches focus on single traits, even though the final results from multiple traits are evaluated together. Such approaches miss the opportunity to systemically integrate the phenome-wide data available for genetic association analysis. In this study, we propose a general approach that can integrate association evidence from summary statistics of multiple traits, either correlated, independent, continuous, or binary traits, which might come from the same or different studies. We allow for trait heterogeneity effects. Population structure and cryptic relatedness can also be controlled. Our simulations suggest that the proposed method has improved statistical power over single-trait analysis in most of the cases we studied. We applied our method to the Continental Origins and Genetic Epidemiology Network (COGENT) African ancestry samples for three blood pressure traits and identified four loci (CHIC2, HOXA-EVX1, IGFBP1/IGFBP3, and CDH17; p < 5.0 × 10(-8)) associated with hypertension-related traits that were missed by a single-trait analysis in the original report. Six additional loci with suggestive association evidence (p < 5.0 × 10(-7)) were also observed, including CACNA1D and WNT3. Our study strongly suggests that analyzing multiple phenotypes can improve statistical power and that such analysis can be executed with the summary statistics from GWASs. Our method also provides a way to study a cross phenotype (CP) association by using summary statistics from GWASs of multiple phenotypes.