881 resultados para Regression-based decomposition.
Resumo:
Recently, kernel-based Machine Learning methods have gained great popularity in many data analysis and data mining fields: pattern recognition, biocomputing, speech and vision, engineering, remote sensing etc. The paper describes the use of kernel methods to approach the processing of large datasets from environmental monitoring networks. Several typical problems of the environmental sciences and their solutions provided by kernel-based methods are considered: classification of categorical data (soil type classification), mapping of environmental and pollution continuous information (pollution of soil by radionuclides), mapping with auxiliary information (climatic data from Aral Sea region). The promising developments, such as automatic emergency hot spot detection and monitoring network optimization are discussed as well.
Resumo:
Nonlinear regression problems can often be reduced to linearity by transforming the response variable (e.g., using the Box-Cox family of transformations). The classic estimates of the parameter defining the transformation as well as of the regression coefficients are based on the maximum likelihood criterion, assuming homoscedastic normal errors for the transformed response. These estimates are nonrobust in the presence of outliers and can be inconsistent when the errors are nonnormal or heteroscedastic. This article proposes new robust estimates that are consistent and asymptotically normal for any unimodal and homoscedastic error distribution. For this purpose, a robust version of conditional expectation is introduced for which the prediction mean squared error is replaced with an M scale. This concept is then used to develop a nonparametric criterion to estimate the transformation parameter as well as the regression coefficients. A finite sample estimate of this criterion based on a robust version of smearing is also proposed. Monte Carlo experiments show that the new estimates compare favorably with respect to the available competitors.
Resumo:
We present a novel numerical approach for the comprehensive, flexible, and accurate simulation of poro-elastic wave propagation in 2D polar coordinates. An important application of this method and its extensions will be the modeling of complex seismic wave phenomena in fluid-filled boreholes, which represents a major, and as of yet largely unresolved, computational problem in exploration geophysics. In view of this, we consider a numerical mesh, which can be arbitrarily heterogeneous, consisting of two or more concentric rings representing the fluid in the center and the surrounding porous medium. The spatial discretization is based on a Chebyshev expansion in the radial direction and a Fourier expansion in the azimuthal direction and a Runge-Kutta integration scheme for the time evolution. A domain decomposition method is used to match the fluid-solid boundary conditions based on the method of characteristics. This multi-domain approach allows for significant reductions of the number of grid points in the azimuthal direction for the inner grid domain and thus for corresponding increases of the time step and enhancements of computational efficiency. The viability and accuracy of the proposed method has been rigorously tested and verified through comparisons with analytical solutions as well as with the results obtained with a corresponding, previously published, and independently bench-marked solution for 2D Cartesian coordinates. Finally, the proposed numerical solution also satisfies the reciprocity theorem, which indicates that the inherent singularity associated with the origin of the polar coordinate system is adequately handled.
Resumo:
Aim This study used data from temperate forest communities to assess: (1) five different stepwise selection methods with generalized additive models, (2) the effect of weighting absences to ensure a prevalence of 0.5, (3) the effect of limiting absences beyond the environmental envelope defined by presences, (4) four different methods for incorporating spatial autocorrelation, and (5) the effect of integrating an interaction factor defined by a regression tree on the residuals of an initial environmental model. Location State of Vaud, western Switzerland. Methods Generalized additive models (GAMs) were fitted using the grasp package (generalized regression analysis and spatial predictions, http://www.cscf.ch/grasp). Results Model selection based on cross-validation appeared to be the best compromise between model stability and performance (parsimony) among the five methods tested. Weighting absences returned models that perform better than models fitted with the original sample prevalence. This appeared to be mainly due to the impact of very low prevalence values on evaluation statistics. Removing zeroes beyond the range of presences on main environmental gradients changed the set of selected predictors, and potentially their response curve shape. Moreover, removing zeroes slightly improved model performance and stability when compared with the baseline model on the same data set. Incorporating a spatial trend predictor improved model performance and stability significantly. Even better models were obtained when including local spatial autocorrelation. A novel approach to include interactions proved to be an efficient way to account for interactions between all predictors at once. Main conclusions Models and spatial predictions of 18 forest communities were significantly improved by using either: (1) cross-validation as a model selection method, (2) weighted absences, (3) limited absences, (4) predictors accounting for spatial autocorrelation, or (5) a factor variable accounting for interactions between all predictors. The final choice of model strategy should depend on the nature of the available data and the specific study aims. Statistical evaluation is useful in searching for the best modelling practice. However, one should not neglect to consider the shapes and interpretability of response curves, as well as the resulting spatial predictions in the final assessment.
Resumo:
The objective of this paper is to compare the performance of twopredictive radiological models, logistic regression (LR) and neural network (NN), with five different resampling methods. One hundred and sixty-seven patients with proven calvarial lesions as the only known disease were enrolled. Clinical and CT data were used for LR and NN models. Both models were developed with cross validation, leave-one-out and three different bootstrap algorithms. The final results of each model were compared with error rate and the area under receiver operating characteristic curves (Az). The neural network obtained statistically higher Az than LR with cross validation. The remaining resampling validation methods did not reveal statistically significant differences between LR and NN rules. The neural network classifier performs better than the one based on logistic regression. This advantage is well detected by three-fold cross-validation, but remains unnoticed when leave-one-out or bootstrap algorithms are used.
Resumo:
Human papillomavirus type 6 (HPV6) is the major etiological agent of anogenital warts and laryngeal papillomas and has been included in both the quadrivalent and nonavalent prophylactic HPV vaccines. This study investigated the global genomic diversity of HPV6, using 724 isolates and 190 complete genomes from six continents, and the association of HPV6 genomic variants with geographical location, anatomical site of infection/disease, and gender. Initially, a 2,800-bp E5a-E5b-L1-LCR fragment was sequenced from 492/530 (92.8%) HPV6-positive samples collected for this study. Among them, 130 exhibited at least one single nucleotide polymorphism (SNP), indel, or amino acid change in the E5a-E5b-L1-LCR fragment and were sequenced in full. A global alignment and maximum likelihood tree of 190 complete HPV6 genomes (130 fully sequenced in this study and 60 obtained from sequence repositories) revealed two variant lineages, A and B, and five B sublineages: B1, B2, B3, B4, and B5. HPV6 (sub)lineage-specific SNPs and a 960-bp representative region for whole-genome-based phylogenetic clustering within the L2 open reading frame were identified. Multivariate logistic regression analysis revealed that lineage B predominated globally. Sublineage B3 was more common in Africa and North and South America, and lineage A was more common in Asia. Sublineages B1 and B3 were associated with anogenital infections, indicating a potential lesion-specific predilection of some HPV6 sublineages. Females had higher odds for infection with sublineage B3 than males. In conclusion, a global HPV6 phylogenetic analysis revealed the existence of two variant lineages and five sublineages, showing some degree of ethnogeographic, gender, and/or disease predilection in their distribution. IMPORTANCE: This study established the largest database of globally circulating HPV6 genomic variants and contributed a total of 130 new, complete HPV6 genome sequences to available sequence repositories. Two HPV6 variant lineages and five sublineages were identified and showed some degree of association with geographical location, anatomical site of infection/disease, and/or gender. We additionally identified several HPV6 lineage- and sublineage-specific SNPs to facilitate the identification of HPV6 variants and determined a representative region within the L2 gene that is suitable for HPV6 whole-genome-based phylogenetic analysis. This study complements and significantly expands the current knowledge of HPV6 genetic diversity and forms a comprehensive basis for future epidemiological, evolutionary, functional, pathogenicity, vaccination, and molecular assay development studies.
Resumo:
BACKGROUND: Associations of serum calcium levels with the metabolic syndrome and other novel cardio-metabolic risk factors not classically included in the metabolic syndrome, such as those involved in oxidative stress, are largely unexplored. We analyzed the association of albumin-corrected serum calcium levels with conventional and non-conventional cardio-metabolic risk factors in a general adult population. METHODOLOGY/PRINCIPAL FINDINGS: The CoLaus study is a population-based study including Caucasians from Lausanne, Switzerland. The metabolic syndrome was defined using the Adult Treatment Panel III criteria. Non-conventional cardio-metabolic risk factors considered included: fat mass, leptin, LDL particle size, apolipoprotein B, fasting insulin, adiponectin, ultrasensitive CRP, serum uric acid, homocysteine, and gamma-glutamyltransferase. We used adjusted standardized multivariable regression to compare the association of each cardio-metabolic risk factor with albumin-corrected serum calcium. We assessed associations of albumin-corrected serum calcium with the cumulative number of non-conventional cardio-metabolic risk factors. We analyzed 4,231 subjects aged 35 to 75 years. Corrected serum calcium increased with both the number of the metabolic syndrome components and the number of non-conventional cardio-metabolic risk factors, independently of the metabolic syndrome and BMI. Among conventional and non-conventional cardio-metabolic risk factors, the strongest positive associations were found for factors related to oxidative stress (uric acid, homocysteine and gamma-glutamyltransferase). Adiponectin had the strongest negative association with corrected serum calcium. CONCLUSIONS/SIGNIFICANCE: Serum calcium was associated with the metabolic syndrome and with non-conventional cardio-metabolic risk factors independently of the metabolic syndrome. Associations with uric acid, homocysteine and gamma-glutamyltransferase were the strongest. These novel findings suggest that serum calcium levels may be associated with cardiovascular risk via oxidative stress.
Resumo:
Erosion is deleterious because it reduces the soil's productivity capacity for growing crops and causes sedimentation and water pollution problems. Surface and buried crop residue, as well as live and dead plant roots, play an important role in erosion control. An efficient way to assess the effectiveness of such materials in erosion reduction is by means of decomposition constants as used within the Revised Universal Soil Loss Equation - RUSLE's prior-land-use subfactor - PLU. This was investigated using simulated rainfall on a 0.12 m m-1 slope, sandy loam Paleudult soil, at the Agriculture Experimental Station of the Federal University of Rio Grande do Sul, in Eldorado do Sul, State of Rio Grande do Sul, Brazil. The study area had been covered by native grass pasture for about fifteen years. By the middle of March 1996, the sod was mechanically mowed and the crop residue removed from the field. Late in April 1996, the sod was chemically desiccated with herbicide and, about one month later, the following treatments were established and evaluated for sod biomass decomposition and soil erosion, from June 1996 to May 1998, on duplicated 3.5 x 11.0 m erosion plots: (a) and (b) soil without tillage, with surface residue and dead roots; (c) soil without tillage, with dead roots only; (d) soil tilled conventionally every two-and-half months, with dead roots plus incorporated residue; and (e) soil tilled conventionally every six months, with dead roots plus incorporated residue. Simulated rainfall was applied with a rotating-boom rainfall simulator, at an intensity of 63.5 mm h-1 for 90 min, eight to nine times during the experimental period (about every two-and-half months). Surface and subsurface sod biomass amounts were measured before each rainfall test along with the erosion measurements of runoff rate, sediment concentration in runoff, soil loss rate, and total soil loss. Non-linear regression analysis was performed using an exponential and a power model. Surface sod biomass decomposition was better depicted by the exponential model, while subsurface sod biomass was by the power model. Subsurface sod biomass decomposed faster and more than surface sod biomass, with dead roots in untilled soil without residue on the surface decomposing more than dead roots in untilled soil with surface residue. Tillage type and frequency did not appreciably influence subsurface sod biomass decomposition. Soil loss rates increased greatly with both surface sod biomass decomposition and decomposition of subsurface sod biomass in the conventionally tilled soil, but they were minimally affected by subsurface sod biomass decomposition in the untilled soil. Runoff rates were little affected by the studied treatments. Dead roots plus incorporated residues were effective in reducing erosion in the conventionally tilled soil, while consolidation of the soil surface was important in no-till. The residual effect of the turned soil on erosion diminished gradually with time and ceased after two years.
Resumo:
La tècnica de l’electroencefalograma (EEG) és una de les tècniques més utilitzades per estudiar el cervell. En aquesta tècnica s’enregistren els senyals elèctrics que es produeixen en el còrtex humà a través d’elèctrodes col•locats al cap. Aquesta tècnica, però, presenta algunes limitacions a l’hora de realitzar els enregistraments, la principal limitació es coneix com a artefactes, que són senyals indesitjats que es mesclen amb els senyals EEG. L’objectiu d’aquest treball de final de màster és presentar tres nous mètodes de neteja d’artefactes que poden ser aplicats en EEG. Aquests estan basats en l’aplicació de la Multivariate Empirical Mode Decomposition, que és una nova tècnica utilitzada per al processament de senyal. Els mètodes de neteja proposats s’apliquen a dades EEG simulades que contenen artefactes (pestanyeigs), i un cop s’han aplicat els procediments de neteja es comparen amb dades EEG que no tenen pestanyeigs, per comprovar quina millora presenten. Posteriorment, dos dels tres mètodes de neteja proposats s’apliquen sobre dades EEG reals. Les conclusions que s’han extret del treball són que dos dels nous procediments de neteja proposats es poden utilitzar per realitzar el preprocessament de dades reals per eliminar pestanyeigs.
Resumo:
OBJECTIVE: To identify pregnancy-related risk factors for different manifestations of congenital anorectal malformations (ARMs). DESIGN: A population-based case-control study. SETTING: Seventeen EUROCAT (European Surveillance of Congenital Anomalies) registries, 1980-2008. POPULATION: The study population consisted of 1417 cases with ARM, including 648 cases of isolated ARM, 601 cases of ARM with additional congenital anomalies, and 168 cases of ARM-VACTERL (vertebral, anal, cardiac, tracheo-esophageal, renal, and limb defects), along with 13 371 controls with recognised syndromes or chromosomal abnormalities. METHODS: Multiple logistic regression analyses were used to calculate adjusted odds ratios (ORs) for potential risk factors for ARM, such as fertility treatment, multiple pregnancy, primiparity, maternal illnesses during pregnancy, and pregnancy-related complications. MAIN OUTCOME MEASURES: Adjusted ORs for pregnancy-related risk factors for ARM. RESULTS: The ARM cases were more likely to be firstborn than the controls (OR 1.6, 95% CI 1.4-1.8). Fertility treatment and being one of twins or triplets seemed to increase the risk of ARM in cases with additional congenital anomalies or VACTERL (ORs ranging from 1.6 to 2.5). Maternal fever during pregnancy and pre-eclampsia were only associated with ARM when additional congenital anomalies were present (OR 3.9, 95% CI 1.3-11.6; OR 3.4, 95% CI 1.6-7.1, respectively), whereas maternal epilepsy during pregnancy resulted in a five-fold elevated risk of all manifestations of ARM (OR 5.1, 95% CI 1.7-15.6). CONCLUSIONS: This large European study identified maternal epilepsy, fertility treatment, multiple pregnancy, primiparity, pre-eclampsia, and maternal fever during pregnancy as potential risk factors primarily for complex manifestations of ARM with additional congenital anomalies and VACTERL.
Resumo:
Spatial resolution is a key parameter of all remote sensing satellites and platforms. The nominal spatial resolution of satellites is a well-known characteristic because it is directly related to the area in ground that represents a pixel in the detector. Nevertheless, in practice, the actual resolution of a specific image obtained from a satellite is difficult to know precisely because it depends on many other factors such as atmospheric conditions. However, if one has two or more images of the same region, it is possible to compare their relative resolutions. In this paper, a wavelet-decomposition-based method for the determination of the relative resolution between two remotely sensed images of the same area is proposed. The method can be applied to panchromatic, multispectral, and mixed (one panchromatic and one multispectral) images. As an example, the method was applied to compute the relative resolution between SPOT-3, Landsat-5, and Landsat-7 panchromatic and multispectral images taken under similar as well as under very different conditions. On the other hand, if the true absolute resolution of one of the images of the pair is known, the resolution of the other can be computed. Thus, in the last part of this paper, a spatial calibrator that is designed and constructed to help compute the absolute resolution of a single remotely sensed image is described, and an example of its use is presented.
Resumo:
ABSTRACT: BACKGROUND: Serologic testing algorithms for recent HIV seroconversion (STARHS) provide important information for HIV surveillance. We have shown that a patient's antibody reaction in a confirmatory line immunoassay (INNO-LIATM HIV I/II Score, Innogenetics) provides information on the duration of infection. Here, we sought to further investigate the diagnostic specificity of various Inno-Lia algorithms and to identify factors affecting it. METHODS: Plasma samples of 714 selected patients of the Swiss HIV Cohort Study infected for longer than 12 months and representing all viral clades and stages of chronic HIV-1 infection were tested blindly by Inno-Lia and classified as either incident (up to 12 m) or older infection by 24 different algorithms. Of the total, 524 patients received HAART, 308 had HIV-1 RNA below 50 copies/mL, and 620 were infected by a HIV-1 non-B clade. Using logistic regression analysis we evaluated factors that might affect the specificity of these algorithms. RESULTS: HIV-1 RNA <50 copies/mL was associated with significantly lower reactivity to all five HIV-1 antigens of the Inno-Lia and impaired specificity of most algorithms. Among 412 patients either untreated or with HIV-1 RNA ≥50 copies/mL despite HAART, the median specificity of the algorithms was 96.5% (range 92.0-100%). The only factor that significantly promoted false-incident results in this group was age, with false-incident results increasing by a few percent per additional year. HIV-1 clade, HIV-1 RNA, CD4 percentage, sex, disease stage, and testing modalities exhibited no significance. Results were similar among 190 untreated patients. CONCLUSIONS: The specificity of most Inno-Lia algorithms was high and not affected by HIV-1 variability, advanced disease and other factors promoting false-recent results in other STARHS. Specificity should be good in any group of untreated HIV-1 patients.
Resumo:
BACKGROUND: Several studies have reported increased levels of inflammatory biomarkers in chronic kidney disease (CKD), but data from the general population are sparse. In this study, we assessed levels of the inflammatory markers C-reactive protein (hsCRP), tumor necrosis factor α (TNF-α), interleukin (IL)-1β and IL-6 across all ranges of renal function. METHODS: We conducted a cross-sectional study in a random sample of 6,184 Caucasian subjects aged 35-75 years in Lausanne, Switzerland. Serum levels of hsCRP, TNF-α, IL-6, and IL-1β were measured in 6,067 participants (98.1%); serum creatinine-based estimated glomerular filtration rate (eGFR(creat), CKD-EPI formula) was used to assess renal function, and albumin/creatinine ratio on spot morning urine to assess microalbuminuria (MAU). RESULTS: Higher serum levels of IL-6, TNF-α and hsCRP and lower levels of IL-1β were associated with a lower renal function, CKD (eGFR(creat) <60 ml/min/1.73 m(2); n = 283), and MAU (n = 583). In multivariate linear regression analysis adjusted for age, sex, hypertension, smoking, diabetes, body mass index, lipids, antihypertensive and hypolipemic therapy, only log-transformed TNF-α remained independently associated with lower renal function (β -0.54 ±0.19). In multivariate logistic regression analysis, higher TNF-α levels were associated with CKD (OR 1.17; 95% CI 1.01-1.35), whereas higher levels of IL-6 (OR 1.09; 95% CI 1.02-1.16) and hsCRP (OR 1.21; 95% CI 1.10-1.32) were associated with MAU. CONCLUSION: We did not confirm a significant association between renal function and IL-6, IL-1β and hsCRP in the general population. However, our results demonstrate a significant association between TNF-α and renal function, suggesting a potential link between inflammation and the development of CKD. These data also confirm the association between MAU and inflammation.
Resumo:
Aims: We performed a randomised controlled trial in children of both gender and different pubertal stages to determine whether a school-based physical activity (PA) program during a full schoolyear influences bone mineral content (BMC) and whether there are differences in response for boys and girls before and during puberty. Methods: Twenty-eight 1st and 5th grade classes were cluster randomised to an intervention (INT, 16 classes, n=297) and control (CON; 12 classes, n=205) group. The intervention consisted of a multi-component PA intervention including daily physical education during a full school year. Each lesson was predetermined, included about ten minutes of jumping or strength training exercises of various intensity and was the same for all children. Measurements included anthropometry (height and weight), tanner stages (by self-assessment), PA (by accelerometry) and BMC for total body, femoral neck, total hip and lumbar spine using dualenergy X-ray absorptiometry (DXA). Bone parameters were normalized for gender and tanner stage (pre- vs. puberty). Analyses were performed by a regression model adjusted for gender, baseline height, baseline weight, baseline PA, post-intervention tanner stage, baseline BMC, and cluster. Researchers were blinded to group allocation. Children in the control group did not know about the intervention arm. Results: 217 (57%) of 380 children who initially agreed to have DXA measurements had also post-intervention DXA and PA data. Mean age of prepubertal and pubertal children at baseline was 9.0±2.1 and 11.2±0.6 years, respectively. 47/114 girls and 68/103 boys were prepubertal at the end of the intervention. Compared to CON, children in INT showed statistically significant increases in BMC of total body (adjusted z-score differences: 0.123; 95%>CI 0.035 to 0.212), femoral neck (0.155; 95%>CI 0.007 to 0.302), and lumbar spine (0.127; 95%>CI 0.026 to 0.228). Importantly, there was no gender*group, but a tanner*group interaction consistently favoring prepubertal children. Conclusions: Our findings show that a general, but stringent school-based PA intervention can improve BMC in elementary school children. Pubertal stage, but not gender seems to determine bone sensitivity to physical activity loading.
Resumo:
Background and aims: Because of the changing epidemiology of Inflammatory Bowel Diseases 0131)), we set out to characterize the population-based prevalence of Crohn's Disease (CD) and Ulcerative Colitis (UC) in a defined population of Switzerland. Methods: Adult IBD patients were identified by a cross-matched review of histological, hospital and gastroenterologist files throughout a geographical defined population (Canton of Vaud). Demographic factors statistically significantly associated with prevalence were evaluated using a stepwise Poisson regression analysis. Results were compared to IBD prevalence rates in other population-based studies and time trends were performed, based on a systematic literature review. Results: Age and sex-adjusted prevalence rates were 205.7 IBD (100.7 CD and 105.0 UC) cases per 105 inhabitants. Among 1016 IBD patients (519 CD and 497 UC), females outnumbered mates in CD (p < 0.001), but mates were more represented in elderly UC patients (p = 0.008). Thus, being a mate was statistically associated with UC (Relative Risk (RR) 1.25; p = 0.013), whereas being a female was associated with CD (RR 1.27; p = 0.007). Living in an urban zone was associated with both CD and UC (RR 1.49; p < 0.001, 1.63; p < 0.001, respectively). From 1960 to 2005, increases in UC and CD prevalences of 2.4% (95%CI, 2.1%-2.8%; p < 0.001) and 3.6% (95%CI, 3.1%-4.1%; p < 0.001) per annum were found in industrialised countries.