38 resultados para Feature selection
Resumo:
Human respiratory syncytial virus (HRSV) is the major cause of lower respiratory tract infections in children under 5 years of age and the elderly, causing annual disease outbreaks during the fall and winter. Multiple lineages of the HRSVA and HRSVB serotypes co-circulate within a single outbreak and display a strongly temporal pattern of genetic variation, with a replacement of dominant genotypes occurring during consecutive years. In the present study we utilized phylogenetic methods to detect and map sites subject to adaptive evolution in the G protein of HRSVA and HRSVB. A total of 29 and 23 amino acid sites were found to be putatively positively selected in HRSVA and HRSVB, respectively. Several of these sites defined genotypes and lineages within genotypes in both groups, and correlated well with epitopes previously described in group A. Remarkably, 18 of these positively selected tended to revert in time to a previous codon state, producing a ""flipflop'' phylogenetic pattern. Such frequent evolutionary reversals in HRSV are indicative of a combination of frequent positive selection, reflecting the changing immune status of the human population, and a limited repertoire of functionally viable amino acids at specific amino acid sites.
Resumo:
Background: The malaria parasite Plasmodium falciparum exhibits abundant genetic diversity, and this diversity is key to its success as a pathogen. Previous efforts to study genetic diversity in P. falciparum have begun to elucidate the demographic history of the species, as well as patterns of population structure and patterns of linkage disequilibrium within its genome. Such studies will be greatly enhanced by new genomic tools and recent large-scale efforts to map genomic variation. To that end, we have developed a high throughput single nucleotide polymorphism (SNP) genotyping platform for P. falciparum. Results: Using an Affymetrix 3,000 SNP assay array, we found roughly half the assays (1,638) yielded high quality, 100% accurate genotyping calls for both major and minor SNP alleles. Genotype data from 76 global isolates confirm significant genetic differentiation among continental populations and varying levels of SNP diversity and linkage disequilibrium according to geographic location and local epidemiological factors. We further discovered that nonsynonymous and silent (synonymous or noncoding) SNPs differ with respect to within-population diversity, interpopulation differentiation, and the degree to which allele frequencies are correlated between populations. Conclusions: The distinct population profile of nonsynonymous variants indicates that natural selection has a significant influence on genomic diversity in P. falciparum, and that many of these changes may reflect functional variants deserving of follow-up study. Our analysis demonstrates the potential for new high-throughput genotyping technologies to enhance studies of population structure, natural selection, and ultimately enable genome-wide association studies in P. falciparum to find genes underlying key phenotypic traits.
Resumo:
Background: Plasmodium vivax malaria is a major public health challenge in Latin America, Asia and Oceania, with 130-435 million clinical cases per year worldwide. Invasion of host blood cells by P. vivax mainly depends on a type I membrane protein called Duffy binding protein (PvDBP). The erythrocyte-binding motif of PvDBP is a 170 amino-acid stretch located in its cysteine-rich region II (PvDBP(II)), which is the most variable segment of the protein. Methods: To test whether diversifying natural selection has shaped the nucleotide diversity of PvDBP(II) in Brazilian populations, this region was sequenced in 122 isolates from six different geographic areas. A Bayesian method was applied to test for the action of natural selection under a population genetic model that incorporates recombination. The analysis was integrated with a structural model of PvDBP(II), and T-and B-cell epitopes were localized on the 3-D structure. Results: The results suggest that: (i) recombination plays an important role in determining the haplotype structure of PvDBP(II), and (ii) PvDBP(II) appears to contain neutrally evolving codons as well as codons evolving under natural selection. Diversifying selection preferentially acts on sites identified as epitopes, particularly on amino acid residues 417, 419, and 424, which show strong linkage disequilibrium. Conclusions: This study shows that some polymorphisms of PvDBP(II) are present near the erythrocyte-binding domain and might serve to elude antibodies that inhibit cell invasion. Therefore, these polymorphisms should be taken into account when designing vaccines aimed at eliciting antibodies to inhibit erythrocyte invasion.
Resumo:
Context tree models have been introduced by Rissanen in [25] as a parsimonious generalization of Markov models. Since then, they have been widely used in applied probability and statistics. The present paper investigates non-asymptotic properties of two popular procedures of context tree estimation: Rissanen's algorithm Context and penalized maximum likelihood. First showing how they are related, we prove finite horizon bounds for the probability of over- and under-estimation. Concerning overestimation, no boundedness or loss-of-memory conditions are required: the proof relies on new deviation inequalities for empirical probabilities of independent interest. The under-estimation properties rely on classical hypotheses for processes of infinite memory. These results improve on and generalize the bounds obtained in Duarte et al. (2006) [12], Galves et al. (2008) [18], Galves and Leonardi (2008) [17], Leonardi (2010) [22], refining asymptotic results of Buhlmann and Wyner (1999) [4] and Csiszar and Talata (2006) [9]. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Objective: We carry out a systematic assessment on a suite of kernel-based learning machines while coping with the task of epilepsy diagnosis through automatic electroencephalogram (EEG) signal classification. Methods and materials: The kernel machines investigated include the standard support vector machine (SVM), the least squares SVM, the Lagrangian SVM, the smooth SVM, the proximal SVM, and the relevance vector machine. An extensive series of experiments was conducted on publicly available data, whose clinical EEG recordings were obtained from five normal subjects and five epileptic patients. The performance levels delivered by the different kernel machines are contrasted in terms of the criteria of predictive accuracy, sensitivity to the kernel function/parameter value, and sensitivity to the type of features extracted from the signal. For this purpose, 26 values for the kernel parameter (radius) of two well-known kernel functions (namely. Gaussian and exponential radial basis functions) were considered as well as 21 types of features extracted from the EEG signal, including statistical values derived from the discrete wavelet transform, Lyapunov exponents, and combinations thereof. Results: We first quantitatively assess the impact of the choice of the wavelet basis on the quality of the features extracted. Four wavelet basis functions were considered in this study. Then, we provide the average accuracy (i.e., cross-validation error) values delivered by 252 kernel machine configurations; in particular, 40%/35% of the best-calibrated models of the standard and least squares SVMs reached 100% accuracy rate for the two kernel functions considered. Moreover, we show the sensitivity profiles exhibited by a large sample of the configurations whereby one can visually inspect their levels of sensitiveness to the type of feature and to the kernel function/parameter value. Conclusions: Overall, the results evidence that all kernel machines are competitive in terms of accuracy, with the standard and least squares SVMs prevailing more consistently. Moreover, the choice of the kernel function and parameter value as well as the choice of the feature extractor are critical decisions to be taken, albeit the choice of the wavelet family seems not to be so relevant. Also, the statistical values calculated over the Lyapunov exponents were good sources of signal representation, but not as informative as their wavelet counterparts. Finally, a typical sensitivity profile has emerged among all types of machines, involving some regions of stability separated by zones of sharp variation, with some kernel parameter values clearly associated with better accuracy rates (zones of optimality). (C) 2011 Elsevier B.V. All rights reserved.
Anthropometric characteristics and motor skills in talent selection and development in indoor soccer
Resumo:
Kick performance, anthropometric characteristics, slalom, and linear running were assessed in 49 (24 elite, 25 nonelite) postpubertal indoor soccer players in order to (a) verify whether anthropometric characteristics and physical and technical capacities can distinguish players of different competitive levels, (b) compare the kicking kinematics of these groups, with and without a defined target, and (c) compare results on the assessments and coaches` subjective rankings of the players. Thigh circumference and specific technical capacities differentiated the players by level of play; cluster analysis correctly classified 77.5% of the players. The correlation between players` standardized measures and the coaches` rankings was 0.29. Anthropometric characteristics and physical capacities do not necessarily differentiate players at post-pubertal stages and should not be overvalued during early development. Considering the coaches` rankings, performance measures outside the specific game conditions may not be useful in identification of talented players.
Resumo:
To evaluate the potential for fermentation of raspberry pulp, sixteen yeast strains (S. cerevisiae and S. bayanus) were studied. Volatile compounds were determined by GC-MS, GC-FID, and GC-PFPD. Ethanol. glycerol and organic acids were determined by HPLC. HPLC-DAD was used to analyse phenolic acids. Sensory analysis was performed by trained panellists. After a screening step, CAT-1, UFLA FW 15 and S. bayanus CBS 1505 were previously selected based on their fermentative characteristics and profile of the metabolites identified. The beverage produced with CAT-1 showed the highest volatile fatty acid concentration (1542.6 mu g/L), whereas the beverage produced with UFLA FIN 15 showed the highest concentration of acetates (2211.1 mu g/L) and total volatile compounds (5835 mu g/L). For volatile sulphur compounds. 566.5 mu g/L were found in the beverage produced with S. bayanus CBS 1505. The lowest concentration of volatile sulphur compounds (151.9 mu g/L) was found for the beverage produced with UFLA FW 15. In the sensory analysis, the beverage produced with UFLA FW 15 was characterised by the descriptors raspberry, cherry, sweet, strawberry, floral and violet. In conclusion, strain UFLA FW 15 was the yeast that produced a raspberry wine with a good chemical and sensory quality. (C) 2010 Elsevier Ltd. All rights reserved.
Resumo:
A combination of an extension of the topological instability ""lambda criterion"" and a thermodynamic criterion were applied to the Al-La system, indicating the best range of compositions for glass formation. Alloy compositions in this range were prepared by melt-spinning and casting in an arc-melting furnace with a wedge-section copper mold. The GFA of these samples was evaluated by X-ray diffraction, differential scanning calorimetry and scanning electron microscopy. The results indicated that the gamma* parameter of compositions with high GFA is higher, corresponding to a range in which the lambda parameter is greater than 0.1, which are compositions far from Al solid solution. A new alloy was identified with the best GFA reported so far for this system, showing a maximum thickness of 286 mu m in a wedge-section copper mold. Crown Copyright (C) 2009 Published by Elsevier B.V. All rights reserved.
Resumo:
This paper investigates how to make improved action selection for online policy learning in robotic scenarios using reinforcement learning (RL) algorithms. Since finding control policies using any RL algorithm can be very time consuming, we propose to combine RL algorithms with heuristic functions for selecting promising actions during the learning process. With this aim, we investigate the use of heuristics for increasing the rate of convergence of RL algorithms and contribute with a new learning algorithm, Heuristically Accelerated Q-learning (HAQL), which incorporates heuristics for action selection to the Q-Learning algorithm. Experimental results on robot navigation show that the use of even very simple heuristic functions results in significant performance enhancement of the learning rate.
Resumo:
This study presents a decision-making method for maintenance policy selection of power plants equipment. The method is based on risk analysis concepts. The method first step consists in identifying critical equipment both for power plant operational performance and availability based on risk concepts. The second step involves the proposal of a potential maintenance policy that could be applied to critical equipment in order to increase its availability. The costs associated with each potential maintenance policy must be estimated, including the maintenance costs and the cost of failure that measures the critical equipment failure consequences for the power plant operation. Once the failure probabilities and the costs of failures are estimated, a decision-making procedure is applied to select the best maintenance policy. The decision criterion is to minimize the equipment cost of failure, considering the costs and likelihood of occurrence of failure scenarios. The method is applied to the analysis of a lubrication oil system used in gas turbines journal bearings. The turbine has more than 150 MW nominal output, installed in an open cycle thermoelectric power plant. A design modification with the installation of a redundant oil pump is proposed for lubricating oil system availability improvement. (C) 2009 Elsevier Ltd. All rights reserved.
Resumo:
The aim of this work was the development of miniaturized structures useful for retention and/or selection of particles and viscous substances from a liquid flow. The proposed low costs structures are similar to macroscopic wastewater treatment systems, named baffles, and allow disassemble. They were simulated using FEMLAB 3.2b package and manufactured in acrylic with conventional tools. Tests for retention or selection of particles in water or air and viscous fluids in water were carried out. Either in air or water particles with 50 mu m diameter will be retained but not with 13 mu m diameter. In aqueous flow, it is also possible the retention of viscous samples, such as silicone 350 cSt. The simulated results showed good agreement with experimental measurements. These miniaturized structures can be useful in sample pretreatment for chemical analysis and microorganism manipulation. (C) 2007 Elsevier B.V. All rights reserved.
Resumo:
Intravascular ultrasound (IVUS) image segmentation can provide more detailed vessel and plaque information, resulting in better diagnostics, evaluation and therapy planning. A novel automatic segmentation proposal is described herein; the method relies on a binary morphological object reconstruction to segment the coronary wall in IVUS images. First, a preprocessing followed by a feature extraction block are performed, allowing for the desired information to be extracted. Afterward, binary versions of the desired objects are reconstructed, and their contours are extracted to segment the image. The effectiveness is demonstrated by segmenting 1300 images, in which the outcomes had a strong correlation to their corresponding gold standard. Moreover, the results were also corroborated statistically by having as high as 92.72% and 91.9% of true positive area fraction for the lumen and media adventitia border, respectively. In addition, this approach can be adapted easily and applied to other related modalities, such as intravascular optical coherence tomography and intravascular magnetic resonance imaging. (E-mail: matheuscardosomg@hotmail.com) (C) 2011 World Federation for Ultrasound in Medicine & Biology.
Resumo:
Saccharomyces cerevisiae hexokinase-less strains were produced to study the production of ethanol and fructose from sucrose. These strains do not have the hexokinases A and B. Twenty-three double-mutant strains were produced, and then, three were selected for presenting a smaller growth in yeast extract-peptone-fructose. In fermentations with a medium containing sucrose (180.3 g L-1) and with cell recycles, simulating industrial conditions, the capacity of these mutant yeasts in inverting sucrose and fermenting only glucose was well characterized. Besides that, we could also see their great tolerance to the stresses of fermentative recycles, where fructose production (until 90 g L-1) and ethanol production (until 42.3 g L-1) occurred in cycles of 12 h, in which hexokinase-less yeasts performed high growth (51.2% of wet biomass) and viability rates (77% of viable cells) after nine consecutive cycles.
Resumo:
Maize (Zea mays L.) is a very important cereal to world-wide economy which is also true for Brazil, particularly in the South region. Grain yield and plant height have been chosen as important criteria by breeders and farmers from Santa Catarina State (SC), Brazil. The objective of this work was to estimate genetic-statistic parameters associated with genetic gain for grain yield and plant height, in the first cycle of convergent-divergent half-sib selection in a maize population (MPA1) cultivated by farmers within the municipality of Anchieta (SC). Three experiments were carried out in different small farms at Anchieta using low external agronomic inputs; each experiment represented independent samples of half-sib families, which were evaluated in randomized complete blocks with three replications per location. Significant differences among half-sib families were observed for both variables in all experiments. The expected responses to truncated selection of the 25% better families in each experiment were 5.1, 5.8 and 5.2% for reducing plant height and 3.9, 5.7 and 5.0% for increasing grain yield, respectively. The magnitudes of genetic-statistic parameters estimated evidenced that the composite population MPA1 exhibits enough genetic variability to be used in cyclical process of recurrent selection. There were evidences that the genetic structure of the base population MPA1, as indicated by its genetic variability, may lead to expressive changes in the traits under selection, even under low selection pressure.
Resumo:
Causal inference methods - mainly path analysis and structural equation modeling - offer plant physiologists information about cause-and-effect relationships among plant traits. Recently, an unusual approach to causal inference through stepwise variable selection has been proposed and used in various works on plant physiology. The approach should not be considered correct from a biological point of view. Here, it is explained why stepwise variable selection should not be used for causal inference, and shown what strange conclusions can be drawn based upon the former analysis when one aims to interpret cause-and-effect relationships among plant traits.