892 resultados para Two-stage classification
Resumo:
This Ph.D. project aimed to the development and improvement of analytical solutions for control of quality and authenticity of virgin olive oils. According to this main objective, different research activities were carried out: concerning the quality control of olive oil, two of the official parameters defined by regulations (free acidity and fatty acid ethyl esters) were taken into account, and more sustainable and easier analytical solutions were developed and validated in-house. Regarding authenticity, two different issues were faced: verification of the geographical origin of extra virgin (EVOOs) and virgin olive oils (VOOs), and assessment of soft-deodorized oils illegally mixed with EVOOs. About fatty acid ethyl esters, a revised method based on the application of off-line HPLC-GC-FID (with PTV injector), revising both the preparative phase and the GC injector required in the official method, was developed. Next, the method was in-house validated evaluating several parameters. Concerning free acidity, a portable system suitable for in-situ measurements of VOO free acidity was developed and in-house validated. Its working principle is based on the estimation of the olive oil free acidity by measuring the conductance of an emulsion between a hydro-alcoholic solution and the sample to be tested. The procedure is very quick and easy and, therefore, suitable for people without specific training. Another study developed during the Ph.D. was about the application of flash gas chromatography for volatile compounds analysis, combined with untargeted chemometric data elaborations, for discrimination of EVOOs and VOOs of different geographical origin. A set of 210 samples coming from different EU member states and extra-EU countries were collected and analyzed. Data were elaborated applying two different classification techniques, one linear (PLS-DA) and one non-linear (ANN). Finally, a preliminary study about the application of GC-IMS (Gas Chromatograph - Ion Mobility Spectrometer) for assessment of soft-deodorized olive oils was carried out.
Resumo:
The Three-Dimensional Single-Bin-Size Bin Packing Problem is one of the most studied problem in the Cutting & Packing category. From a strictly mathematical point of view, it consists of packing a finite set of strongly heterogeneous “small” boxes, called items, into a finite set of identical “large” rectangles, called bins, minimizing the unused volume and requiring that the items are packed without overlapping. The great interest is mainly due to the number of real-world applications in which it arises, such as pallet and container loading, cutting objects out of a piece of material and packaging design. Depending on these real-world applications, more objective functions and more practical constraints could be needed. After a brief discussion about the real-world applications of the problem and a exhaustive literature review, the design of a two-stage algorithm to solve the aforementioned problem is presented. The algorithm must be able to provide the spatial coordinates of the placed boxes vertices and also the optimal boxes input sequence, while guaranteeing geometric, stability, fragility constraints and a reduced computational time. Due to NP-hard complexity of this type of combinatorial problems, a fusion of metaheuristic and machine learning techniques is adopted. In particular, a hybrid genetic algorithm coupled with a feedforward neural network is used. In the first stage, a rich dataset is created starting from a set of real input instances provided by an industrial company and the feedforward neural network is trained on it. After its training, given a new input instance, the hybrid genetic algorithm is able to run using the neural network output as input parameter vector, providing as output the optimal solution. The effectiveness of the proposed works is confirmed via several experimental tests.
Resumo:
Over the last century, mathematical optimization has become a prominent tool for decision making. Its systematic application in practical fields such as economics, logistics or defense led to the development of algorithmic methods with ever increasing efficiency. Indeed, for a variety of real-world problems, finding an optimal decision among a set of (implicitly or explicitly) predefined alternatives has become conceivable in reasonable time. In the last decades, however, the research community raised more and more attention to the role of uncertainty in the optimization process. In particular, one may question the notion of optimality, and even feasibility, when studying decision problems with unknown or imprecise input parameters. This concern is even more critical in a world becoming more and more complex —by which we intend, interconnected —where each individual variation inside a system inevitably causes other variations in the system itself. In this dissertation, we study a class of optimization problems which suffer from imprecise input data and feature a two-stage decision process, i.e., where decisions are made in a sequential order —called stages —and where unknown parameters are revealed throughout the stages. The applications of such problems are plethora in practical fields such as, e.g., facility location problems with uncertain demands, transportation problems with uncertain costs or scheduling under uncertain processing times. The uncertainty is dealt with a robust optimization (RO) viewpoint (also known as "worst-case perspective") and we present original contributions to the RO literature on both the theoretical and practical side.
Resumo:
In this PhD thesis a new firm level conditional risk measure is developed. It is named Joint Value at Risk (JVaR) and is defined as a quantile of a conditional distribution of interest, where the conditioning event is a latent upper tail event. It addresses the problem of how risk changes under extreme volatility scenarios. The properties of JVaR are studied based on a stochastic volatility representation of the underlying process. We prove that JVaR is leverage consistent, i.e. it is an increasing function of the dependence parameter in the stochastic representation. A feasible class of nonparametric M-estimators is introduced by exploiting the elicitability of quantiles and the stochastic ordering theory. Consistency and asymptotic normality of the two stage M-estimator are derived, and a simulation study is reported to illustrate its finite-sample properties. Parametric estimation methods are also discussed. The relation with the VaR is exploited to introduce a volatility contribution measure, and a tail risk measure is also proposed. The analysis of the dynamic JVaR is presented based on asymmetric stochastic volatility models. Empirical results with S&P500 data show that accounting for extreme volatility levels is relevant to better characterize the evolution of risk. The work is complemented by a review of the literature, where we provide an overview on quantile risk measures, elicitable functionals and several stochastic orderings.
Resumo:
Latency can be defined as the sum of the arrival times at the customers. Minimum latency problems are specially relevant in applications related to humanitarian logistics. This thesis presents algorithms for solving a family of vehicle routing problems with minimum latency. First the latency location routing problem (LLRP) is considered. It consists of determining the subset of depots to be opened, and the routes that a set of homogeneous capacitated vehicles must perform in order to visit a set of customers such that the sum of the demands of the customers assigned to each vehicle does not exceed the capacity of the vehicle. For solving this problem three metaheuristic algorithms combining simulated annealing and variable neighborhood descent, and an iterated local search (ILS) algorithm, are proposed. Furthermore, the multi-depot cumulative capacitated vehicle routing problem (MDCCVRP) and the multi-depot k-traveling repairman problem (MDk-TRP) are solved with the proposed ILS algorithm. The MDCCVRP is a special case of the LLRP in which all the depots can be opened, and the MDk-TRP is a special case of the MDCCVRP in which the capacity constraints are relaxed. Finally, a LLRP with stochastic travel times is studied. A two-stage stochastic programming model and a variable neighborhood search algorithm are proposed for solving the problem. Furthermore a sampling method is developed for tackling instances with an infinite number of scenarios. Extensive computational experiments show that the proposed methods are effective for solving the problems under study.
Resumo:
Background and purpose: Breast cancer continues to be a health problem for women, representing 28 percent of all female cancers and remaining one of the leading causes of death for women. Breast cancer incidence rates become substantial before the age of 50. After menopause, breast cancer incidence rates continue to increase with age creating a long-lasting source of concern (Harris et al., 1992). Mammography, a technique for the detection of breast tumors in their nonpalpable stage when they are most curable, has taken on considerable importance as a public health measure. The lifetime risk of breast cancer is approximately 1 in 9 and occurs over many decades. Recommendations are that screening be periodic in order to detect cancer at early stages. These recommendations, largely, are not followed. Not only are most women not getting regular mammograms, but this circumstance is particularly the case among older women where regular mammography has been proven to reduce mortality by approximately 30 percent. The purpose of this project was to increase our understanding of factors that are associated with stage of readiness to obtain subsequent mammograms. A secondary purpose of this research was to suggest further conceptual considerations toward the extension of the Transtheoretical Model (TTM) of behavior change to repeat screening mammography. ^ Methods. A sample (n = 1,222) of women 50 years and older in a large multi-specialty clinic in Houston, Texas was surveyed by mail questionnaire regarding their previous screening experience and stage of readiness to obtain repeat screening. A computerized database, maintained on all women who undergo mammography at the clinic, was used to identify women who are eligible for the project. The major statistical technique employed to select the significant variables and to examine the man and interaction effects of independent variables on dependent variables was polychotomous stepwise, logistic regression. A prediction model for each stage of readiness definition was estimated. The expected probabilities for stage of readiness were calculated to assess the magnitude and direction of significant predictors. ^ Results. Analysis showed that both ways of defining stage of readiness for obtaining a screening mammogram were associated with specific constructs, including decisional balance and processes of the change. ^ Conclusions. The results of the present study demonstrate that the TTM appears to translate to repeat mammography screening. Findings in the current study also support finding of previous studies that suggest that stage of readiness is associated with respondent decisional balance and the processes of change. ^
Resumo:
Near-infrared spectroscopy (NIRS) was used to analyse the crude protein content of dried and milled samples of wheat and to discriminate samples according to their stage of growth. A calibration set of 72 samples from three growth stages of wheat (tillering, heading and harvest) and a validation set of 28 samples was collected for this purpose. Principal components analysis (PCA) of the calibration set discriminated groups of samples according to the growth stage of the wheat. Based on these differences, a classification procedure (SIMCA) showed a very accurate classification of the validation set samples : all of them were successfully classified in each group using this procedure when both the residual and the leverage were used in the classification criteria. Looking only at the residuals all the samples were also correctly classified except one of tillering stage that was assigned to both tillering and heading stages. Finally, the determination of the crude protein content of these samples was considered in two ways: building up a global model for all the growth stages, and building up local models for each stage, separately. The best prediction results for crude protein were obtained using a global model for samples in the two first growth stages (tillering and heading), and using a local model for the harvest stage samples.
Resumo:
The pivotal role of spleen CD4(+) T cells in the development of both malaria pathogenesis and protective immunity makes necessary a profound comprehension of the mechanisms involved in their activation and regulation during Plasmodium infection. Herein, we examined in detail the behaviour of non-conventional and conventional splenic CD4(+) T cells during P. chabaudi malaria. We took advantage of the fact that a great proportion of CD4(+) T cells generated in CD1d(-/-) mice are I-A(b)-restricted (conventional cells), while their counterparts in I-Ab(-/-) mice are restricted by CD1d and other class IB major histocompatibility complex (MHC) molecules (non-conventional cells). We found that conventional CD4(+) T cells are the main protagonists of the immune response to infection, which develops in two consecutive phases concomitant with acute and chronic parasitaemias. The early phase of the conventional CD4(+) T cell response is intense and short lasting, rapidly providing large amounts of proinflammatory cytokines and helping follicular and marginal zone B cells to secrete polyclonal immunoglobulin. Both TNF-alpha and IFN-gamma production depend mostly on conventional CD4(+) T cells. IFN-gamma is produced simultaneously by non-conventional and conventional CD4(+) T cells. The early phase of the response finishes after a week of infection, with the elimination of a large proportion of CD4(+) T cells, which then gives opportunity to the development of acquired immunity. Unexpectedly, the major contribution of CD1d-restricted CD4(+) T cells occurs at the beginning of the second phase of the response, but not earlier, helping both IFN-gamma and parasite-specific antibody production. We concluded that conventional CD4(+) T cells have a central role from the onset of P. chabaudi malaria, acting in parallel with non-conventional CD4(+) T cells as a link between innate and acquired immunity. This study contributes to the understanding of malaria immunology and opens a perspective for future studies designed to decipher the molecular mechanisms behind immune responses to Plasmodium infection.
Resumo:
Gasteruptiinae is the largest Gasteruptiidae subfamily, with circa 400 species that have been grouped into the worldwide Gasteruption Latreille. Based on a cladistic analysis with 43 morphological characters, 40 ingroup taxa representing all biogeographic regions, and seven outgroups (four Hyptiogastrinae, two Aulacidae and one Evaniidae), I confirm the monophyly of Gasteruptiinae and Gasteruption and recognize three exclusively Neotropical small genera: Plutofoenus Kieffer (revalidated) (southern South America), Spinolafoenus Macedo n. gen. (Chile) and Trilobitofoenus Macedo n. gen. (Central and South America). Gasteruption, supported by four synapomorphies, remains the most speciose genus in the subfamily. The four Gasteruptiinae genera are keyed and described. Seven species are keyed and described or redescribed: Plutofoenus chaeturus (Schletterer) n. comb., P. edwardsi Turner, P. paraguayensis (Schrottky), Spinolafoenus ruficornis (Spinola) n. comb., Trilobitofoenus alvarengai Macedo n. sp., T. plaumanni Macedo n. sp. and T. sericeus (Cameron) n. comb. (lectotype designated).
Resumo:
Chronic Liver Disease is a progressive, most of the time asymptomatic, and potentially fatal disease. In this paper, a semi-automatic procedure to stage this disease is proposed based on ultrasound liver images, clinical and laboratorial data. In the core of the algorithm two classifiers are used: a k nearest neighbor and a Support Vector Machine, with different kernels. The classifiers were trained with the proposed multi-modal feature set and the results obtained were compared with the laboratorial and clinical feature set. The results showed that using ultrasound based features, in association with laboratorial and clinical features, improve the classification accuracy. The support vector machine, polynomial kernel, outperformed the others classifiers in every class studied. For the Normal class we achieved 100% accuracy, for the chronic hepatitis with cirrhosis 73.08%, for compensated cirrhosis 59.26% and for decompensated cirrhosis 91.67%.
Resumo:
In order to classify mosquito immature stage habitats, samples were taken in 42 localities of Córdoba Province, Argentina, representing the phytogeographic regions of Chaco, Espinal and Pampa. Immature stage habitats were described and classified according to the following criteria: natural or artificial; size; location related to light and neighboring houses; vegetation; water: permanence, movement, turbidity and pH. Four groups of species were associated based on the habitat similarity by means of cluster analysis: Aedes albifasciatus, Culex saltanensis, Cx. mollis, Cx. brethesi, Psorophora ciliata, Anopheles albitarsis, and Uranotaenia lowii (Group A); Cx. acharistus, Cx. quinquefasciatus, Cx. bidens, Cx. dolosus, Cx. maxi and Cx. apicinus (Group B); Cx. coronator, Cx. chidesteri, Mansonia titillans and Ps. ferox (Group C); Ae. fluviatilis and Ae. milleri (Group D). The principal component analysis (ordination method) pointed out that the different types of habitats, their nature (natural or artificial), plant species, water movement and depth are the main characters explaining the observed variation among the mosquito species. The distribution of mosquito species by phytogeographic region did not affect the species groups, since species belonging to different groups were collected in the same region.
Resumo:
This study presents a classification criteria for two-class Cannabis seedlings. As the cultivation of drug type cannabis is forbidden in Switzerland, law enforcement authorities regularly ask laboratories to determine cannabis plant's chemotype from seized material in order to ascertain that the plantation is legal or not. In this study, the classification analysis is based on data obtained from the relative proportion of three major leaf compounds measured by gas-chromatography interfaced with mass spectrometry (GC-MS). The aim is to discriminate between drug type (illegal) and fiber type (legal) cannabis at an early stage of the growth. A Bayesian procedure is proposed: a Bayes factor is computed and classification is performed on the basis of the decision maker specifications (i.e. prior probability distributions on cannabis type and consequences of classification measured by losses). Classification rates are computed with two statistical models and results are compared. Sensitivity analysis is then performed to analyze the robustness of classification criteria.
Resumo:
PURPOSE: To retrospectively assess the influence of prophylactic cranial irradiation (PCI) timing on brain relapse rates in patients treated with two different chemoradiotherapy (CRT) regimens for Stage IIIB non-small-cell lung cancer (NSCLC). METHODS AND MATERIALS: A cohort of 134 patients, with Stage IIIB NSCLC in recursive partitioning analysis Group 1, was treated with PCI (30 Gy at 2 Gy/fr) following one of two CRT regimens. Regimen 1 (n = 58) consisted of three cycles of induction chemotherapy (ICT) followed by concurrent CRT (C-CRT). Regimen 2 (n = 76) consisted of immediate C-CRT during thoracic radiotherapy. RESULTS: At a median follow-up of 27.6 months (range, 7.2-40.4), 65 patients were alive. Median, progression-free, and brain metastasis-free survival (BMFS) times for the whole study cohort were 23.4, 15.4, and 23.0 months, respectively. Median survival time and the 3-year survival rate for regimens 1 and 2 were 19.3 vs. 26.1 months (p = 0.001) and 14.4% vs. 34.4% (p < .001), respectively. Median time from the initiation of primary treatment to PCI was 123.2 (range, 97-161) and 63.4 (range, 55-74) days for regimens 1 and 2, respectively (p < 0.001). Overall, 11 (8.2%) patients developed brain metastasis (BM) during the follow-up period: 8 (13.8%) in regimen 1 and 3 (3.9%) in regimen 2 (p = 0.03). Only 3 (2.2%) patients developed BM at the site of first failure, and for 2 of them, it was also the sole site of recurrence. Median BMFS for regimens 1 and 2 were 17.4 (13.5-21.3) vs. 26.0 (22.9-29.1 months), respectively (p < 0.001). CONCLUSION: These results suggest that in Stage IIIB NSCLC patients treated with PCI, lower BM incidence and longer survival rates result from immediate C-CRT rather than ITC-first regimens. This indicates the benefit of earlier PCI use without delay because of induction protocols.