927 resultados para Negative Selection Algorithm
Resumo:
Doutoramento em Matemática
Resumo:
The K-means algorithm is one of the most popular clustering algorithms in current use as it is relatively fast yet simple to understand and deploy in practice. Nevertheless, its use entails certain restrictive assumptions about the data, the negative consequences of which are not always immediately apparent, as we demonstrate. While more flexible algorithms have been developed, their widespread use has been hindered by their computational and technical complexity. Motivated by these considerations, we present a flexible alternative to K-means that relaxes most of the assumptions, whilst remaining almost as fast and simple. This novel algorithm which we call MAP-DP (maximum a-posteriori Dirichlet process mixtures), is statistically rigorous as it is based on nonparametric Bayesian Dirichlet process mixture modeling. This approach allows us to overcome most of the limitations imposed by K-means. The number of clusters K is estimated from the data instead of being fixed a-priori as in K-means. In addition, while K-means is restricted to continuous data, the MAP-DP framework can be applied to many kinds of data, for example, binary, count or ordinal data. Also, it can efficiently separate outliers from the data. This additional flexibility does not incur a significant computational overhead compared to K-means with MAP-DP convergence typically achieved in the order of seconds for many practical problems. Finally, in contrast to K-means, since the algorithm is based on an underlying statistical model, the MAP-DP framework can deal with missing data and enables model testing such as cross validation in a principled way. We demonstrate the simplicity and effectiveness of this algorithm on the health informatics problem of clinical sub-typing in a cluster of diseases known as parkinsonism.
Resumo:
A dedicated algorithm for sparse spectral representation of music sound is presented. The goal is to enable the representation of a piece of music signal as a linear superposition of as few spectral components as possible, without affecting the quality of the reproduction. A representation of this nature is said to be sparse. In the present context sparsity is accomplished by greedy selection of the spectral components, from an overcomplete set called a dictionary. The proposed algorithm is tailored to be applied with trigonometric dictionaries. Its distinctive feature being that it avoids the need for the actual construction of the whole dictionary, by implementing the required operations via the fast Fourier transform. The achieved sparsity is theoretically equivalent to that rendered by the orthogonal matching pursuit (OMP) method. The contribution of the proposed dedicated implementation is to extend the applicability of the standard OMP algorithm, by reducing its storage and computational demands. The suitability of the approach for producing sparse spectral representation is illustrated by comparison with the traditional method, in the line of the short time Fourier transform, involving only the corresponding orthonormal trigonometric basis.
Resumo:
The relative role of drift versus selection underlying the evolution of bacterial species within the gut microbiota remains poorly understood. The large sizes of bacterial populations in this environment suggest that even adaptive mutations with weak effects, thought to be the most frequently occurring, could substantially contribute to a rapid pace of evolutionary change in the gut. We followed the emergence of intra-species diversity in a commensal Escherichia coli strain that previously acquired an adaptive mutation with strong effect during one week of colonization of the mouse gut. Following this first step, which consisted of inactivating a metabolic operon, one third of the subsequent adaptive mutations were found to have a selective effect as high as the first. Nevertheless, the order of the adaptive steps was strongly affected by a mutational hotspot with an exceptionally high mutation rate of 10-5. The pattern of polymorphism emerging in the populations evolving within different hosts was characterized by periodic selection, which reduced diversity, but also frequency-dependent selection, actively maintaining genetic diversity. Furthermore, the continuous emergence of similar phenotypes due to distinct mutations, known as clonal interference, was pervasive. Evolutionary change within the gut is therefore highly repeatable within and across hosts, with adaptive mutations of selection coefficients as strong as 12% accumulating without strong constraints on genetic background. In vivo competitive assays showed that one of the second steps (focA) exhibited positive epistasis with the first, while another (dcuB) exhibited negative epistasis. The data shows that strong effect adaptive mutations continuously recur in gut commensal bacterial species.
Resumo:
The depredation of semi-domesticated reindeer by large carnivores reflects an important human-wildlife conflict in Fennoscandia. Recent studies have revealed that brown bears (Ursus arctos) may kill substantial numbers of reindeer calves (Rangifer tarandus tarandus) in forest areas in Sweden. Several authors have suggested that predation risk is an important driver of habitat selection in wild Rangifer populations where predation is a limiting factor, but little is known about these mechanisms in semi-domesticated populations. We examined the habitat selection of female reindeer in relation to spatial and temporal variations in brown bear predation risk on the reindeer calving grounds and evaluated the simultaneous responses of brown bears and reindeer to landscape characteristics. We used GPS data from 110 reindeer years (97 individuals) and 29 brown bear years (19 individuals), from two reindeer herding districts in the forest area of northern Sweden. Our results did not indicate that reindeer alter their behavior in response to spatiotemporal variation in brown bear predation risk, on the scale of the calving range. Instead, we suggest that spatiotemporal behavioral adjustments by brown bears were the main driver of prey-predator interactions in our study system. Contrasting responses by brown bears and reindeer to clear-cuts and young forest indicate that forestry can influence species interactions and possibly yield negative consequences for the reindeer herd. Even if clear-cuts may be beneficial in terms of calf survival, logging activity will eventually cause greater abundance of young regenerating forest, reducing available reindeer habitats and increasing habitat preferred by brown bears. Domestication may have made semi-domesticated reindeer in Fennoscandia less adapted to cope with predators. Areal restrictions, limiting the opportunity for dispersion and escape, possibly make the calves more susceptible to predation. Also, a generally higher population density in semi-domesticated herds compared to wild populations can make dispersion a less efficient strategy and the reindeer calves easier prey. Overall, the lack of ability of the reindeer females to reduce brown bear encounter risk on the scale of the calving range is probably an important reason for the high brown bear predation rates on reindeer calves documented in our study areas.
Resumo:
This thesis focuses on finding the optimum block cutting dimensions in terms of the environmental and economic factors by using a 3D algorithm for a limestone quarry in Foggia, Italy. The environmental concerns of quarrying operations are mainly: energy consumption, material waste, and pollution. The main economic concerns are the block recovery, the selling prices, and the production costs. Fractures adversely affect the block recovery ratio. With a fracture model, block production can be optimized. In this research, the waste volume produced by quarrying was minimised to increase the recovery ratio and ensure economic benefits. SlabCutOpt is a software developed at DICAM–University of Bologna for block cutting optimization which tests different cutting angles on the x-y-z planes to offer up alternative cutting methods. The program tests several block sizes and outputs the optimal result for each entry. By using SlabCutOpt, ten different block dimensions were analysed, the results indicated the maximum number of non-intersecting blocks for each dimension. After analysing the outputs, the block named number 1 with the dimensions ‘1mx1mx1m’ had the highest recovery ratio as 43% and the total Relative Money Value (RMV) with a value of 22829. Dimension number 1, also had the lowest waste volume, with a value of 3953.25 m3, for the total bench. For cutting the total bench volume of 6932.25m3, the diamond wire cutter had the lowest dust emission values for the block with the dimension ‘2mx2mx2m’, with a value of 24m3. When compared with the Eco-Label standards, block dimensions having surface area values lower than 15m2, were found to fit the natural resource waste criteria of the label, as the threshold required 25% of minimum recovery [1]. Due to the relativity of production costs, together with the Eco-Label threshold, the research recommends the selection of the blocks with a surface area value between 6m2 and 14m2.
Resumo:
The aim of this thesis project is to automatically localize HCC tumors in the human liver and subsequently predict if the tumor will undergo microvascular infiltration (MVI), the initial stage of metastasis development. The input data for the work have been partially supplied by Sant'Orsola Hospital and partially downloaded from online medical databases. Two Unet models have been implemented for the automatic segmentation of the livers and the HCC malignancies within it. The segmentation models have been evaluated with the Intersection-over-Union and the Dice Coefficient metrics. The outcomes obtained for the liver automatic segmentation are quite good (IOU = 0.82; DC = 0.35); the outcomes obtained for the tumor automatic segmentation (IOU = 0.35; DC = 0.46) are, instead, affected by some limitations: it can be state that the algorithm is almost always able to detect the location of the tumor, but it tends to underestimate its dimensions. The purpose is to achieve the CT images of the HCC tumors, necessary for features extraction. The 14 Haralick features calculated from the 3D-GLCM, the 120 Radiomic features and the patients' clinical information are collected to build a dataset of 153 features. Now, the goal is to build a model able to discriminate, based on the features given, the tumors that will undergo MVI and those that will not. This task can be seen as a classification problem: each tumor needs to be classified either as “MVI positive” or “MVI negative”. Techniques for features selection are implemented to identify the most descriptive features for the problem at hand and then, a set of classification models are trained and compared. Among all, the models with the best performances (around 80-84% ± 8-15%) result to be the XGBoost Classifier, the SDG Classifier and the Logist Regression models (without penalization and with Lasso, Ridge or Elastic Net penalization).
Resumo:
Despite the remarkable improvements in breast cancer (BC) characterization, accurate prediction of BC clinical behavior is often still difficult to achieve. Some studies have investigated the association between the molecular subtype, namely the basal-like BC and the pattern of relapse, however only few investigated the association between relapse pattern and immunohistochemical defined triple-negative breast cancers (TNBCs). The aim of this study was to evaluate the pattern of relapse in patients with TNBC, namely the primary distant relapse site. One-hundred twenty nine (129) invasive breast carcinomas with follow-up information were classified according to the molecular subtype using immunohistochemistry for ER, PgR and Her2. The association between TNBC and distant relapse primary site was analyzed by logistic regression. Using multivariate logistic regression analysis patients with TNBC displayed only 0.09 (95% CI: 0.00-0.74; p=0.02) the odds of the non-TNBC patients of developing bone primary relapse. Regarding visceral and lymph-node relapse, no differences between in this cohort were found. Though classically regarded as aggressive tumors, TNBCs rarely development primary relapse in bone when compared to non-TNBC, a clinical relevant fact when investigating a metastasis of an occult or non-sampled primary BC.
Resumo:
Lipidic mixtures present a particular phase change profile highly affected by their unique crystalline structure. However, classical solid-liquid equilibrium (SLE) thermodynamic modeling approaches, which assume the solid phase to be a pure component, sometimes fail in the correct description of the phase behavior. In addition, their inability increases with the complexity of the system. To overcome some of these problems, this study describes a new procedure to depict the SLE of fatty binary mixtures presenting solid solutions, namely the Crystal-T algorithm. Considering the non-ideality of both liquid and solid phases, this algorithm is aimed at the determination of the temperature in which the first and last crystal of the mixture melts. The evaluation is focused on experimental data measured and reported in this work for systems composed of triacylglycerols and fatty alcohols. The liquidus and solidus lines of the SLE phase diagrams were described by using excess Gibbs energy based equations, and the group contribution UNIFAC model for the calculation of the activity coefficients of both liquid and solid phases. Very low deviations of theoretical and experimental data evidenced the strength of the algorithm, contributing to the enlargement of the scope of the SLE modeling.
Resumo:
We assessed associations between steroid receptors including: estrogen-alpha, estrogen-beta, androgen receptor, progesterone receptor, the HER2 status and triple-negative epithelial ovarian cancer (ERα-/PR-/HER2-; TNEOC) status and survival in women with epithelial ovarian cancer. The study included 152 women with primary epithelial ovarian cancer. The status of steroid receptor and HER2 was determined by immunohistochemistry. Disease-free and overall survival were calculated and compared with steroid receptor and HER2 status as well as clinicopathological features using the Cox Proportional Hazards model. A mean follow-up period of 43.6 months (interquartile range=41.4 months) was achieved where 44% of patients had serous tumor, followed by mucinous (23%), endometrioid (9%), mixed (9%), undifferentiated (8.5%) and clear cell tumors (5.3%). ER-alpha staining was associated with grade II-III tumors. Progesterone receptor staining was positively associated with a Body Mass Index≥25. Androgen receptor positivity was higher in serous tumors. In stand-alone analysis of receptor contribution to survival, estrogen-alpha positivity was associated with greater disease-free survival. However, there was no significant association between steroid receptor expression, HER2 status, or TNEOC status, and overall survival. Although estrogen-alpha, androgen receptor, progesterone receptor and the HER2 status were associated with key clinical features of the women and pathological characteristics of the tumors, these associations were not implicated in survival. Interestingly, women with TNEOC seem to fare the same way as their counterparts with non-TNEOC.
Resumo:
The genera Cochliomyia and Chrysomya contain both obligate and saprophagous flies, which allows the comparison of different feeding habits between closely related species. Among the different strategies for comparing these habits is the use of qPCR to investigate the expression levels of candidate genes involved in feeding behavior. To ensure an accurate measure of the levels of gene expression, it is necessary to normalize the amount of the target gene with the amount of a reference gene having a stable expression across the compared species. Since there is no universal gene that can be used as a reference in functional studies, candidate genes for qPCR data normalization were selected and validated in three Calliphoridae (Diptera) species, Cochliomyia hominivorax Coquerel, Cochliomyia macellaria Fabricius, and Chrysomya albiceps Wiedemann . The expression stability of six genes ( Actin, Gapdh, Rp49, Rps17, α -tubulin, and GstD1) was evaluated among species within the same life stage and between life stages within each species. The expression levels of Actin, Gapdh, and Rp49 were the most stable among the selected genes. These genes can be used as reliable reference genes for functional studies in Calliphoridae using similar experimental settings.
Resumo:
Although MRI is utilized for planning the resection of soft-tissue tumors, it is not always capable of differentiating benign from malignant lesions. The risk of local recurrence of soft-tissue sarcomas is increased when biopsies are performed before resection and by inadequate resections. PET associated with computed tomography using fluorodeoxyglucose labeled with fluorine-18 ((18)F-FDG PET/CT) may help differentiate between benign and malignant tumors, thus avoiding inadequate resections and making prior biopsies unnecessary. The purpose of this study was to evaluate the usefulness of (18)F-FDG PET/CT in differentiating benign from malignant solid soft-tissue lesions. Patients with solid lesions of the limbs or abdominal wall detected by MRI were submitted to (18)F-FDG PET/CT. The maximum standardized uptake value (SUVmax) cutoff was determined to differentiate malignant from benign tumors. Regardless of the (18)F-FDG PET/CT results all patients underwent biopsy and surgery. MRI was performed in 54 patients, and 10 patients were excluded because of purely lipomatose or cystic lesions. (18)F-FDG PET/CT was performed in the remaining 44 patients. Histopathology revealed 26 (59%) benign and 18 (41%) malignant soft-tissue lesions. A significant difference in SUVmax was observed between benign and malignant soft-tissue lesions. The SUVmax cutoff of 3.0 differentiated malignant from benign lesions with 100% sensitivity, 83.3% specificity, 89.6% accuracy, 78.3% positive predictive value, and 100% negative predictive value. (18)F-FDG PET/CT seems to be able to differentiate benign from malignant soft-tissue lesions with good accuracy and very high negative predictive value. Incorporating (18)F-FDG PET/CT into the diagnostic algorithm of these patients may prevent inadequate resections and unnecessary biopsies.
Resumo:
To evaluate whether dyspareunia is associated with HIV status in menopausal women and also to assess which factors are associated with dyspareunia in a group of HIV-positive menopausal women. A cross-sectional study was conducted with 178 HIV-negative and 128 HIV-positive women aged 40-60 years. The Short Personal Experiences Questionnaire (SPEQ) was used to collect data. Sociodemographic, clinical, behavioural and reproductive factors were evaluated, as well as factors related to the HIV infection. Dyspareunia was defined as pain during intercourse. A bivariate analysis and Poisson multiple regression analysis were performed. Overall, 41.4% of the HIV-positive women reported dyspareunia compared with 34.8% of the HIV-negative women (p=0.242). In the HIV-positive women, bivariate analysis revealed an association between dyspareunia and having a steady partner (p=0.047); the woman's partner having undergone HIV testing (p=0.020); vaginal dryness (p<0.001); muscle/joint pain (p=0.021); physical/emotional violence (p=0.049); urinary incontinence (p=0.004); and the use of lamivudine/zidovudine (p=0.048). The Poisson multiple regression analysis found an association between dyspareunia and vaginal dryness (prevalence ratio (PR)=1.96, 95% CI 1.10 to 3.50, p=0.023) and urinary incontinence (PR=1.86, 95% CI 1.06 to 3.27, p=0.031). Dyspareunia was common in this group of HIV-positive women and was associated principally with vaginal dryness and urinary incontinence. The importance of treating dyspareunia within the context of sexual health in this group of women should be emphasised and appropriate management of this issue may reduce the likelihood of lesions on the vaginal wall, which may act as a portal of entry for other infections.
Resumo:
El Niño South Oscillation (ENSO) is one climatic phenomenon related to the inter-annual variability of global meteorological patterns influencing sea surface temperature and rainfall variability. It influences human health indirectly through extreme temperature and moisture conditions that may accelerate the spread of some vector-borne viral diseases, like dengue fever (DF). This work examines the spatial distribution of association between ENSO and DF in the countries of the Americas during 1995-2004, which includes the 1997-1998 El Niño, one of the most important climatic events of 20(th) century. Data regarding the South Oscillation index (SOI), indicating El Niño-La Niña activity, were obtained from Australian Bureau of Meteorology. The annual DF incidence (AIy) by country was computed using Pan-American Health Association data. SOI and AIy values were standardised as deviations from the mean and plotted in bars-line graphics. The regression coefficient values between SOI and AIy (rSOI,AI) were calculated and spatially interpolated by an inverse distance weighted algorithm. The results indicate that among the five years registering high number of cases (1998, 2002, 2001, 2003 and 1997), four had El Niño activity. In the southern hemisphere, the annual spatial weighted mean centre of epidemics moved southward, from 6° 31' S in 1995 to 21° 12' S in 1999 and the rSOI,AI values were negative in Cuba, Belize, Guyana and Costa Rica, indicating a synchrony between higher DF incidence rates and a higher El Niño activity. The rSOI,AI map allows visualisation of a graded surface with higher values of ENSO-DF associations for Mexico, Central America, northern Caribbean islands and the extreme north-northwest of South America.
Resumo:
Although Brazil is the third largest fruit producer in the world, several specimens consumed are not well studied from the chemical viewpoint, especially for quantitative analysis. For this reason and the crescent employment of mass spectrometry (MS) techniques in food science we selected twenty-two phenolic compounds with important biological activities and developed an ultra-high performance liquid chromatography tandem mass spectrometry (UHPLC-MS/MS) method using electrospray (ESI) in negative ion mode aiming their quantification in largely consumed Brazilian fruits (açaí-do-Amazonas, acerola, cashew apple, camu-camu, pineapple and taperebá). Multiple reaction monitoring (MRM) was applied and the selection of proper product ions for each transition assured high selectivity. Linearity (0.995