337 resultados para outliers


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Pós-graduação em Biometria - IBB

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Pós-graduação em Biociências - FCLAS

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Brucella species are Gram-negative bacteria that infect mammals. Recently, two unusual strains (Brucella inopinata BO1(T) and B. inopinata-like BO2) have been isolated from human patients, and their similarity to some atypical brucellae isolated from Australian native rodent species was noted. Here we present a phylogenomic analysis of the draft genome sequences of BO1(T) and BO2 and of the Australian rodent strains 83-13 and NF2653 that shows that they form two groups well separated from the other sequenced Brucella spp. Several important differences were noted. Both BO1(T) and BO2 did not agglutinate significantly when live or inactivated cells were exposed to monospecific A and Mantisera against O-side chain sugars composed of N-formyl-perosamine. While BO1(T) maintained the genes required to synthesize a typical Brucella O-antigen, BO2 lacked many of these genes but still produced a smooth LPS (lipopolysaccharide). Most missing genes were found in the wbk region involved in O-antigen synthesis in classic smooth Brucella spp. In their place, BO2 carries four genes that other bacteria use for making a rhamnose-based O-antigen. Electrophoretic, immunoblot, and chemical analyses showed that BO2 carries an antigenically different O-antigen made of repeating hexose-rich oligosaccharide units that made the LPS water-soluble, which contrasts with the homopolymeric O-antigen of other smooth brucellae that have a phenol-soluble LPS. The results demonstrate the existence of a group of early-diverging brucellae with traits that depart significantly from those of the Brucella species described thus far. IMPORTANCE This report examines differences between genomes from four new Brucella strains and those from the classic Brucella spp. Our results show that the four new strains are outliers with respect to the previously known Brucella strains and yet are part of the genus, forming two new clades. The analysis revealed important information about the evolution and survival mechanisms of Brucella species, helping reshape our knowledge of this important zoonotic pathogen. One discovery of special importance is that one of the strains, BO2, produces an O-antigen distinct from any that has been seen in any other Brucella isolates to date.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We report a morphology-based approach for the automatic identification of outlier neurons, as well as its application to the NeuroMorpho.org database, with more than 5,000 neurons. Each neuron in a given analysis is represented by a feature vector composed of 20 measurements, which are then projected into a two-dimensional space by applying principal component analysis. Bivariate kernel density estimation is then used to obtain the probability distribution for the group of cells, so that the cells with highest probabilities are understood as archetypes while those with the smallest probabilities are classified as outliers. The potential of the methodology is illustrated in several cases involving uniform cell types as well as cell types for specific animal species. The results provide insights regarding the distribution of cells, yielding single and multi-variate clusters, and they suggest that outlier cells tend to be more planar and tortuous. The proposed methodology can be used in several situations involving one or more categories of cells, as well as for detection of new categories and possible artifacts.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Model diagnostics is an integral part of model determination and an important part of the model diagnostics is residual analysis. We adapt and implement residuals considered in the literature for the probit, logistic and skew-probit links under binary regression. New latent residuals for the skew-probit link are proposed here. We have detected the presence of outliers using the residuals proposed here for different models in a simulated dataset and a real medical dataset.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Context. Lithium abundances in open clusters are a very effective probe of mixing processes, and their study can help us to understand the large depletion of lithium that occurs in the Sun. Owing to its age and metallicity, the open cluster M 67 is especially interesting on this respect. Many studies of lithium abundances in M 67 have been performed, but a homogeneous global analysis of lithium in stars from subsolar masses and extending to the most massive members, has yet to be accomplished for a large sample based on high-quality spectra. Aims. We test our non-standard models, which were calibrated using the Sun with observational data. Methods. We collect literature data to analyze, for the first time in a homogeneous way, the non-local thermal equilibrium lithium abundances of all observed single stars in M 67 more massive than similar to 0.9 M-circle dot. Our grid of evolutionary models is computed assuming a non-standard mixing at metallicity [Fe/H] = 0.01, using the Toulouse-Geneva evolution code. Our analysis starts from the entrance into the zero-age main-sequence. Results. Lithium in M 67 is a tight function of mass for stars more massive than the Sun, apart from a few outliers. A plateau in lithium abundances is observed for turn-off stars. Both less massive (M >= 1.10 M-circle dot) and more massive (M >= 1.28 M-circle dot) stars are more depleted than those in the plateau. There is a significant scatter in lithium abundances for any given mass M <= 1.1 M-circle dot. Conclusions. Our models qualitatively reproduce most of the features described above, although the predicted depletion of lithium is 0.45 dex smaller than observed for masses in the plateau region, i.e. between 1.1 and 1.28 solar masses. More work is clearly needed to accurately reproduce the observations. Despite hints that chromospheric activity and rotation play a role in lithium depletion, no firm conclusion can be drawn with the presently available data.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

To understand the regulatory dynamics of transcription factors (TFs) and their interplay with other cellular components we have integrated transcriptional, protein-protein and the allosteric or equivalent interactions which mediate the physiological activity of TFs in Escherichia coli. To study this integrated network we computed a set of network measurements followed by principal component analysis (PCA), investigated the correlations between network structure and dynamics, and carried out a procedure for motif detection. In particular, we show that outliers identified in the integrated network based on their network properties correspond to previously characterized global transcriptional regulators. Furthermore, outliers are highly and widely expressed across conditions, thus supporting their global nature in controlling many genes in the cell. Motifs revealed that TFs not only interact physically with each other but also obtain feedback from signals delivered by signaling proteins supporting the extensive cross-talk between different types of networks. Our analysis can lead to the development of a general framework for detecting and understanding global regulatory factors in regulatory networks and reinforces the importance of integrating multiple types of interactions in underpinning the interrelationships between them.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The choice of an appropriate family of linear models for the analysis of longitudinal data is often a matter of concern for practitioners. To attenuate such difficulties, we discuss some issues that emerge when analyzing this type of data via a practical example involving pretestposttest longitudinal data. In particular, we consider log-normal linear mixed models (LNLMM), generalized linear mixed models (GLMM), and models based on generalized estimating equations (GEE). We show how some special features of the data, like a nonconstant coefficient of variation, may be handled in the three approaches and evaluate their performance with respect to the magnitude of standard errors of interpretable and comparable parameters. We also show how different diagnostic tools may be employed to identify outliers and comment on available software. We conclude by noting that the results are similar, but that GEE-based models may be preferable when the goal is to compare the marginal expected responses.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract Background With the development of DNA hybridization microarray technologies, nowadays it is possible to simultaneously assess the expression levels of thousands to tens of thousands of genes. Quantitative comparison of microarrays uncovers distinct patterns of gene expression, which define different cellular phenotypes or cellular responses to drugs. Due to technical biases, normalization of the intensity levels is a pre-requisite to performing further statistical analyses. Therefore, choosing a suitable approach for normalization can be critical, deserving judicious consideration. Results Here, we considered three commonly used normalization approaches, namely: Loess, Splines and Wavelets, and two non-parametric regression methods, which have yet to be used for normalization, namely, the Kernel smoothing and Support Vector Regression. The results obtained were compared using artificial microarray data and benchmark studies. The results indicate that the Support Vector Regression is the most robust to outliers and that Kernel is the worst normalization technique, while no practical differences were observed between Loess, Splines and Wavelets. Conclusion In face of our results, the Support Vector Regression is favored for microarray normalization due to its superiority when compared to the other methods for its robustness in estimating the normalization curve.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract Background Sugarcane is an increasingly economically and environmentally important C4 grass, used for the production of sugar and bioethanol, a low-carbon emission fuel. Sugarcane originated from crosses of Saccharum species and is noted for its unique capacity to accumulate high amounts of sucrose in its stems. Environmental stresses limit enormously sugarcane productivity worldwide. To investigate transcriptome changes in response to environmental inputs that alter yield we used cDNA microarrays to profile expression of 1,545 genes in plants submitted to drought, phosphate starvation, herbivory and N2-fixing endophytic bacteria. We also investigated the response to phytohormones (abscisic acid and methyl jasmonate). The arrayed elements correspond mostly to genes involved in signal transduction, hormone biosynthesis, transcription factors, novel genes and genes corresponding to unknown proteins. Results Adopting an outliers searching method 179 genes with strikingly different expression levels were identified as differentially expressed in at least one of the treatments analysed. Self Organizing Maps were used to cluster the expression profiles of 695 genes that showed a highly correlated expression pattern among replicates. The expression data for 22 genes was evaluated for 36 experimental data points by quantitative RT-PCR indicating a validation rate of 80.5% using three biological experimental replicates. The SUCAST Database was created that provides public access to the data described in this work, linked to tissue expression profiling and the SUCAST gene category and sequence analysis. The SUCAST database also includes a categorization of the sugarcane kinome based on a phylogenetic grouping that included 182 undefined kinases. Conclusion An extensive study on the sugarcane transcriptome was performed. Sugarcane genes responsive to phytohormones and to challenges sugarcane commonly deals with in the field were identified. Additionally, the protein kinases were annotated based on a phylogenetic approach. The experimental design and statistical analysis applied proved robust to unravel genes associated with a diverse array of conditions attributing novel functions to previously unknown or undefined genes. The data consolidated in the SUCAST database resource can guide further studies and be useful for the development of improved sugarcane varieties.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

[EN] In this work, we describe an implementation of the variational method proposed by Brox et al. in 2004, which yields accurate optical flows with low running times. It has several benefits with respect to the method of Horn and Schunck: it is more robust to the presence of outliers, produces piecewise-smooth flow fields and can cope with constant brightness changes. This method relies on the brightness and gradient constancy assumptions, using the information of the image intensities and the image gradients to find correspondences. It also generalizes the use of continuous L1 functionals, which help mitigate the efect of outliers and create a Total Variation (TV) regularization. Additionally, it introduces a simple temporal regularization scheme that enforces a continuous temporal coherence of the flow fields.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The purpose of this Thesis is to develop a robust and powerful method to classify galaxies from large surveys, in order to establish and confirm the connections between the principal observational parameters of the galaxies (spectral features, colours, morphological indices), and help unveil the evolution of these parameters from $z \sim 1$ to the local Universe. Within the framework of zCOSMOS-bright survey, and making use of its large database of objects ($\sim 10\,000$ galaxies in the redshift range $0 < z \lesssim 1.2$) and its great reliability in redshift and spectral properties determinations, first we adopt and extend the \emph{classification cube method}, as developed by Mignoli et al. (2009), to exploit the bimodal properties of galaxies (spectral, photometric and morphologic) separately, and then combining together these three subclassifications. We use this classification method as a test for a newly devised statistical classification, based on Principal Component Analysis and Unsupervised Fuzzy Partition clustering method (PCA+UFP), which is able to define the galaxy population exploiting their natural global bimodality, considering simultaneously up to 8 different properties. The PCA+UFP analysis is a very powerful and robust tool to probe the nature and the evolution of galaxies in a survey. It allows to define with less uncertainties the classification of galaxies, adding the flexibility to be adapted to different parameters: being a fuzzy classification it avoids the problems due to a hard classification, such as the classification cube presented in the first part of the article. The PCA+UFP method can be easily applied to different datasets: it does not rely on the nature of the data and for this reason it can be successfully employed with others observables (magnitudes, colours) or derived properties (masses, luminosities, SFRs, etc.). The agreement between the two classification cluster definitions is very high. ``Early'' and ``late'' type galaxies are well defined by the spectral, photometric and morphological properties, both considering them in a separate way and then combining the classifications (classification cube) and treating them as a whole (PCA+UFP cluster analysis). Differences arise in the definition of outliers: the classification cube is much more sensitive to single measurement errors or misclassifications in one property than the PCA+UFP cluster analysis, in which errors are ``averaged out'' during the process. This method allowed us to behold the \emph{downsizing} effect taking place in the PC spaces: the migration between the blue cloud towards the red clump happens at higher redshifts for galaxies of larger mass. The determination of $M_{\mathrm{cross}}$ the transition mass is in significant agreement with others values in literature.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Miglioramento delle prestazioni del modello mono-compartimentale del maximum slope dovuto all'introduzione di sistemi per l'eliminazione degli outliers.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Während der Glazialphasen kam es in den europäischen Mittelgebirgen bedingt durch extensive solifluidale Massenbewegungen zur Bildung von Deckschichten. Diese Deckschichten repräsentieren eine Mischung verschiedener Substrate, wie anstehendes Ausgangsgestein, äolische Depositionen und lokale Erzgänge. Die räumliche Ausdehnung der Metallkontaminationen verursacht durch kleinräumige Erzgänge wird durch die periglaziale Solifluktion verstärkt. Das Ziel der vorliegenden Untersuchung war a) den Zusammenhang zwischen den Reliefeigenschaften und den Ausprägungen der solifluidalen Deckschichten und Böden aufzuklären, sowie b) mittels Spurenelementgehalte und Blei-Isotopen-Verhältnisse als Eingangsdaten für Mischungsmodelle die Beitrage der einzelnen Substrate zum Ausgangsmaterial der Bodenbildung zu identifizieren und quantifizieren und c) die räumliche Verteilung von Blei (Pb) in Deckschichten, die über Bleierzgänge gewandert sind, untersucht, die Transportweite des erzbürtigen Bleis berechnet und die kontrollierenden Faktoren der Transportweite bestimmt werden. Sechs Transekte im südöstlichen Rheinischen Schiefergebirge, einschließlich der durch periglaziale Solifluktion entwickelten Böden, wurden untersucht. Die bodenkundliche Geländeaufnahme erfolgte nach AG Boden (2005). O, A, B und C-Horizontproben wurden auf ihre Spurenelementgehalte und teilweise auf ihre 206Pb/207Pb-Isotopenverhältnisse analysiert. Die steuernden Faktoren der Verteilung und Eigenschaften periglazialer Deckschichten sind neben der Petrographie, Reliefeigenschaften wie Exposition, Hangneigung, Hangposition und Krümmung. Die Reliefanalyse zeigt geringmächtige Deckschichten in divergenten, konvexen Hangbereichen bei gleichzeitig hohem Skelettgehalt. In konvergent, konkaven Hangbereichen nimmt die Deckschichtenmächtigkeit deutlich zu, bei gleichzeitig zunehmendem Lösslehm- und abnehmendem Skelettgehalt. Abhängig von den Reliefeigenschaften und -positionen reichen die ausgeprägten Bodentypen von sauren Braunerden bis hin zu Pseudogley-Parabraunerden. Des Weiteren kommen holozäne Kolluvien in eher untypischen Reliefpositionen wie langgestreckten, kaum geneigten Hangbereichen oder Mittelhangbereichen vor. Außer für Pb bewegen sich die Spurenelementgehalte im Rahmen niedriger Hintergrundgehalte. Die Pb-Gehalte liegen zwischen 20-135 mg kg-1. Abnehmende Spurenelementgehalte und Isotopensignaturen (206Pb/207Pb-Isotopenverhältnisse) von Pb zeigen, dass nahezu kein Pb aus atmosphärischen Depositionen in die B-Horizonte verlagert wurde. Eine Hauptkomponentenanalyse (PCA) der Spurenelementgehalte hat vier Hauptsubstratquellen der untersuchten B-Horizonte identifiziert (Tonschiefer, Löss, Laacher-See-Tephra [LST] und lokale Pb-Erzgänge). Mittels 3-Komponenten-Mischungsmodell, das Tonschiefer, Löss und LST einschloss, konnten, bis auf 10 Ausreißer, die Spurenelementgehalte aller 120 B-Horizontproben erklärt werden. Der Massenbeitrag des Pb-Erzes zur Substratmischung liegt bei <0,1%. Die räumliche Pb-Verteilung zeigt Bereiche lokaler Pb-Gehaltsmaxima hangaufwärtiger Pb-Erzgänge. Mittels eines 206Pb/207Pb-Isotopenverhältnis-Mischungsmodells konnten 14 Bereiche erhöhter lokaler Pb-Gehaltsmaxima ausgewiesen werden, die 76-100% erzbürtigen Bleis enthalten. Mit Hilfe eines Geographischen Informationssystems wurden die Transportweiten des erzbürtigen Bleis mit 30 bis 110 m bestimmt. Die steuerenden Faktoren der Transportweite sind dabei die Schluffkonzentration und die Vertikalkrümmung. Diese Untersuchung zeigt, dass Reliefeigenschaften und Reliefposition einen entscheidenden Einfluss auf die Ausprägung der Deckschichten und Böden im europäischen Mittelgebirgsbereich haben. Mischungsmodelle in Kombination mit Spurenelementanalysen und Isotopenverhältnissen stellen ein wichtiges Werkzeug zur Bestimmung der Beiträge der einzelnen Glieder in Bodensubstratmischungen dar. Außerdem können lokale Bleierzgänge die natürlichen Pb-Gehalte in Böden, entwickelt in periglazialen Deckschichten der letzten Vereisungsphase (Würm), bis über 100 m Entfernung erhöhen.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Seventeen bones (sixteen cadaveric bones and one plastic bone) were used to validate a method for reconstructing a surface model of the proximal femur from 2D X-ray radiographs and a statistical shape model that was constructed from thirty training surface models. Unlike previously introduced validation studies, where surface-based distance errors were used to evaluate the reconstruction accuracy, here we propose to use errors measured based on clinically relevant morphometric parameters. For this purpose, a program was developed to robustly extract those morphometric parameters from the thirty training surface models (training population), from the seventeen surface models reconstructed from X-ray radiographs, and from the seventeen ground truth surface models obtained either by a CT-scan reconstruction method or by a laser-scan reconstruction method. A statistical analysis was then performed to classify the seventeen test bones into two categories: normal cases and outliers. This classification step depends on the measured parameters of the particular test bone. In case all parameters of a test bone were covered by the training population's parameter ranges, this bone is classified as normal bone, otherwise as outlier bone. Our experimental results showed that statistically there was no significant difference between the morphometric parameters extracted from the reconstructed surface models of the normal cases and those extracted from the reconstructed surface models of the outliers. Therefore, our statistical shape model based reconstruction technique can be used to reconstruct not only the surface model of a normal bone but also that of an outlier bone.