Biblioteca Digital

988 resultados para clustered binary data

Advances in the stochastic modelling of satellite-derived rainfall estimates using a sparse calibration dataset

Relevância:

80.00% 80.00%

Publicador:

Resumo:

As satellite technology develops, satellite rainfall estimates are likely to become ever more important in the world of food security. It is therefore vital to be able to identify the uncertainty of such estimates and for end users to be able to use this information in a meaningful way. This paper presents new developments in the methodology of simulating satellite rainfall ensembles from thermal infrared satellite data. Although the basic sequential simulation methodology has been developed in previous studies, it was not suitable for use in regions with more complex terrain and limited calibration data. Developments in this work include the creation of a multithreshold, multizone calibration procedure, plus investigations into the causes of an overestimation of low rainfall amounts and the best way to take into account clustered calibration data. A case study of the Ethiopian highlands has been used as an illustration.

Electric load forecasting using a fuzzy ART&ARTMAP neural network

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This work presents a neural network based on the ART architecture ( adaptive resonance theory), named fuzzy ART& ARTMAP neural network, applied to the electric load-forecasting problem. The neural networks based on the ARTarchitecture have two fundamental characteristics that are extremely important for the network performance ( stability and plasticity), which allow the implementation of continuous training. The fuzzy ART& ARTMAP neural network aims to reduce the imprecision of the forecasting results by a mechanism that separate the analog and binary data, processing them separately. Therefore, this represents a reduction on the processing time and improved quality of the results, when compared to the Back-Propagation neural network, and better to the classical forecasting techniques (ARIMA of Box and Jenkins methods). Finished the training, the fuzzy ART& ARTMAP neural network is capable to forecast electrical loads 24 h in advance. To validate the methodology, data from a Brazilian electric company is used. (C) 2004 Elsevier B.V. All rights reserved.

Variabilidade genética intrapopulacional em Myracrodruon urundeuva Fr. All. por marcador AFLP

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Levels of genetic variability for in situ and ex situ genetic conservation were estimated in a population of Myracrodruon urundeuva using the PCR (polymerase chain reaction) technique with the AFLP (Amplified fragment-length polymorphism) genetic marker. Seeds for progeny tests were collected from 30 open-pollination trees (matrices) at Paulo de Faria Ecological Station - SP. From this genetic material, three progeny tests were installed on the Teaching and Research Farm of Ilha Solteira Faculty of Engineering - University of São Paulo State (UNESP), which is located in Selvlria - MS, Brazil. The analysis by genetic marker was conducted with three combinations of different starters EcoRl-Msel, resulting in a total number of 137 polymorphic bands, thus forming a table of binary data. These data were used for the analysis of genetic divergence and distance between progenies. High levels of genetic divergence were observed among families. Based on the Analysis of Molecular Variance (AMOVA), it was shown that 16.2% of genetic diversity is found among progenies and 83.8% within progenies, which suggests deviances of random matings. The grouping of progenies, based on genetic distances, suggests that progenies deriving from trees which are close to each other tend to be more similar. This, in turn, indicates that the population originating the seeds may be genetically structured.

Detection of cavitated approximal surfaces using cone beam CT and intraoral receptors

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Objectives: The aim of this study was to compare cone beam CT (CBCT) in a small field of view (FOV) with a solid-state sensor and a photostimulable phosphor plate system for detection of cavitated approximal surfaces. Methods: 257 non-filled approximal surfaces from human permanent premolars and molars were recorded by two intraoral digital receptors, a storage phosphor plate (Digora Optime, Soredex) and a solid-state CMOS sensor (Digora Toto, Soredex), and scanned in a cone beam CT unit (3D Accuitomo FPD80, Morita) with a FOV of 4 cm and a voxel size of 0.08 mm. Image sections were carried out in the axial and mesiodistal tooth planes. Six observers recorded surface cavitation in all images. Validation of the true absence or presence of surface cavitation was performed by inspecting the surfaces under strong light with the naked eye. Differences in sensitivity, specificity and agreement were estimated by analysing the binary data in a generalized linear model using an identity link function. Results: A significantly higher sensitivity was obtained by all observers with CBCT (p,0.001), which was not compromised by a lower specificity. Therefore, a significantly higher overall agreement was obtained with CBCT (p,0.001). There were no significant differences between the Digora Optime phosphor plate system and the Digora Toto CMOS sensor for any parameter. Conclusions: CBCT was much more accurate in the detection of surface cavitation in approximal surfaces than intraoral receptors. The differences are interpreted as clinically significant. A CBCT examination performed for other reasons should also be assessed for approximal surface cavities in teeth without restorations. © 2013 The British Institute of Radiology.

Resistência da seringueira ao mal das folhas e modelagem no patossistema Hevea sp. – Microcyclus ulei através dos parâmetros monocíclicos

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Pós-graduação em Ciência Florestal - FCA

Análise dinâmica de contingências de sistemas de energia elétrica por redes neurais baseadas na teoria da ressonância adaptativa

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Pós-graduação em Engenharia Elétrica - FEIS

Caracterização da diversidade genética de Stryphnodendron adstringens (Mart.) Coville por marcador molecular AFLP e transferência de microssatélites

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Pós-graduação em Agronomia (Horticultura) - FCA

Modelos lineares generalizados mistos e equações de estimação generalizadas para dados binário aplicados em anestesiologia veterinária

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Modelos lineares generalizados bayesianos para dados longitudinais

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Genetic diversity and population structure of Musa accessions in ex situ conservation

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Abstract Background Banana cultivars are mostly derived from hybridization between wild diploid subspecies of Musa acuminata (A genome) and M. balbisiana (B genome), and they exhibit various levels of ploidy and genomic constitution. The Embrapa ex situ Musa collection contains over 220 accessions, of which only a few have been genetically characterized. Knowledge regarding the genetic relationships and diversity between modern cultivars and wild relatives would assist in conservation and breeding strategies. Our objectives were to determine the genomic constitution based on Internal Transcribed Spacer (ITS) regions polymorphism and the ploidy of all accessions by flow cytometry and to investigate the population structure of the collection using Simple Sequence Repeat (SSR) loci as co-dominant markers based on Structure software, not previously performed in Musa. Results From the 221 accessions analyzed by flow cytometry, the correct ploidy was confirmed or established for 212 (95.9%), whereas digestion of the ITS region confirmed the genomic constitution of 209 (94.6%). Neighbor-joining clustering analysis derived from SSR binary data allowed the detection of two major groups, essentially distinguished by the presence or absence of the B genome, while subgroups were formed according to the genomic composition and commercial classification. The co-dominant nature of SSR was explored to analyze the structure of the population based on a Bayesian approach, detecting 21 subpopulations. Most of the subpopulations were in agreement with the clustering analysis. Conclusions The data generated by flow cytometry, ITS and SSR supported the hypothesis about the occurrence of homeologue recombination between A and B genomes, leading to discrepancies in the number of sets or portions from each parental genome. These phenomenons have been largely disregarded in the evolution of banana, as the “single-step domestication” hypothesis had long predominated. These findings will have an impact in future breeding approaches. Structure analysis enabled the efficient detection of ancestry of recently developed tetraploid hybrids by breeding programs, and for some triploids. However, for the main commercial subgroups, Structure appeared to be less efficient to detect the ancestry in diploid groups, possibly due to sampling restrictions. The possibility of inferring the membership among accessions to correct the effects of genetic structure opens possibilities for its use in marker-assisted selection by association mapping.

Indicatori di correlazione e di disordine basati sul concetto di entropia

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The aim of this work is to carry out an applicative, comparative and exhaustive study between several entropy based indicators of independence and correlation. We considered some indicators characterized by a wide and consolidate literature, like mutual information, joint entropy, relative entropy or Kullback Leibler distance, and others, more recently introduced, like Granger, Maasoumi and racine entropy, also called Sρ, or utilized in more restricted domains, like Pincus approximate entropy or ApEn. We studied the behaviour of such indicators applying them to binary series. The series was designed to simulate a wide range of situations in order to characterize indicators limit and capability and to identify, case by case, the more useful and trustworthy ones. Our target was not only to study if such indicators were able to discriminate between dependence and independence because, especially for mutual information and Granger, Maasoumi and Racine, that was already demonstrated and reported in literature, but also to verify if and how they were able to provide information about structure, complexity and disorder of the series they were applied to. Special attention was paid on Pincus approximate entropy, that is said by the author to be able to provide information regarding the level of randomness, regularity and complexity of a series. By means of a focused and extensive research, we furthermore tried to clear the meaning of ApEn applied to a couple of different series. In such situation the indicator is named in literature as cross-ApEn. The cross-ApEn meaning and the interpretation of its results is often not simple nor univocal and the matter is scarcely delved into by literature, thereby users can easily leaded up to a misleading conclusion, especially if the indicator is employed, as often unfortunately it happens, in uncritical manner. In order to plug some cross-ApEn gaps and limits clearly brought out during the experimentation, we developed and applied to the already considered cases a further indicator we called “correspondence index”. The correspondence index is perfectly integrated into the cross-ApEn computational algorithm and it is able to provide, at least for binary data, accurate information about the intensity and the direction of an eventual correlation, even not linear, existing between two different series allowing, in the meanwhile, to detect an eventual condition of independence between the series themselves.

Computational Techniques for Spatial Logistic Regression with Large Datasets

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In epidemiological work, outcomes are frequently non-normal, sample sizes may be large, and effects are often small. To relate health outcomes to geographic risk factors, fast and powerful methods for fitting spatial models, particularly for non-normal data, are required. We focus on binary outcomes, with the risk surface a smooth function of space. We compare penalized likelihood models, including the penalized quasi-likelihood (PQL) approach, and Bayesian models based on fit, speed, and ease of implementation. A Bayesian model using a spectral basis representation of the spatial surface provides the best tradeoff of sensitivity and specificity in simulations, detecting real spatial features while limiting overfitting and being more efficient computationally than other Bayesian approaches. One of the contributions of this work is further development of this underused representation. The spectral basis model outperforms the penalized likelihood methods, which are prone to overfitting, but is slower to fit and not as easily implemented. Conclusions based on a real dataset of cancer cases in Taiwan are similar albeit less conclusive with respect to comparing the approaches. The success of the spectral basis with binary data and similar results with count data suggest that it may be generally useful in spatial models and more complicated hierarchical models.

Semiparametric Estimation in General Repeated Measures Problems

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper considers a wide class of semiparametric problems with a parametric part for some covariate effects and repeated evaluations of a nonparametric function. Special cases in our approach include marginal models for longitudinal/clustered data, conditional logistic regression for matched case-control studies, multivariate measurement error models, generalized linear mixed models with a semiparametric component, and many others. We propose profile-kernel and backfitting estimation methods for these problems, derive their asymptotic distributions, and show that in likelihood problems the methods are semiparametric efficient. While generally not true, with our methods profiling and backfitting are asymptotically equivalent. We also consider pseudolikelihood methods where some nuisance parameters are estimated from a different algorithm. The proposed methods are evaluated using simulation studies and applied to the Kenya hemoglobin data.

USING PROBABILISTIC GRAPHICAL MODELS TO DRAW INFERENCES IN SENSOR NETWORKS WITH TRACKING APPLICATIONS

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Sensor networks have been an active research area in the past decade due to the variety of their applications. Many research studies have been conducted to solve the problems underlying the middleware services of sensor networks, such as self-deployment, self-localization, and synchronization. With the provided middleware services, sensor networks have grown into a mature technology to be used as a detection and surveillance paradigm for many real-world applications. The individual sensors are small in size. Thus, they can be deployed in areas with limited space to make unobstructed measurements in locations where the traditional centralized systems would have trouble to reach. However, there are a few physical limitations to sensor networks, which can prevent sensors from performing at their maximum potential. Individual sensors have limited power supply, the wireless band can get very cluttered when multiple sensors try to transmit at the same time. Furthermore, the individual sensors have limited communication range, so the network may not have a 1-hop communication topology and routing can be a problem in many cases. Carefully designed algorithms can alleviate the physical limitations of sensor networks, and allow them to be utilized to their full potential. Graphical models are an intuitive choice for designing sensor network algorithms. This thesis focuses on a classic application in sensor networks, detecting and tracking of targets. It develops feasible inference techniques for sensor networks using statistical graphical model inference, binary sensor detection, events isolation and dynamic clustering. The main strategy is to use only binary data for rough global inferences, and then dynamically form small scale clusters around the target for detailed computations. This framework is then extended to network topology manipulation, so that the framework developed can be applied to tracking in different network topology settings. Finally the system was tested in both simulation and real-world environments. The simulations were performed on various network topologies, from regularly distributed networks to randomly distributed networks. The results show that the algorithm performs well in randomly distributed networks, and hence requires minimum deployment effort. The experiments were carried out in both corridor and open space settings. A in-home falling detection system was simulated with real-world settings, it was setup with 30 bumblebee radars and 30 ultrasonic sensors driven by TI EZ430-RF2500 boards scanning a typical 800 sqft apartment. Bumblebee radars are calibrated to detect the falling of human body, and the two-tier tracking algorithm is used on the ultrasonic sensors to track the location of the elderly people.

Distancias genéticas entre perfiles moleculares obtenidos desde marcadores multilocus multialélicos

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Para expresar la magnitud de la identidad genética (similaridad) o su complemento (distancia) entre dos individuos caracterizados molecularmente a través de marcadores del tipo microsatélites (SSR), que son multilocusmultialélicos, es necesario elegir una métrica acorde con la naturaleza multivariada de los datos. Comúnmente, las métricas de distancias genéticas son diseñadas para expresar, en un único número, la diferencia genética entre dos poblaciones y son expresadas como función de la frecuencia alélica poblacional. Dichas métricas pueden también ser utilizadas para calcular la distancia entre perfiles individuales, pero las frecuencias alélicas no son continuas en este caso. Alternativamente, se pueden usar distancias geométricas obtenidas como el complemento del índice de similaridad para datos binarios que indican la presencia/ ausencia de cada alelo en un individuo. El objetivo de este trabajo fue evaluar simultáneamente el desempeño de ambos tipos de métricas para ordenar y clasificar individuos en una base de datos generadas a partir de loci de marcadores microsatélites SSR. Se calcularon 11 métricas de distancias a partir de 17 loci SSR obtenidos desde 17 introducciones de un banco de germoplasma de soja [Glycine max (L.) Merr.]. Se evaluó el consenso de los resultados obtenidos para la clasificación de los 17 perfiles moleculares desde varias métricas. Los resultados sugieren que los diferentes tipos de métricas producen información similar para comparar individuos. No obstante, se realizó una clasificación de las métricas que responden a diferencias entre los núcleos de las expresiones de cálculo.

«
1
2
3
4
5
6
7
8
...
65
66
»