3 resultados para model-based clustering
em DigitalCommons@University of Nebraska - Lincoln
Resumo:
We consider a fully model-based approach for the analysis of distance sampling data. Distance sampling has been widely used to estimate abundance (or density) of animals or plants in a spatially explicit study area. There is, however, no readily available method of making statistical inference on the relationships between abundance and environmental covariates. Spatial Poisson process likelihoods can be used to simultaneously estimate detection and intensity parameters by modeling distance sampling data as a thinned spatial point process. A model-based spatial approach to distance sampling data has three main benefits: it allows complex and opportunistic transect designs to be employed, it allows estimation of abundance in small subregions, and it provides a framework to assess the effects of habitat or experimental manipulation on density. We demonstrate the model-based methodology with a small simulation study and analysis of the Dubbo weed data set. In addition, a simple ad hoc method for handling overdispersion is also proposed. The simulation study showed that the model-based approach compared favorably to conventional distance sampling methods for abundance estimation. In addition, the overdispersion correction performed adequately when the number of transects was high. Analysis of the Dubbo data set indicated a transect effect on abundance via Akaike’s information criterion model selection. Further goodness-of-fit analysis, however, indicated some potential confounding of intensity with the detection function.
Resumo:
Raccoons are the reservoir for the raccoon rabies virus variant in the United States. To combat this threat, oral rabies vaccination (ORV) programs are conducted in many eastern states. To aid in these efforts, the genetic structure of raccoons (Procyon lotor) was assessed in southwestern Pennsylvania to determine if select geographic features (i.e., ridges and valleys) serve as corridors or hindrances to raccoon gene flow (e.g., movement) and, therefore, rabies virus trafficking in this physiographic region. Raccoon DNA samples (n = 185) were collected from one ridge site and two adjacent valleys in southwestern Pennsylvania (Westmoreland, Cambria, Fayette, and Somerset counties). Raccoon genetic structure within and among these study sites was characterized at nine microsatellite loci. Results indicated that there was little population subdivision among any sites sampled. Furthermore, analyses using a model-based clustering approach indicated one essentially panmictic population was present among all the raccoons sampled over a reasonably broad geographic area (e.g., sites up to 36 km apart). However, a signature of isolation by distance was detected, suggesting that widths of ORV zones are critical for success. Combined, these data indicate that geographic features within this landscape influence raccoon gene flow only to a limited extent, suggesting that ridges of this physiographic system will not provide substantial long-term natural barriers to rabies virus trafficking. These results may be of value for future ORV efforts in Pennsylvania and other eastern states with similar landscapes.
Resumo:
We explore the problem of budgeted machine learning, in which the learning algorithm has free access to the training examples’ labels but has to pay for each attribute that is specified. This learning model is appropriate in many areas, including medical applications. We present new algorithms for choosing which attributes to purchase of which examples in the budgeted learning model based on algorithms for the multi-armed bandit problem. All of our approaches outperformed the current state of the art. Furthermore, we present a new means for selecting an example to purchase after the attribute is selected, instead of selecting an example uniformly at random, which is typically done. Our new example selection method improved performance of all the algorithms we tested, both ours and those in the literature.