922 resultados para Supervised and Unsupervised Classification
Resumo:
Supplements issued bimonthly, April 1961-June 1963.
Resumo:
Projected air and ground temperatures are expected to be higher in Arctic and sub-Arcticlatitudes and with temperatures already close to the limit where permafrost can exist,resistance against degradation is low. With thawing permafrost, the landscape is modifiedwith depression in which thermokarst lakes emerge. In permafrost soils a considerableamount of soil organic carbon is stored, with the potential of altering climate even furtherif expansion and formation of new thermokarst lakes emerge, as decay releasesgreenhouse gases (C02 and CH4) to the atmosphere. Analyzing the spatial distribution andmorphometry over time of thermokarst lakes and other water bodies, is of importance inaccurately predict carbon budget and feedback mechanisms, as well as to assess futurelandscape layout and these features interaction. Different types of high-spatial resolutionaerial and satellite imageries from 1963, 1975, 2003, 2010 and 2015, were used in bothpre- and post-classification change detection analyses. Using object oriented segmentationin eCognition combined with manual adjustments, resulted in digitalized water bodies>28m2 from which direction of change and morphometric values were extracted. Thequantity of thermokarst lakes and other water bodies was in 1963 n=92, with succeedingyears as a trend decreased in numbers, until 2010-2015 when eleven water bodies wereadded in 2015 (n=74 to n=85). In 1963-2003, area of these water bodies decreased with50 651m2 (189 446-138 795m2) and continued to decrease in 2003-2015 ending at 129337m2. Limnicity decreased from 19.9% in 1963 to 14.6% in 2003 (-5.3%). In 2010 and2015 13.7-13.6%. The late increase in water bodies differs from an earlier hypothesis thatsporadic permafrost regions experience decrease in both area and quantity of thermokarstlakes and water bodies. During 1963-2015, land gain has been in dominance of the ratiobetween the two competing processes of expansion and drainage. In 1963-1975, 55/45%,followed by 90/10% in 1975-2003. After major drainage events, land loss increased to62/38% in 2010-2015. Drainage and infilling rates, calculated for 15 shorelines werevaried across both landscape and parts of shorelines, with in average 0.17/0.15/0.14m/yr.Except for 1963-1975 when rate of change in average was in opposite direction (-0.09m/yr.), likely due to evident expansion of a large thermokarst lake. Using a squaregrid, distribution of water bodies was determined, with an indistinct cluster located in NEand central parts. Especially for water bodies <250m2, which is the dominant area classthroughout 1963-2015 ranging from n=39-51. With a heterogeneous composition of bothsmall and large thermokarst lakes, and with both expansion and drainage altering thelandscape in Tavvavuoma, both positive and negative climate feedback mechanisms are inplay - given that sporadic permafrost still exist.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-06
Resumo:
Thesis (Master's)--University of Washington, 2016-06
Resumo:
A 16S rRNA gene database (http://greengenes.bl.gov) addresses limitations of public repositories by providing chimera screening, standard alignment, and taxonomic classification using multiple published taxonomies. It was found that there is incongruent taxonomic nomenclature among curators even at the phylum level. Putative chimeras were identified in 3% of environmental sequences and in 0.2% of records derived from isolates. Environmental sequences were classified into 100 phylum-level lineages in the Archaea and Bacteria.
Resumo:
Document classification is a supervised machine learning process, where predefined category labels are assigned to documents based on the hypothesis derived from training set of labelled documents. Documents cannot be directly interpreted by a computer system unless they have been modelled as a collection of computable features. Rogati and Yang [M. Rogati and Y. Yang, Resource selection for domain-specific cross-lingual IR, in SIGIR 2004: Proceedings of the 27th annual international conference on Research and Development in Information Retrieval, ACM Press, Sheffied: United Kingdom, pp. 154-161.] pointed out that the effectiveness of document classification system may vary in different domains. This implies that the quality of document model contributes to the effectiveness of document classification. Conventionally, model evaluation is accomplished by comparing the effectiveness scores of classifiers on model candidates. However, this kind of evaluation methods may encounter either under-fitting or over-fitting problems, because the effectiveness scores are restricted by the learning capacities of classifiers. We propose a model fitness evaluation method to determine whether a model is sufficient to distinguish positive and negative instances while still competent to provide satisfactory effectiveness with a small feature subset. Our experiments demonstrated how the fitness of models are assessed. The results of our work contribute to the researches of feature selection, dimensionality reduction and document classification.
Resumo:
Fast Classification (FC) networks were inspired by a biologically plausible mechanism for short term memory where learning occurs instantaneously. Both weights and the topology for an FC network are mapped directly from the training samples by using a prescriptive training scheme. Only two presentations of the training data are required to train an FC network. Compared with iterative learning algorithms such as Back-propagation (which may require many hundreds of presentations of the training data), the training of FC networks is extremely fast and learning convergence is always guaranteed. Thus FC networks may be suitable for applications where real-time classification is needed. In this paper, the FC networks are applied for the real-time extraction of gene expressions for Chlamydia microarray data. Both the classification performance and learning time of the FC networks are compared with the Multi-Layer Proceptron (MLP) networks and support-vector-machines (SVM) in the same classification task. The FC networks are shown to have extremely fast learning time and comparable classification accuracy.
Resumo:
Conventionally, document classification researches focus on improving the learning capabilities of classifiers. Nevertheless, according to our observation, the effectiveness of classification is limited by the suitability of document representation. Intuitively, the more features that are used in representation, the more comprehensive that documents are represented. However, if a representation contains too many irrelevant features, the classifier would suffer from not only the curse of high dimensionality, but also overfitting. To address this problem of suitableness of document representations, we present a classifier-independent approach to measure the effectiveness of document representations. Our approach utilises a labelled document corpus to estimate the distribution of documents in the feature space. By looking through documents in this way, we can clearly identify the contributions made by different features toward the document classification. Some experiments have been performed to show how the effectiveness is evaluated. Our approach can be used as a tool to assist feature selection, dimensionality reduction and document classification.
Resumo:
The Java programming language supports concurrency. Concurrent programs are hard to test due to their inherent non-determinism. This paper presents a classification of concurrency failures that is based on a model of Java concurrency. The model and failure classification is used to justify coverage of synchronization primitives of concurrent components. This is achieved by constructing concurrency flow graphs for each method call. A producer-consumer monitor is used to demonstrate how the approach can be used to measure coverage of concurrency primitives and thereby assist in determining test sequences for deterministic execution.
Resumo:
* Supported by INTAS 00-626 and TIC 2003-09319-c03-03.
Resumo:
The IUPHAR database (IUPHAR-DB) integrates peer-reviewed pharmacological, chemical, genetic, functional and anatomical information on the 354 nonsensory G protein-coupled receptors (GPCRs), 71 ligand-gated ion channel subunits and 141 voltage-gated-like ion channel subunits encoded by the human, rat and mouse genomes. These genes represent the targets of approximately one-third of currently approved drugs and are a major focus of drug discovery and development programs in the pharmaceutical industry. IUPHAR-DB provides a comprehensive description of the genes and their functions, with information on protein structure and interactions, ligands, expression patterns, signaling mechanisms, functional assays and biologically important receptor variants (e.g. single nucleotide polymorphisms and splice variants). In addition, the phenotypes resulting from altered gene expression (e.g. in genetically altered animals or in human genetic disorders) are described. The content of the database is peer reviewed by members of the International Union of Basic and Clinical Pharmacology Committee on Receptor Nomenclature and Drug Classification (NC-IUPHAR); the data are provided through manual curation of the primary literature by a network of over 60 subcommittees of NC-IUPHAR. Links to other bioinformatics resources, such as NCBI, Uniprot, HGNC and the rat and mouse genome databases are provided. IUPHAR-DB is freely available at http://www.iuphar-db.org. © 2008 The Author(s).
Resumo:
This study surveys the occurrence of nodulation in woody legume species in Panamá and Costa Rica, describes nodule and root characteristics, and researches host-bacteria specificity, nodulation potential of soils, and the effects of light, added nitrogen, and rhizobia and VA mycorrhizal fungi inoculation on seedling growth. I examined 83 species in 37 genera and found 80% to be nodulated. Percent nodulated species in the Caesalpinioideae, Mimosoideae, and Papilionoideae was 17, 95, and 86, respectively, with no correlation between nodule morphology and tribal classification. Nodules formed mainly at root branch points which supports epidermal breaks as an important rhizobia infection route. More non-nodulated than nodulated species had root hairs. Several species emitted volatile sulfur-containing compounds, including the toxic compound ethylmercaptan, from roots, germinating seeds, and other tissues. These emissions may have an allelopathic action against pathogens, predators, or other plants. In contrast to the general non-specificity of most legumes for rhizobia, Mimosa pigra L. was highly specific and only nodulated in flooded soils. This species' specificity, combined with a limited occurrence of its root nodule bacteria may limit its natural distribution, but its spread as an invasive weed is facilitated when fill material from rivers is deposited in other areas. ^ An experimental light level of 1.5% of full sun completely inhibited seedling nodulation, as do similar naturally low levels in forest understory. In the forest, trees and seedlings were not nodulated. in some soils with suspected high N content. For six experimental species, added N progressively increased seedling growth while decreasing nodule biomass; at the highest level of added N nodulation was completely suppressed. Species and individuals showed variation in nodule biomass at high N applications which may indicate an opportunity for genetic selection for optimal N acquisition. Rhizobia inoculation had a small positive effect on seedling shoot growth, but VA mycorrhiza inoculation overwhelmingly increased seedling size, biomass, and leaf mineral concentration. In lowland tropical forest, VA mycorrhizal colonization appears indispensable for legume nodulation because of the fungus' ability to supply P in deficient soils. This requirement makes the legume-rhizobia-mycorrhiza association obligately tripartite. ^
Resumo:
Minimal educational requirements for Registered Dietitians (RDs) include a bachelor's degree and practice program. Recently, a master's degree was recommended. Studies have not established whether education affects employment. A secondary analysis of 2005 Dietetics Practice Audit data determined whether job responsibility, individuals supervised, and activities differed between 1,626 bachelor's RDs (B-RDs) and 767 master's (M-RDs) RDs, registered .5 years. Chi-square and ANOVA analyzed differences between B-RDs and M-RDs, at entry-level (0-3 years experience) and beyond-entry-level (3+-5 years experience). Beyond-entry-level B-RDs (31.8%) and entry-level M-RDs (31.9%) reported “supervisor/executive” responsibility more than entry-level B-RDs (26.5%; p=0.01). A higher percentage of M-RDs supervised (29.2%) than B-RDs (24.7%; p=0.02); however, B-RDs supervised more individuals (7.38 ± 4.89) than M-RDs (6.25 ± 4.87; t=2.32; p=0.021). A master's degree has limited benefits; experience may affect responsibility, individuals supervised, and activities more than education.
Resumo:
Flow Cytometry analyzers have become trusted companions due to their ability to perform fast and accurate analyses of human blood. The aim of these analyses is to determine the possible existence of abnormalities in the blood that have been correlated with serious disease states, such as infectious mononucleosis, leukemia, and various cancers. Though these analyzers provide important feedback, it is always desired to improve the accuracy of the results. This is evidenced by the occurrences of misclassifications reported by some users of these devices. It is advantageous to provide a pattern interpretation framework that is able to provide better classification ability than is currently available. Toward this end, the purpose of this dissertation was to establish a feature extraction and pattern classification framework capable of providing improved accuracy for detecting specific hematological abnormalities in flow cytometric blood data. ^ This involved extracting a unique and powerful set of shift-invariant statistical features from the multi-dimensional flow cytometry data and then using these features as inputs to a pattern classification engine composed of an artificial neural network (ANN). The contribution of this method consisted of developing a descriptor matrix that can be used to reliably assess if a donor’s blood pattern exhibits a clinically abnormal level of variant lymphocytes, which are blood cells that are potentially indicative of disorders such as leukemia and infectious mononucleosis. ^ This study showed that the set of shift-and-rotation-invariant statistical features extracted from the eigensystem of the flow cytometric data pattern performs better than other commonly-used features in this type of disease detection, exhibiting an accuracy of 80.7%, a sensitivity of 72.3%, and a specificity of 89.2%. This performance represents a major improvement for this type of hematological classifier, which has historically been plagued by poor performance, with accuracies as low as 60% in some cases. This research ultimately shows that an improved feature space was developed that can deliver improved performance for the detection of variant lymphocytes in human blood, thus providing significant utility in the realm of suspect flagging algorithms for the detection of blood-related diseases.^
Resumo:
Global air surface temperatures and precipitation have increased over the last several decades resulting in a trend of greening across the Circumpolar Arctic. The spatial variability of warming and the inherent effects on plant communities has not proven to be uniform or homogeneous on global or local scales. We can apply remote sensing vegetation indices such as the Normalized Difference Vegetation Index (NDVI) to map and monitor vegetation change (e.g., phenology, greening, percent cover, and biomass) over time. It is important to document how Arctic vegetation is changing, as it will have large implications related to global carbon and surface energy budgets. The research reported here examined vegetation greening across different spatial and temporal scales at two disparate Arctic sites: Apex River Watershed (ARW), Baffin Island, and Cape Bounty Arctic Watershed Observatory (CBAWO), Melville Island, NU. To characterize the vegetation in the ARW, high spatial resolution WorldView-2 data were processed to create a supervised land-cover classification and model percent vegetation cover (PVC) (a similar process had been completed in a previous study for the CBAWO). Meanwhile, NDVI data spanning the past 30 years were derived from intermediate resolution Landsat data at the two Arctic sites. The land-cover classifications at both sites were used to examine the Landsat NDVI time series by vegetation class. Climate variables (i.e., temperature, precipitation and growing season length (GSL) were examined to explore the potential relationships of NDVI to climate warming. PVC was successfully modeled using high resolution data in the ARW. PVC and plant communities appear to reside along a moisture and altitudinal gradient. The NDVI time series demonstrated an overall significant increase in greening at the CBAWO (High Arctic site), specifically in the dry and mesic vegetation type. However, similar overall greening was not observed for the ARW (Low Arctic site). The overall increase in NDVI at the CBAWO was attributed to a significant increase in July temperatures, precipitation and GSL.