942 resultados para categorical and mix datasets


Relevância:

100.00% 100.00%

Publicador:

Resumo:

It is generally accepted that two major gene pools exist in cultivated common bean (Phaseolus vulgaris L.), a Middle American and an Andean one. Some evidence, based on unique phaseolin morphotypes and AFLP analysis, suggests that at least one more gene pool exists in cultivated common bean. To investigate this hypothesis, 1072 accessions from a common bean core collection from the primary centres of origin, held at CIAT, were investigated. Various agronomic and morphological attributes (14 categorical and 11 quantitative) were measured. Multivariate analyses, consisting of homogeneity analysis and clustering for categorical data, clustering and ordination techniques for quantitative data and nonlinear principal component analysis for mixed data, were undertaken. The results of most analyses supported the existence of the two major gene pools. However, the analysis of categorical data of protein types showed an additional minor gene pool. The minor gene pool is designated North Andean and includes phaseolin types CH, S and T; lectin types 312, Pr, B and K; and mostly A5, A6 and A4 types alpha-amylase inhibitor. Analysis of the combined categorical data of protein types and some plant categorical data also suggested that some other germplasm with C type phaseolin are distinguished from the major gene pools.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The ability to generate enormous random libraries of DNA probes via split-and-mix synthesis on solid supports is an important biotechnological application of colloids that has not been fully utilized to date. To discriminate between colloid-based DNA probes each colloidal particle must be 'encoded' so it is distinguishable from all other particles. To this end, we have used novel particle synthesis strategies to produce large numbers of optically encoded particle suitable for DNA library synthesis. Multifluorescent particles with unique and reproducible optical signatures (i.e., fluorescence and light-scattering attributes) suitable for high-throughput flow cytometry have been produced. In the spectroscopic study presented here, we investigated the optical characteristics of multi-fluorescent particles that were synthesized by coating silica 'core' particles with up to six different fluorescent dye shells alternated with non-fluorescent silica 'spacer' shells. It was observed that the diameter of the particles increased by up to 20% as a result of the addition of twelve concentric shells and that there was a significant reduction in fluorescence emission intensities from inner shells as an increasing number of shells were deposited.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

For modern consumer cameras often approximate calibration data is available, making applications such as 3D reconstruction or photo registration easier as compared to the pure uncalibrated setting. In this paper we address the setting with calibrateduncalibrated image pairs: for one image intrinsic parameters are assumed to be known, whereas the second view has unknown distortion and calibration parameters. This situation arises e.g. when one would like to register archive imagery to recently taken photos. A commonly adopted strategy for determining epipolar geometry is based on feature matching and minimal solvers inside a RANSAC framework. However, only very few existing solutions apply to the calibrated-uncalibrated setting. We propose a simple and numerically stable two-step scheme to first estimate radial distortion parameters and subsequently the focal length using novel solvers. We demonstrate the performance on synthetic and real datasets.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dissertação de Mestrado apresentada ao Instituto de Contabilidade e Administração do Porto para a obtenção do grau de Mestre em Contabilidade e Finanças, sob orientação da Professora Doutora Ana Maria Alves Bandeira

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Parallel hyperspectral unmixing problem is considered in this paper. A semisupervised approach is developed under the linear mixture model, where the abundance's physical constraints are taken into account. The proposed approach relies on the increasing availability of spectral libraries of materials measured on the ground instead of resorting to endmember extraction methods. Since Libraries are potentially very large and hyperspectral datasets are of high dimensionality a parallel implementation in a pixel-by-pixel fashion is derived to properly exploits the graphics processing units (GPU) architecture at low level, thus taking full advantage of the computational power of GPUs. Experimental results obtained for real hyperspectral datasets reveal significant speedup factors, up to 164 times, with regards to optimized serial implementation.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The increasing number of television channels, on-demand services and online content, is expected to contribute to a better quality of experience for a costumer of such a service. However, the lack of efficient methods for finding the right content, adapted to personal interests, may lead to a progressive loss of clients. In such a scenario, recommendation systems are seen as a tool that can fill this gap and contribute to the loyalty of users. Multimedia content, namely films and television programmes are usually described using a set of metadata elements that include the title, a genre, the date of production, and the list of directors and actors. This paper provides a deep study on how the use of different metadata elements can contribute to increase the quality of the recommendations suggested. The analysis is conducted using Netflix and Movielens datasets and aspects such as the granularity of the descriptions, the accuracy metric used and the sparsity of the data are taken into account. Comparisons with collaborative approaches are also presented.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Geological research on the Mediterranean region is presently characterized by the transition from disciplinary to multidisciplinary research, as well as from national to international investigations. In order to synthesize and integrate the vast disciplinary and national datasets which are available, it is necessary to implement maximum interaction among geoscientists of different backgrounds. The creation of project-oriented task forces in universities and other research institutions, as well as the development of large international cooperation programs, is instrumental in pursuing such a multidisciplinary and supranational approach. The TRANSMED Atlas, an official publication of the 32nd International Geological Congress (Florence 2004), is the result of an international scientific cooperation program which brought together for over two years sixty-three structural geologists, geophysicists, marine geologists, petrologists, sedimentologists, stratigraphers, paleogeographers, and petroleum geologists coming from eighteen countries, and working for the petroleum industry, academia, and other institutions, both public and private. The TRANSMED Atlas provides an updated, synthetic, and coherent portrayal of the overall geological-geophysical structure of the Mediterranean domain and the surrounding areas. The initial stimulus for the Atlas came from the realization of the extremely heterogeneous nature of the existing geological-geophysical data about such domain. These data have been gathered by universities, oil companies, geological surveys and other institutions in several countries, often using different procedures and standards. In addition, much of these data are written in languages and published in outlets that are not readily accessible to the general international reader. By synthesizing and integrating a wealth of preexisting and new data derived from surficial geology, seismic sections at various scales, and mantle tomographies, the TRANSMED Atlas provides for the first time a coherent geological overview of the Mediterranean region and represents an ideal springboard for future studies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In CoDaWork’05, we presented an application of discriminant function analysis (DFA) to 4 differentcompositional datasets and modelled the first canonical variable using a segmented regression modelsolely based on an observation about the scatter plots. In this paper, multiple linear regressions areapplied to different datasets to confirm the validity of our proposed model. In addition to dating theunknown tephras by calibration as discussed previously, another method of mapping the unknown tephrasinto samples of the reference set or missing samples in between consecutive reference samples isproposed. The application of these methodologies is demonstrated with both simulated and real datasets.This new proposed methodology provides an alternative, more acceptable approach for geologists as theirfocus is on mapping the unknown tephra with relevant eruptive events rather than estimating the age ofunknown tephra.Kew words: Tephrochronology; Segmented regression

Relevância:

100.00% 100.00%

Publicador:

Resumo:

PURPOSE: Self-administered questionnaires continue to be the most widely used type of physical activity assessment in epidemiological studies. However, test-retest reliability and validity of physical activity questionnaires have to be determined. In this study, three short physical activity questionnaires already used in Switzerland and the International Physical Activity Questionnaire (IPAQ) were validated. METHODS: Test-retest reliability was assessed by repeated administration of all questionnaires within 3 wk in 178 volunteers (77 women, 46.1+/-14.8 yr; 101 men 46.8+/-13.2 yr). Validity of categorical and continuous data was studied in a subsample of 35 persons in relation to 7-d accelerometer readings, percent body fat, and cardiorespiratory fitness. RESULTS: Reliability was fair to good with a Spearman correlation coefficient range of 0.43-0.68 for measures of continuous data and moderate to fair with Kappa values between 0.32 and 0.46 for dichotomous measures active/inactive. Total physical activity reported in the IPAQ and the Office in Motion Questionnaire (OIMQ) correlated with accelerometry readings (r=0.39 and 0.44, respectively). In contrast, correlations of self-reported physical data with percent body fat and cardiorespiratory fitness were low (r=-0.26-0.29). Participants categorized as active by the Swiss HEPA Survey 1999 instrument (HEPA99) accumulated significantly more days of the recommended physical activities than their inactive counterparts (4.4 and 2.7 d.wk, respectively, P<0.05). However, compared with accelerometer data, vigorous physical activities were overreported in investigated questionnaires. CONCLUSION: Collecting valid data on physical activity remains a challenging issue for questionnaire surveys. The IPAQ and the three other questionnaires are characterized to inform decisions about their appropriate use.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mammals are characterized by specific phenotypic traits that include lactation, hair, and relatively large brains with unique structures. Individual mammalian lineages have, in turn, evolved characteristic traits that distinguish them from others. These include obvious anatom¬ical differences but also differences related to reproduction, life span, cognitive abilities, be¬havior. and disease susceptibility. However, the molecular basis of the diverse mammalian phenotypes and the selective pressures that shaped their evolution remain largely unknown. In the first part of my thesis, I analyzed the genetic factors associated with the origin of a unique mammalian phenotype lactation and I studied the selective pressures that forged the transition from oviparity to viviparity. Using a comparative genomics approach and evolutionary simulations, I showed that the emergence of lactation, as well as the appear¬ance of the casein gene family, significantly reduced selective pressure on the major egg-yolk proteins (the vitellogenin family). This led to a progressive loss of vitellogenins, which - in oviparous species - act as storage proteins for lipids, amino acids, phosphorous and calcium in the isolated egg. The passage to internal fertilization and placentation in therian mam¬mals rendered vitellogenins completely dispensable, which ended in the loss of the whole gene family in this lineage. As illustrated by the vitellogenin study, changes in gene content are one possible underlying factor for the evolution of mammalian-specific phenotypes. However, more subtle genomic changes, such as mutations in protein-coding sequences, can also greatly affect the phenotypes. In particular, it was proposed that changes at the level of gene reg¬ulation could underlie many (or even most) phenotypic differences between species. In the second part of my thesis, I participated in a major comparative study of mammalian tissue transcriptomes, with the goal of understanding how evolutionary forces affected expression patterns in the past 200 million years of mammalian evolution. I showed that, while com¬parisons of gene expressions are in agreement with the known species phylogeny, the rate of expression evolution varies greatly among lineages. Species with low effective population size, such as monotremes and hominoids, showed significantly accelerated rates of gene expression evolution. The most likely explanation for the high rate of gene expression evolution in these lineages is the accumulation of mildly deleterious mutations in regulatory regions, due to the low efficiency of purifying selection. Thus, our observations are in agreement with the nearly neutral theory of molecular evolution. I also describe substantial differences in evolutionary rates between tissues, with brain being the most constrained (especially in primates) and testis significantly accelerated. The rate of gene expression evolution also varies significantly between chromosomes. In particular, I observed an acceleration of gene expression changes on the X chromosome, probably as a result of adaptive processes associated with the origin of therian sex chromosomes. Lastly, I identified several individual genes as well as co-regulated expression modules that have undergone lineage specific expression changes and likely under¬lie various phenotypic innovations in mammals. The methods developed during my thesis, as well as the comprehensive gene content analyses and transcriptomics datasets made available by our group, will likely prove to be useful for further exploratory analyses of the diverse mammalian phenotypes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Density is an important component of hot-mix asphalt (HMA) pavement quality and long-term performance. Insufficient density of an in-place HMA pavement is the most frequently cited construction-related performance problem. This study evaluated the use of electromagnetic gauges to nondestructively determine densities. Field and laboratory measurements were taken with two electromagnetic gauges—a PaveTracker and a Pavement Quality Indicator (PQI). Test data were collected in the field during and after paving operations and also in a laboratory on field mixes compacted in the lab. This study revealed that several mix- and project-specific factors affect electromagnetic gauge readings. Consequently, the implementation of these gauges will likely need to be done utilizing a test strip on a project- and mix-specific basis to appropriately identify an adjustment factor for the specific electromagnetic gauge being used for quality control and quality assurance (QC/QA) testing. The substantial reduction in testing time that results from employing electromagnetic gauges rather than coring makes it possible for more readings to be used in the QC/QA process with real-time information without increasing the testing costs.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Ontic structural realism is the view that structures are what is real in the first place in the domain of fundamental physics. The structures are usually conceived as including a primitive modality. However, it has not been spelled out as yet what exactly that modality amounts to. This paper proposes to fill this lacuna by arguing that the fundamental physical structures possess a causal essence, being powers. Applying the debate about causal vs. categorical properties in analytic metaphysics to ontic structural realism, I show that the standard argument against categorical and for causal properties holds for structures as well. Structural realism, as a position in the metaphysics of science that is a form of scientific realism, is committed to causal structures. The metaphysics of causal structures is supported by physics, and it can provide for a complete and coherent view of the world that includes all domains of empirical science.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

I am pleased to present the performance report for the Iowa Department for the Blind for fiscal year 2005. This report is provided in compliance with sections 8E.210 and 216B.7 of the Code of Iowa. It contains valuable information about the services the Department and its partners provided for Iowans during the past fiscal year in the areas of vocational rehabilitation, library services, and resource management. Major accomplishments of the year included new food service opportunities in the Randolph-Sheppard program, extensive remodeling of the Adult Orientation and Adjustment Center, and continued national prominence in vocational rehabilitation as measured by the U.S. Rehabilitation Services Administration, which on June 13, 2005 released data on federal standards and indicators for the year ended September 30, 2004. Earnings ratios and the percentage of employment for vocational rehabilitation clients of the Department remain among the best in the nation. This is corroborated by a report released in September, 2005 by the U.S. Government Accountability Office, which tested and summarized datasets compiled by the U.S. Department of Education for the nation’s 80 vocational rehabilitation agencies. Overall, we met or exceeded 26 of 32 results targets included in this report. Key strategic challenges, developments, and trends are also discussed in the "Department Overview" that follows. Sincerely, Allen C. Harris Director, Iowa Department for the Blind

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Many classifiers achieve high levels of accuracy but have limited applicability in real world situations because they do not lead to a greater understanding or insight into the^way features influence the classification. In areas such as health informatics a classifier that clearly identifies the influences on classification can be used to direct research and formulate interventions. This research investigates the practical applications of Automated Weighted Sum, (AWSum), a classifier that provides accuracy comparable to other techniques whilst providing insight into the data. This is achieved by calculating a weight for each feature value that represents its influence on the class value. The merits of this approach in classification and insight are evaluated on a Cystic Fibrosis and Diabetes datasets with positive results.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Quadrennial Needs Study was developed to assist in the identification of highway needs and the distribution of road funds in Iowa among the various highway entities. During the period 1978 to 1990, the process has seen large shifts in needs and associated funding distribution in individual counties with no apparent reasons. This study investigated the reasons for such shifts. The study identified program inputs that can result in major shifts in needs either up or down from minor changes in the input values. The areas of concern were identified as the condition ratings for roads and structures, traffic volume and mix counts, and the assignment of construction cost areas. Eight counties exhibiting the large shifts (greater than 30%) in needs over time were used to test the sensitivity of the variables. A ninth county was used as the base line for the study. Recommendations are identified for improvements in the process of data collection in the areas of road and structure condition--rating, traffic, and in the assignment of construction cost areas. Advice is also offered in how to account for changes in jurisdiction between successive studies. Maintenance cost area assignment and levels of maintenance service are identified as requiring additional detailed research.