11 resultados para heterogeneous data sources

em Cambridge University Engineering Department Publications Database


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Compared with construction data sources that are usually stored and analyzed in spreadsheets and single data tables, data sources with more complicated structures, such as text documents, site images, web pages, and project schedules have been less intensively studied due to additional challenges in data preparation, representation, and analysis. In this paper, our definition and vision for advanced data analysis addressing such challenges are presented, together with related research results from previous work, as well as our recent developments of data analysis on text-based, image-based, web-based, and network-based construction sources. It is shown in this paper that particular data preparation, representation, and analysis operations should be identified, and integrated with careful problem investigations and scientific validation measures in order to provide general frameworks in support of information search and knowledge discovery from such information-abundant data sources.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cluster analysis of ranking data, which occurs in consumer questionnaires, voting forms or other inquiries of preferences, attempts to identify typical groups of rank choices. Empirically measured rankings are often incomplete, i.e. different numbers of filled rank positions cause heterogeneity in the data. We propose a mixture approach for clustering of heterogeneous rank data. Rankings of different lengths can be described and compared by means of a single probabilistic model. A maximum entropy approach avoids hidden assumptions about missing rank positions. Parameter estimators and an efficient EM algorithm for unsupervised inference are derived for the ranking mixture model. Experiments on both synthetic data and real-world data demonstrate significantly improved parameter estimates on heterogeneous data when the incomplete rankings are included in the inference process.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In the Climate Change Act of 2008 the UK Government pledged to reduce carbon emissions by 80% by 2050. As one step towards this, regulations are being introduced requiring all new buildings to be ‘zero carbon’ by 2019. These are defined as buildings which emit net zero carbon during their operational lifetime. However, in order to meet the 80% target it is necessary to reduce the carbon emitted during the whole life-cycle of buildings, including that emitted during the processes of construction. These elements make up the ‘embodied carbon’ of the building. While there are no regulations yet in place to restrict embodied carbon, a number of different approaches have been made. There are several existing databases of embodied carbon and embodied energy. Most provide data for the material extraction and manufacturing only, the ‘cradle to factory gate’ phase. In addition to the databases, various software tools have been developed to calculate embodied energy and carbon of individual buildings. A third source of data comes from the research literature, in which individual life cycle analyses of buildings are reported. This paper provides a comprehensive review, comparing and assessing data sources, boundaries and methodologies. The paper concludes that the wide variations in these aspects produce incomparable results. It highlights the areas where existing data is reliable, and where new data and more precise methods are needed. This comprehensive review will guide the future development of a consistent and transparent database and software tool to calculate the embodied energy and carbon of buildings.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Compared with structured data sources that are usually stored and analyzed in spreadsheets, relational databases, and single data tables, unstructured construction data sources such as text documents, site images, web pages, and project schedules have been less intensively studied due to additional challenges in data preparation, representation, and analysis. In this paper, our vision for data management and mining addressing such challenges are presented, together with related research results from previous work, as well as our recent developments of data mining on text-based, web-based, image-based, and network-based construction databases.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In the Climate Change Act of 2008 the UK Government pledged to reduce carbon emissions by 80% by 2050. As one step towards this, regulations are being introduced requiring all new buildings to be ‘zero carbon’ by 2019. These are defined as buildingswhichemitnetzerocarbonduringtheiroperationallifetime.However,inordertomeetthe80%targetitisnecessary to reduce the carbon emitted during the whole life-cycle of buildings, including that emitted during the processes of construction. These elements make up the ‘embodied carbon’ of the building. While there are no regulations yet in place to restrictembodiedcarbon,anumberofdifferentapproacheshavebeenmade.Thereareseveralexistingdatabasesofembodied carbonandembodiedenergy.Mostprovidedataforthematerialextractionandmanufacturingonly,the‘cradletofactorygate’ phase. In addition to the databases, various software tools have been developed to calculate embodied energy and carbon of individual buildings. A third source of data comes from the research literature, in which individual life cycle analyses of buildings are reported. This paper provides a comprehensive review, comparing and assessing data sources, boundaries and methodologies. The paper concludes that the wide variations in these aspects produce incomparable results. It highlights the areas where existing data is reliable, and where new data and more precise methods are needed. This comprehensive review will guide the future development of a consistent and transparent database and software tool to calculate the embodied energy and carbon of buildings.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We present a map of the transformation of energy in China as a Sankey diagram. After a review of previous work, and a statement of methodology, our main work has been the identification, evaluation, and treatment of appropriate data sources. This data is used to construct the Sankey diagram, in which flows of energy are traced from energy sources through end-use conversion devices, passive systems and final services to demand drivers. The resulting diagram provides a convenient and clear snapshot of existing energy transformations in China which can usefully be compared with a similar global analysis and which emphasises the potential for improvements in energy efficiency in 'passive systems'. More broadly, it gives a basis for examining and communicating future energy scenarios, including changes to demand, changes to the supply mix, changes in efficiency and alternative provision of existing services. © 2012 Elsevier Ltd.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The fundamental aim of clustering algorithms is to partition data points. We consider tasks where the discovered partition is allowed to vary with some covariate such as space or time. One approach would be to use fragmentation-coagulation processes, but these, being Markov processes, are restricted to linear or tree structured covariate spaces. We define a partition-valued process on an arbitrary covariate space using Gaussian processes. We use the process to construct a multitask clustering model which partitions datapoints in a similar way across multiple data sources, and a time series model of network data which allows cluster assignments to vary over time. We describe sampling algorithms for inference and apply our method to defining cancer subtypes based on different types of cellular characteristics, finding regulatory modules from gene expression data from multiple human populations, and discovering time varying community structure in a social network.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This article explores risk management in global industrial investment by identifying linkages and gaps between theories and practices. It identifies opportunities for further development of the field. Three related bodies of literature have been reviewed: risk management, global manufacturing and investment. The review suggests that risk management in global manufacturing is overlooked in the literature; that existing theoretical risk management processes are not well developed in the global manufacturing context and that the investment literature applies mainly to financial risk assessment rather than investment risk management structures. Further, there appears to be a serious lack of systematic industrial risk management in investment decision making. This article highlights the opportunities to deploy current good practices more effectively as well as the need to develop more robust theories of industrial investment risk management. The approach adopted to investigate this multidisciplinary topic included a historical review of literature to understand the diverse background of theoretical development. A case study research approach was adopted to collect data, involving four global manufacturing companies and one risk management advisory company to observe the patterns and rationale of current practices. Supporting arguments from secondary data sources reinforced the findings. The research focuses risk management in global industrial investment. It links theories with practice to understand the existing knowledge gap and proposes key research themes for further research. © 2013 Macmillan Publishers Ltd. 1460-3799 Risk Management.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Usually, firms that produce innovative global products are discussed within the context of developed countries. New ventures in developing countries are typically viewed as low-cost product providers that generate technologically similar products to those produced by developed economies. However, this paper argues that some Chinese university spin-outs (USOs), although rare, have adopted a novel 'catch-up' strategy to build global products on the basis of indigenous platform technologies. This paper attempts to develop a conceptual framework to address the question: how do these specific Chinese USOs develop their innovation capabilities to build global products? In order to explore the idiosyncrasies of the specific USOs, this paper uses the multiple case studies method. The primary data sources are accessed through semi-structured interviews. In addition, archival data and other materials are used as secondary sources. The study analyses the configuration of capabilities that are needed for idiosyncratic growth, and maps them to the globalisation processes. This paper provides a strategic 'roadmap' as an explanatory guide to entrepreneurs, policy makers and investors to better understand the phenomena. © 2014 Inderscience Enterprises Ltd.