49 resultados para test data generation
em Cambridge University Engineering Department Publications Database
Resumo:
Data in an organisation often contains business secrets that organisations do not want to release. However, there are occasions when it is necessary for an organisation to release its data such as when outsourcing work or using the cloud for Data Quality (DQ) related tasks like data cleansing. Currently, there is no mechanism that allows organisations to release their data for DQ tasks while ensuring that it is suitably protected from releasing business related secrets. The aim of this paper is therefore to present our current progress on determining which methods are able to modify secret data and retain DQ problems. So far we have identified the ways in which data swapping and the SHA-2 hash function alterations methods can be used to preserve missing data, incorrectly formatted values, and domain violations DQ problems while minimising the risk of disclosing secrets. © (2012) by the AIS/ICIS Administrative Office All rights reserved.
Resumo:
A significant cost in obtaining acoustic training data is the generation of accurate transcriptions. For some sources close-caption data is available. This allows the use of lightly-supervised training techniques. However, for some sources and languages close-caption is not available. In these cases unsupervised training techniques must be used. This paper examines the use of unsupervised techniques for discriminative training. In unsupervised training automatic transcriptions from a recognition system are used for training. As these transcriptions may be errorful data selection may be useful. Two forms of selection are described, one to remove non-target language shows, the other to remove segments with low confidence. Experiments were carried out on a Mandarin transcriptions task. Two types of test data were considered, Broadcast News (BN) and Broadcast Conversations (BC). Results show that the gains from unsupervised discriminative training are highly dependent on the accuracy of the automatic transcriptions. © 2007 IEEE.
Resumo:
Soil liquefaction following large earthquakes is a major contributor to damage to infrastructure and economic loss, as borne out by the earthquakes in Japan and New Zealand in 2011. While extensive research has been conducted on soil liquefaction and our understanding of liquefaction has been advancing, several uncertainties remain. In this paper the basic premise that liquefaction is an 'undrained' event will be challenged. Evidence will be offered based on dynamic centrifuge tests to show that rapid settlements occur both in level ground and for shallow foundations. It will also be shown that the definition of liquefaction based on excess pore pressure generation and the subsequent classification of sites as liquefiable and non-liquefiable is not satisfactory, as centrifuge test data shows that both loose and dense sand sites produce significant excess pore pressure. Experimental evidence will be presented that shows that the permeability of sands increases rapidly at very low effective stresses to allow for rapid drainage to take place from liquefied soil. Based on these observations a micro-mechanical view of soil liquefaction that brings together the Critical State view of soil liquefaction and the importance of dynamic loading will be presented. © 2012 Indian Geotechnical Society.
Resumo:
Focused laser micromachining in an optical microscope system is used to prototype packages for optoelectronic devices and to investigate new materials with potential applications in packaging. Micromachined thin films are proposed as mechanical components to locate fibres and other optical and electrical components on opto-assemblies. This paper reports prototype structures which are micromachined in silicon carbide to produce beams 5 μm thick by (i) laser cutting a track in a SiC coated Si wafer, (ii) undercutting by anisotropic silicon etching using KOH in water, and (iii) trimming if necessary with the laser system. This approach has the advantage of fast turn around and proof of concept. Mechanical test data are obtained from the prototype SiC beam package structures by testing with a stylus profilometer. The Youngs modulus obtained for chemical vapour deposited silicon carbide is 360 +/- 50 GPa indicating that it is a promising material for packaging applications.
Resumo:
The heat dissipation capability of highly porous cellular metal foams with open cells subject to forced air convection is studied using a combined experimental and analytical approach. The cellular morphologies of six FeCrAlY (an iron-based alloy) foams and six copper alloy foams with a range of pore sizes and porosities are quantified with the scanning electronic microscope and image analysis. Experimental measurements on pressure drop and heat transfer for copper foams are carried out. A numerical model for forced convection across open-celled metal foams is subsequently developed, and the predictions are compared with those measured. Reasonably good agreement with test data is obtained, given the complexity of the cellular foam morphology and the associated momentum/energy transport. The results show that cell size has a more significant effect on the overall heat transfer than porosity. An optimal porosity is obtained based on the balance between pressure drop and overall heat transfer, which decreases as the Reynolds number is increased.
Resumo:
Piles passing through sloping liquefiable deposits are prone to lateral loading if these deposits liquefy and flow during earthquakes. These lateral loads caused by the relative soil-pile movement will induce bending in the piles and may result in failure of the piles or excessive pile-head displacement. Whilst the weak nature of the flowing liquefied soil would suggest that only small loads would be exerted on the piles, it is known from case histories that piles do fail owing to the influence of laterally spreading soils. It will be shown, based on dynamic centrifuge test data, that dilatant behaviour of soil close to the pile is the major cause of these considerable transient lateral loads which are transferred to the pile. This paper reports the results of geotechnical centrifuge tests in which models of gently sloping liquefiable sand with pile foundations passing through them were subjected to earthquake excitation. The soil close to the pile was instrumented with pore-pressure transducers and contact stress cells in order to monitor the interaction between soil and pile and to track the soil stress state both upslope and downslope of the pile. The presence of instrumentation measuring pore-pressure and lateral stress close to the pile in the research described in this paper gives the opportunity to better study the soil stress state close to the pile and to compare the loads measured as being applied to the piles by the laterally spreading soils with those suggested by the JRA design code. This test data shows that lateral stresses much greater than one might expect from calculations based on the residual strength of liquefied soil may be applied to piles in flowing liquefied slopes owing to the dilative behaviour of the liquefied soil. It is shown at least for the particular geometry studied that the current JRA design code can be un-conservative by a factor of three for these dilation-affected transient lateral loads.
Resumo:
In recent years, the use of morphological decomposition strategies for Arabic Automatic Speech Recognition (ASR) has become increasingly popular. Systems trained on morphologically decomposed data are often used in combination with standard word-based approaches, and they have been found to yield consistent performance improvements. The present article contributes to this ongoing research endeavour by exploring the use of the 'Morphological Analysis and Disambiguation for Arabic' (MADA) tools for this purpose. System integration issues concerning language modelling and dictionary construction, as well as the estimation of pronunciation probabilities, are discussed. In particular, a novel solution for morpheme-to-word conversion is presented which makes use of an N-gram Statistical Machine Translation (SMT) approach. System performance is investigated within a multi-pass adaptation/combination framework. All the systems described in this paper are evaluated on an Arabic large vocabulary speech recognition task which includes both Broadcast News and Broadcast Conversation test data. It is shown that the use of MADA-based systems, in combination with word-based systems, can reduce the Word Error Rates by up to 8.1 relative. © 2012 Elsevier Ltd. All rights reserved.
Resumo:
This letter presents data from triaxial tests conducted as part of a research programme into the stress-strain behaviour of clays and silts at Cambridge University. To support findings from earlier research using databases of soil tests, eighteen CIU triaxial tests on speswhite kaolin were performed to confirm an assumed link between mobilisation strain (γ M=2) and overconsolidation ratio (OCR). In the moderate shear stress range (0.2c u to 0.8c u) the test data are essentially linear on log-log plots. Both the slopes and intercepts of these lines are simple functions of OCR.
Resumo:
The paper presents centrifuge test data of the problem of tunnelling effects on buried pipelines and compares them to predictions made using DEM simulations. The paper focuses on the examination of pipeline bending moments, their distribution along the pipe, and their development with tunnel volume loss. Centrifuge results are obtained by PIV analysis and compared to results obtained using the DEM model. The DEM model was built to replicate the centrifuge model as closely as possible and included numerical features formulated specially for this task, such as structural elements to replicate the tunnel and pipeline. Results are extremely encouraging, with deviations between DEM and centrifuge test bending moment results being very small. © 2010 Taylor & Francis Group, London.
Resumo:
Soil liquefaction following strong earthquakes causes extensive damage to civil engineering structures. Foundations of buildings, bridges etc can suffer excessive rotation/settlement due to liquefaction. Many of the recent earthquakes bear testimony for such damage. In this article a hypothesis that "Superstructure stiffness can determine the type of liquefaction-induced failure mechanism suffered by the foundations" is proposed. As a rider to this hypothesis, it will be argued that liquefaction will cause failure of a foundation system in a mode of failure that offers least resistance. Evidence will be offered in terms of field observations during the 921 Ji-Ji earthquake in 1999 in Taiwan and Bhuj earthquake of 2001 in India. Dynamic centrifuge test data and finite element analyses results are presented to illustrate the traditional failure mechanisms. Copyright © 2010, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Resumo:
In order to minimize the number of iterations to a turbine design, reasonable choices of the key parameters must be made at the earliest possible opportunity. The choice of blade loading is of particular concern in the low pressure (LP) turbine of civil aero engines, where the use of high-lift blades is widespread. This paper presents an analytical mean-line design study for a repeating-stage, axial-flow Low Pressure (LP) turbine. The problem of how to measure blade loading is first addressed. The analysis demonstrates that the Zweifel coefficient [1] is not a reasonable gauge of blade loading because it inherently depends on the flow angles. A more appropriate coefficient based on blade circulation is proposed. Without a large set of turbine test data it is not possible to directly evaluate the accuracy of a particular loss correlation. The analysis therefore focuses on the efficiency trends with respect to flow coefficient, stage loading, lift coefficient and Reynolds number. Of the various loss correlations examined, those based on Ainley and Mathieson ([2], [3], [4]) do not produce realistic trends. The profile loss model of Coull and Hodson [5] and the secondary loss models of Craig and Cox [6] and Traupel [7] gave the most reasonable results. The analysis suggests that designs with the highest flow turning are the least sensitive to increases in blade loading. The increase in Reynolds number lapse with loading is also captured, achieving reasonable agreement with experiments. Copyright © 2011 by ASME.
Resumo:
This paper is aimed at enabling the confident use of existing model test facilities for ultra deepwater application without having to compromise on the widely accepted range of scales currently used by the floating production industry. Passive line truncation has traditionally been the preferred method of creating an equivalent numerical model at reduced depth; however, these techniques tend to suffer in capturing accurately line dynamic response and so reproducing peak tensions. In an attempt to improve credibility of model test data the proposed truncation procedure sets up the truncated model, based on line dynamic response rather than quasi-static system stiffness. The upper sections of each line are modeled in detail, capturing the wave action zone and all coupling effects with the vessel. These terminate to an approximate analytical model that aims to simulate the remainder of the line. Stages 1 & 2 are used to derive a water depth truncation ratio. Here vibration decay of transverse elastic waves is assessed and it is found that below a certain length criterion, the transverse vibrational characteristics for each line are inertia driven, hence with respect to these motions the truncated model can assume a linear damper whose coefficient depends on the local line properties and vibration frequency. Stage 3 endeavors to match the individual line stiffness between the full depth and truncated models. In deepwater it is likely that taut polyester moorings will be used which are predominantly straight and have high axial stiffness that provides the principal restoring force to static and low frequency vessel motions. Consequently, it means that the natural frequencies of axial vibrations are above the typical wave frequency range allowing for a quasi-static solution. In cases of exceptionally large wave frequency vessel motions, localized curvature at the chain seabed segment and tangential skin drag on the polyester rope can increase dynamic peak tensions considerably. The focus of this paper is to develop an efficient scheme based on analytic formulation, for replicating these forces at the truncation. The paper will close with an example case study of a single mooring under extreme conditions that replicates exactly the static and dynamic characteristics of the full depth line. Copyright © 2012 by the International Society of Offshore and Polar Engineers (ISOPE).
Resumo:
Language models (LMs) are often constructed by building multiple individual component models that are combined using context independent interpolation weights. By tuning these weights, using either perplexity or discriminative approaches, it is possible to adapt LMs to a particular task. This paper investigates the use of context dependent weighting in both interpolation and test-time adaptation of language models. Depending on the previous word contexts, a discrete history weighting function is used to adjust the contribution from each component model. As this dramatically increases the number of parameters to estimate, robust weight estimation schemes are required. Several approaches are described in this paper. The first approach is based on MAP estimation where interpolation weights of lower order contexts are used as smoothing priors. The second approach uses training data to ensure robust estimation of LM interpolation weights. This can also serve as a smoothing prior for MAP adaptation. A normalized perplexity metric is proposed to handle the bias of the standard perplexity criterion to corpus size. A range of schemes to combine weight information obtained from training data and test data hypotheses are also proposed to improve robustness during context dependent LM adaptation. In addition, a minimum Bayes' risk (MBR) based discriminative training scheme is also proposed. An efficient weighted finite state transducer (WFST) decoding algorithm for context dependent interpolation is also presented. The proposed technique was evaluated using a state-of-the-art Mandarin Chinese broadcast speech transcription task. Character error rate (CER) reductions up to 7.3 relative were obtained as well as consistent perplexity improvements. © 2012 Elsevier Ltd. All rights reserved.