869 resultados para text to scene conversion
Resumo:
In this paper, we present syllable-based duration modelling in the context of a prosody model for Standard Yorùbá (SY) text-to-speech (TTS) synthesis applications. Our prosody model is conceptualised around a modular holistic framework. This framework is implemented using the Relational Tree (R-Tree) techniques. An important feature of our R-Tree framework is its flexibility in that it facilitates the independent implementation of the different dimensions of prosody, i.e. duration, intonation, and intensity, using different techniques and their subsequent integration. We applied the Fuzzy Decision Tree (FDT) technique to model the duration dimension. In order to evaluate the effectiveness of FDT in duration modelling, we have also developed a Classification And Regression Tree (CART) based duration model using the same speech data. Each of these models was integrated into our R-Tree based prosody model. We performed both quantitative (i.e. Root Mean Square Error (RMSE) and Correlation (Corr)) and qualitative (i.e. intelligibility and naturalness) evaluations on the two duration models. The results show that CART models the training data more accurately than FDT. The FDT model, however, shows a better ability to extrapolate from the training data since it achieved a better accuracy for the test data set. Our qualitative evaluation results show that our FDT model produces synthesised speech that is perceived to be more natural than our CART model. In addition, we also observed that the expressiveness of FDT is much better than that of CART. That is because the representation in FDT is not restricted to a set of piece-wise or discrete constant approximation. We, therefore, conclude that the FDT approach is a practical approach for duration modelling in SY TTS applications. © 2006 Elsevier Ltd. All rights reserved.
Resumo:
This paper presents a novel intonation modelling approach and demonstrates its applicability using the Standard Yorùbá language. Our approach is motivated by the theory that abstract and realised forms of intonation and other dimensions of prosody should be modelled within a modular and unified framework. In our model, this framework is implemented using the Relational Tree (R-Tree) technique. The R-Tree is a sophisticated data structure for representing a multi-dimensional waveform in the form of a tree. Our R-Tree for an utterance is generated in two steps. First, the abstract structure of the waveform, called the Skeletal Tree (S-Tree), is generated using tone phonological rules for the target language. Second, the numerical values of the perceptually significant peaks and valleys on the S-Tree are computed using a fuzzy logic based model. The resulting points are then joined by applying interpolation techniques. The actual intonation contour is synthesised by Pitch Synchronous Overlap Technique (PSOLA) using the Praat software. We performed both quantitative and qualitative evaluations of our model. The preliminary results suggest that, although the model does not predict the numerical speech data as accurately as contemporary data-driven approaches, it produces synthetic speech with comparable intelligibility and naturalness. Furthermore, our model is easy to implement, interpret and adapt to other tone languages.
Resumo:
In this paper we present the design and analysis of an intonation model for text-to-speech (TTS) synthesis applications using a combination of Relational Tree (RT) and Fuzzy Logic (FL) technologies. The model is demonstrated using the Standard Yorùbá (SY) language. In the proposed intonation model, phonological information extracted from text is converted into an RT. RT is a sophisticated data structure that represents the peaks and valleys as well as the spatial structure of a waveform symbolically in the form of trees. An initial approximation to the RT, called Skeletal Tree (ST), is first generated algorithmically. The exact numerical values of the peaks and valleys on the ST is then computed using FL. Quantitative analysis of the result gives RMSE of 0.56 and 0.71 for peak and valley respectively. Mean Opinion Scores (MOS) of 9.5 and 6.8, on a scale of 1 - -10, was obtained for intelligibility and naturalness respectively.
Resumo:
Bio energy is a renewable energy and a solution to the depleting fossil fuels. Bio energy such as heat, power and bio fuel is generated by conversion technologies using biomass for example domestic waste, root crops, forest residue and animal slurry. Pyrolysis, anaerobic digestion and combined heat and power engine are some examples of the technologies. Depending on the nature of a biomass, it can be treated with various technologies giving out some products, which can be further treated with other technologies and eventually converted into the final products as bio energy. The pathway followed by the biomass, technologies, intermediate products and bio energy in the conversion process is referred to as bio energy pathway. Identification of appropriate pathways optimizes the conversion process. Although there are various approaches to create or generate the pathways, there is still a need for a semantic approach to generate the pathways, which allow checking the consistency of the knowledge, and to share and extend the knowledge efficiently. This paper presents an ontology-based approach to automatic generation of the pathways for biomass to bio energy conversion, which exploits the definition and hierarchical structure of the biomass and technologies, their relationship and associated properties, and infers appropriate pathways. A case study has been carried out in a real-life scenario, the bio energy project for the North West of Europe (Bioen NW), which showed promising results.
Resumo:
This study investigated the effects of word prediction and text-to-speech on the narrative composition writing skills of 6, fifth-grade Hispanic boys with specific learning disabilities (SLD). A multiple baseline design across subjects was used to explore the efficacy of word prediction and text-to-speech alone and in combination on four dependent variables: writing fluency (words per minute), syntax (T-units), spelling accuracy, and overall organization (holistic scoring rubric). Data were collected and analyzed during baseline, assistive technology interventions, and at 2-, 4-, and 6-week maintenance probes. ^ Participants were equally divided into Cohorts A and B, and two separate but related studies were conducted. Throughout all phases of the study, participants wrote narrative compositions for 15-minute sessions. During baseline, participants used word processing only. During the assistive technology intervention condition, Cohort A participants used word prediction followed by word prediction with text-to-speech. Concurrently, Cohort B participants used text-to-speech followed by text-to-speech with word prediction. ^ The results of this study indicate that word prediction alone or in combination with text-to-speech has a positive effect on the narrative writing compositions of students with SLD. Overall, participants in Cohorts A and B wrote more words, more T-units, and spelled more words correctly. A sign test indicated that these perceived effects were not likely due to chance. Additionally, the quality of writing improved as measured by holistic rubric scores. When participants in Cohort B used text-to-speech alone, with the exception of spelling accuracy, inconsequential results were observed on all dependent variables. ^ This study demonstrated that word prediction alone or in combination assists students with SLD to write longer, improved-quality, narrative compositions. These results suggest that word prediction or word prediction with text-to-speech be considered as a writing support to facilitate the production of a first draft of a narrative composition. However, caution should be given to the use of text-to-speech alone as its effectiveness has not been established. Recommendations for future research include investigating the use of these technologies in other phases of the writing process, with other student populations, and with other writing styles. Further, these technologies should be investigated while integrated into classroom composition instruction. ^
Resumo:
Stirling engines with parabolic dish for thermal to electric conversion of solar energy is one of the most promising solutions of renewable energy technologies in order to reduce the dependency from fossil fuels in electricity generation. This paper addresses the modelling and simulation of a solar powered Stirling engine system with parabolic dish and electric generator aiming to determine its energy production and efficiency. The model includes the solar radiation concentration system, the heat transfer in the ther- mal receiver, the thermal cycle and the mechanical and electric energy conversion. The thermodynamic and energy transfer processes in the engine are modelled in detail, including all the main processes occur- ring in the compression, expansion and regenerator spaces. Starting from a particular configuration, an optimization of the concentration factor is also carried out and the results for both the transient and steady state regimes are presented. It was found that using a directly illuminated thermal receiver with- out cavity the engine efficiency is close to 23.8% corresponding to a global efficiency of 10.4%. The com- ponents to be optimized are identified in order to increase the global efficiency of the system and the trade-off between system complexity and efficiency is discussed.
Resumo:
This paper describes an interactive installation work set in a large dome space. The installation is an audio and physical re-rendition of an interactive writing work. In the original work, the user interacted via keyboard and screen while online. This rendition of the work retains the online interaction, but also places the interaction within a physical space, where the main 'conversation' takes place by the participant-audience speaking through microphones and listening through headphones. The work now also includes voice and SMS input, using speech-to-text and text-to-speech conversion technologies, and audio and displayed text for output. These additions allow the participant-audience to co-author the work while they participate in audible conversation with keyword-triggering characters (bots). Communication in the space can be person-to-computer via microphone, keyboard, and phone; person-to-person via machine and within the physical space; computer-to- computer; and computer-to-person via audio and projected text.
Resumo:
In the last two decades, there has been an important increase in research on speech technology in Spain, mainly due to a higher level of funding from European, Spanish and local institutions and also due to a growing interest in these technologies for developing new services and applications. This paper provides a review of the main areas of speech technology addressed by research groups in Spain, their main contributions in the recent years and the main focus of interest these days. This description is classified in five main areas: audio processing including speech, speaker characterization, speech and language processing, text to speech conversion and spoken language applications. This paper also introduces the Spanish Network of Speech Technologies (RTTH. Red Temática en Tecnologías del Habla) as the research network that includes almost all the researchers working in this area, presenting some figures, its objectives and its main activities developed in the last years.
Resumo:
Reviews the books, Lessons From the Northern Ireland Peace Process edited by Timothy J. White (2013) and Human Rights as War by Other Means by Jennifer Curtis (2014). Edited by a U.S.-based academic with an enduring interest in Ireland, the first book draws together an interdisciplinary group of academics from across North America and the U.K. (though notably not Northern Ireland itself) to cover such topics as third party intervention, nationalism, grassroots change, and community development. The second text to be reviewed may be seen as a thorough analysis of this particular point: what is the role played by human rights in Northern Ireland’s peace process?
Resumo:
Transmissible spongiform encephalopathies (TSEs) are lethal, infectious disorders of the mammalian nervous system. A TSE hallmark is the conversion of the cellular protein PrPC to disease-associated PrPSc (named for scrapie, the first known TSE). PrPC is protease-sensitive, monomeric, detergent soluble, and primarily α-helical; PrPSc is protease-resistant, polymerized, detergent insoluble, and rich in β-sheet. The “protein-only” hypothesis posits that PrPSc is the infectious TSE agent that directly converts host-encoded PrPC to fresh PrPSc, harming neurons and creating new agents of infection. To gain insight on the conformational transitions of PrP, we tested the ability of several protein chaperones, which supervise the conformational transitions of proteins in diverse ways, to affect conversion of PrPC to its protease-resistant state. None affected conversion in the absence of pre-existing PrPSc. In its presence, only two, GroEL and Hsp104 (heat shock protein 104), significantly affected conversion. Both promoted it, but the reaction characteristics of conversions with the two chaperones were distinct. In contrast, chemical chaperones inhibited conversion. Our findings provide new mechanistic insights into nature of PrP conversions, and provide a new set of tools for studying the process underlying TSE pathogenesis.
Resumo:
Scrapie is a transmissible neurodegenerative disease that appears to result from an accumulation in the brain of an abnormal protease-resistant isoform of prion protein (PrP) called PrPsc. Conversion of the normal, protease-sensitive form of PrP (PrPc) to protease-resistant forms like PrPsc has been demonstrated in a cell-free reaction composed largely of hamster PrPc and PrPsc. We now report studies of the species specificity of this cell-free reaction using mouse, hamster, and chimeric PrP molecules. Combinations of hamster PrPc with hamster PrPsc and mouse PrPc with mouse PrPsc resulted in the conversion of PrPc to protease-resistant forms. Protease-resistant PrP species were also generated in the nonhomologous reaction of hamster PrPc with mouse PrPsc, but little conversion was observed in the reciprocal reaction. Glycosylation of the PrPc precursors was not required for species specificity in the conversion reaction. The relative conversion efficiencies correlated with the relative transmissibilities of these strains of scrapie between mice and hamsters. Conversion experiments performed with chimeric mouse/hamster PrPc precursors indicated that differences between PrPc and PrPsc at residues 139, 155, and 170 affected the conversion efficiency and the size of the resultant protease-resistant PrP species. We conclude that there is species specificity in the cell-free interactions that lead to the conversion of PrPc to protease-resistant forms. This specificity may be the molecular basis for the barriers to interspecies transmission of scrapie and other transmissible spongiform encephalopathies in vivo.
Resumo:
Introduction: The motivation for developing megavoltage (and kilovoltage) cone beam CT (MV CBCT) capabilities in the radiotherapy treatment room was primarily based on the need to improve patient set-up accuracy. There has recently been an interest in using the cone beam CT data for treatment planning. Accurate treatment planning, however, requires knowledge of the electron density of the tissues receiving radiation in order to calculate dose distributions. This is obtained from CT, utilising a conversion between CT number and electron density of various tissues. The use of MV CBCT has particular advantages compared to treatment planning with kilovoltage CT in the presence of high atomic number materials and requires the conversion of pixel values from the image sets to electron density. Therefore, a study was undertaken to characterise the pixel value to electron density relationship for the Siemens MV CBCT system, MVision, and determine the effect, if any, of differing the number of monitor units used for acquisition. If a significant difference with number of monitor units was seen then pixel value to ED conversions may be required for each of the clinical settings. The calibration of the MV CT images for electron density offers the possibility for a daily recalculation of the dose distribution and the introduction of new adaptive radiotherapy treatment strategies. Methods: A Gammex Electron Density CT Phantom was imaged with the MVCB CT system. The pixel value for each of the sixteen inserts, which ranged from 0.292 to 1.707 relative electron density to the background solid water, was determined by taking the mean value from within a region of interest centred on the insert, over 5 slices within the centre of the phantom. These results were averaged and plotted against the relative electron densities of each insert with a linear least squares fit was preformed. This procedure was performed for images acquired with 5, 8, 15 and 60 monitor units. Results: The linear relationship between MVCT pixel value and ED was demonstrated for all monitor unit settings and over a range of electron densities. The number of monitor units utilised was found to have no significant impact on this relationship. Discussion: It was found that the number of MU utilised does not significantly alter the pixel value obtained for different ED materials. However, to ensure the most accurate and reproducible MV to ED calibration, one MU setting should be chosen and used routinely. To ensure accuracy for the clinical situation this MU setting should correspond to that which is used clinically. If more than one MU setting is used clinically then an average of the CT values acquired with different numbers of MU could be utilized without loss in accuracy. Conclusions: No significant differences have been shown between the pixel value to ED conversion for the Siemens MV CT cone beam unit with change in monitor units. Thus as single conversion curve could be utilised for MV CT treatment planning. To fully utilise MV CT imaging for radiotherapy treatment planning further work will be undertaken to ensure all corrections have been made and dose calculations verified. These dose calculations may be either for treatment planning purposes or for reconstructing the delivered dose distribution from transit dosimetry measurements made using electronic portal imaging devices. This will potentially allow the cumulative dose distribution to be determined through the patient’s multi-fraction treatment and adaptive treatment strategies developed to optimize the tumour response.
Resumo:
Both [C4CO]−· and [C2COC2]−· are formed in the ion source of a VG ZAB 2HF mass spectrometer by the respective processes HO− + Me3Si–CC–CC–CO–CMe3 → [C4CO]−· + Me3SiOH + Me3C·, and Me3Si–CC–CO–CC–SiMe3 + SF6 + e → [C2COC2]−· + 2Me3SiF + SF4. The second synthetic pathway involves a double desilylation reaction similar to that first reported by Squires. The two radical anion isomers produce different and characteristic charge reversal spectra upon collisional activation. In contrast, following collision induced charge stripping, both radical anions produce neutral C4CO as evidenced by the identical neutralisation reionisation (−NR+) spectra. The exclusive rearrangement of C213COC2 to C413CO indicates that 12C–O bond formation is not involved in the reaction. Ab initio calculations (at the RCCSD(T)/aug-cc-pVDZ//B3LYP/6-31G∗ level of theory) have been used to investigate the reaction coordinates on the potential surfaces for both singlet and triplet rearrangements of neutral C2COC2. Singlet C2COC2 is less stable than singlet C4CO by 78.8 kcal mol−1 and requires only 8.5 kcal mol−1 of additional energy to effect conversion to C4CO by a rearrangement sequence involving three C–C ring opening/cyclisation steps.
Resumo:
Free charge generation in donor-acceptor (D-A) based organic photovoltaic diodes (OPV) progresses through formation of charge-transfer (CT) and charge-separated (CS) states and excitation decay to the triplet level is considered as a terminal loss. On the other hand a direct excitation decay to the triplet state is beneficial for multiexciton harvesting in singlet fission photovoltaics (SF-PV) and the formation of CT-state is considered as a limiting factor for multiple triplet harvesting. These two extremes when present in a D-A system are expected to provide important insights into the mechanism of free charge generation and spin-character of bimolecular recombination in OPVs. Herein, we present the complete cycle of events linked to spin conversion in the model OPV system of rubrene/C60. By tracking the spectral evolution of photocurrent generation at short-circuit and close to open-circuit conditions we are able to capture spectral changes to photocurrent that reveal the triplet character of CT-state. Furthermore, we unveil an energy up-conversion effect that sets in as a consequence of triplet population build-up where triplet-triplet annihilation (TTA) process effectively regenerates the singlet excitation. This detailed balance is shown to enable a rare event of photon emission just above the open-circuit voltage (VOC) in OPVs.