993 resultados para galaxies: star clusters: general
Resumo:
Early-Type galaxies (ETGs) are embedded in hot (10^6-10^7 K), X-ray emitting gaseous haloes, produced mainly by stellar winds and heated by Type Ia supernovae explosions, by the thermalization of stellar motions and occasionally by the central super-massive black hole (SMBH). In particular, the thermalization of the stellar motions is due to the interaction between the stellar and the SNIa ejecta and the hot interstellar medium (ISM) already residing in the ETG. A number of different astrophysical phenomena determine the X-ray properties of the hot ISM, such as stellar population formation and evolution, galaxy structure and internal kinematics, Active Galactic Nuclei (AGN) presence, and environmental effects. With the aid of high-resolution hydrodynamical simulations performed on state-of-the-art galaxy models, in this Thesis we focus on the effects of galaxy shape, stellar kinematics and star formation on the evolution of the X-ray coronae of ETGs. Numerical simulations show that the relative importance of flattening and rotation are functions of the galaxy mass: at low galaxy masses, adding flattening and rotation induces a galactic wind, thus lowering the X-ray luminosity; at high galaxy masses the angular momentum conservation keeps the central regions of rotating galaxies at low density, whereas in non-rotating models a denser and brighter atmosphere is formed. The same dependence from the galaxy mass is present in the effects of star formation (SF): in light galaxies SF contributes to increase the spread in Lx, while at high galaxy masses the halo X-ray properties are marginally sensitive to SF effects. In every case, the star formation rate at the present epoch quite agrees with observations, and the massive, cold gaseous discs are partially or completely consumed by SF on a time-scale of few Gyr, excluding the presence of young stellar discs at the present epoch.
Resumo:
In this thesis, I have investigated the evolution of the high-redshift (z > 3) AGN population by collecting data from some of the major Chandra and XMM-Newton surveys. The final sample (141 sources) is one of the largest selected at z> 3 in the X- rays and it is characterised by a very high redshift completeness (98%). I derived the spectral slopes and obscurations through a spectral anaysis and I assessed the high-z evolution by deriving the luminosity function and the number counts of the sample. The best representation of the AGN evolution is a pure density evolution (PDE) model: the AGN space density is found to decrease by a factor of 10 from z=3 to z=5. I also found that about 50% of AGN are obscured by large column densities (logNH > 23). By comparing these data with those in the Local Universe, I found a positive evolution of the obscured AGN fraction with redshift, especially for luminous (logLx > 44) AGN. I also studied the gas content of z < 1 AGN-hosting galaxies and compared it with that of inactive galaxies. For the first time, I applied to AGN a method to derive the gas mass previously used for inactive galaxies only. AGN are found to live preferentially in gas-rich galaxies. This result on the one hand can help us in understanding the AGN triggering mechanisms, on the other hand explains why AGN are preferentially hosted by star-forming galaxies.
Resumo:
Holding the major share of stellar mass in galaxies and being also old and passively evolving, early-type galaxies (ETGs) are the primary probes in investigating these various evolution scenarios, as well as being useful means to provide insights on cosmological parameters. In this thesis work I focused specifically on ETGs and on their capability in constraining galaxy formation and evolution; in particular, the principal aims were to derive some of the ETGs evolutionary parameters, such as age, metallicity and star formation history (SFH) and to study their age-redshift and mass-age relations. In order to infer galaxy physical parameters, I used the public code STARLIGHT: this program provides a best fit to the observed spectrum from a combination of many theoretical models defined in user-made libraries. the comparison between the output and input light-weighted ages shows a good agreement starting from SNRs of ∼ 10, with a bias of ∼ 2.2% and a dispersion 3%. Furthermore, also metallicities and SFHs are well reproduced. In the second part of the thesis I performed an analysis on real data, starting from Sloan Digital Sky Survey (SDSS) spectra. I found that galaxies get older with cosmic time and with increasing mass (for a fixed redshift bin); absolute light-weighted ages, instead, result independent from the fitting parameters or the synthetic models used. Metallicities, instead, are very similar from each other and clearly consistent with the ones derived from the Lick indices. The predicted SFH indicates the presence of a double burst of star formation. Velocity dispersions and extinctiona are also well constrained, following the expected behaviours. As a further step, I also fitted single SDSS spectra (with SNR∼ 20), to verify that stacked spectra gave the same results without introducing any bias: this is an important check, if one wants to apply the method at higher z, where stacked spectra are necessary to increase the SNR. Our upcoming aim is to adopt this approach also on galaxy spectra obtained from higher redshift Surveys, such as BOSS (z ∼ 0.5), zCOSMOS (z 1), K20 (z ∼ 1), GMASS (z ∼ 1.5) and, eventually, Euclid (z 2). Indeed, I am currently carrying on a preliminary study to estabilish the applicability of the method to lower resolution, as well as higher redshift (z 2) spectra, just like the Euclid ones.
Resumo:
The mass estimation of galaxy clusters is a crucial point for modern cosmology, and can be obtained by several different techniques. In this work we discuss a new method to measure the mass of galaxy clusters connecting the gravitational potential of the cluster with the kinematical properties of its surroundings. We explore the dynamics of the structures located in the region outside virialized cluster, We identify groups of galaxies, as sheets or filaments, in the cluster outer region, and model how the cluster gravitational potential perturbs the motion of these structures from the Hubble fow. This identification is done in the redshift space where we look for overdensities with a filamentary shape. Then we use a radial mean velocity profile that has been found as a quite universal trend in simulations, and we fit the radial infall velocity profile of the overdensities found. The method has been tested on several cluster-size haloes from cosmological N-body simulations giving results in very good agreement with the true values of virial masses of the haloes and orientation of the sheets. We then applied the method to the Coma cluster and even in this case we found a good correspondence with previous. It is possible to notice a mass discrepancy between sheets with different alignments respect to the center of the cluster. This difference can be used to reproduce the shape of the cluster, and to demonstrate that the spherical symmetry is not always a valid assumption. In fact, if the cluster is not spherical, sheets oriented along different axes should feel a slightly different gravitational potential, and so give different masses as result of the analysis described before. Even this estimation has been tested on cosmological simulations and then applied to Coma, showing the actual non-sphericity of this cluster.
Resumo:
We have used high-resolution spectra, acquired with UVES@ESO-VLT, to determine the chemical abundances of different samples of AGB and RGB stars in 4 Galactic globular clusters, namely 47Tuc, NGC3201, M22 and M62. For almost all the analyzed AGB stars we found a clear discrepancy between the iron abundance measured from neutral lines and that obtained from single ionized lines, while this discrepancy is not obtained for the RGB samples observed in the same clusters and analyzed with the same procedure. Such a behavior exactly corresponds to what expected in the case of Non-Local Thermodynamical Equilibrium (NLTE) in the star atmosphere. These results have a huge impact on the proper determination of GC chemistry. In fact, one of the most intriguing consequences is that, at odds with previous claims, no iron spread is found in NGC3201 and M22 if the iron abundance is obtained from ionized lines only.
Resumo:
In this Thesis, we study the accretion of mass and angular momentum onto the disc of spiral galaxies from a global and a local perspective and comparing theory predictions with several observational data. First, we propose a method to measure the specific mass and radial growth rates of stellar discs, based on their star formation rate density profiles and we apply it to a sample of nearby spiral galaxies. We find a positive radial growth rate for almost all galaxies in our sample. Our galaxies grow in size, on average, at one third of the rate at which they grow in mass. Our results are in agreement with theoretical expectations if known scaling relations of disc galaxies are not evolving with time. We also propose a novel method to reconstruct accretion profiles and the local angular momentum of the accreting material from the observed structural and chemical properties of spiral galaxies. Applied to the Milky Way and to one external galaxy, our analysis indicates that accretion occurs at relatively large radii and has a local deficit of angular momentum with respect to the disc. Finally, we show how structure and kinematics of hot gaseous coronae, which are believed to be the source of mass and angular momentum of massive spiral galaxies, can be reconstructed from their angular momentum and entropy distributions. We find that isothermal models with cosmologically motivated angular momentum distributions are compatible with several independent observational constraints. We also consider more complex baroclinic equilibria: we describe a new parametrization for these states, a new self-similar family of solution and a method for reconstructing structure and kinematics from the joint angular momentum/entropy distribution.
Resumo:
The width of the 21 cm line (HI) emitted by spiral galaxies depends on the physical processes that release energy in the Interstellar Medium (ISM). This quantity is called velocity dispersion (σ) and it is proportional first of all to the thermal kinetic energy of the gas. The accepted theoretical picture predicts that the neutral hydrogen component (HI) exists in the ISM in two stable phases: a cold one (CNM, with σ~0.8 km/s) and a warm one (WNM, with σ~8 km/s). However, this is called into question by the observation that the HI gas has usually larger velocity dispersions. This suggests the presence of turbulence in the ISM, although the energy sources remain unknown. In this thesis we want to shed new light on this topic. We have studied the HI line emission of two nearby galaxies: NGC6946 and M101. For the latter we used new deep observations obtained with the Westerbork radio interferometer. Through a gaussian fitting procedure, we produced dispersion maps of the two galaxies. For both of them, we compared the σ values measured in the spiral arms with those in the interarms. In NGC6946 we found that, in both arms and interarms, σ grows with the column density, while we obtained the opposite for M 101. Using a statistical analysis we did not find a significant difference between arm and interarm dispersion distributions. Producing star formation rate density maps (SFRD) of the galaxies, we studied their global and local relations with the HI kinetic energy, as inferred from the measured dispersions. For NGC6946 we obtained a good log-log correlation, in agreement with a simple model of supernova feedback driven turbulence. This shows that in this galaxy turbulent motions are mainly induced by the stellar activity. For M 101 we did not find an analogous correlation, since the gas kinetic energy appears constant with the SFRD. We think that this may indicate that in this galaxy turbulence is driven also by accretion of extragalactic material.
Resumo:
Extended cluster radio galaxies show different morphologies com- pared to those found isolated in the field. Indeed, symmetric double radio galaxies are only a small percentage of the total content of ra- dio loud cluster galaxies, which show mainly tailed morphologies (e.g. O’Dea & Owen, 1985). Moreover, cluster mergers can deeply affect the statistical properties of their radio activity. In order to better understand the morphological and radio activity differences of the radio galaxies in major mergeing and non/tidal-merging clusters, we performed a multifrequency study of extended radio galax- ies inside two cluster complexes, A3528 and A3558. They belong to the innermost region of the Shapley Concentration, the most massive con- centration of galaxy clusters (termed supercluster) in the local Universe, at average redshift z ≈ 0.043. We analysed low frequency radio data performed at 235 and 610 MHz with Giant Metrewave Radio Telescope (GMRT) and we combined them with proprietary and literature observations, in order to have a wide frequency range (150 MHz to 8.4 GHz) to perform the spectral analysis. The low frequency images allowed us to carry out a detailed study of the radio tails and diffuse emission found in some cases. The results in the radio band were also qualitatively compared with the X-ray information coming from XMM-Newton observations, in order to test the interaction between radio galaxies and cluster weather. We found that the brightest central galaxies (BCGs) in the A3528 cluster complex are powerful and present substantial emission from old relativistic plasma characterized by a steep spectrum (α > 2). In the light of observational pieces of evidence, we suggest they are possible re-started radio galaxies. On the other hand, the tailed radio galaxies trace the host galaxy motion with respect to the ICM, and our find- ings is consistent with the dynamical interpretation of a tidal interaction (Gastaldello et al. 2003). On the contrary, the BCGs in the A3558 clus- ter complex are either quiet or very faint radio galaxies, supporting the hypothesis that clusters mergers quench the radio emission from AGN.
Resumo:
Using latent class analysis (LCA), a previous study on patients attending primary care identified four courses of low back pain (LBP) over the subsequent 6 months. To date, no studies have used longitudinal pain recordings to examine the "natural" course of recurrent and chronic LBP in a population-based sample of individuals. This study examines the course of LBP in the general population and elaborates on the stability and criterion-related validity of the clusters derived. A random sample of 400 individuals reporting LBP in a population-based study was asked to complete a comprehensive questionnaire at the start and end of the year's survey, and 52 weekly pain diaries in between. The latter were analyzed using LCA. 305 individuals returned more than 50% of the diaries. Four clusters were identified (severe persistent, moderate persistent, mild persistent, and fluctuating). The clusters differed significantly with regards to pain and disability. Assessment of cluster stability showed that a considerable proportion of patients in the "fluctuating" group changed their classification over time. Three of the four clusters describing the typical course of pain matched the clusters described previously for patients in primary care. Due to the population-based design, this study achieves, for the first time, a close insight into the "natural" course of chronic and recurrent low back pain, including individuals that did not necessarily visit the general practitioner. The findings will help to understand better the nature of this pain in the general population.
Resumo:
PLATO 2.0 has recently been selected for ESA’s M3 launch opportunity (2022/24). Providing accurate key planet parameters (radius, mass, density and age) in statistical numbers, it addresses fundamental questions such as: How do planetary systems form and evolve? Are there other systems with planets like ours, including potentially habitable planets? The PLATO 2.0 instrument consists of 34 small aperture telescopes (32 with 25 s readout cadence and 2 with 2.5 s candence) providing a wide field-of-view (2232 deg 2) and a large photometric magnitude range (4–16 mag). It focusses on bright (4–11 mag) stars in wide fields to detect and characterize planets down to Earth-size by photometric transits, whose masses can then be determined by ground-based radial-velocity follow-up measurements. Asteroseismology will be performed for these bright stars to obtain highly accurate stellar parameters, including masses and ages. The combination of bright targets and asteroseismology results in high accuracy for the bulk planet parameters: 2 %, 4–10 % and 10 % for planet radii, masses and ages, respectively. The planned baseline observing strategy includes two long pointings (2–3 years) to detect and bulk characterize planets reaching into the habitable zone (HZ) of solar-like stars and an additional step-and-stare phase to cover in total about 50 % of the sky. PLATO 2.0 will observe up to 1,000,000 stars and detect and characterize hundreds of small planets, and thousands of planets in the Neptune to gas giant regime out to the HZ. It will therefore provide the first large-scale catalogue of bulk characterized planets with accurate radii, masses, mean densities and ages. This catalogue will include terrestrial planets at intermediate orbital distances, where surface temperatures are moderate. Coverage of this parameter range with statistical numbers of bulk characterized planets is unique to PLATO 2.0. The PLATO 2.0 catalogue allows us to e.g.: - complete our knowledge of planet diversity for low-mass objects, - correlate the planet mean density-orbital distance distribution with predictions from planet formation theories,- constrain the influence of planet migration and scattering on the architecture of multiple systems, and - specify how planet and system parameters change with host star characteristics, such as type, metallicity and age. The catalogue will allow us to study planets and planetary systems at different evolutionary phases. It will further provide a census for small, low-mass planets. This will serve to identify objects which retained their primordial hydrogen atmosphere and in general the typical characteristics of planets in such low-mass, low-density range. Planets detected by PLATO 2.0 will orbit bright stars and many of them will be targets for future atmosphere spectroscopy exploring their atmosphere. Furthermore, the mission has the potential to detect exomoons, planetary rings, binary and Trojan planets. The planetary science possible with PLATO 2.0 is complemented by its impact on stellar and galactic science via asteroseismology as well as light curves of all kinds of variable stars, together with observations of stellar clusters of different ages. This will allow us to improve stellar models and study stellar activity. A large number of well-known ages from red giant stars will probe the structure and evolution of our Galaxy. Asteroseismic ages of bright stars for different phases of stellar evolution allow calibrating stellar age-rotation relationships. Together with the results of ESA’s Gaia mission, the results of PLATO 2.0 will provide a huge legacy to planetary, stellar and galactic science.
Resumo:
The importance of race as a factor in mental health status has been a topic of controversy. This study reviews the history of research in this area and examines racial variances in the relationship between selected socio-demographic variables and general well-being. The study also examines the appropriateness of an additive versus an interactive statistical model for this investigation.^ The sample consists of 6,913 persons who completed the General Well-Being Schedules as administered in the detailed component of the first National Health and Nutrition Examination Survey (NHANES I) conducted by the National Center for Health Statistics between April, 1971 and October, 1975. The sampling design is a multistage, probability sample of clusters of persons in area based segments. Of the 6,913 persons, 873 are Black.^ Unlike other recent community based mental health studies, this study revealed significant differences between the general well-being of Blacks and Whites. Blacks continued to exhibit significantly lower levels of well-being even after adjustments were made for income, education, marital status, sex, age and place of residence. Statistical interaction was found between race and sex with Black females reporting lower levels of well-being than either Black or White males or their White female counterparts.^ The study includes a detailed review of the NHANES I sample design. It is shown that selected aspects of the design make it difficult to render appropriate national comparisons of Black-White differences. As a result conclusions pertaining to these differences based on NHANES I may be of questionable validity. ^
Resumo:
The purpose of this report is to use information provided by a questionnaire survey to analyze the factors and processes underlying the formation of industrial clusters in Japan. The study, based on questionnaire surveys, forms part of an "Industrial Cluster Project". The Japanese government has implemented policies for industrial clusters so as to enable Japanese industries to maintain competitive power in global markets, and to aid the self-sufficient expansion of local industries. The government's project goes under the heading "Industry Agglomeration for the Recovery of Local Industries with respect to so-called "Industry Clusters." The authors aim to identify what expectations are held of government by the enterprises that make up industrial clusters. As part of our investigation, we used the results of a survey conducted by UNDP in 2004. Tsuji's study, published by the Osaka School of International Public Policy, surveyed 1198 small or medium sized manufacturing companies located in O ward, Tokyo and Higashi Osaka city, Osaka prefecture. The outcome of the present study, together with data from Tsuji's work on IT usage by SMEs in Japan, is meant to form the basis for policy design and implementation.
Resumo:
La última década ha sido testigo de importantes avances en el campo de la tecnología de reconocimiento de voz. Los sistemas comerciales existentes actualmente poseen la capacidad de reconocer habla continua de múltiples locutores, consiguiendo valores aceptables de error, y sin la necesidad de realizar procedimientos explícitos de adaptación. A pesar del buen momento que vive esta tecnología, el reconocimiento de voz dista de ser un problema resuelto. La mayoría de estos sistemas de reconocimiento se ajustan a dominios particulares y su eficacia depende de manera significativa, entre otros muchos aspectos, de la similitud que exista entre el modelo de lenguaje utilizado y la tarea específica para la cual se está empleando. Esta dependencia cobra aún más importancia en aquellos escenarios en los cuales las propiedades estadísticas del lenguaje varían a lo largo del tiempo, como por ejemplo, en dominios de aplicación que involucren habla espontánea y múltiples temáticas. En los últimos años se ha evidenciado un constante esfuerzo por mejorar los sistemas de reconocimiento para tales dominios. Esto se ha hecho, entre otros muchos enfoques, a través de técnicas automáticas de adaptación. Estas técnicas son aplicadas a sistemas ya existentes, dado que exportar el sistema a una nueva tarea o dominio puede requerir tiempo a la vez que resultar costoso. Las técnicas de adaptación requieren fuentes adicionales de información, y en este sentido, el lenguaje hablado puede aportar algunas de ellas. El habla no sólo transmite un mensaje, también transmite información acerca del contexto en el cual se desarrolla la comunicación hablada (e.g. acerca del tema sobre el cual se está hablando). Por tanto, cuando nos comunicamos a través del habla, es posible identificar los elementos del lenguaje que caracterizan el contexto, y al mismo tiempo, rastrear los cambios que ocurren en estos elementos a lo largo del tiempo. Esta información podría ser capturada y aprovechada por medio de técnicas de recuperación de información (information retrieval) y de aprendizaje de máquina (machine learning). Esto podría permitirnos, dentro del desarrollo de mejores sistemas automáticos de reconocimiento de voz, mejorar la adaptación de modelos del lenguaje a las condiciones del contexto, y por tanto, robustecer al sistema de reconocimiento en dominios con condiciones variables (tales como variaciones potenciales en el vocabulario, el estilo y la temática). En este sentido, la principal contribución de esta Tesis es la propuesta y evaluación de un marco de contextualización motivado por el análisis temático y basado en la adaptación dinámica y no supervisada de modelos de lenguaje para el robustecimiento de un sistema automático de reconocimiento de voz. Esta adaptación toma como base distintos enfoque de los sistemas mencionados (de recuperación de información y aprendizaje de máquina) mediante los cuales buscamos identificar las temáticas sobre las cuales se está hablando en una grabación de audio. Dicha identificación, por lo tanto, permite realizar una adaptación del modelo de lenguaje de acuerdo a las condiciones del contexto. El marco de contextualización propuesto se puede dividir en dos sistemas principales: un sistema de identificación de temática y un sistema de adaptación dinámica de modelos de lenguaje. Esta Tesis puede describirse en detalle desde la perspectiva de las contribuciones particulares realizadas en cada uno de los campos que componen el marco propuesto: _ En lo referente al sistema de identificación de temática, nos hemos enfocado en aportar mejoras a las técnicas de pre-procesamiento de documentos, asimismo en contribuir a la definición de criterios más robustos para la selección de index-terms. – La eficiencia de los sistemas basados tanto en técnicas de recuperación de información como en técnicas de aprendizaje de máquina, y específicamente de aquellos sistemas que particularizan en la tarea de identificación de temática, depende, en gran medida, de los mecanismos de preprocesamiento que se aplican a los documentos. Entre las múltiples operaciones que hacen parte de un esquema de preprocesamiento, la selección adecuada de los términos de indexado (index-terms) es crucial para establecer relaciones semánticas y conceptuales entre los términos y los documentos. Este proceso también puede verse afectado, o bien por una mala elección de stopwords, o bien por la falta de precisión en la definición de reglas de lematización. En este sentido, en este trabajo comparamos y evaluamos diferentes criterios para el preprocesamiento de los documentos, así como también distintas estrategias para la selección de los index-terms. Esto nos permite no sólo reducir el tamaño de la estructura de indexación, sino también mejorar el proceso de identificación de temática. – Uno de los aspectos más importantes en cuanto al rendimiento de los sistemas de identificación de temática es la asignación de diferentes pesos a los términos de acuerdo a su contribución al contenido del documento. En este trabajo evaluamos y proponemos enfoques alternativos a los esquemas tradicionales de ponderado de términos (tales como tf-idf ) que nos permitan mejorar la especificidad de los términos, así como también discriminar mejor las temáticas de los documentos. _ Respecto a la adaptación dinámica de modelos de lenguaje, hemos dividimos el proceso de contextualización en varios pasos. – Para la generación de modelos de lenguaje basados en temática, proponemos dos tipos de enfoques: un enfoque supervisado y un enfoque no supervisado. En el primero de ellos nos basamos en las etiquetas de temática que originalmente acompañan a los documentos del corpus que empleamos. A partir de estas, agrupamos los documentos que forman parte de la misma temática y generamos modelos de lenguaje a partir de dichos grupos. Sin embargo, uno de los objetivos que se persigue en esta Tesis es evaluar si el uso de estas etiquetas para la generación de modelos es óptimo en términos del rendimiento del reconocedor. Por esta razón, nosotros proponemos un segundo enfoque, un enfoque no supervisado, en el cual el objetivo es agrupar, automáticamente, los documentos en clusters temáticos, basándonos en la similaridad semántica existente entre los documentos. Por medio de enfoques de agrupamiento conseguimos mejorar la cohesión conceptual y semántica en cada uno de los clusters, lo que a su vez nos permitió refinar los modelos de lenguaje basados en temática y mejorar el rendimiento del sistema de reconocimiento. – Desarrollamos diversas estrategias para generar un modelo de lenguaje dependiente del contexto. Nuestro objetivo es que este modelo refleje el contexto semántico del habla, i.e. las temáticas más relevantes que se están discutiendo. Este modelo es generado por medio de la interpolación lineal entre aquellos modelos de lenguaje basados en temática que estén relacionados con las temáticas más relevantes. La estimación de los pesos de interpolación está basada principalmente en el resultado del proceso de identificación de temática. – Finalmente, proponemos una metodología para la adaptación dinámica de un modelo de lenguaje general. El proceso de adaptación tiene en cuenta no sólo al modelo dependiente del contexto sino también a la información entregada por el proceso de identificación de temática. El esquema usado para la adaptación es una interpolación lineal entre el modelo general y el modelo dependiente de contexto. Estudiamos también diferentes enfoques para determinar los pesos de interpolación entre ambos modelos. Una vez definida la base teórica de nuestro marco de contextualización, proponemos su aplicación dentro de un sistema automático de reconocimiento de voz. Para esto, nos enfocamos en dos aspectos: la contextualización de los modelos de lenguaje empleados por el sistema y la incorporación de información semántica en el proceso de adaptación basado en temática. En esta Tesis proponemos un marco experimental basado en una arquitectura de reconocimiento en ‘dos etapas’. En la primera etapa, empleamos sistemas basados en técnicas de recuperación de información y aprendizaje de máquina para identificar las temáticas sobre las cuales se habla en una transcripción de un segmento de audio. Esta transcripción es generada por el sistema de reconocimiento empleando un modelo de lenguaje general. De acuerdo con la relevancia de las temáticas que han sido identificadas, se lleva a cabo la adaptación dinámica del modelo de lenguaje. En la segunda etapa de la arquitectura de reconocimiento, usamos este modelo adaptado para realizar de nuevo el reconocimiento del segmento de audio. Para determinar los beneficios del marco de trabajo propuesto, llevamos a cabo la evaluación de cada uno de los sistemas principales previamente mencionados. Esta evaluación es realizada sobre discursos en el dominio de la política usando la base de datos EPPS (European Parliamentary Plenary Sessions - Sesiones Plenarias del Parlamento Europeo) del proyecto europeo TC-STAR. Analizamos distintas métricas acerca del rendimiento de los sistemas y evaluamos las mejoras propuestas con respecto a los sistemas de referencia. ABSTRACT The last decade has witnessed major advances in speech recognition technology. Today’s commercial systems are able to recognize continuous speech from numerous speakers, with acceptable levels of error and without the need for an explicit adaptation procedure. Despite this progress, speech recognition is far from being a solved problem. Most of these systems are adjusted to a particular domain and their efficacy depends significantly, among many other aspects, on the similarity between the language model used and the task that is being addressed. This dependence is even more important in scenarios where the statistical properties of the language fluctuates throughout the time, for example, in application domains involving spontaneous and multitopic speech. Over the last years there has been an increasing effort in enhancing the speech recognition systems for such domains. This has been done, among other approaches, by means of techniques of automatic adaptation. These techniques are applied to the existing systems, specially since exporting the system to a new task or domain may be both time-consuming and expensive. Adaptation techniques require additional sources of information, and the spoken language could provide some of them. It must be considered that speech not only conveys a message, it also provides information on the context in which the spoken communication takes place (e.g. on the subject on which it is being talked about). Therefore, when we communicate through speech, it could be feasible to identify the elements of the language that characterize the context, and at the same time, to track the changes that occur in those elements over time. This information can be extracted and exploited through techniques of information retrieval and machine learning. This allows us, within the development of more robust speech recognition systems, to enhance the adaptation of language models to the conditions of the context, thus strengthening the recognition system for domains under changing conditions (such as potential variations in vocabulary, style and topic). In this sense, the main contribution of this Thesis is the proposal and evaluation of a framework of topic-motivated contextualization based on the dynamic and non-supervised adaptation of language models for the enhancement of an automatic speech recognition system. This adaptation is based on an combined approach (from the perspective of both information retrieval and machine learning fields) whereby we identify the topics that are being discussed in an audio recording. The topic identification, therefore, enables the system to perform an adaptation of the language model according to the contextual conditions. The proposed framework can be divided in two major systems: a topic identification system and a dynamic language model adaptation system. This Thesis can be outlined from the perspective of the particular contributions made in each of the fields that composes the proposed framework: _ Regarding the topic identification system, we have focused on the enhancement of the document preprocessing techniques in addition to contributing in the definition of more robust criteria for the selection of index-terms. – Within both information retrieval and machine learning based approaches, the efficiency of topic identification systems, depends, to a large extent, on the mechanisms of preprocessing applied to the documents. Among the many operations that encloses the preprocessing procedures, an adequate selection of index-terms is critical to establish conceptual and semantic relationships between terms and documents. This process might also be weakened by a poor choice of stopwords or lack of precision in defining stemming rules. In this regard we compare and evaluate different criteria for preprocessing the documents, as well as for improving the selection of the index-terms. This allows us to not only reduce the size of the indexing structure but also to strengthen the topic identification process. – One of the most crucial aspects, in relation to the performance of topic identification systems, is to assign different weights to different terms depending on their contribution to the content of the document. In this sense we evaluate and propose alternative approaches to traditional weighting schemes (such as tf-idf ) that allow us to improve the specificity of terms, and to better identify the topics that are related to documents. _ Regarding the dynamic language model adaptation, we divide the contextualization process into different steps. – We propose supervised and unsupervised approaches for the generation of topic-based language models. The first of them is intended to generate topic-based language models by grouping the documents, in the training set, according to the original topic labels of the corpus. Nevertheless, a goal of this Thesis is to evaluate whether or not the use of these labels to generate language models is optimal in terms of recognition accuracy. For this reason, we propose a second approach, an unsupervised one, in which the objective is to group the data in the training set into automatic topic clusters based on the semantic similarity between the documents. By means of clustering approaches we expect to obtain a more cohesive association of the documents that are related by similar concepts, thus improving the coverage of the topic-based language models and enhancing the performance of the recognition system. – We develop various strategies in order to create a context-dependent language model. Our aim is that this model reflects the semantic context of the current utterance, i.e. the most relevant topics that are being discussed. This model is generated by means of a linear interpolation between the topic-based language models related to the most relevant topics. The estimation of the interpolation weights is based mainly on the outcome of the topic identification process. – Finally, we propose a methodology for the dynamic adaptation of a background language model. The adaptation process takes into account the context-dependent model as well as the information provided by the topic identification process. The scheme used for the adaptation is a linear interpolation between the background model and the context-dependent one. We also study different approaches to determine the interpolation weights used in this adaptation scheme. Once we defined the basis of our topic-motivated contextualization framework, we propose its application into an automatic speech recognition system. We focus on two aspects: the contextualization of the language models used by the system, and the incorporation of semantic-related information into a topic-based adaptation process. To achieve this, we propose an experimental framework based in ‘a two stages’ recognition architecture. In the first stage of the architecture, Information Retrieval and Machine Learning techniques are used to identify the topics in a transcription of an audio segment. This transcription is generated by the recognition system using a background language model. According to the confidence on the topics that have been identified, the dynamic language model adaptation is carried out. In the second stage of the recognition architecture, an adapted language model is used to re-decode the utterance. To test the benefits of the proposed framework, we carry out the evaluation of each of the major systems aforementioned. The evaluation is conducted on speeches of political domain using the EPPS (European Parliamentary Plenary Sessions) database from the European TC-STAR project. We analyse several performance metrics that allow us to compare the improvements of the proposed systems against the baseline ones.
Resumo:
Observers have found a small number of lithium-depleted halo stars in the temperature range of the Spite plateau. The current status of the mass-loss hypothesis for producing the observed lithium dip in Population (Pop) I stars is briefly discussed and extended to Pop II stars as a possible explanation for these halo objects. Based on detections of F-type main-sequence variables, mass loss is assumed to occur in a narrow temperature region corresponding to this “instability strip.” As Pop II main-sequence stars evolve to the blue, they enter this narrow temperature region, then move back through the lower temperature area of the Spite plateau. If 0.05 M⊙ (solar mass) or more have been lost, they will show lithium depletion. This hypothesis affects the lithium-to- beryllium abundance, the ratio of high- to low-lithium stars, and the luminosity function. Constraints on the mass-loss hypothesis due to these effects are discussed. Finally, mass loss in this temperature range would operate in stars near the turnoff of metal-poor globular clusters, resulting in apparent ages 2 to 3 Gyr (gigayears) older than they actually are.
Resumo:
It is now straightforward to assemble large samples of very high redshift (z ∼ 3) field galaxies selected by their pronounced spectral discontinuity at the rest frame Lyman limit of hydrogen (at 912 Å). This makes possible both statistical analyses of the properties of the galaxies and the first direct glimpse of the progression of the growth of their large-scale distribution at such an early epoch. Here I present a summary of the progress made in these areas to date and some preliminary results of and future plans for a targeted redshift survey at z = 2.7–3.4. Also discussed is how the same discovery method may be used to obtain a “census” of star formation in the high redshift Universe, and the current implications for the history of galaxy formation as a function of cosmic epoch.