31 resultados para estimation des provisions
em Helda - Digital Repository of University of Helsinki
Resumo:
The work integrates research in the language and terminology of various fields with lexicography, etymology, semantics, word formation, and pragmatics. Additionally, examination of German and Finnish provides the work with perspective of contrastive linguistics and the translation of texts in specialized fields. The work is an attempt to chart the language, vocabulary, different textual types, and essential communication-connected features of this special field. The study is primary concerned with internal communication within the field of ecology, but it also provides a comparison of the public discussion of environmental issues in Germany and Finland. The work attempts to use textual signs to provide a picture of the literary communication used on the different vertical levels in the central text types within the field. The dictionaries in the fields of environmental issues and ecology for the individual text types are examined primarily from the perspective of their quantity and diversity. One central point of the work is to clarify and collect all of the dictionaries in the field that have been compiled thus far in which German and/or Finnish ware included. Ecology and environmental protection are closely linked not only to each other but also to many other scientific fields. Consequently, the language of the environmental field has acquired an abundance of influences and vocabulary from the language of the special fields close to it as well as from that of politics and various areas of public administration. The work also demonstrates how the popularization of environmental terminology often leads to semantic distortion. Traditionally, scientific texts have used the smallest number of expressions, the purpose of which is to appeal to or influence the behavior of the text recipient. Particularly in Germany, those who support or oppose measures to protect the environment have long been making concerted efforts to represent their own views in the language that they use. When discussing controversial issues competing designations for the same referent or concept are used in accordance with the interest group to which the speaker belongs. One of the objectives of the study is to sensitize recipients of texts to notice the euphemistic expressions that occur in German and Finnish texts dealing with issues that are sensitive from the standpoint of environmental policy. One particular feature of the field is the wealth and large number of variants designating the same entry or concept. The terminological doublets formed by words of foreign origin and their German or Finnish language equivalents are quite typical of the field. Methods of corpus linguistics are used to determine the reasons for the large number of variant designations as well as their functionality.
Resumo:
Valency Realization in Short Excerpts of News Text. A Pragmatics-funded analysis This dissertation is a study of the so-called pragmatic valency. The aim of the study is to examine the phenomenon both theoretically by discussing the research literature and empirically based on evidence from a text corpus consisting of 218 short excerpts of news text from the German newspaper Frankfurter Allgemeine Zeitung. In the theoretical part of the study, the central concepts of the valency and the pragmatic valency are discussed. In the research literature, the valency denotes the relation among the verb and its obligatory and optional complements. The pragmatic valency can be defined as modification of the so-called system valency in the parole, including non-realization of an obligatory complement, non- realization of an optional complement and realization of an optional complement. Furthermore, the investigation of the pragmatic valency includes the role of the adjuncts, elements that are not defined by the valency, in the concrete valency realization. The corpus study investigates the valency behaviour of German verbs in a corpus of about 1500 sentences combining the methodology and concepts of valency theory, semantics and text linguistics. The analysis is focused on the about 600 sentences which show deviations from the system valency, providing over 800 examples for the modification of the system valency as codified in the (valency) dictionaries. The study attempts to answer the following primary question: Why is the system valency modified in the parole? To answer the question, the concept of modification types is entered. The modification types are recognized using distinctive feature bundles in which each feature with a negative or a positive value refers to one reason for the modification treated in the research literature. For example, the features of irrelevance and relevance, focus, world and text type knowledge, text theme, theme-rheme structure and cohesive chains are applied. The valency approach appears in a new light when explored through corpus-based investigation; both the optionality of complements and the distinction between complements and adjuncts as defined in the present valency approach seem in some respects defective. Furthermore, the analysis indicates that the adjuncts outside the valency domain play a central role in the concrete realization of the valency. Finally, the study suggests a definition of pragmatic valency, based on the modification types introduced in the study and tested in the corpus analysis.
Resumo:
"Radiodiskurssin kontekstualisointi prosodisin keinoin. Esimerkkinä viisi suurta ranskalaista 1900-luvun filosofia" Väitöskirja käsittelee puheen kontekstualisointia prosodisin keinoin. Toisin sanottuna työssä käsitellään sitä, miten puheen prosodiset piirteet (kuten sävelkulku, intensiteetti, tauot, kesto ja rytmi) ohjaavat puheen tulkintaa vanhastaan enemmän tutkittujen sana- ja lausemerkitysten ohella. Työssä keskitytään seitsemään prosodisesti merkittyyn kuvioon, jotka koostuvat yhden tai usean parametrin silmiinpistävistä muutoksista. Ilmiöitä käsitellään sekä niiden akustisten muotojen että tyypillisten esiintymisyhteyksien ja diskursiivisten tehtävien näkökulmasta. Aineisto koostuu radio-ohjelmista, joissa puhuu viisi suurta ranskalaista 1900-luvun filosofia: Gaston Bachelard, Albert Camus, Michel Foucault, Maurice Merleau-Ponty ja Jean-Paul Sartre. Ohjelmat on lähetetty eri radiokanavilla Ranskassa vuosina 1948–1973. Väitöskirjan tulokset osoittavat, että prosodisesti merkityt kuviot ovat moniulotteisia puheen ilmiöitä, joilla on keskeinen rooli sanotun kontekstualisoinnissa: ne voivat esimerkiksi nostaa tai laskea sanotun informaatioarvoa, ilmaista puhujan voimakasta tai heikkoa sitoutumista sanomaansa, ilmaista rakenteellisen kokonaisuuden jatkumista tai päättymistä, jne. Väitöskirja sisältää myös kontrastiivisia osia, joissa ilmiöitä verrataan erääseen klassisessa pianomusiikissa esiintyvään melodiseen kuvioon sekä erääseen suomen kielen prosodiseen ilmiöön. Tulokset viittaavat siihen, että tietynlaista melodista kuviota käytetään samankaltaisena jäsentämiskeinona sekä puheessa että klassisessa musiikissa. Lisäksi tulokset antavat viitteitä siitä, että tiettyjä melodisia muotoja käytetään samankaltaisten implikaatioiden luomiseen kahdessa niinkin erilaisessa kielessä kuin suomessa ja ranskassa. Yksi väitöskirjan osa käsittelee pisteen ja pilkun prosodista merkitsemistä puheessa. Tulosten mukaan pisteellä ja pilkulla on kummallakin oma suullinen prototyyppinsä: piste merkitään tyypillisesti sävelkulun laskulla ja tauolla, ja pilkku puolestaan sävelkulun nousulla ja tauolla. Merkittävimmät tulokset koskevat kuitenkin tapauksia, joissa välimerkki tulkitaan prosodisesti epätyypillisellä tavalla: sekä pisteellä että pilkulla vaikuttaisi olevan useita eri suullisia vastaavuuksia, ja välimerkkien tehtävät voivat muotoutua hyvin erilaisiksi niiden prosodisesta tulkinnasta riippuen.
Resumo:
Pro gradu -työssä tutkitaan ja vertaillaan käännöskirjallisuuden arvosteluja Ranskassa ja Suomessa. Empiirinen aineisto koostuu kaikista Helsingin Sanomien ja Le Monden vuonna 2003 julkaisemista arvosteluista. Lehdissä oli yhteensä 2691 arvosteltua kirjaa, joista 845 oli käännöksiä. Päätavoite on ollut selvittää näiden valtasanomalehtien kritiikkejä tutkimalla, kummassa maassa kääntäjän ja käännöksen asema on näkyvämpi ja minkälaisia julkaistut käännöskritiikit ovat. Lisäksi tavoitteena on ollut tutkia, kumpi lehdistä on avoimempi vieraskielisiä kirjoja ja käännöksiä kohtaan. Kirja-arvostelujen lähemmälle tutkimiselle luodaan pohjaa perehtymällä kääntäjän ja käännöksen näkyvyyteen liittyviin seikkoihin. Tässä käytetään hyväksi Koskisen (2000) tutkimusta. Tutkimuksessa tarkastellaan myös, millainen on hyvä käännös eri kääntäjien ja tutkijoiden mielestä, sekä muita saman aihepiirin tutkimuksia ja aiheesta vallalla ollutta keskustelua. Lisäksi selvitetään, miksi laadukkaat käännösarvostelut ovat harvinaisia ja mistä tämä johtuu. Analyysivaiheen kvantitatiivisessa osassa perehdytään käännösten määrälliseen osuuteen sekä aineistossa että Ranskan ja Suomen kokonaisjulkaisumäärissä. Aineiston käännösarvostelut luokitellaan niiden sisällön mukaan. Tässä on käytetty soveltuvin osin Gullinin (1998) kehittelemää mallia. Käännösarvostelujen sisältöanalyysissa kiinnitetään huomiota niiden käännöstä ja kääntäjää koskeviin kommentteihin. Arvostelujen laadun ja kriitikkojen käyttämien arvosteluperusteiden pohdinta nojautuu aiheesta aikaisemmin tehtyihin teoreettisiin sekä empiirisiin tutkimuksiin. Pro gradu -työ sisältää myös erillisen katsauksen käännettyjen lastenkirjojen arvosteluihin sekä arvostelujen ulkopuolisiin kääntäjiin ja kääntämiseen liittyviin artikkeleihin. Koko tutkimuksen ajan lähestymistapa on vertaileva Le Monden ja Helsingin Sanomien välillä. Tutkimuksesta selviää, että Le Monde julkaisee huomattavasti enemmän kirja-arvosteluja. Molemmat lehdet sisältävät kuitenkin suhteellisesti yhtä paljon arvosteluja käännöskirjoista. Helsingin Sanomissa on enemmän vieraskielisten teosten arvosteluja, ja käännösten ja kääntäjän asema on huomattavasti näkyvämpi lehden kritiikeissä. Suurin osa Le Monden käännösarvosteluista sisältää vain kääntäjän nimen bibliografisissa tiedoissa. Helsingin Sanomissa vain alle puolet käännöskirjoista on arvosteltu tällä tavoin. Myös kritiikit, joissa kääntäjän nimeä ei mainita ollenkaan, ovat yleisempiä ranskalaislehdessä. Suhteellisen pieni osa käännösarvosteluista arvioi käännöksen laatua. Näille arvioille on ominaista perustelujen ja analyysin puuttuminen ja ne ovat usein lyhyitä. Arviot ovat sävyltään enimmäkseen positiivisia tai neutraaleja. Hyvin yleistä on myös se, että kriitikko sekoittaa kaksi eri asiaa: kääntäjän ja kirjailijan tyylin. Yleisin kriitikkojen käyttämä arviointikriteeri on tutkia käännöksen ja kohdekielen tai käännöksen ja lähtötekstin suhdetta. Monesti arviointikriteeri jää täysin epäselväksi. Helsingin Sanomien kritiikeissä kääntäjiin viitataan huomattavasti useammin myös itse arvostelutekstissä. Lehti tuo näkyvästi esille kääntäjiä muissakin kuin kirja-arvosteluartikkeleissaan. Sen sijaan Le Mondessa ei ole pelkästään kääntäjiä käsitteleviä tekstejä.
Resumo:
There is an increasing need to compare the results obtained with different methods of estimation of tree biomass in order to reduce the uncertainty in the assessment of forest biomass carbon. In this study, tree biomass was investigated in a 30-year-old Scots pine (Pinus sylvestris) (Young-Stand) and a 130-year-old mixed Norway spruce (Picea abies)-Scots pine stand (Mature-Stand) located in southern Finland (61º50' N, 24º22' E). In particular, a comparison of the results of different estimation methods was conducted to assess the reliability and suitability of their applications. For the trees in Mature-Stand, annual stem biomass increment fluctuated following a sigmoid equation, and the fitting curves reached a maximum level (from about 1 kg/yr for understorey spruce to 7 kg/yr for dominant pine) when the trees were 100 years old. Tree biomass was estimated to be about 70 Mg/ha in Young-Stand and about 220 Mg/ha in Mature-Stand. In the region (58.00-62.13 ºN, 14-34 ºE, ≤ 300 m a.s.l.) surrounding the study stands, the tree biomass accumulation in Norway spruce and Scots pine stands followed a sigmoid equation with stand age, with a maximum of 230 Mg/ha at the age of 140 years. In Mature-Stand, lichen biomass on the trees was 1.63 Mg/ha with more than half of the biomass occurring on dead branches, and the standing crop of litter lichen on the ground was about 0.09 Mg/ha. There were substantial differences among the results estimated by different methods in the stands. These results imply that a possible estimation error should be taken into account when calculating tree biomass in a stand with an indirect approach.
Resumo:
This thesis examines the feasibility of a forest inventory method based on two-phase sampling in estimating forest attributes at the stand or substand levels for forest management purposes. The method is based on multi-source forest inventory combining auxiliary data consisting of remote sensing imagery or other geographic information and field measurements. Auxiliary data are utilized as first-phase data for covering all inventory units. Various methods were examined for improving the accuracy of the forest estimates. Pre-processing of auxiliary data in the form of correcting the spectral properties of aerial imagery was examined (I), as was the selection of aerial image features for estimating forest attributes (II). Various spatial units were compared for extracting image features in a remote sensing aided forest inventory utilizing very high resolution imagery (III). A number of data sources were combined and different weighting procedures were tested in estimating forest attributes (IV, V). Correction of the spectral properties of aerial images proved to be a straightforward and advantageous method for improving the correlation between the image features and the measured forest attributes. Testing different image features that can be extracted from aerial photographs (and other very high resolution images) showed that the images contain a wealth of relevant information that can be extracted only by utilizing the spatial organization of the image pixel values. Furthermore, careful selection of image features for the inventory task generally gives better results than inputting all extractable features to the estimation procedure. When the spatial units for extracting very high resolution image features were examined, an approach based on image segmentation generally showed advantages compared with a traditional sample plot-based approach. Combining several data sources resulted in more accurate estimates than any of the individual data sources alone. The best combined estimate can be derived by weighting the estimates produced by the individual data sources by the inverse values of their mean square errors. Despite the fact that the plot-level estimation accuracy in two-phase sampling inventory can be improved in many ways, the accuracy of forest estimates based mainly on single-view satellite and aerial imagery is a relatively poor basis for making stand-level management decisions.
Resumo:
Remote sensing provides methods to infer land cover information over large geographical areas at a variety of spatial and temporal resolutions. Land cover is input data for a range of environmental models and information on land cover dynamics is required for monitoring the implications of global change. Such data are also essential in support of environmental management and policymaking. Boreal forests are a key component of the global climate and a major sink of carbon. The northern latitudes are expected to experience a disproportionate and rapid warming, which can have a major impact on vegetation at forest limits. This thesis examines the use of optical remote sensing for estimating aboveground biomass, leaf area index (LAI), tree cover and tree height in the boreal forests and tundra taiga transition zone in Finland. The continuous fields of forest attributes are required, for example, to improve the mapping of forest extent. The thesis focus on studying the feasibility of satellite data at multiple spatial resolutions, assessing the potential of multispectral, -angular and -temporal information, and provides regional evaluation for global land cover data. Preprocessed ASTER, MISR and MODIS products are the principal satellite data. The reference data consist of field measurements, forest inventory data and fine resolution land cover maps. Fine resolution studies demonstrate how statistical relationships between biomass and satellite data are relatively strong in single species and low biomass mountain birch forests in comparison to higher biomass coniferous stands. The combination of forest stand data and fine resolution ASTER images provides a method for biomass estimation using medium resolution MODIS data. The multiangular data improve the accuracy of land cover mapping in the sparsely forested tundra taiga transition zone, particularly in mires. Similarly, multitemporal data improve the accuracy of coarse resolution tree cover estimates in comparison to single date data. Furthermore, the peak of the growing season is not necessarily the optimal time for land cover mapping in the northern boreal regions. The evaluated coarse resolution land cover data sets have considerable shortcomings in northernmost Finland and should be used with caution in similar regions. The quantitative reference data and upscaling methods for integrating multiresolution data are required for calibration of statistical models and evaluation of land cover data sets. The preprocessed image products have potential for wider use as they can considerably reduce the time and effort used for data processing.
Resumo:
The Minimum Description Length (MDL) principle is a general, well-founded theoretical formalization of statistical modeling. The most important notion of MDL is the stochastic complexity, which can be interpreted as the shortest description length of a given sample of data relative to a model class. The exact definition of the stochastic complexity has gone through several evolutionary steps. The latest instantation is based on the so-called Normalized Maximum Likelihood (NML) distribution which has been shown to possess several important theoretical properties. However, the applications of this modern version of the MDL have been quite rare because of computational complexity problems, i.e., for discrete data, the definition of NML involves an exponential sum, and in the case of continuous data, a multi-dimensional integral usually infeasible to evaluate or even approximate accurately. In this doctoral dissertation, we present mathematical techniques for computing NML efficiently for some model families involving discrete data. We also show how these techniques can be used to apply MDL in two practical applications: histogram density estimation and clustering of multi-dimensional data.
Resumo:
This study examines the properties of Generalised Regression (GREG) estimators for domain class frequencies and proportions. The family of GREG estimators forms the class of design-based model-assisted estimators. All GREG estimators utilise auxiliary information via modelling. The classic GREG estimator with a linear fixed effects assisting model (GREG-lin) is one example. But when estimating class frequencies, the study variable is binary or polytomous. Therefore logistic-type assisting models (e.g. logistic or probit model) should be preferred over the linear one. However, other GREG estimators than GREG-lin are rarely used, and knowledge about their properties is limited. This study examines the properties of L-GREG estimators, which are GREG estimators with fixed-effects logistic-type models. Three research questions are addressed. First, I study whether and when L-GREG estimators are more accurate than GREG-lin. Theoretical results and Monte Carlo experiments which cover both equal and unequal probability sampling designs and a wide variety of model formulations show that in standard situations, the difference between L-GREG and GREG-lin is small. But in the case of a strong assisting model, two interesting situations arise: if the domain sample size is reasonably large, L-GREG is more accurate than GREG-lin, and if the domain sample size is very small, estimation of assisting model parameters may be inaccurate, resulting in bias for L-GREG. Second, I study variance estimation for the L-GREG estimators. The standard variance estimator (S) for all GREG estimators resembles the Sen-Yates-Grundy variance estimator, but it is a double sum of prediction errors, not of the observed values of the study variable. Monte Carlo experiments show that S underestimates the variance of L-GREG especially if the domain sample size is minor, or if the assisting model is strong. Third, since the standard variance estimator S often fails for the L-GREG estimators, I propose a new augmented variance estimator (A). The difference between S and the new estimator A is that the latter takes into account the difference between the sample fit model and the census fit model. In Monte Carlo experiments, the new estimator A outperformed the standard estimator S in terms of bias, root mean square error and coverage rate. Thus the new estimator provides a good alternative to the standard estimator.
Resumo:
The main objects of the investigation were the syntactic functions of adjectives. The reason for the interest in these functions are the different modes of use, in which an adjective can occur. All together an adjective can take three different modes of use: attributive (e. g. a fast car), predicative (e. g. the car is fast) and adverbial (e. g. the car drives fast). Since an adjective cannot always take every function, some dictionaries (esp. learner s dictionaries) deliver information within the lexical entry about any restrictions. The purpose of the research consisted of a comparison in relation to the lexical entries of adjectives, which were investigated within four selected monolingual German-speaking dictionaries. The comparison of the syntactical data of adjectives were done to work out the differences and the common characteristics of the lexical entries concerning the different modes of use and to analyse respective to assess them. In the foreground, however, were the differences of the syntactical information. Concerning those differences it had to be worked out, which entry is the grammatically right one respective if one entry is in fact wrong. To find that out an empirical analysis was needed, which based on the question in which way an adjective is used within a context as far as there are no conforming data within the dictionaries. The delivery of the correctness and the homogeneity of lexical entries of German-speaking dictionaries are very important to support people who are learning the German language and to ensure the user friendliness of dictionaries. Throughout the investigations it became clear that in almost half of the cases (over 40 %) syntactical information of adjectives differ from each other within the dictionaries. These differences make it for non-native speakers of course very difficult to understand the correct usage of an adjective. Thus the main aim of the doctoral thesis was it to deliver and to demonstrate the clear syntactical usage of a certain amount of adjectives.
Resumo:
This paper estimates the extent of income underreporting by the self-employed in Finland using the expenditure based approach developed by Pissarides & Weber (1989). Household spending data are for the years 1994 to 1996. The results suggest that self-employment income in Finland is underreported by some 27% on average. Since income for the self-employed is about 8 % of all incomes in Finland, the size of this part of the black economy in Finland is estimated to be about 2,3% of GDP.