882 resultados para Methods : Data Analysis


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Rapid growth in the global population requires expansion of building stock, which in turn calls for increased energy demand. This demand varies in time and also between different buildings, yet, conventional methods are only able to provide mean energy levels per zone and are unable to capture this inhomogeneity, which is important to conserve energy. An additional challenge is that some of the attempts to conserve energy, through for example lowering of ventilation rates, have been shown to exacerbate another problem, which is unacceptable indoor air quality (IAQ). The rise of sensing technology over the past decade has shown potential to address both these issues simultaneously by providing high–resolution tempo–spatial data to systematically analyse the energy demand and its consumption as well as the impacts of measures taken to control energy consumption on IAQ. However, challenges remain in the development of affordable services for data analysis, deployment of large–scale real–time sensing network and responding through Building Energy Management Systems. This article presents the fundamental drivers behind the rise of sensing technology for the management of energy and IAQ in urban built environments, highlights major challenges for their large–scale deployment and identifies the research gaps that should be closed by future investigations.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In recent years there has been growing interest in selecting suitable wood raw material to increase end product quality and to increase the efficiency of industrial processes. Genetic background and growing conditions are known to affect properties of growing trees, but only a few parameters reflecting wood quality, such as volume and density can be measured on an industrial scale. Therefore research on cellular level structures of trees grown in different conditions is needed to increase understanding of the growth process of trees leading to desired wood properties. In this work the cellular and cell wall structures of wood were studied. Parameters, such as the mean microfibril angle (MFA), the spiral grain angles, the fibre length, the tracheid cell wall thickness and the cross-sectional shape of the tracheid, were determined as a function of distance from the pith towards the bark and mutual dependencies of these parameters were discussed. Samples from fast-grown trees, which belong to a same clone, grown in fertile soil and also from fertilised trees were measured. It was found that in fast-grown trees the mean MFA decreased more gradually from the pith to the bark than in reference stems. In fast-grown samples cells were shorter, more thin-walled and their cross-sections were rounder than in slower-grown reference trees. Increased growth rate was found to cause an increase in spiral grain variation both within and between annual rings. Furthermore, methods for determination of the mean MFA using x-ray diffraction were evaluated. Several experimental arrangements including the synchrotron radiation based microdiffraction were compared. For evaluation of the data analysis procedures a general form for diffraction conditions in terms of angles describing the fibre orientation and the shape of the cell was derived. The effects of these parameters on the obtained microfibril angles were discussed. The use of symmetrical transmission geometry and tangentially cut samples gave the most reliable MFA values.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Protein conformations and dynamics can be studied by nuclear magnetic resonance spectroscopy using dilute liquid crystalline samples. This work clarifies the interpretation of residual dipolar coupling data yielded by the experiments. It was discovered that unfolded proteins without any additional structure beyond that of a mere polypeptide chain exhibit residual dipolar couplings. Also, it was found that molecular dynamics induce fluctuations in the molecular alignment and doing so affect residual dipolar couplings. The finding clarified the origins of low order parameter values observed earlier. The work required the development of new analytical and computational methods for the prediction of intrinsic residual dipolar coupling profiles for unfolded proteins. The presented characteristic chain model is able to reproduce the general trend of experimental residual dipolar couplings for denatured proteins. The details of experimental residual dipolar coupling profiles are beyond the analytical model, but improvements are proposed to achieve greater accuracy. A computational method for rapid prediction of unfolded protein residual dipolar couplings was also developed. Protein dynamics were shown to modulate the effective molecular alignment in a dilute liquid crystalline medium. The effects were investigated from experimental and molecular dynamics generated conformational ensembles of folded proteins. It was noted that dynamics induced alignment is significant especially for the interpretation of molecular dynamics in small, globular proteins. A method of correction was presented. Residual dipolar couplings offer an attractive possibility for the direct observation of protein conformational preferences and dynamics. The presented models and methods of analysis provide significant advances in the interpretation of residual dipolar coupling data from proteins.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This thesis deals with theoretical modeling of the electrodynamics of auroral ionospheres. In the five research articles forming the main part of the thesis we have concentrated on two main themes: Development of new data-analysis techniques and study of inductive phenomena in the ionospheric electrodynamics. The introductory part of the thesis provides a background for these new results and places them in the wider context of ionospheric research. In this thesis we have developed a new tool (called 1D SECS) for analysing ground based magnetic measurements from a 1-dimensional magnetometer chain (usually aligned in the North-South direction) and a new method for obtaining ionospheric electric field from combined ground based magnetic measurements and estimated ionospheric electric conductance. Both these methods are based on earlier work, but contain important new features: 1D SECS respects the spherical geometry of large scale ionospheric electrojet systems and due to an innovative way of implementing boundary conditions the new method for obtaining electric fields can be applied also at local scale studies. These new calculation methods have been tested using both simulated and real data. The tests indicate that the new methods are more reliable than the previous techniques. Inductive phenomena are intimately related to temporal changes in electric currents. As the large scale ionospheric current systems change relatively slowly, in time scales of several minutes or hours, inductive effects are usually assumed to be negligible. However, during the past ten years, it has been realised that induction can play an important part in some ionospheric phenomena. In this thesis we have studied the role of inductive electric fields and currents in ionospheric electrodynamics. We have formulated the induction problem so that only ionospheric electric parameters are used in the calculations. This is in contrast to previous studies, which require knowledge of the magnetospheric-ionosphere coupling. We have applied our technique to several realistic models of typical auroral phenomena. The results indicate that inductive electric fields and currents are locally important during the most dynamical phenomena (like the westward travelling surge, WTS). In these situations induction may locally contribute up to 20-30% of the total ionospheric electric field and currents. Inductive phenomena do also change the field-aligned currents flowing between the ionosphere and magnetosphere, thus modifying the coupling between the two regions.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The first observations of solar X-rays date back to late 1940 s. In order to observe solar X-rays the instruments have to be lifted above the Earth s atmosphere, since all high energy radiation from the space is almost totally attenuated by it. This is a good thing for all living creatures, but bad for X-ray astronomers. Detectors observing X-ray emission from space must be placed on-board satellites, which makes this particular discipline of astronomy technologically and operationally demanding, as well as very expensive. In this thesis, I have focused on detectors dedicated to observing solar X-rays in the energy range 1-20 keV. The purpose of these detectors was to measure solar X-rays simultaneously with another X-ray spectrometer measuring fluorescence X-ray emission from the Moon surface. The X-ray fluorescence emission is induced by the primary solar X-rays. If the elemental abundances on the Moon were to be determined with fluorescence analysis methods, the shape and intensity of the simultaneous solar X-ray spectrum must be known. The aim of this thesis is to describe the characterization and operation of our X-ray instruments on-board two Moon missions, SMART-1 and Chandrayaan-1. Also the independent solar science performance of these two almost similar X-ray spectrometers is described. These detectors have the following two features in common. Firstly, the primary detection element is made of a single crystal silicon diode. Secondly, the field of view is circular and very large. The data obtained from these detectors are spectra with a 16 second time resolution. Before launching an instrument into space, its performance must be characterized by ground calibrations. The basic operation of these detectors and their ground calibrations are described in detail. Two C-flares are analyzed as examples for introducing the spectral fitting process. The first flare analysis shows the fit of a single spectrum of the C1-flare obtained during the peak phase. The other analysis example shows how to derive the time evolution of fluxes, emission measures (EM) and temperatures through the whole single C4 flare with the time resolution of 16 s. The preparatory data analysis procedures are also introduced in detail. These are required in spectral fittings of the data. A new solar monitor design equipped with a concentrator optics and a moderate size of field of view is also introduced.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This article reports on a 6-year study that examined the association between pre-admission variables and field placement performance in an Australian bachelor of social work program (N=463). Very few of the pre-admission variables were found to be significantly associated with performance. These findings and the role of the admissions process are discussed. In addition to the usual academic criteria, the authors urge schools to include a focus on nonacademic criteria during the admissions process and the ongoing educational program.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Hole-doped perovskites such as La1-xCaxMnO3 present special magnetic and magnetotransport properties, and it is commonly accepted that the local atomic structure around Mn ions plays a crucial role in determining these peculiar features. Therefore experimental techniques directly probing the local atomic structure, like x-ray absorption spectroscopy (XAS), have been widely exploited to deeply understand the physics of these compounds. Quantitative XAS analysis usually concerns the extended region [extended x-ray absorption fine structure (EXAFS)] of the absorption spectra. The near-edge region [x-ray absorption near-edge spectroscopy (XANES)] of XAS spectra can provide detailed complementary information on the electronic structure and local atomic topology around the absorber. However, the complexity of the XANES analysis usually prevents a quantitative understanding of the data. This work exploits the recently developed MXAN code to achieve a quantitative structural refinement of the Mn K-edge XANES of LaMnO3 and CaMnO3 compounds; they are the end compounds of the doped manganite series LaxCa1-xMnO3. The results derived from the EXAFS and XANES analyses are in good agreement, demonstrating that a quantitative picture of the local structure can be obtained from XANES in these crystalline compounds. Moreover, the quantitative XANES analysis provides topological information not directly achievable from EXAFS data analysis. This work demonstrates that combining the analysis of extended and near-edge regions of Mn K-edge XAS spectra could provide a complete and accurate description of Mn local atomic environment in these compounds.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The starting point of this study was to find out how the historical consciousness manifest in conceptions and experiences of Chilean refugees and their descendants. The previous research of historical consciousness has shown that powerful experiences such as the revolution and being a refugee may have an effect on historical consciousness. The purpose of this study is to solve how those experiences in the past have influenced Chilean refugees and their descendant s interpretations of the present and expectations for the future. The research material was collected by interviewing four Chilean refugees that escaped to Finland in years 1973 1976 and four young adults who represent the second generation. All second generation interviewees were born in Finland and their other parent or both parents were Chilean refugees. The two groups were not in a family relation to each other. The empirical part of the research was made by qualitative methods. The research material was collected by the method of focused interview and it was analysed by the qualitative data analysis software Atlas.ti 6.0. Content analysis was the main research tool. The previous theory of historical consciousness and the study questions was used to create the seven categories that manifest historical consciousness. The seven categories were biographical memory, collective memory, experiences of living between two cultures, idea of man, the essence of history and the reason for living, value conceptions and expectations of the future. Content analysis was based on those categories. Subcategories were based on the research material and were created during the analysis. The results of this study were made up of categories. The study revealed that experiences of revolution and of being a refugee has a significant role in the historical consciousness of the Chilean refugees. It became evident in their biographical memory being separated in three parts, in their values and in the belief of possibility of an individual to govern her own life. The second generation was also exposed to their parent s experiences in the past. The collective trauma in their parent s past has been part of their life indirectly and has affected the way they think of themselves, their concepts and their place in the present world. The active and regular retrospection in Finland by Chilean adults and special Gabriela Mistral club activities has played a big part in the construction of their historical consciousness.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Bangladesh, often better known to the outside world as a country of natural calamities, is one of the most densely populated countries in the world. Despite rapid urbanization, more than 75% of the people still live in rural areas. The density of the rural population is also one of the highest in the world. Being a poor and low-income country, its main challenge is to eradicate poverty through increasing equitable income. Since its independence in 1971, Bangladesh has experienced many ups and downs, but over the past three decades, its gross domestic product (GDP) has grown at an impressive rate. Consequently, the country s economy is developing and the country has outperformed many low-income countries in terms of several social indicators. Bangladesh has achieved the Millennium Development Goal (MDG) of eliminating gender disparity in primary and secondary school enrollment. A sharp decline in child and infant mortality rates, increased per capita income, and improved food security have placed Bangladesh on the track to achieving in the near future the status of a middle-income country. All these developments have influenced the consumption pattern of the country. This study explores the consumption scenario of rural Bangladesh, its changing consumption patterns, the relationship between technology and consumption in rural Bangladesh, cultural consumption in rural Bangladesh, and the myriad reasons why consumers nevertheless feel compelled to consume chemically treated foods. Data were collected in two phases in the summers of 2006 and 2008. In 2006, the empirical data were collected from the following three sources: interviews with consumers, producers/sellers, and doctors and pharmacists; observations of sellers/producers; and reviews of articles published in the national English and Bengali (the national language of Bangladesh) daily newspapers. A total of 110 consumers, 25 sellers/producers, 7 doctors, and 7 pharmacists were interviewed and observed. In 2008, data were collected through semi-structured in-depth qualitative interviews, ethnography, and unstructured conversations substantiated by secondary sources and photographs; the total number of persons interviewed was 22. -- Data were also collected on the consumption of food, clothing, housing, education, medical facilities, marriage and dowry, the division of labor, household decision making, different festivals such as Eid (for Muslims), the Bengali New Year, and Durga puja (for Hindus), and leisure. Qualitative methods were applied to the data analysis and were supported by secondary quantitative data. The findings of this study suggest that the consumption patterns of rural Bangladeshis are changing over time along with economic and social development, and that technology has rendered aspects of daily life more convenient. This study identified the perceptions and experiences of rural people regarding technologies in use and explored how culture is associated with consumption. This study identified the reasons behind the use of hazardous chemicals (e.g. calcium carbide, sodium cyclamate, cyanide and formalin, etc.) in foods as well as the extent to which food producers/sellers used such chemicals. In addition, this study assessed consumer perceptions of and attitudes toward these contaminated food items and explored how adulterated foods and food stuffs affect consumer health. This study also showed that consumers were aware that various foods and food stuffs contained hazardous chemicals, and that these adulterated foods and food stuffs were harmful to their health.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Aerosol particles have effect on climate, visibility, air quality and human health. However, the strength of which aerosol particles affect our everyday life is not well described or entirely understood. Therefore, investigations of different processes and phenomena including e.g. primary particle sources, initial steps of secondary particle formation and growth, significance of charged particles in particle formation, as well as redistribution mechanisms in the atmosphere are required. In this work sources, sinks and concentrations of air ions (charged molecules, cluster and particles) were investigated directly by measuring air molecule ionising components (i.e. radon activity concentrations and external radiation dose rates) and charged particle size distributions, as well as based on literature review. The obtained results gave comprehensive and valuable picture of the spatial and temporal variation of the air ion sources, sinks and concentrations to use as input parameters in local and global scale climate models. Newly developed air ion spectrometers (Airel Ltd.) offered a possibility to investigate atmospheric (charged) particle formation and growth at sub-3 nm sizes. Therefore, new visual classification schemes for charged particle formation events were developed, and a newly developed particle growth rate method was tested with over one year dataset. These data analysis methods have been widely utilised by other researchers since introducing them. This thesis resulted interesting characteristics of atmospheric particle formation and growth: e.g. particle growth may sometimes be suppressed before detection limit (~ 3 nm) of traditional aerosol instruments, particle formation may take place during daytime as well as in the evening, growth rates of sub-3 nm particles were quite constant throughout the year while growth rates of larger particles (3-20 nm in diameter) were higher during summer compared to winter. These observations were thought to be a consequence of availability of condensing vapours. The observations of this thesis offered new understanding of the particle formation in the atmosphere. However, the role of ions in particle formation, which is not well understood with current knowledge, requires further research in future.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Tiivistelmä ReferatAbstract Metabolomics is a rapidly growing research field that studies the response of biological systems to environmental factors, disease states and genetic modifications. It aims at measuring the complete set of endogenous metabolites, i.e. the metabolome, in a biological sample such as plasma or cells. Because metabolites are the intermediates and end products of biochemical reactions, metabolite compositions and metabolite levels in biological samples can provide a wealth of information on on-going processes in a living system. Due to the complexity of the metabolome, metabolomic analysis poses a challenge to analytical chemistry. Adequate sample preparation is critical to accurate and reproducible analysis, and the analytical techniques must have high resolution and sensitivity to allow detection of as many metabolites as possible. Furthermore, as the information contained in the metabolome is immense, the data set collected from metabolomic studies is very large. In order to extract the relevant information from such large data sets, efficient data processing and multivariate data analysis methods are needed. In the research presented in this thesis, metabolomics was used to study mechanisms of polymeric gene delivery to retinal pigment epithelial (RPE) cells. The aim of the study was to detect differences in metabolomic fingerprints between transfected cells and non-transfected controls, and thereafter to identify metabolites responsible for the discrimination. The plasmid pCMV-β was introduced into RPE cells using the vector polyethyleneimine (PEI). The samples were analyzed using high performance liquid chromatography (HPLC) and ultra performance liquid chromatography (UPLC) coupled to a triple quadrupole (QqQ) mass spectrometer (MS). The software MZmine was used for raw data processing and principal component analysis (PCA) was used in statistical data analysis. The results revealed differences in metabolomic fingerprints between transfected cells and non-transfected controls. However, reliable fingerprinting data could not be obtained because of low analysis repeatability. Therefore, no attempts were made to identify metabolites responsible for discrimination between sample groups. Repeatability and accuracy of analyses can be influenced by protocol optimization. However, in this study, optimization of analytical methods was hindered by the very small number of samples available for analysis. In conclusion, this study demonstrates that obtaining reliable fingerprinting data is technically demanding, and the protocols need to be thoroughly optimized in order to approach the goals of gaining information on mechanisms of gene delivery.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Reorganizing a dataset so that its hidden structure can be observed is useful in any data analysis task. For example, detecting a regularity in a dataset helps us to interpret the data, compress the data, and explain the processes behind the data. We study datasets that come in the form of binary matrices (tables with 0s and 1s). Our goal is to develop automatic methods that bring out certain patterns by permuting the rows and columns. We concentrate on the following patterns in binary matrices: consecutive-ones (C1P), simultaneous consecutive-ones (SC1P), nestedness, k-nestedness, and bandedness. These patterns reflect specific types of interplay and variation between the rows and columns, such as continuity and hierarchies. Furthermore, their combinatorial properties are interlinked, which helps us to develop the theory of binary matrices and efficient algorithms. Indeed, we can detect all these patterns in a binary matrix efficiently, that is, in polynomial time in the size of the matrix. Since real-world datasets often contain noise and errors, we rarely witness perfect patterns. Therefore we also need to assess how far an input matrix is from a pattern: we count the number of flips (from 0s to 1s or vice versa) needed to bring out the perfect pattern in the matrix. Unfortunately, for most patterns it is an NP-complete problem to find the minimum distance to a matrix that has the perfect pattern, which means that the existence of a polynomial-time algorithm is unlikely. To find patterns in datasets with noise, we need methods that are noise-tolerant and work in practical time with large datasets. The theory of binary matrices gives rise to robust heuristics that have good performance with synthetic data and discover easily interpretable structures in real-world datasets: dialectical variation in the spoken Finnish language, division of European locations by the hierarchies found in mammal occurrences, and co-occuring groups in network data. In addition to determining the distance from a dataset to a pattern, we need to determine whether the pattern is significant or a mere occurrence of a random chance. To this end, we use significance testing: we deem a dataset significant if it appears exceptional when compared to datasets generated from a certain null hypothesis. After detecting a significant pattern in a dataset, it is up to domain experts to interpret the results in the terms of the application.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This thesis report attempts to improve the models for predicting forest stand structure for practical use, e.g. forest management planning (FMP) purposes in Finland. Comparisons were made between Weibull and Johnson s SB distribution and alternative regression estimation methods. Data used for preliminary studies was local but the final models were based on representative data. Models were validated mainly in terms of bias and RMSE in the main stand characteristics (e.g. volume) using independent data. The bivariate SBB distribution model was used to mimic realistic variations in tree dimensions by including within-diameter-class height variation. Using the traditional method, diameter distribution with the expected height resulted in reduced height variation, whereas the alternative bivariate method utilized the error-term of the height model. The lack of models for FMP was covered to some extent by the models for peatland and juvenile stands. The validation of these models showed that the more sophisticated regression estimation methods provided slightly improved accuracy. A flexible prediction and application for stand structure consisted of seemingly unrelated regression models for eight stand characteristics, the parameters of three optional distributions and Näslund s height curve. The cross-model covariance structure was used for linear prediction application, in which the expected values of the models were calibrated with the known stand characteristics. This provided a framework to validate the optional distributions and the optional set of stand characteristics. Height distribution is recommended for the earliest state of stands because of its continuous feature. From the mean height of about 4 m, Weibull dbh-frequency distribution is recommended in young stands if the input variables consist of arithmetic stand characteristics. In advanced stands, basal area-dbh distribution models are recommended. Näslund s height curve proved useful. Some efficient transformations of stand characteristics are introduced, e.g. the shape index, which combined the basal area, the stem number and the median diameter. Shape index enabled SB model for peatland stands to detect large variation in stand densities. This model also demonstrated reasonable behaviour for stands in mineral soils.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This study examines the diaconia work of the Finnish Evangelical Lutheran Church from the standpoint of clients. The role of diaconia work has grown since the early 1990s recession, and since it established itself as one of the actors along with other social organizations. Previous studies have described the changing role of diaconal work, especially from the standpoint of diaconia workers and co-operators. This research goes back to examine, beyond the activities of the diaconia work of everyday practices, its relations of ruling which are determining practices. The theoretical and methodological framework rises from the thinking of Dorothy E. Smith, the creator of institutional ethnography. Its origins are in feminism, Marxism, phenomenology, etnomethodology, and symbolic interactionism. However, it does not represent any school. Unlike the objectivity-based traditional sociology, institutional ethnography has its starting point in everyday life, and people s subjective experience of it. Everyday life is just a starting point, and is used to examine everyday life s experiences of hidden relations of ruling, linking people and organizations. The level of generalization is just on the relations of ruling. The research task is to examine those meanings of diaconia work which are embedded in its clients experiences. The research task is investigated with two questions: how diaconia work among its clients takes shape and what kinds of relations of ruling exist in diaconia work. The meanings of diaconia work come through an examination of the relations of ruling, which create new forms of diaconal work compared with previous studies. For the study, two kinds of data were collected: a questionnaire and ethnographic fieldwork. The first data set was collected from diaconal workers using the questionnaire. It gives background information of the diaconia work process from the standpoint of the clients. In the ethnographic study there were two phases. The first ethnographic material was collected from one local parish by observing, interviewing clients and diaconal workers and gathering documents. The number of observations was 36 customer appointments, and 29 interviews. The second ethnographic material was included as a part of the analysis, in which ruling relations in people s experiences were collected from the transcribed data. Close reading and narrative analysis are used as analysing methods. The analysis has three phases. First, the experiences are identified with close reading; the following step is to select some of the institutional processes that are shaping those experiences and are relevant for the research. At the third stage, those processes are investigated in order to describe analytically how they determine people s experience. The analysis produces another narrative about diaconia work, which provides tools for examining the diaconal work from a new perspective. Through the analysis it is possible to see diaconia as an exchange ratio, in which the exchange takes place between a client and a diaconia worker, but also more broadly with other actors, such as social workers, shop clerks, or with other parishioners. The exchange ratio is examined from the perspective of power which is embedded in the client s experiences. The analysis reveals that the most important relations of ruling are humiliation and randomness in the exchange ratio of diaconia work; valuating spirituality above the bodily being; and replacing official social work. The results give a map about the relations of ruling of diaconia work which gives tools to look at diaconia work s meanings to the clients. The hidden element of humiliation in the exchange ratio breaks the current picture of diaconia work. The ethos of the holistic encounters and empathic practices are shown to be of another kind when spirituality is preferred to the bodily being. Nevertheless, diaconia appears to be a place for a respectful encounter, especially in situations where the public sector s actors are retreating on liability or clients are in a life crisis. The collapse of the welfare state structures imposes on diaconia work tasks that have not previously belonged to it. At the local level, clients receive partners from diaconia workers in order to advocate them in the welfare system. Actions to influence the wider societal structures are not reached because of lacking resources. An awareness of the oppressive practices of diaconia work and their critical reviewing are the keys to the development of diaconia work, since there are such practices even in holistic and respectful diaconia work. While the research raises new information for the development of diaconia work, it also opens up new aspects for developing other kinds of social work by emphasizing the importance of taking people s experiences seriously. Keywords: diaconia work, institutional ethnography, Dorothy E. Smith, experience, customer, relations of ruling.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This work is a case study of applying nonparametric statistical methods to corpus data. We show how to use ideas from permutation testing to answer linguistic questions related to morphological productivity and type richness. In particular, we study the use of the suffixes -ity and -ness in the 17th-century part of the Corpus of Early English Correspondence within the framework of historical sociolinguistics. Our hypothesis is that the productivity of -ity, as measured by type counts, is significantly low in letters written by women. To test such hypotheses, and to facilitate exploratory data analysis, we take the approach of computing accumulation curves for types and hapax legomena. We have developed an open source computer program which uses Monte Carlo sampling to compute the upper and lower bounds of these curves for one or more levels of statistical significance. By comparing the type accumulation from women’s letters with the bounds, we are able to confirm our hypothesis.