996 resultados para Statistical maps.
Resumo:
In this paper, we tackle the problem of unsupervised domain adaptation for classification. In the unsupervised scenario where no labeled samples from the target domain are provided, a popular approach consists in transforming the data such that the source and target distributions be- come similar. To compare the two distributions, existing approaches make use of the Maximum Mean Discrepancy (MMD). However, this does not exploit the fact that prob- ability distributions lie on a Riemannian manifold. Here, we propose to make better use of the structure of this man- ifold and rely on the distance on the manifold to compare the source and target distributions. In this framework, we introduce a sample selection method and a subspace-based method for unsupervised domain adaptation, and show that both these manifold-based techniques outperform the cor- responding approaches based on the MMD. Furthermore, we show that our subspace-based approach yields state-of- the-art results on a standard object recognition benchmark.
Resumo:
The genus Corymbia is closely related to the genus Eucalyptus, and like Eucalyptus contains tree species that are important for sub-tropical forestry. Corymbia's close relationship with Eucalyptus suggests genetic studies in Corymbia should benefit from transfer of genetic information from its more intensively studied relatives. Here we report a genetic map for Corymbia spp. based on microsatellite markers identified de novo in Corymbia sp or transferred from Eucalyptus. A framework consensus map was generated from an outbred F 2 population (n = 90) created by crossing two unrelated Corymbia torelliana x C. citriodora subsp. variegata F1 trees. The map had a total length of 367 cM (Kosambi) and was composed of 46 microsatellite markers distributed across 13 linkage groups (LOD 3). A high proportion of Eucalyptus microsatellites (90%) transferred to Corymbia. Comparative analysis between the Corymbia map and a published Eucalyptus map identified eight homeologous linkage groups in Corymbia with 13 markers mapping on one or both maps. Further comparative analysis was limited by low power to detect linkage due to low genome coverage in Corymbia, however, there was no convincing evidence for chromosomal structural differences because instances of non-synteny were associated with large distances on the Eucalyptus map. Segregation distortion was primarily restricted to a single linkage group and due to a deficit of hybrid genotypes, suggesting that hybrid inviability was one factor shaping the genetic composition of the F2 population in this inter-subgeneric hybrid. The conservation of microsatellite loci and synteny between Corymbia and Eucalyptus suggests there will be substantial value in exchanging information between the two groups.
Resumo:
To facilitate marketing and export, the Australian macadamia industry requires accurate crop forecasts. Each year, two levels of crop predictions are produced for this industry. The first is an overall longer-term forecast based on tree census data of growers in the Australian Macadamia Society (AMS). This data set currently accounts for around 70% of total production, and is supplemented by our best estimates of non-AMS orchards. Given these total tree numbers, average yields per tree are needed to complete the long-term forecasts. Yields from regional variety trials were initially used, but were found to be consistently higher than the average yields that growers were obtaining. Hence, a statistical model was developed using growers' historical yields, also taken from the AMS database. This model accounted for the effects of tree age, variety, year, region and tree spacing, and explained 65% of the total variation in the yield per tree data. The second level of crop prediction is an annual climate adjustment of these overall long-term estimates, taking into account the expected effects on production of the previous year's climate. This adjustment is based on relative historical yields, measured as the percentage deviance between expected and actual production. The dominant climatic variables are observed temperature, evaporation, solar radiation and modelled water stress. Initially, a number of alternate statistical models showed good agreement within the historical data, with jack-knife cross-validation R2 values of 96% or better. However, forecasts varied quite widely between these alternate models. Exploratory multivariate analyses and nearest-neighbour methods were used to investigate these differences. For 2001-2003, the overall forecasts were in the right direction (when compared with the long-term expected values), but were over-estimates. In 2004 the forecast was well under the observed production, and in 2005 the revised models produced a forecast within 5.1% of the actual production. Over the first five years of forecasting, the absolute deviance for the climate-adjustment models averaged 10.1%, just outside the targeted objective of 10%.
Resumo:
The recently introduced generalized pencil of Sudarshan which gives an exact ray picture of wave optics is analysed in some situations of interest to wave optics. A relationship between ray dispersion and statistical inhomogeneity of the field is obtained. A paraxial approximation which preserves the rectilinear propagation character of the generalized pencils is presented. Under this approximation the pencils can be computed directly from the field conditions on a plane, without the necessity to compute the cross-spectral density function in the entire space as an intermediate quantity. The paraxial results are illustrated with examples. The pencils are shown to exhibit an interesting scaling behaviour in the far-zone. This scaling leads to a natural generalization of the Fraunhofer range criterion and of the classical van Cittert-Zernike theorem to planar sources of arbitrary state of coherence. The recently derived results of radiometry with partially coherent sources are shown to be simple consequences of this scaling.
Resumo:
Background: Sorghum genome mapping based on DNA markers began in the early 1990s and numerous genetic linkage maps of sorghum have been published in the last decade, based initially on RFLP markers with more recent maps including AFLPs and SSRs and very recently, Diversity Array Technology (DArT) markers. It is essential to integrate the rapidly growing body of genetic linkage data produced through DArT with the multiple genetic linkage maps for sorghum generated through other marker technologies. Here, we report on the colinearity of six independent sorghum component maps and on the integration of these component maps into a single reference resource that contains commonly utilized SSRs, AFLPs, and high-throughput DArT markers. Results: The six component maps were constructed using the MultiPoint software. The lengths of the resulting maps varied between 910 and 1528 cM. The order of the 498 markers that segregated in more than one population was highly consistent between the six individual mapping data sets. The framework consensus map was constructed using a "Neighbours" approach and contained 251 integrated bridge markers on the 10 sorghum chromosomes spanning 1355.4 cM with an average density of one marker every 5.4 cM, and were used for the projection of the remaining markers. In total, the sorghum consensus map consisted of a total of 1997 markers mapped to 2029 unique loci ( 1190 DArT loci and 839 other loci) spanning 1603.5 cM and with an average marker density of 1 marker/0.79 cM. In addition, 35 multicopy markers were identified. On average, each chromosome on the consensus map contained 203 markers of which 58.6% were DArT markers. Non-random patterns of DNA marker distribution were observed, with some clear marker-dense regions and some marker-rare regions. Conclusion: The final consensus map has allowed us to map a larger number of markers than possible in any individual map, to obtain a more complete coverage of the sorghum genome and to fill a number of gaps on individual maps. In addition to overall general consistency of marker order across individual component maps, good agreement in overall distances between common marker pairs across the component maps used in this study was determined, using a difference ratio calculation. The obtained consensus map can be used as a reference resource for genetic studies in different genetic backgrounds, in addition to providing a framework for transferring genetic information between different marker technologies and for integrating DArT markers with other genomic resources. DArT markers represent an affordable, high throughput marker system with great utility in molecular breeding programs, especially in crops such as sorghum where SNP arrays are not publicly available.
Resumo:
It is demanding for children with visual impairment to become aware of the world beyond their immediate experience. They need to learn to control spatial experiences as a whole and understand the relationships between objects, surfaces and themselves. Tactile maps can be an excellent source of information for depicting space and environment. By means of tactile maps children can develop their spatial understanding more efficiently than through direct travel experiences supplemented with verbal explanations. Tactile maps can help children when they are learning to understand environmental, spatial, and directional concepts. The ability to read tactile maps is not self-evident; it is a skill, which must be learned. The main research question was: can children who are visually impaired learn to read tactile maps at the preschool age if they receive structural teaching? The purpose of this study was to develop an educational program for preschool children with visual impairment, the aim of which was to teach them to read tactile maps in order to strengthen their orientation skills and to encourage them to explore the world beyond their immediate experience. The study is a multiple case study describing the development of the map program consisting of eight learning tasks. The program was developed with one preschooler who was blind, and subsequently the program was implemented with three other children. Two of the children were blind from birth, one child had lost her vision at the age of two, and one child had low vision. The program was implemented in a normal preschool. Another objective of the pre-map program was to teach the preschooler with visual impairment to understand the concept of a map. The teaching tools were simple, map-like representations called pre-maps. Before a child with visual impairment can read a comprehensive tactile map, it is important to learn to understand map symbols, and how a three-dimensional model changes to a two-dimensional tactile map. All teaching sessions were videotaped; the results are based on the analysis of the videotapes. Two of the children completed the program successfully, and learned to read a tactile map. The two other children felt happy during the sessions, but it was problematic for them to engage fully in the instruction. One of the two eventually completed the program, while the other developed predominantly emerging skills. The results of the children's performances and the positive feedback from the teachers, assistants and the parents proved that this pre-map program is appropriate teaching material for preschool children who are visually impaired. The program does not demand high-level expertise; also parents, preschool teachers, and school assistants can carry out the program.
Resumo:
The purpose of the present study was to investigate the possibilities and interconnec-tions that exist concerning the relationship between the University of Applied Sci-ences and the Learning by Developing action model (LbD), on the one hand, and education for sustainable development and high-quality learning as a part of profes-sional competence development on the other. The research and learning environment was the Coping at Home research project and its Caring TV project, which provided the context of the Physiotherapy for Elderly People professional study unit. The re-searcher was a teacher and an evaluator of her own students learning. The aims of the study were to monitor and evaluate learning at the individual and group level using tools of high-quality learning − improved concept maps − related to understanding the projects core concept of successful ageing. Conceptions were evaluated through aspects of sustainable development and a conceptual basis of physiotherapy. As edu-cational research this was a multi-method case study design experiment. The three research questions were as follows. 1. What kind of individual conceptions and conceptual structures do students build concerning the concept of successful ageing? How many and what kind of concepts and propositions do they have a) before the study unit, b) after the study unit, c) after the social-knowledge building? 2. What kind of social-knowledge building exists? a) What kind of social learn-ing process exists? b) What kind of socially created concepts, propositions and conceptual structures do the students possess after the project? c) What kind of meaning does the social-knowledge building have at an individual level? 3. How do physiotherapy competences develop according to the results of the first and second research questions? The subjects were 22 female, third-year Bachelor of Physiotherapy students in Laurea University of Applied Sciences in Finland. Individual learning was evaluated in 12 of the 22 students. The data was collected as a part of the learning exercises of the Physiotherapy for Elderly People study unit, with improved concept maps both at individual and group levels. The students were divided into two social-knowledge building groups: the first group had 15 members and second 7 members. Each group created a group-level concept map on the theme of successful ageing. These face-to-face interactions were recorded with CMapTools and videotaped. The data consists of both individually produced concept maps and group-produced concept maps of the two groups and the videotaped material of these processes. The data analysis was carried out at the intersection of various research traditions. Individually produced data was analysed based on content analysis. Group-produced data was analysed based on content analysis and dialogue analysis. The data was also analysed by simple statistical analysis. In the individually produced improved concept maps the students conceptions were comprehensive, and the first concept maps were found to have many concepts unrelated to each other. The conceptual structures were between spoke structures and chain structures. Only a few professional concepts were evident. In the second indi-vidual improved concept maps the conception was more professional than earlier, particulary from the functional point of view. The conceptual structures mostly re-sembled spoke structures. After the second individual concept mapping social map-ping interventions were made in the two groups. After this, multidisciplinary concrete links were established between all concepts in almost all individual concept maps, and the interconnectedness of the concepts in different subject areas was thus understood. The conceptual structures were mainly net structures. The concepts in these individual concept maps were also found to be more professional and concrete than in the previ-ous concept maps of these subjects. In addition, the wider context dependency of the concepts was recognized in many individual concept maps. This implies a conceptual framework for specialists. The social-knowledge building was similar to a social learning process. Both socio-cultural processes and cognitive processes were found to develop students conceptual awareness and the ability to engage in intentional learning. In the knowl-edge-building process two aspects were found: knowledge creation and pedagogical action. The discussion during the concept-mapping process was similar to a shared thinking process. In visualising the process with CMapTools, students easily comple-mented each others thoughts and words, as if mutually telepathic . Synthesizing, supporting, asking and answering, peer teaching and counselling, tutoring, evaluating and arguing took place, and students were very active, self-directed and creative. It took hundreds of conversations before a common understanding could be found. The use of concept mapping in particular was very effective. The concepts in these group-produced concept maps were found to be professional, and values of sustainable development were observed. The results show the importance of developing the contents and objectives of the European Qualification Framework as well as education for sustainable development, especially in terms of the need for knowledge creation, global responsibility and systemic, holistic and critical thinking in order to develop clinical practice. Keywords: education for sustainable development, learning, knowledge building, improved concept map, conceptual structure, competence, successful ageing
Resumo:
In genetic epidemiology, population-based disease registries are commonly used to collect genotype or other risk factor information concerning affected subjects and their relatives. This work presents two new approaches for the statistical inference of ascertained data: a conditional and full likelihood approaches for the disease with variable age at onset phenotype using familial data obtained from population-based registry of incident cases. The aim is to obtain statistically reliable estimates of the general population parameters. The statistical analysis of familial data with variable age at onset becomes more complicated when some of the study subjects are non-susceptible, that is to say these subjects never get the disease. A statistical model for a variable age at onset with long-term survivors is proposed for studies of familial aggregation, using latent variable approach, as well as for prospective studies of genetic association studies with candidate genes. In addition, we explore the possibility of a genetic explanation of the observed increase in the incidence of Type 1 diabetes (T1D) in Finland in recent decades and the hypothesis of non-Mendelian transmission of T1D associated genes. Both classical and Bayesian statistical inference were used in the modelling and estimation. Despite the fact that this work contains five studies with different statistical models, they all concern data obtained from nationwide registries of T1D and genetics of T1D. In the analyses of T1D data, non-Mendelian transmission of T1D susceptibility alleles was not observed. In addition, non-Mendelian transmission of T1D susceptibility genes did not make a plausible explanation for the increase in T1D incidence in Finland. Instead, the Human Leucocyte Antigen associations with T1D were confirmed in the population-based analysis, which combines T1D registry information, reference sample of healthy subjects and birth cohort information of the Finnish population. Finally, a substantial familial variation in the susceptibility of T1D nephropathy was observed. The presented studies show the benefits of sophisticated statistical modelling to explore risk factors for complex diseases.
Resumo:
We present an introductory overview of several challenging problems in the statistical characterization of turbulence. We provide examples from fluid turbulence in three and two dimensions, from the turbulent advection of passive scalars, turbulence in the one-dimensional Burgers equation, and fluid turbulence in the presence of polymer additives.
Resumo:
A method is developed for demonstrating how solitons with some internal periodic motion may emerge as elementary excitations in the statistical mechanics of field systems. The procedure is demonstrated in the context of complex scalar fields which can, for appropriate choices of the Lagrangian, yield charge-carrying solitons with such internal motion. The derivation uses the techniques of the steepest-descent method for functional integrals. It is shown that, despite the constraint of some fixed total charge, a gaslike excitation of such charged solitons does emerge.
Resumo:
The past decade has brought a proliferation of statistical genetic (linkage) analysis techniques, incorporating new methodology and/or improvement of existing methodology in gene mapping, specifically targeted towards the localization of genes underlying complex disorders. Most of these techniques have been implemented in user-friendly programs and made freely available to the genetics community. Although certain packages may be more 'popular' than others, a common question asked by genetic researchers is 'which program is best for me?'. To help researchers answer this question, the following software review aims to summarize the main advantages and disadvantages of the popular GENEHUNTER package.
Resumo:
Digital image
Resumo:
Remote sensing provides methods to infer land cover information over large geographical areas at a variety of spatial and temporal resolutions. Land cover is input data for a range of environmental models and information on land cover dynamics is required for monitoring the implications of global change. Such data are also essential in support of environmental management and policymaking. Boreal forests are a key component of the global climate and a major sink of carbon. The northern latitudes are expected to experience a disproportionate and rapid warming, which can have a major impact on vegetation at forest limits. This thesis examines the use of optical remote sensing for estimating aboveground biomass, leaf area index (LAI), tree cover and tree height in the boreal forests and tundra taiga transition zone in Finland. The continuous fields of forest attributes are required, for example, to improve the mapping of forest extent. The thesis focus on studying the feasibility of satellite data at multiple spatial resolutions, assessing the potential of multispectral, -angular and -temporal information, and provides regional evaluation for global land cover data. Preprocessed ASTER, MISR and MODIS products are the principal satellite data. The reference data consist of field measurements, forest inventory data and fine resolution land cover maps. Fine resolution studies demonstrate how statistical relationships between biomass and satellite data are relatively strong in single species and low biomass mountain birch forests in comparison to higher biomass coniferous stands. The combination of forest stand data and fine resolution ASTER images provides a method for biomass estimation using medium resolution MODIS data. The multiangular data improve the accuracy of land cover mapping in the sparsely forested tundra taiga transition zone, particularly in mires. Similarly, multitemporal data improve the accuracy of coarse resolution tree cover estimates in comparison to single date data. Furthermore, the peak of the growing season is not necessarily the optimal time for land cover mapping in the northern boreal regions. The evaluated coarse resolution land cover data sets have considerable shortcomings in northernmost Finland and should be used with caution in similar regions. The quantitative reference data and upscaling methods for integrating multiresolution data are required for calibration of statistical models and evaluation of land cover data sets. The preprocessed image products have potential for wider use as they can considerably reduce the time and effort used for data processing.
Resumo:
Determination of the environmental factors controlling earth surface processes and landform patterns is one of the central themes in physical geography. However, the identification of the main drivers of the geomorphological phenomena is often challenging. Novel spatial analysis and modelling methods could provide new insights into the process-environment relationships. The objective of this research was to map and quantitatively analyse the occurrence of cryogenic phenomena in subarctic Finland. More precisely, utilising a grid-based approach the distribution and abundance of periglacial landforms were modelled to identify important landscape scale environmental factors. The study was performed using a comprehensive empirical data set of periglacial landforms from an area of 600 km2 at a 25-ha resolution. The utilised statistical methods were generalized linear modelling (GLM) and hierarchical partitioning (HP). GLMs were used to produce distribution and abundance models and HP to reveal independently the most likely causal variables. The GLM models were assessed utilising statistical evaluation measures, prediction maps, field observations and the results of HP analyses. A total of 40 different landform types and subtypes were identified. Topographical, soil property and vegetation variables were the primary correlates for the occurrence and cover of active periglacial landforms on the landscape scale. In the model evaluation, most of the GLMs were shown to be robust although the explanation power, prediction ability as well as the selected explanatory variables varied between the models. The great potential of the combination of a spatial grid system, terrain data and novel statistical techniques to map the occurrence of periglacial landforms was demonstrated in this study. GLM proved to be a useful modelling framework for testing the shapes of the response functions and significances of the environmental variables and the HP method helped to make better deductions of the important factors of earth surface processes. Hence, the numerical approach presented in this study can be a useful addition to the current range of techniques available to researchers to map and monitor different geographical phenomena.
Resumo:
Sequential firings with fixed time delays are frequently observed in simultaneous recordings from multiple neurons. Such temporal patterns are potentially indicative of underlying microcircuits and it is important to know when a repeatedly occurring pattern is statistically significant. These sequences are typically identified through correlation counts. In this paper we present a method for assessing the significance of such correlations. We specify the null hypothesis in terms of a bound on the conditional probabilities that characterize the influence of one neuron on another. This method of testing significance is more general than the currently available methods since under our null hypothesis we do not assume that the spiking processes of different neurons are independent. The structure of our null hypothesis also allows us to rank order the detected patterns. We demonstrate our method on simulated spike trains.