953 resultados para Semi-supervised classification
Resumo:
The purpose of this thesis is to present a new approach to the lossy compression of multispectral images. Proposed algorithm is based on combination of quantization and clustering. Clustering was investigated for compression of the spatial dimension and the vector quantization was applied for spectral dimension compression. Presenting algo¬rithms proposes to compress multispectral images in two stages. During the first stage we define the classes' etalons, another words to each uniform areas are located inside the image the number of class is given. And if there are the pixels are not yet assigned to some of the clusters then it doing during the second; pass and assign to the closest eta¬lons. Finally a compressed image is represented with a flat index image pointing to a codebook with etalons. The decompression stage is instant too. The proposed method described in this paper has been tested on different satellite multispectral images from different resources. The numerical results and illustrative examples of the method are represented too.
Resumo:
Tärkeä tehtävä ympäristön tarkkailussa on arvioida ympäristön nykyinen tila ja ihmisen siihen aiheuttamat muutokset sekä analysoida ja etsiä näiden yhtenäiset suhteet. Ympäristön muuttumista voidaan hallita keräämällä ja analysoimalla tietoa. Tässä diplomityössä on tutkittu vesikasvillisuudessa hai vainuja muutoksia käyttäen etäältä hankittua mittausdataa ja kuvan analysointimenetelmiä. Ympäristön tarkkailuun on käytetty Suomen suurimmasta järvestä Saimaasta vuosina 1996 ja 1999 otettuja ilmakuvia. Ensimmäinen kuva-analyysin vaihe on geometrinen korjaus, jonka tarkoituksena on kohdistaa ja suhteuttaa otetut kuvat samaan koordinaattijärjestelmään. Toinen vaihe on kohdistaa vastaavat paikalliset alueet ja tunnistaa kasvillisuuden muuttuminen. Kasvillisuuden tunnistamiseen on käytetty erilaisia lähestymistapoja sisältäen valvottuja ja valvomattomia tunnistustapoja. Tutkimuksessa käytettiin aitoa, kohinoista mittausdataa, minkä perusteella tehdyt kokeet antoivat hyviä tuloksia tutkimuksen onnistumisesta.
Resumo:
The increase of publicly available sequencing data has allowed for rapid progress in our understanding of genome composition. As new information becomes available we should constantly be updating and reanalyzing existing and newly acquired data. In this report we focus on transposable elements (TEs) which make up a significant portion of nearly all sequenced genomes. Our ability to accurately identify and classify these sequences is critical to understanding their impact on host genomes. At the same time, as we demonstrate in this report, problems with existing classification schemes have led to significant misunderstandings of the evolution of both TE sequences and their host genomes. In a pioneering publication Finnegan (1989) proposed classifying all TE sequences into two classes based on transposition mechanisms and structural features: the retrotransposons (class I) and the DNA transposons (class II). We have retraced how ideas regarding TE classification and annotation in both prokaryotic and eukaryotic scientific communities have changed over time. This has led us to observe that: (1) a number of TEs have convergent structural features and/or transposition mechanisms that have led to misleading conclusions regarding their classification, (2) the evolution of TEs is similar to that of viruses by having several unrelated origins, (3) there might be at least 8 classes and 12 orders of TEs including 10 novel orders. In an effort to address these classification issues we propose: (1) the outline of a universal TE classification, (2) a set of methods and classification rules that could be used by all scientific communities involved in the study of TEs, and (3) a 5-year schedule for the establishment of an International Committee for Taxonomy of Transposable Elements (ICTTE).
Resumo:
This study aimed to evaluate the growth characteristics of irrigated Vitória pineapple plants grown in semi-arid conditions and determine its developmental stages based on those characteristics. It was used a randomized block design with four replicates. The experimental treatments were: plant harvest at 270, 330, 390, 450, 510, 570, 690, 750, and 810 days after planting (DAP). The following variables were determined: plant height, stem diameter, D-leaf length, D-leaf fresh and dry mass, biomass production of plants and plant parts (organs), and vegetative biomass. Five phenological stages are proposed based on vegetative biomass production: < 20% biomass production (V1); 21-40% (V2); 41-60% (V3); 61-80% (V4); and > 80% (V5). The maximum growth rate for plant height, D-leaf length, and stem diameter was observed at the end of the phenological stage V1 (390-411 DAP), and at the end of stage V5 these plant traits had average values of 106, 82, and 7 cm, respectively. The maximum biomass accumulation rates were observed at stages V4 and V5, resulting in a final fruit yield and total fresh biomass of 72 t ha-1 and 326 t ha-1, respectively. Finally, we estimated that 80% of the accumulated biomass may remain in the field after fruit and slip harvest, and could be incorporated as plant residue into the soil.
Resumo:
Internet on elektronisen postin perusrakenne ja ollut tärkeä tiedonlähde akateemisille käyttäjille jo pitkään. Siitä on tullut merkittävä tietolähde kaupallisille yrityksille niiden pyrkiessä pitämään yhteyttä asiakkaisiinsa ja seuraamaan kilpailijoitansa. WWW:n kasvu sekä määrällisesti että sen moninaisuus on luonut kasvavan kysynnän kehittyneille tiedonhallintapalveluille. Tällaisia palveluja ovet ryhmittely ja luokittelu, tiedon löytäminen ja suodattaminen sekä lähteiden käytön personointi ja seuranta. Vaikka WWW:stä saatavan tieteellisen ja kaupallisesti arvokkaan tiedon määrä on huomattavasti kasvanut viime vuosina sen etsiminen ja löytyminen on edelleen tavanomaisen Internet hakukoneen varassa. Tietojen hakuun kohdistuvien kasvavien ja muuttuvien tarpeiden tyydyttämisestä on tullut monimutkainen tehtävä Internet hakukoneille. Luokittelu ja indeksointi ovat merkittävä osa luotettavan ja täsmällisen tiedon etsimisessä ja löytämisessä. Tämä diplomityö esittelee luokittelussa ja indeksoinnissa käytettävät yleisimmät menetelmät ja niitä käyttäviä sovelluksia ja projekteja, joissa tiedon hakuun liittyvät ongelmat on pyritty ratkaisemaan.
Resumo:
Tutkielman tavoitteena oli selvittää millaisia työ- ja toimintatapoja suomalaisilla CMC-konsulteilla (Certified Management Consultant) on ja miten työ- ja toimintatavat on jäsennetty konsultointia varten. Tutkielman kirjallisuuskatsauksen tavoitteena oli lisätä tietämystä liikkeenjohdon konsultoinnista ja konsultointitapojen jaotteluista. Empiriaosan tavoitteena oli kerätä CMC-konsulttien kertomuksia omasta toiminnastaan ja muodostaa näiden kertomusten pohjalta käsitys CMC-konsulttien konsultointitavoista. Tutkimuksessa on käytetty laadullista tutkimusotetta. Tämä tutkimusote sopii hyvin konsultointitapojen erojen ja yhtäläisyyksien selvittämiseen. Tutkimuksen empiirinen aineisto kerättiin käyttämällä puolistrukturoitua haastattelua. Aineisto sisältää 21 konsultin haastattelut sekä yhdet kirjalliset vastaukset. Empiirisen aineiston analysoinnissa käytettiin tyypittelyä. Tyypittelyssä aineisto ryhmitellään perustyypeiksi etsimällä aineistosta eroja ja samankaltaisuuksia. Tutkimuksen tuloksena muodostettiin konsulttien kertomusten perusteella viisi konsulttityyppiä eli konsultointitapaa. Konsulttityyppien ryhmittelyssä kriteereinä käytettiin toiminnan tuotteistusastetta sekä keskittymistä eri asiakaskuntiin. Tulosten perusteella voidaan todeta, että suomalaisilla CMC-konsulteilla on erilaisia konsultointitapoja, jotka kaikki ovat käytännössä toimivia.
Resumo:
In this paper we propose the use of the independent component analysis (ICA) [1] technique for improving the classification rate of decision trees and multilayer perceptrons [2], [3]. The use of an ICA for the preprocessing stage, makes the structure of both classifiers simpler, and therefore improves the generalization properties. The hypothesis behind the proposed preprocessing is that an ICA analysis will transform the feature space into a space where the components are independent, and aligned to the axes and therefore will be more adapted to the way that a decision tree is constructed. Also the inference of the weights of a multilayer perceptron will be much easier because the gradient search in the weight space will follow independent trajectories. The result is that classifiers are less complex and on some databases the error rate is lower. This idea is also applicable to regression
Resumo:
Diagnosis of community acquired legionella pneumonia (CALP) is currently performed by means of laboratory techniques which may delay diagnosis several hours. To determine whether ANN can categorize CALP and non-legionella community-acquired pneumonia (NLCAP) and be standard for use by clinicians, we prospectively studied 203 patients with community-acquired pneumonia (CAP) diagnosed by laboratory tests. Twenty one clinical and analytical variables were recorded to train a neural net with two classes (LCAP or NLCAP class). In this paper we deal with the problem of diagnosis, feature selection, and ranking of the features as a function of their classification importance, and the design of a classifier the criteria of maximizing the ROC (Receiving operating characteristics) area, which gives a good trade-off between true positives and false negatives. In order to guarantee the validity of the statistics; the train-validation-test databases were rotated by the jackknife technique, and a multistarting procedure was done in order to make the system insensitive to local maxima.
Resumo:
This paper considers an alternative perspective to China's exchange rate policy. It studies a semi-open economy where the private sector has no access to international capital markets but the central bank has full access. Moreover, it assumes limited financial development generating a large demand for saving instruments by the private sector. The paper analyzes the optimal exchange rate policy by modeling the central bank as a Ramsey planner. Its main result is that in a growth acceleration episode it is optimal to have an initial real depreciation of the currency combined with an accumulation of reserves, which is consistent with the Chinese experience. This depreciation is followed by an appreciation in the long run. The paper also shows that the optimal exchange rate path is close to the one that would result in an economy with full capital mobility and no central bank intervention.
Resumo:
The Commission on Classification and Terminology and the Commission on Epidemiology of the International League Against Epilepsy (ILAE) have charged a Task Force to revise concepts, definition, and classification of status epilepticus (SE). The proposed new definition of SE is as follows: Status epilepticus is a condition resulting either from the failure of the mechanisms responsible for seizure termination or from the initiation of mechanisms, which lead to abnormally, prolonged seizures (after time point t1 ). It is a condition, which can have long-term consequences (after time point t2 ), including neuronal death, neuronal injury, and alteration of neuronal networks, depending on the type and duration of seizures. This definition is conceptual, with two operational dimensions: the first is the length of the seizure and the time point (t1 ) beyond which the seizure should be regarded as "continuous seizure activity." The second time point (t2 ) is the time of ongoing seizure activity after which there is a risk of long-term consequences. In the case of convulsive (tonic-clonic) SE, both time points (t1 at 5 min and t2 at 30 min) are based on animal experiments and clinical research. This evidence is incomplete, and there is furthermore considerable variation, so these time points should be considered as the best estimates currently available. Data are not yet available for other forms of SE, but as knowledge and understanding increase, time points can be defined for specific forms of SE based on scientific evidence and incorporated into the definition, without changing the underlying concepts. A new diagnostic classification system of SE is proposed, which will provide a framework for clinical diagnosis, investigation, and therapeutic approaches for each patient. There are four axes: (1) semiology; (2) etiology; (3) electroencephalography (EEG) correlates; and (4) age. Axis 1 (semiology) lists different forms of SE divided into those with prominent motor systems, those without prominent motor systems, and currently indeterminate conditions (such as acute confusional states with epileptiform EEG patterns). Axis 2 (etiology) is divided into subcategories of known and unknown causes. Axis 3 (EEG correlates) adopts the latest recommendations by consensus panels to use the following descriptors for the EEG: name of pattern, morphology, location, time-related features, modulation, and effect of intervention. Finally, axis 4 divides age groups into neonatal, infancy, childhood, adolescent and adulthood, and elderly.
Resumo:
Background: Some countries have recently extended smoke-free policies to particular outdoor settings; however, there is controversy regarding whether this is scientifically and ethically justifiable. Objectives: The objective of the present study was to review research on secondhand smoke (SHS) exposure in outdoor settings. Data sources: We conducted different searches in PubMed for the period prior to September 2012. We checked the references of the identified papers, and conducted a similar search in Google Scholar. Study selection: Our search terms included combinations of"secondhand smoke,""environmental tobacco smoke,""passive smoking" OR"tobacco smoke pollution" AND"outdoors" AND"PM" (particulate matter),"PM2.5" (PM with diameter ≤ 2.5 µm),"respirable suspended particles,""particulate matter,""nicotine,""CO" (carbon monoxide),"cotinine,""marker,""biomarker" OR"airborne marker." In total, 18 articles and reports met the inclusion criteria. Results: Almost all studies used PM2.5 concentration as an SHS marker. Mean PM2.5 concentrations reported for outdoor smoking areas when smokers were present ranged from 8.32 to 124 µg/m3 at hospitality venues, and 4.60 to 17.80 µg/m3 at other locations. Mean PM2.5 concentrations in smoke-free indoor settings near outdoor smoking areas ranged from 4 to 120.51 µg/m3. SHS levels increased when smokers were present, and outdoor and indoor SHS levels were related. Most studies reported a positive association between SHS measures and smoker density, enclosure of outdoor locations, wind conditions, and proximity to smokers. Conclusions: The available evidence indicates high SHS levels at some outdoor smoking areas and at adjacent smoke-free indoor areas. Further research and standardization of methodology is needed to determine whether smoke-free legislation should be extended to outdoor settings.