992 resultados para Database accession number


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Sequential pattern mining is an important subject in data mining with broad applications in many different areas. However, previous sequential mining algorithms mostly aimed to calculate the number of occurrences (the support) without regard to the degree of importance of different data items. In this paper, we propose to explore the search space of subsequences with normalized weights. We are not only interested in the number of occurrences of the sequences (supports of sequences), but also concerned about importance of sequences (weights). When generating subsequence candidates we use both the support and the weight of the candidates while maintaining the downward closure property of these patterns which allows to accelerate the process of candidate generation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Database design is a difficult problem for non-expert designers. It is desirable to assist such designers during the problem solving process by means of a knowledge based (KB) system. A number of prototype KB systems have been proposed, however there are many shortcomings. Few have incorporated sufficient expertise in modeling relationships, particularly higher order relationships. There has been no empirical study that experimentally tested the effectiveness of any of these KB tools. Problem solving behavior of non-experts, whom the systems were intended to assist, has not been one of the bases for system design. In this project a consulting system for conceptual database design that addresses the above short comings was developed and empirically validated.^ The system incorporates (a) findings on why non-experts commit errors and (b) heuristics for modeling relationships. Two approaches to knowledge base implementation--system restrictiveness and decisional guidance--were used and compared in this project. The Restrictive approach is proscriptive and limits the designer's choices at various design phases by forcing him/her to follow a specific design path. The Guidance system approach which is less restrictive, provides context specific, informative and suggestive guidance throughout the design process. The main objectives of the study are to evaluate (1) whether the knowledge-based system is more effective than a system without the knowledge-base and (2) which knowledge implementation--restrictive or guidance--strategy is more effective. To evaluate the effectiveness of the knowledge base itself, the two systems were compared with a system that does not incorporate the expertise (Control).^ The experimental procedure involved the student subjects solving a task without using the system (pre-treatment task) and another task using one of the three systems (experimental task). The experimental task scores of those subjects who performed satisfactorily in the pre-treatment task were analyzed. Results are (1) The knowledge based approach to database design support lead to more accurate solutions than the control system; (2) No significant difference between the two KB approaches; (3) Guidance approach led to best performance; and (4) The subjects perceived the Restrictive system easier to use than the Guidance system. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Database design is a difficult problem for non-expert designers. It is desirable to assist such designers during the problem solving process by means of a knowledge based (KB) system. Although a number of prototype KB systems have been proposed, there are many shortcomings. Firstly, few have incorporated sufficient expertise in modeling relationships, particularly higher order relationships. Secondly, there does not seem to be any published empirical study that experimentally tested the effectiveness of any of these KB tools. Thirdly, problem solving behavior of non-experts, whom the systems were intended to assist, has not been one of the bases for system design. In this project, a consulting system, called CODA, for conceptual database design that addresses the above short comings was developed and empirically validated. More specifically, the CODA system incorporates (a) findings on why non-experts commit errors and (b) heuristics for modeling relationships. Two approaches to knowledge base implementation were used and compared in this project, namely system restrictiveness and decisional guidance (Silver 1990). The Restrictive system uses a proscriptive approach and limits the designer's choices at various design phases by forcing him/her to follow a specific design path. The Guidance system approach, which is less restrictive, involves providing context specific, informative and suggestive guidance throughout the design process. Both the approaches would prevent erroneous design decisions. The main objectives of the study are to evaluate (1) whether the knowledge-based system is more effective than the system without a knowledge-base and (2) which approach to knowledge implementation - whether Restrictive or Guidance - is more effective. To evaluate the effectiveness of the knowledge base itself, the systems were compared with a system that does not incorporate the expertise (Control). An experimental procedure using student subjects was used to test the effectiveness of the systems. The subjects solved a task without using the system (pre-treatment task) and another task using one of the three systems, viz. Control, Guidance or Restrictive (experimental task). Analysis of experimental task scores of those subjects who performed satisfactorily in the pre-treatment task revealed that the knowledge based approach to database design support lead to more accurate solutions than the control system. Among the two KB approaches, Guidance approach was found to lead to better performance when compared to the Control system. It was found that the subjects perceived the Restrictive system easier to use than the Guidance system.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A database of representative BRDF and BPDF derived from the POLDER measurements. From the huge amount of data acquired by the spaceborne instrument over a period of 7 years, we selected a set of targets with high quality observations. The selection aimed at a large number of observations, free of cloud or aerosol contamination, acquired in diverse observation geometry with a focus on the backscatter direction that shows the specific Hot-Spot signature. The targets are sorted according to the 16-classes IGBP land cover classification system and the target selection aims at a spatial representativeness within the class. The database thus provides a set of high quality BRDF and BPDF samples that can be used to assess the typical variability of natural surface reflectances or to evaluate models.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this study we review a global set of alkenone- and foraminiferal Mg/Ca-derived sea surface temperatures (SST) records from the Holocene and compare them with a suite of published Eemian SST records based on the same approach. For the Holocene, the alkenone SST records belong to the actualized GHOST database (Kim, J.-H., Schneider R.R., 2004). The actualized GHOST database not only confirms the SST changes previously described but also documents the Holocene temperature evolution in new oceanic regions such as the Northwestern Atlantic, the eastern equatorial Pacific, and the Southern Ocean. A comparison of Holocene SST records stemming from the two commonly applied paleothermometry methods reveals contrasting - sometimes divergent - SST evolution, particularly at low latitudes where SST records are abundant enough to infer systematic discrepancies at a regional scale. Opposite SST trends at particular locations could be explained by out-of-phase trends in seasonal insolation during the Holocene. This hypothesis assumes that a strong contrast in the ecological responses of coccolithophores and planktonic foraminifera to winter and summer oceanographic conditions is the ultimate reason for seasonal differences in the origin of the temperature signal provided by these organisms. As a simple test for this hypothesis, Eemian SST records are considered because the Holocene and Eemian time periods experienced comparable changes in orbital configurations, but had a higher magnitude in insolation variance during the Eemian. For several regions, SST changes during both interglacials were of a similar sign, but with higher magnitudes during the Eemian as compared to the Holocene. This observation suggests that the ecological mechanism shaping SST trends during the Holocene was comparable during the penultimate interglacial period. Although this "ecology hypothesis" fails to explain all of the available results, we argue that any other mechanism would fail to satisfactorily explain the observed SST discrepancies among proxies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Two years of harmonized aerosol number size distribution data from 24 European field monitoring sites have been analysed. The results give a comprehensive overview of the European near surface aerosol particle number concentrations and number size distributions between 30 and 500 nm of dry particle diameter. Spatial and temporal distribution of aerosols in the particle sizes most important for climate applications are presented. We also analyse the annual, weekly and diurnal cycles of the aerosol number concentrations, provide log-normal fitting parameters for median number size distributions, and give guidance notes for data users. Emphasis is placed on the usability of results within the aerosol modelling community. We also show that the aerosol number concentrations of Aitken and accumulation mode particles (with 100 nm dry diameter as a cut-off between modes) are related, although there is significant variation in the ratios of the modal number concentrations. Different aerosol and station types are distinguished from this data and this methodology has potential for further categorization of stations aerosol number size distribution types. The European submicron aerosol was divided into characteristic types: Central European aerosol, characterized by single mode median size distributions, unimodal number concentration histograms and low variability in CCN-sized aerosol number concentrations; Nordic aerosol with low number concentrations, although showing pronounced seasonal variation of especially Aitken mode particles; Mountain sites (altitude over 1000 m a.s.l.) with a strong seasonal cycle in aerosol number concentrations, high variability, and very low median number concentrations. Southern and Western European regions had fewer stations, which decreases the regional coverage of these results. Aerosol number concentrations over the Britain and Ireland had very high variance and there are indications of mixed air masses from several source regions; the Mediterranean aerosol exhibit high seasonality, and a strong accumulation mode in the summer. The greatest concentrations were observed at the Ispra station in Northern Italy with high accumulation mode number concentrations in the winter. The aerosol number concentrations at the Arctic station Zeppelin in Ny-Ålesund in Svalbard have also a strong seasonal cycle, with greater concentrations of accumulation mode particles in winter, and dominating summer Aitken mode indicating more recently formed particles. Observed particles did not show any statistically significant regional work-week or weekday related variation in number concentrations studied. Analysis products are made for open-access to the research community, available in a freely accessible internet site. The results give to the modelling community a reliable, easy-to-use and freely available comparison dataset of aerosol size distributions.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Five years of SMOS L-band brightness temperature data intercepting a large number of tropical cyclones (TCs) are analyzed. The storm-induced half-power radio-brightness contrast (ΔI) is defined as the difference between the brightness observed at a specific wind force and that for a smooth water surface with the same physical parameters. ΔI can be related to surface wind speed and has been estimated for ~ 300 TCs that intercept with SMOS measurements. ΔI, expressed in a common storm-centric coordinate system, shows that mean brightness contrast monotonically increases with increased storm intensity ranging from ~ 5 K for strong storms to ~ 24 K for the most intense Category 5 TCs. A remarkable feature of the 2D mean ΔI fields and their variability is that maxima are systematically found on the right quadrants of the storms in the storm-centered coordinate frame, consistent with the reported asymmetric structure of the wind and wave fields in hurricanes. These results highlight the strong potential of SMOS measurements to improve monitoring of TC intensification and evolution. An improved empirical geophysical model function (GMF) was derived using a large ensemble of co-located SMOS ΔI, aircraft and H*WIND (a multi-measurement analysis) surface wind speed data. The GMF reveals a quadratic relationship between ΔI and the surface wind speed at a height of 10 m (U10). ECMWF and NCEP analysis products and SMOS derived wind speed estimates are compared to a large ensemble of H*WIND 2D fields. This analysis confirms that the surface wind speed in TCs can effectively be retrieved from SMOS data with an RMS error on the order of 10 kt up to 100 kt. SMOS wind speed products above hurricane force (64 kt) are found to be more accurate than those derived from NWP analyses products that systematically underestimate the surface wind speed in these extreme conditions. Using co-located estimates of rain rate, we show that the L-band radio-brightness contrasts could be weakly affected by rain or ice-phase clouds and further work is required to refine the GMF in this context.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Negative-ion mode electrospray ionization, ESI(-), with Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR MS) was coupled to a Partial Least Squares (PLS) regression and variable selection methods to estimate the total acid number (TAN) of Brazilian crude oil samples. Generally, ESI(-)-FT-ICR mass spectra present a power of resolution of ca. 500,000 and a mass accuracy less than 1 ppm, producing a data matrix containing over 5700 variables per sample. These variables correspond to heteroatom-containing species detected as deprotonated molecules, [M - H](-) ions, which are identified primarily as naphthenic acids, phenols and carbazole analog species. The TAN values for all samples ranged from 0.06 to 3.61 mg of KOH g(-1). To facilitate the spectral interpretation, three methods of variable selection were studied: variable importance in the projection (VIP), interval partial least squares (iPLS) and elimination of uninformative variables (UVE). The UVE method seems to be more appropriate for selecting important variables, reducing the dimension of the variables to 183 and producing a root mean square error of prediction of 0.32 mg of KOH g(-1). By reducing the size of the data, it was possible to relate the selected variables with their corresponding molecular formulas, thus identifying the main chemical species responsible for the TAN values.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The basic reproduction number is a key parameter in mathematical modelling of transmissible diseases. From the stability analysis of the disease free equilibrium, by applying Routh-Hurwitz criteria, a threshold is obtained, which is called the basic reproduction number. However, the application of spectral radius theory on the next generation matrix provides a different expression for the basic reproduction number, that is, the square root of the previously found formula. If the spectral radius of the next generation matrix is defined as the geometric mean of partial reproduction numbers, however the product of these partial numbers is the basic reproduction number, then both methods provide the same expression. In order to show this statement, dengue transmission modelling incorporating or not the transovarian transmission is considered as a case study. Also tuberculosis transmission and sexually transmitted infection modellings are taken as further examples.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Different types of water bodies, including lakes, streams, and coastal marine waters, are often susceptible to fecal contamination from a range of point and nonpoint sources, and have been evaluated using fecal indicator microorganisms. The most commonly used fecal indicator is Escherichia coli, but traditional cultivation methods do not allow discrimination of the source of pollution. The use of triplex PCR offers an approach that is fast and inexpensive, and here enabled the identification of phylogroups. The phylogenetic distribution of E. coli subgroups isolated from water samples revealed higher frequencies of subgroups A1 and B23 in rivers impacted by human pollution sources, while subgroups D1 and D2 were associated with pristine sites, and subgroup B1 with domesticated animal sources, suggesting their use as a first screening for pollution source identification. A simple classification is also proposed based on phylogenetic subgroup distribution using the w-clique metric, enabling differentiation of polluted and unpolluted sites.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Metastasizing pleomorphic adenoma (MPA) is a rare tumour, and its mechanism of metastasis still is unknown. To date, there has been no study on MPA genomics. We analysed primary and secondary MPAs with array comparative genomic hybridization to identify somatic copy number alterations and affected genes. Tumour DNA samples from primary (parotid salivary gland) and secondary (scalp skin) MPAs were subjected to array comparative genomic hybridization investigation, and the data were analysed with NEXUS COPY NUMBER DISCOVERY. The primary MPA showed copy number losses affecting 3p22.2p14.3 and 19p13.3p123, and a complex pattern of four different deletions at chromosome 6. The 3p deletion encompassed several genes: CTNNB1, SETD2, BAP1, and PBRM1, among others. The secondary MPA showed a genomic profile similar to that of the primary MPA, with acquisition of additional copy number changes affecting 9p24.3p13.1 (loss), 19q11q13.43 (gain), and 22q11.1q13.33 (gain). Our findings indicated a clonal origin of the secondary MPA, as both tumours shared a common profile of genomic copy number alterations. Furthermore, we were able to detect in the primary tumour a specific pattern of copy number alterations that could explain the metastasizing characteristic, whereas the secondary MPA showed a more unbalanced genome.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

American visceral leishmaniasis (AVL) is an emerging disease in the state of São Paulo, Brazil. Its geographical expansion and the increase in the number of human cases has been linked to dispersion of Lutzomyia longipalpis into urban areas. To produce more accurate risk maps we investigated the geographic distribution and routes of expansion of the disease as well as chemotype populations of the vector. A database, containing the annual records of municipalities which had notified human and canine AVL cases as well as the presence of the vector, was compiled. The chemotypes of L. longipalpis populations from municipalities in different regions of São Paulo State were determined by Coupled Gas Chromatography - Mass Spectrometry. From 1997 to June 2014, L. longipalpis has been reported in 166 municipalities, 148 of them in the Western region. A total of 106 municipalities were identified with transmission and 99 were located in the Western region, where all 2,204 autochthonous human cases occurred. Both the vector and the occurrence of human cases have expanded in a South-easterly direction, from the Western to central region, and from there, a further expansion to the North and the South. The (S)-9-methylgermacrene-B population of L. longipalpis is widely distributed in the Western region and the cembrene-1 population is restricted to the Eastern region. The maps in the present study show that there are two distinct epidemiological patterns of AVL in São Paulo State and that the expansion of human and canine AVL cases through the Western region has followed the same dispersion route of only one of the two species of the L. longipalpis complex, (S)-9-methylgermacrene-B. Entomological vigilance based on the routes of dispersion and identification of the chemotype population could be used to identify at-risk areas and consequently define the priorities for control measures.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Typical orofacial clefts (OFCs) comprise cleft lip, cleft palate and cleft lip and palate. The complex etiology has been postulated to involve chromosome rearrangements, gene mutations and environmental factors. A group of genes including IRF6, FOXE1, GLI2, MSX2, SKI, SATB2, MSX1 and FGF has been implicated in the etiology of OFCs. Recently, the role of the copy number variations (CNVs) has been studied in genetic defects and diseases. CNVs act by modifying gene expression, disrupting gene sequence or altering gene dosage. The aims of this study were to screen the above-mentioned genes and to investigate CNVs in patients with OFCs. The sample was composed of 23 unrelated individuals who were grouped according to phenotype (associated with other anomalies or isolated) and familial recurrence. New sequence variants in GLI2, MSX1 and FGF8 were detected in patients, but not in their parents, as well as in 200 control chromosomes, indicating that these were rare variants. CNV screening identified new genes that can influence OFC pathogenesis, particularly highlighting TCEB3 and KIF7, that could be further analyzed. The findings of the present study suggest that the mechanism underlying CNV associated with sequence variants may play a role in the etiology of OFC.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Currently, owing to the occurrence of environmental problems, along with the need of environmental preservation, both the territory management of Hydrographic Basin and the conservation of natural resources have proven to have remarkable importance. Thus, the mean goal of the research is to raise and scrutinize social-economic and technologic data from the Mogi Guaçu River Hydrographic Basin (São Paulo, Brazil). The aim is to group municipalities with similar characteristics regarding the collected data, which may direct joint actions in the Hydrographic Basin Management. There were used both the methods of factorial analysis and automatic hierarchical classifications. Additionally, there is going to be applied a Geographical Information System to represent the outcomes of the methods aforementioned, through the evolvement of a geo-referenced database, which will allow the obtainment of information categorically distributed including theme maps of interest. The main characteristics adopted to group the municipalities were: agricultural area, sugar cane production, small farms, animal production, number of agriculture machinery and equipments and agricultural income. The methodology adopted in the Mogi Guaçu River Hydrographic Basin will be analyzed vis-à-vis its appropriateness on basin management, as well as the possibility of assisting the studies on behalf of the São Paulo Hydrographic Basin groups, to regional development.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The family Malpighiaceae presents species with different habits, fruit types and cytological characters. Climbers are considered the most derived habit, followed, respectively, by the shrubby and arboreal ones. The present study examines the relationship between basic chromosome numbers and the derivation of climbing habit and fruit types in Malpighiaceae. A comparison of all the chromosome number reports for Malpighiaceae showed a predominance of chromosome numbers based on x=5 or 10 in the genera of sub-family Malpighioideae, mainly represented by climbers with winged fruits, whereas non-climbing species with non-winged fruits, which predominate in sub-family Byrsonimoideae, had counts based on x=6, which is considered the less derived basic number for the family. Based on such data, confirmed by statistic assays, and on the monophyletic origin of this family, we admit the hypothesis that morphological derivation of habit and fruit is correlated with chromosome basic number variation in the family Malpighiaceae.