911 resultados para streams
Resumo:
Advances in hardware and software in the past decade allow to capture, record and process fast data streams at a large scale. The research area of data stream mining has emerged as a consequence from these advances in order to cope with the real time analysis of potentially large and changing data streams. Examples of data streams include Google searches, credit card transactions, telemetric data and data of continuous chemical production processes. In some cases the data can be processed in batches by traditional data mining approaches. However, in some applications it is required to analyse the data in real time as soon as it is being captured. Such cases are for example if the data stream is infinite, fast changing, or simply too large in size to be stored. One of the most important data mining techniques on data streams is classification. This involves training the classifier on the data stream in real time and adapting it to concept drifts. Most data stream classifiers are based on decision trees. However, it is well known in the data mining community that there is no single optimal algorithm. An algorithm may work well on one or several datasets but badly on others. This paper introduces eRules, a new rule based adaptive classifier for data streams, based on an evolving set of Rules. eRules induces a set of rules that is constantly evaluated and adapted to changes in the data stream by adding new and removing old rules. It is different from the more popular decision tree based classifiers as it tends to leave data instances rather unclassified than forcing a classification that could be wrong. The ongoing development of eRules aims to improve its accuracy further through dynamic parameter setting which will also address the problem of changing feature domain values.
Resumo:
This research has responded to the need for diagnostic reference tools explicitly linking the influence of environmental uncertainty and performance within the supply chain. Uncertainty is a key factor influencing performance and an important measure of the operating environment. We develop and demonstrate a novel reference methodology based on data envelopment analysis (DEA) for examining the performance of value streams within the supply chain with specific reference to the level of environmental uncertainty they face. In this paper, using real industrial data, 20 product supply value streams within the European automotive industry sector are evaluated. Two are found to be efficient. The peer reference groups for the underperforming value streams are identified and numerical improvement targets are derived. The paper demonstrates how DEA can be used to guide supply chain improvement efforts through role-model identification and target setting, in a way that recognises the multiple dimensions/outcomes of the supply chain process and the influence of its environmental conditions. We have facilitated the contextualisation of environmental uncertainty and its incorporation into a specific diagnostic reference tool.
Resumo:
Strong winds equatorwards and rearwards of a cyclone core have often been associated with two phenomena, the cold conveyor belt (CCB) jet and sting jets. Here, detailed observations of the mesoscale structure in this region of an intense cyclone are analysed. The {\it in-situ} and dropsonde observations were obtained during two research flights through the cyclone during the DIAMET (DIAbatic influences on Mesoscale structures in ExTratropical storms) field campaign. A numerical weather prediction model is used to link the strong wind regions with three types of ``air streams'', or coherent ensembles of trajectories: two types are identified with the CCB, hooking around the cyclone center, while the third is identified with a sting jet, descending from the cloud head to the west of the cyclone. Chemical tracer observations show for the first time that the CCB and sting jet air streams are distinct air masses even when the associated low-level wind maxima are not spatially distinct. In the model, the CCB experiences slow latent heating through weak resolved ascent and convection, while the sting jet experiences weak cooling associated with microphysics during its subsaturated descent. Diagnosis of mesoscale instabilities in the model shows that the CCB passes through largely stable regions, while the sting jet spends relatively long periods in locations characterized by conditional symmetric instability (CSI). The relation of CSI to the observed mesoscale structure of the bent-back front and its possible role in the cloud banding is discussed.
Resumo:
This paper argues that talent management and expatriation are two significantly overlapping but separate areas of research and that bringing the two together has significant and useful implications for both research and practice. We offer indications of how this bringing together might work, in particular developing the different results that will come from narrower and broader concepts of talent management. Our framework defines global talent management as a combination of high-potential development and global careers development. The goal of the paper is to lay the foundations for future research while encouraging organizations to manage expatriation strategically in a talent-management perspective.
Resumo:
An important application of Big Data Analytics is the real-time analysis of streaming data. Streaming data imposes unique challenges to data mining algorithms, such as concept drifts, the need to analyse the data on the fly due to unbounded data streams and scalable algorithms due to potentially high throughput of data. Real-time classification algorithms that are adaptive to concept drifts and fast exist, however, most approaches are not naturally parallel and are thus limited in their scalability. This paper presents work on the Micro-Cluster Nearest Neighbour (MC-NN) classifier. MC-NN is based on an adaptive statistical data summary based on Micro-Clusters. MC-NN is very fast and adaptive to concept drift whilst maintaining the parallel properties of the base KNN classifier. Also MC-NN is competitive compared with existing data stream classifiers in terms of accuracy and speed.
Resumo:
Crude enzymes produced via solid state fermentation (SSF) using wheat milling by-products have been employed for both fermentation media production using flour-rich waste (FRW) streams and lysis of Rhodosporidium toruloides yeast cells. Filter sterilization of crude hydrolysates was more beneficial than heat sterilization regarding yeast growth and microbial oil production. The initial carbon to free amino nitrogen ratio of crude hydrolysates was optimized (80.2 g/g) in fed-batch cultures of R. toruloides leading to a total dry weight of 61.2 g/L with microbial oil content of 61.8 % (w/w). Employing a feeding strategy where the glucose concentration was maintained in the range of 12.2 – 17.6 g/L led to the highest productivity (0.32 g/L∙h). The crude enzymes produced by SSF were utilised for yeast cell treatment leading to simultaneous release of around 80% of total lipids in the broth and production of a hydrolysate suitable as yeast extract replacement.
Resumo:
Flour-rich waste (FRW) and by-product streams generated by bakery, confectionery and wheat milling plants could be employed as the sole raw materials for generic fermentation media production, suitable for microbial oil synthesis. Wheat milling by-products were used in solid state fermentations (SSF) of Aspergillus awamori for the production of crude enzymes, mainly glucoamylase and protease. Enzyme-rich SSF solids were subsequently employed for hydrolysis of FRW streams into nutrient-rich fermentation media. Batch hydrolytic experiments using FRW concentrations up to 205 g/L resulted in higher than 90%(w/w) starch to glucose conversion yields and 40% (w/w) total Kjeldahl nitrogen to free amino nitro-gen conversion yields. Starch to glucose conversion yields of 98.2, 86.1 and 73.4% (w/w) were achieved when initial FRW concentrations of 235, 300 and 350 g/L were employed in fed-batch hydrolytic experiments, respectively. Crude hydrolysates were used as fermentation media in shake flask cultures with the oleaginous yeast Lipomyces starkeyi DSM 70296 reaching a total dry weight of 30.5 g/L with a microbial oil content of 40.4% (w/w), higher than that achieved in synthetic media. Fed-batch bioreactor cultures led to a total dry weight of 109.8 g/L with a microbial oil content of 57.8% (w/w) and productivity of 0.4 g/L/h.
Resumo:
In order to gain insights into events and issues that may cause errors and outages in parts of IP networks, intelligent methods that capture and express causal relationships online (in real-time) are needed. Whereas generalised rule induction has been explored for non-streaming data applications, its application and adaptation on streaming data is mostly undeveloped or based on periodic and ad-hoc training with batch algorithms. Some association rule mining approaches for streaming data do exist, however, they can only express binary causal relationships. This paper presents the ongoing work on Online Generalised Rule Induction (OGRI) in order to create expressive and adaptive rule sets real-time that can be applied to a broad range of applications, including network telemetry data streams.
Resumo:
1. Analyses of species association have major implications for selecting indicators for freshwater biomonitoring and conservation, because they allow for the elimination of redundant information and focus on taxa that can be easily handled and identified. These analyses are particularly relevant in the debate about using speciose groups (such as the Chironomidae) as indicators in the tropics, because they require difficult and time-consuming analysis, and their responses to environmental gradients, including anthropogenic stressors, are poorly known. 2. Our objective was to show whether chironomid assemblages in Neotropical streams include clear associations of taxa and, if so, how well these associations could be explained by a set of models containing information from different spatial scales. For this, we formulated a priori models that allowed for the influence of local, landscape and spatial factors on chironomid taxon associations (CTA). These models represented biological hypotheses capable of explaining associations between chironomid taxa. For instance, CTA could be best explained by local variables (e.g. pH, conductivity and water temperature) or by processes acting at wider landscape scales (e.g. percentage of forest cover). 3. Biological data were taken from 61 streams in Southeastern Brazil, 47 of which were in well-preserved regions, and 14 of which drained areas severely affected by anthropogenic activities. We adopted a model selection procedure using Akaike`s information criterion to determine the most parsimonious models for explaining CTA. 4. Applying Kendall`s coefficient of concordance, seven genera (Tanytarsus/Caladomyia, Ablabesmyia, Parametriocnemus, Pentaneura, Nanocladius, Polypedilum and Rheotanytarsus) were identified as associated taxa. The best-supported model explained 42.6% of the total variance in the abundance of associated taxa. This model combined local and landscape environmental filters and spatial variables (which were derived from eigenfunction analysis). However, the model with local filters and spatial variables also had a good chance of being selected as the best model. 5. Standardised partial regression coefficients of local and landscape filters, including spatial variables, derived from model averaging allowed an estimation of which variables were best correlated with the abundance of associated taxa. In general, the abundance of the associated genera tended to be lower in streams characterised by a high percentage of forest cover (landscape scale), lower proportion of muddy substrata and high values of pH and conductivity (local scale). 6. Overall, our main result adds to the increasing number of studies that have indicated the importance of local and landscape variables, as well as the spatial relationships among sampling sites, for explaining aquatic insect community patterns in streams. Furthermore, our findings open new possibilities for the elimination of redundant data in the assessment of anthropogenic impacts on tropical streams.
Resumo:
The pervasive and ubiquitous computing has motivated researches on multimedia adaptation which aims at matching the video quality to the user needs and device restrictions. This technique has a high computational cost which needs to be studied and estimated when designing architectures and applications. This paper presents an analytical model to quantify these video transcoding costs in a hardware independent way. The model was used to analyze the impact of transcoding delays in end-to-end live-video transmissions over LANs, MANs and WANs. Experiments confirm that the proposed model helps to define the best transcoding architecture for different scenarios.
Resumo:
http://digitalcommons.colby.edu/atlasofmaine2006/1007/thumbnail.jpg
Resumo:
Comunidades de Ephemeroptera, Plecoptera e Trichoptera (EPT) em substrato rochoso foram estudadas em dois riachos do Parque Estadual Intervales. Coletas com um amostrador de Surber (10 subamostras aleatórias, 1 m²) foram feitas mensalmente de setembro de 1999 a setembro de 2000 e trimestralmente de dezembro de 2000 a setembro de 2001 nos Ribeirões Bocaina e Água Comprida. A fauna de EPT do Ribeirão Bocaina foi mais diversificada e mais abundante do que a do Ribeirão Água Comprida. A fauna de EPT foi bastante diferente entre os dois riachos, tanto do ponto de vista da composição faunística quanto do ponto de vista funcional. Os resultados indicaram que não houve um padrão sazonal claro da variação temporal da densidade.