981 resultados para proposed text.


Relevância:

60.00% 60.00%

Publicador:

Resumo:

The increasing diversity of the Internet has created a vast number of multilingual resources on the Web. A huge number of these documents are written in various languages other than English. Consequently, the demand for searching in non-English languages is growing exponentially. It is desirable that a search engine can search for information over collections of documents in other languages. This research investigates the techniques for developing high-quality Chinese information retrieval systems. A distinctive feature of Chinese text is that a Chinese document is a sequence of Chinese characters with no space or boundary between Chinese words. This feature makes Chinese information retrieval more difficult since a retrieved document which contains the query term as a sequence of Chinese characters may not be really relevant to the query since the query term (as a sequence Chinese characters) may not be a valid Chinese word in that documents. On the other hand, a document that is actually relevant may not be retrieved because it does not contain the query sequence but contains other relevant words. In this research, we propose two approaches to deal with the problems. In the first approach, we propose a hybrid Chinese information retrieval model by incorporating word-based techniques with the traditional character-based techniques. The aim of this approach is to investigate the influence of Chinese segmentation on the performance of Chinese information retrieval. Two ranking methods are proposed to rank retrieved documents based on the relevancy to the query calculated by combining character-based ranking and word-based ranking. Our experimental results show that Chinese segmentation can improve the performance of Chinese information retrieval, but the improvement is not significant if it incorporates only Chinese segmentation with the traditional character-based approach. In the second approach, we propose a novel query expansion method which applies text mining techniques in order to find the most relevant words to extend the query. Unlike most existing query expansion methods, which generally select the highly frequent indexing terms from the retrieved documents to expand the query. In our approach, we utilize text mining techniques to find patterns from the retrieved documents that highly correlate with the query term and then use the relevant words in the patterns to expand the original query. This research project develops and implements a Chinese information retrieval system for evaluating the proposed approaches. There are two stages in the experiments. The first stage is to investigate if high accuracy segmentation can make an improvement to Chinese information retrieval. In the second stage, a text mining based query expansion approach is implemented and a further experiment has been done to compare its performance with the standard Rocchio approach with the proposed text mining based query expansion method. The NTCIR5 Chinese collections are used in the experiments. The experiment results show that by incorporating the text mining based query expansion with the hybrid model, significant improvement has been achieved in both precision and recall assessments.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Despite continuing developments in information technology and the growing economic significance of the emerging Eastern European, South American and Asian economies, international financial activity remains strongly concentrated in a relatively small number of international financial centres. That concentration of financial activity requires a critical mass of office occupation and creates demand for high specification, high cost space. The demand for that space is increasingly linked to the fortunes of global capital markets. That linkage has been emphasised by developments in real estate markets, notably the development of global real estate investment, innovation in property investment vehicles and the growth of debt securitisation. The resultant interlinking of occupier, asset, debt and development markets within and across global financial centres is a source of potential volatility and risk. The paper sets out a broad conceptual model of the linkages and their implications for systemic market risk and presents preliminary empirical results that provide support for the model proposed.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We model the rolling of a standard die, using a Markov matrix. Though a die may be called ‘fair’, its initial position influences a roll’s outcome. This being undesirable, a simple solution is proposed.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

At least three ferritins are found in the bacterium Escherichia coli, the heme-containing bacterioferritin (EcBFR) and two non-heme bacterial ferritins (EcFtnA and EcFtnB). In addition to the conserved A- and B-sites of the diiron ferroxidase center, EcFtnA has a third iron-binding site (the C-site) of unknown function that is nearby the diiron site. In the present work, the complex chemistry of iron oxidation and deposition in EcFtnA has been further defined through a combination of oximetry, pH stat, stopped-flow and conventional kinetics, UV-visible, fluorescence and EPR spectroscopic measurements on the wildtype protein and site-directed variants of the A-, B- and C-sites. The data reveal that, while H2O2 is a product of dioxygen reduction in EcFtnA and oxidation occurs with a stoichiometry of Fe(II)/O2 ~ 3:1, most of the H2O2 produced is consumed in subsequent reactions with a 2:1 Fe(II)/H2O2 stoichiometry, thus suppressing hydroxyl radical formation. While the A- and B-sites are essential for rapid iron oxidation, the C-site slows oxidation and suppresses iron turnover at the ferroxidase center. A tyrosyl radical, assigned to Tyr24 near the ferroxidase center, is formed during iron oxidation and its possible significance to the function of the protein is discussed. Taken as a whole, the data indicate that there are multiple iron-oxidation pathways in EcFtnA with O2 and H2O2 as oxidants. Furthermore, the data are inconsistent with the C-site being a transit site, providing iron to the A- and B-sites, and does not support a universal mechanism for iron oxidation in all ferritins as recently proposed.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A truly variance-minimizing filter is introduced and its per for mance is demonstrated with the Korteweg– DeV ries (KdV) equation and with a multilayer quasigeostrophic model of the ocean area around South Africa. It is recalled that Kalman-like filters are not variance minimizing for nonlinear model dynamics and that four - dimensional variational data assimilation (4DV AR)-like methods relying on per fect model dynamics have dif- ficulty with providing error estimates. The new method does not have these drawbacks. In fact, it combines advantages from both methods in that it does provide error estimates while automatically having balanced states after analysis, without extra computations. It is based on ensemble or Monte Carlo integrations to simulate the probability density of the model evolution. When obser vations are available, the so-called importance resampling algorithm is applied. From Bayes’ s theorem it follows that each ensemble member receives a new weight dependent on its ‘ ‘distance’ ’ t o the obser vations. Because the weights are strongly var ying, a resampling of the ensemble is necessar y. This resampling is done such that members with high weights are duplicated according to their weights, while low-weight members are largely ignored. In passing, it is noted that data assimilation is not an inverse problem by nature, although it can be for mulated that way . Also, it is shown that the posterior variance can be larger than the prior if the usual Gaussian framework is set aside. However , i n the examples presented here, the entropy of the probability densities is decreasing. The application to the ocean area around South Africa, gover ned by strongly nonlinear dynamics, shows that the method is working satisfactorily . The strong and weak points of the method are discussed and possible improvements are proposed.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Eddy covariance has been used in urban areas to evaluate the net exchange of CO2 between the surface and the atmosphere. Typically, only the vertical flux is measured at a height 2–3 times that of the local roughness elements; however, under conditions of relatively low instability, CO2 may accumulate in the airspace below the measurement height. This can result in inaccurate emissions estimates if the accumulated CO2 drains away or is flushed upwards during thermal expansion of the boundary layer. Some studies apply a single height storage correction; however, this requires the assumption that the response of the CO2 concentration profile to forcing is constant with height. Here a full seasonal cycle (7th June 2012 to 3rd June 2013) of single height CO2 storage data calculated from concentrations measured at 10 Hz by open path gas analyser are compared to a data set calculated from a concurrent switched vertical profile measured (2 Hz, closed path gas analyser) at 10 heights within and above a street canyon in central London. The assumption required for the former storage determination is shown to be invalid. For approximately regular street canyons at least one other measurement is required. Continuous measurements at fewer locations are shown to be preferable to a spatially dense, switched profile, as temporal interpolation is ineffective. The majority of the spectral energy of the CO2 storage time series was found to be between 0.001 and 0.2 Hz (500 and 5 s respectively); however, sampling frequencies of 2 Hz and below still result in significantly lower CO2 storage values. An empirical method of correcting CO2 storage values from under-sampled time series is proposed.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Carefully selected sea bottom surface fine sand samples were studied from two sand ribbons normal to the shore. Possible sediment transport along these sand ribbons were investigated from interpretation of the sediment patterns. Simple grain size parameters were obtained and results of heavy mineral and feldspar analysis were compared. On one ribbon offshore sediment movement was indicated, while conversely on the other, onshore movement is proposed.