936 resultados para indexing consistency
Resumo:
This dissertation research points out major challenging problems with current Knowledge Organization (KO) systems, such as subject gateways or web directories: (1) the current systems use traditional knowledge organization systems based on controlled vocabulary which is not very well suited to web resources, and (2) information is organized by professionals not by users, which means it does not reflect intuitively and instantaneously expressed users’ current needs. In order to explore users’ needs, I examined social tags which are user-generated uncontrolled vocabulary. As investment in professionally-developed subject gateways and web directories diminishes (support for both BUBL and Intute, examined in this study, is being discontinued), understanding characteristics of social tagging becomes even more critical. Several researchers have discussed social tagging behavior and its usefulness for classification or retrieval; however, further research is needed to qualitatively and quantitatively investigate social tagging in order to verify its quality and benefit. This research particularly examined the indexing consistency of social tagging in comparison to professional indexing to examine the quality and efficacy of tagging. The data analysis was divided into three phases: analysis of indexing consistency, analysis of tagging effectiveness, and analysis of tag attributes. Most indexing consistency studies have been conducted with a small number of professional indexers, and they tended to exclude users. Furthermore, the studies mainly have focused on physical library collections. This dissertation research bridged these gaps by (1) extending the scope of resources to various web documents indexed by users and (2) employing the Information Retrieval (IR) Vector Space Model (VSM) - based indexing consistency method since it is suitable for dealing with a large number of indexers. As a second phase, an analysis of tagging effectiveness with tagging exhaustivity and tag specificity was conducted to ameliorate the drawbacks of consistency analysis based on only the quantitative measures of vocabulary matching. Finally, to investigate tagging pattern and behaviors, a content analysis on tag attributes was conducted based on the FRBR model. The findings revealed that there was greater consistency over all subjects among taggers compared to that for two groups of professionals. The analysis of tagging exhaustivity and tag specificity in relation to tagging effectiveness was conducted to ameliorate difficulties associated with limitations in the analysis of indexing consistency based on only the quantitative measures of vocabulary matching. Examination of exhaustivity and specificity of social tags provided insights into particular characteristics of tagging behavior and its variation across subjects. To further investigate the quality of tags, a Latent Semantic Analysis (LSA) was conducted to determine to what extent tags are conceptually related to professionals’ keywords and it was found that tags of higher specificity tended to have a higher semantic relatedness to professionals’ keywords. This leads to the conclusion that the term’s power as a differentiator is related to its semantic relatedness to documents. The findings on tag attributes identified the important bibliographic attributes of tags beyond describing subjects or topics of a document. The findings also showed that tags have essential attributes matching those defined in FRBR. Furthermore, in terms of specific subject areas, the findings originally identified that taggers exhibited different tagging behaviors representing distinctive features and tendencies on web documents characterizing digital heterogeneous media resources. These results have led to the conclusion that there should be an increased awareness of diverse user needs by subject in order to improve metadata in practical applications. This dissertation research is the first necessary step to utilize social tagging in digital information organization by verifying the quality and efficacy of social tagging. This dissertation research combined both quantitative (statistics) and qualitative (content analysis using FRBR) approaches to vocabulary analysis of tags which provided a more complete examination of the quality of tags. Through the detailed analysis of tag properties undertaken in this dissertation, we have a clearer understanding of the extent to which social tagging can be used to replace (and in some cases to improve upon) professional indexing.
Resumo:
The aim of this paper is to evaluate the consistency indexes among 30 Brazilian university libraries from the south and south-east regions through a specific mathematical formula. It was selected a sample of 30 university libraries that, according to the information in their official sites, have a collection consisted of more than 100.000 copies and allow the search into the on-line catalog. Searches were carried out in every university by means of their sites, requesting books that contained a certain word in its title and were printed in a certain year. The response was a list of available titles in the library, from which we chose at random a title and asked to visualize the complete record to verify the existence of a given subject. This procedure was repeated until we found the same title in five libraries with the chosen subjects. The result is 10 trials, each one consisting of one figure and one table showing the selected libraries, the subjects, the documentary languages ( tools) and the consistency indexes relaxed and rigid. These trials show great discrepancy between the values of consistency indexes with intervals between 73,3% to 34,4% in the relaxed index, and between 60% and 9,6% in the rigid one. It was revealed that the coincidence in determining the subjects is not too high remaining below 39%. It is concluded that the difference between the consistency indexes may be due to factors as: incompatibility among documentary languages; lack of updating of these languages so as to follow the knowledge evolution; absence of a well-defined indexing policy with guidelines clearly established. Procedures of indexing followed by indexers could contribute to the consistency index to be bigger in percentage, since there would be parameters for the indexing process.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Monte Carlo track structures (MCTS) simulations have been recognized as useful tools for radiobiological modeling. However, the authors noticed several issues regarding the consistency of reported data. Therefore, in this work, they analyze the impact of various user defined parameters on simulated direct DNA damage yields. In addition, they draw attention to discrepancies in published literature in DNA strand break (SB) yields and selected methodologies. The MCTS code Geant4-DNA was used to compare radial dose profiles in a nanometer-scale region of interest (ROI) for photon sources of varying sizes and energies. Then, electron tracks of 0.28 keV-220 keV were superimposed on a geometric DNA model composed of 2.7 × 10(6) nucleosomes, and SBs were simulated according to four definitions based on energy deposits or energy transfers in DNA strand targets compared to a threshold energy ETH. The SB frequencies and complexities in nucleosomes as a function of incident electron energies were obtained. SBs were classified into higher order clusters such as single and double strand breaks (SSBs and DSBs) based on inter-SB distances and on the number of affected strands. Comparisons of different nonuniform dose distributions lacking charged particle equilibrium may lead to erroneous conclusions regarding the effect of energy on relative biological effectiveness. The energy transfer-based SB definitions give similar SB yields as the one based on energy deposit when ETH ≈ 10.79 eV, but deviate significantly for higher ETH values. Between 30 and 40 nucleosomes/Gy show at least one SB in the ROI. The number of nucleosomes that present a complex damage pattern of more than 2 SBs and the degree of complexity of the damage in these nucleosomes diminish as the incident electron energy increases. DNA damage classification into SSB and DSB is highly dependent on the definitions of these higher order structures and their implementations. The authors' show that, for the four studied models, different yields are expected by up to 54% for SSBs and by up to 32% for DSBs, as a function of the incident electrons energy and of the models being compared. MCTS simulations allow to compare direct DNA damage types and complexities induced by ionizing radiation. However, simulation results depend to a large degree on user-defined parameters, definitions, and algorithms such as: DNA model, dose distribution, SB definition, and the DNA damage clustering algorithm. These interdependencies should be well controlled during the simulations and explicitly reported when comparing results to experiments or calculations.
Resumo:
Due to both the widespread and multipurpose use of document images and the current availability of a high number of document images repositories, robust information retrieval mechanisms and systems have been increasingly demanded. This paper presents an approach to support the automatic generation of relationships among document images by exploiting Latent Semantic Indexing (LSI) and Optical Character Recognition (OCR). We developed the LinkDI (Linking of Document Images) service, which extracts and indexes document images content, computes its latent semantics, and defines relationships among images as hyperlinks. LinkDI was experimented with document images repositories, and its performance was evaluated by comparing the quality of the relationships created among textual documents as well as among their respective document images. Considering those same document images, we ran further experiments in order to compare the performance of LinkDI when it exploits or not the LSI technique. Experimental results showed that LSI can mitigate the effects of usual OCR misrecognition, which reinforces the feasibility of LinkDI relating OCR output with high degradation.
Resumo:
Quantum field theory with an external background can be considered as a consistent model only if backreaction is relatively small with respect to the background. To find the corresponding consistency restrictions on an external electric field and its duration in QED and QCD, we analyze the mean-energy density of quantized fields for an arbitrary constant electric field E, acting during a large but finite time T. Using the corresponding asymptotics with respect to the dimensionless parameter eET(2), one can see that the leading contributions to the energy are due to the creation of particles by the electric field. Assuming that these contributions are small in comparison with the energy density of the electric background, we establish the above-mentioned restrictions, which determine, in fact, the time scales from above of depletion of an electric field due to the backreaction.
Resumo:
The Cluster Variation Method (CVM), introduced over 50 years ago by Prof. Dr. Ryoichi Kikuchi, is applied to the thermodynamic modeling of the BCC Cr-Fe system in the irregular tetrahedron approximation, using experimental thermochemical data as initial input for accessing the model parameters. The results are checked against independent data on the low-temperature miscibility gap, using increasingly accurate thermodynamic models, first by the inclusion of the magnetic degrees of freedom of iron and then also by the inclusion of the magnetic degrees of freedom of chromium. It is shown that a reasonably accurate description of the phase diagram at the iron-rich side (i.e. the miscibility gap borders and the Curie line) is obtained, but only at expense of the agreement with the above mentioned thermochemical data. Reasons for these inconsistencies are discussed, especially with regard to the need of introducing vibrational degrees of freedom in the CVM model. (C) 2008 Elsevier Ltd. All rights reserved.
Resumo:
Introduction: Two hundred ten patients with newly diagnosed Hodgkin`s lymphoma (HL) were consecutively enrolled in this prospective trial to evaluate the cost-effectiveness of fluorine-18 ((18)F)-fluoro-2-deoxy-D-glucose-positron emission tomography (FDG-PET) scan in initial staging of patients with HL. Methods: All 210 patients were staged with conventional clinical staging (CCS) methods, including computed tomography (CT), bone marrow biopsy (BMB), and laboratory tests. Patients were also submitted to metabolic staging (MS) with whole-body FDG-PET scan before the beginning of treatment. A standard of reference for staging was determined with all staging procedures, histologic examination, and follow-up examinations. The accuracy of the CCS was compared with the MS. Local unit costs of procedures and tests were evaluated. Incremental cost-effectiveness ratio (ICER) was calculated for both strategies. Results: In the 210 patients with HL, the sensitivity for initial staging of FDG-PET was higher than that of CT and BMB in initial staging (97.9% vs. 87.3%; P < .001 and 94.2% vs. 71.4%, P < 0.003, respectively). The incorporation of FDG-PET in the staging procedure upstaged disease in 50 (24%) patients and downstaged disease in 17 (8%) patients. Changes in treatment would be seen in 32 (15%) patients. Cumulative cost for staging procedures was $3751/patient for CCS compared to $5081 for CCS + PET and $4588 for PET/CT. The ICER of PET/CT strategy was $16,215 per patient with modified treatment. PET/CT costs at the beginning and end of treatment would increase total costs of HL staging and first-line treatment by only 2%. Conclusion: FDG-PET is more accurate than CT and BMB in HL staging. Given observed probabilities, FDG-PET is highly cost-effective in the public health care program in Brazil.
Resumo:
Background & aim: Many disease outbreaks of food origin are caused by foods prepared in Food Service and Nutrition Units of hospitals, affecting hospitalized patients who, in most cases, are immunocompromised and therefore at a higher risk of severe worsening of their clinical status. The aim of this study was to determine the variations in temperature and the time-temperature factor of hospital diets. Methods: The time and temperature for the preparation of 4 diets of modified consistency were determined on 5 nonconsecutive days in a hospital Diet and Nutrition Unit at the end of preparation and during the maintenance period, portioning and distribution at 3 sites, i.e., the first, the middle and the last to receive the diets. Results and discussion: All foods reached an adequate temperature at the end of cooking, but temperature varied significantly from the maintenance period to the final distribution, characterizing critical periods for microorganism proliferation. During holding, temperatures that presented a risk were reached by 16.7% of the meats and 59% of the salads of the general diet, by 16.7% of the garnishes in the bland diet and by 20% of the meats and garnishes in the viscous diet. The same occurred at the end of distribution for 100% of the hot samples and of the salads and for 61% of the desserts. None of the preparations remained at risk temperature for a time exceeding that established by law. Conclusion: The exposure to inadequate temperature did not last long enough to pose risks to the patient.
Resumo:
The aim of this research was to examine the nature and order of recovery of orientation and memory functioning during Post-Traumatic Amnesia (PTA) in relation to injury severity and PTA duration. The Westmead PTA Scale was used across consecutive testing days to assess the recovery of orientation and memory during PTA in 113 patients. Two new indices were examined: a Consistency-of-Recovery and a Duration-to-Recovery index. a predictable order of recovery was observed during PTA: orientation-to-person recovered sooner and more consistently than the following cluster; orientation-to-time, orientation-to-place, and the ability to remember a face and name. However, the type of memory functioning required for the recall face and name task recovered more consistently than that required for memorizing three pictures. An important overall finding was that the order-of-recovery'' of orientation and memory functioning was dependent upon both the elapsed days since injury, and the consistency of recovery. The newly developed indices were shown to be a valuable means of accounting for differences between groups in the elapsed days to recovery of orientation and memory. These indices also clearly increase the clinical utility of the Westmead PTA Scale and supply an objective means of charting (and potentially predicting) patients' recovery on the different components of orientation and memory throughout their period of hospitalization.
Resumo:
The importance of disturbance and the subsequent rate and pattern of recovery has been long recognised as an important driver of community structure. Community recovery is affected by processes operating at local and regional scales yet the examination of community level responses to a standardised disturbance at regional scales (i.e. among regions under different environmental conditions) has seldom been attempted. Here, we mechanically disturbed rocky intertidal lower shore algal dominated assemblages at three locations within each of three different regions within the Lusitanian biogeographical province (Azores, northern Portugal and the Canary Islands). All organisms were cleared from experimental plots and succession followed over a period of 12 months at which time we formally compared the assemblage structure to that of unmanipulated controls. Early patterns of recovery of disturbed communities varied among regions and was positively influenced by temperature, but not by regional species richness. Different components of the assemblage responded differently to disturbance. Regional differences in the relative abundance and identity of species had a key influence on the overall assemblage recovery. This study highlights how regional-scales differences in environmental conditions and species pool are important determinants of recovery of disturbed communities.
Resumo:
ABSTRACT OBJECTIVE To assess the internal consistency of the measurements of the Self-Reporting Questionnaire (SRQ-20) in different occupational groups. METHODS A validation study was conducted with data from four surveys with groups of workers, using similar methods. A total of 9,959 workers were studied. In all surveys, the common mental disorders were assessed via SRQ-20. The internal consistency considered the items belonging to dimensions extracted by tetrachoric factor analysis for each study. Item homogeneity assessment compared estimates of Cronbach’s alpha (KD-20), the alpha applied to a tetrachoric correlation matrix and stratified Cronbach’s alpha. RESULTS The SRQ-20 dimensions showed adequate values, considering the reference parameters. The internal consistency of the instrument items, assessed by stratified Cronbach’s alpha, was high (> 0.80) in the four studies. CONCLUSIONS The SRQ-20 showed good internal consistency in the professional categories evaluated. However, there is still a need for studies using alternative methods and additional information able to refine the accuracy of latent variable measurement instruments, as in the case of common mental disorders.