3 resultados para Information retrieval interfaces
em ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha
Resumo:
Except the article forming the main content most HTML documents on the WWW contain additional contents such as navigation menus, design elements or commercial banners. In the context of several applications it is necessary to draw the distinction between main and additional content automatically. Content extraction and template detection are the two approaches to solve this task. This thesis gives an extensive overview of existing algorithms from both areas. It contributes an objective way to measure and evaluate the performance of content extraction algorithms under different aspects. These evaluation measures allow to draw the first objective comparison of existing extraction solutions. The newly introduced content code blurring algorithm overcomes several drawbacks of previous approaches and proves to be the best content extraction algorithm at the moment. An analysis of methods to cluster web documents according to their underlying templates is the third major contribution of this thesis. In combination with a localised crawling process this clustering analysis can be used to automatically create sets of training documents for template detection algorithms. As the whole process can be automated it allows to perform template detection on a single document, thereby combining the advantages of single and multi document algorithms.
Resumo:
In the early 20th century, Gouy, Chapman, and Stern developed a theory to describe the capacitance and the spatial ion distribution of diluted electrolytes near an electrode. After a century of research, considerable progress has been made in the understanding of the electrolyte/electrode interface. However, its molecular-scale structure and its variation with an applied potential is still under debate. In particular for room-temperature ionic liquids, a new class of solventless electrolytes, the classical theories for the electrical double layer are not applicable. Recently, molecular dynamics simulations and phenomenological theories have attempted to explain the capacitance of the ionic liquid/electrode interface with the molecular-scale structure and dynamics of the ionic liquid near the electrode. rnHowever, experimental evidence is very limited. rnrnIn the presented study, the ion distribution of an ionic liquid near an electrode and its response to applied potentials was examined with sub-molecular resolution. For this purpose, a new sample chamber was constructed, allowing in situ high energy X-ray reflectivity experiments under potential control, as well as impedance spectroscopy measurements. The combination of structural information and electrochmical data provided a comprehensive picture of the electric double layer in ionic liquids. Oscillatory charge density profiles were found, consisting of alternating anion- and cation-enriched layers at both, cathodic and anodic, potentials. This structure was shown to arise from the same ion-ion correlations dominating the liquid bulk structure that were observed as a distinct X-ray diffraction peak. Therefore, existing physically motivated models were refined and verified by comparison with independent measurements. rnrnThe relaxation dynamics of the interfacial structure upon potential variation were studied by time resolved X-ray reflectivity experiments with sub-millisecond resolution. The observed relaxation times during charging/discharging are consistent with the impedance spectroscopy data revealing three processes of vastly different characteristic time-scales. Initially, the ion transport normal to the interface happens on a millisecond-scale. Another 100-millisecond-scale process is associated with molecular reorientation of electrode-adsorbed cations. Further, a minute-scale relaxation was observed, which is tentatively assigned to lateral ordering within the first layer.
Resumo:
Die Molekularbiologie von Menschen ist ein hochkomplexes und vielfältiges Themengebiet, in dem in vielen Bereichen geforscht wird. Der Fokus liegt hier insbesondere auf den Bereichen der Genomik, Proteomik, Transkriptomik und Metabolomik, und Jahre der Forschung haben große Mengen an wertvollen Daten zusammengetragen. Diese Ansammlung wächst stetig und auch für die Zukunft ist keine Stagnation absehbar. Mittlerweile aber hat diese permanente Informationsflut wertvolles Wissen in unüberschaubaren, digitalen Datenbergen begraben und das Sammeln von forschungsspezifischen und zuverlässigen Informationen zu einer großen Herausforderung werden lassen. Die in dieser Dissertation präsentierte Arbeit hat ein umfassendes Kompendium von humanen Geweben für biomedizinische Analysen generiert. Es trägt den Namen medicalgenomics.org und hat diverse biomedizinische Probleme auf der Suche nach spezifischem Wissen in zahlreichen Datenbanken gelöst. Das Kompendium ist das erste seiner Art und sein gewonnenes Wissen wird Wissenschaftlern helfen, einen besseren systematischen Überblick über spezifische Gene oder funktionaler Profile, mit Sicht auf Regulation sowie pathologische und physiologische Bedingungen, zu bekommen. Darüber hinaus ermöglichen verschiedene Abfragemethoden eine effiziente Analyse von signalgebenden Ereignissen, metabolischen Stoffwechselwegen sowie das Studieren der Gene auf der Expressionsebene. Die gesamte Vielfalt dieser Abfrageoptionen ermöglicht den Wissenschaftlern hoch spezialisierte, genetische Straßenkarten zu erstellen, mit deren Hilfe zukünftige Experimente genauer geplant werden können. Infolgedessen können wertvolle Ressourcen und Zeit eingespart werden, bei steigenden Erfolgsaussichten. Des Weiteren kann das umfassende Wissen des Kompendiums genutzt werden, um biomedizinische Hypothesen zu generieren und zu überprüfen.