904 resultados para INFORMATION EXTRACTION FROM DOCUMENTS
Resumo:
As one of the most popular deep learning models, convolution neural network (CNN) has achieved huge success in image information extraction. Traditionally CNN is trained by supervised learning method with labeled data and used as a classifier by adding a classification layer in the end. Its capability of extracting image features is largely limited due to the difficulty of setting up a large training dataset. In this paper, we propose a new unsupervised learning CNN model, which uses a so-called convolutional sparse auto-encoder (CSAE) algorithm pre-Train the CNN. Instead of using labeled natural images for CNN training, the CSAE algorithm can be used to train the CNN with unlabeled artificial images, which enables easy expansion of training data and unsupervised learning. The CSAE algorithm is especially designed for extracting complex features from specific objects such as Chinese characters. After the features of articficial images are extracted by the CSAE algorithm, the learned parameters are used to initialize the first CNN convolutional layer, and then the CNN model is fine-Trained by scene image patches with a linear classifier. The new CNN model is applied to Chinese scene text detection and is evaluated with a multilingual image dataset, which labels Chinese, English and numerals texts separately. More than 10% detection precision gain is observed over two CNN models.
Resumo:
This dissertation develops an image processing framework with unique feature extraction and similarity measurements for human face recognition in the thermal mid-wave infrared portion of the electromagnetic spectrum. The goals of this research is to design specialized algorithms that would extract facial vasculature information, create a thermal facial signature and identify the individual. The objective is to use such findings in support of a biometrics system for human identification with a high degree of accuracy and a high degree of reliability. This last assertion is due to the minimal to no risk for potential alteration of the intrinsic physiological characteristics seen through thermal infrared imaging. The proposed thermal facial signature recognition is fully integrated and consolidates the main and critical steps of feature extraction, registration, matching through similarity measures, and validation through testing our algorithm on a database, referred to as C-X1, provided by the Computer Vision Research Laboratory at the University of Notre Dame. Feature extraction was accomplished by first registering the infrared images to a reference image using the functional MRI of the Brain’s (FMRIB’s) Linear Image Registration Tool (FLIRT) modified to suit thermal infrared images. This was followed by segmentation of the facial region using an advanced localized contouring algorithm applied on anisotropically diffused thermal images. Thermal feature extraction from facial images was attained by performing morphological operations such as opening and top-hat segmentation to yield thermal signatures for each subject. Four thermal images taken over a period of six months were used to generate thermal signatures and a thermal template for each subject, the thermal template contains only the most prevalent and consistent features. Finally a similarity measure technique was used to match signatures to templates and the Principal Component Analysis (PCA) was used to validate the results of the matching process. Thirteen subjects were used for testing the developed technique on an in-house thermal imaging system. The matching using an Euclidean-based similarity measure showed 88% accuracy in the case of skeletonized signatures and templates, we obtained 90% accuracy for anisotropically diffused signatures and templates. We also employed the Manhattan-based similarity measure and obtained an accuracy of 90.39% for skeletonized and diffused templates and signatures. It was found that an average 18.9% improvement in the similarity measure was obtained when using diffused templates. The Euclidean- and Manhattan-based similarity measure was also applied to skeletonized signatures and templates of 25 subjects in the C-X1 database. The highly accurate results obtained in the matching process along with the generalized design process clearly demonstrate the ability of the thermal infrared system to be used on other thermal imaging based systems and related databases. A novel user-initialization registration of thermal facial images has been successfully implemented. Furthermore, the novel approach at developing a thermal signature template using four images taken at various times ensured that unforeseen changes in the vasculature did not affect the biometric matching process as it relied on consistent thermal features.
Resumo:
This dissertation aims to recover the lives and careers of those Amerindians and Europeans who voluntarily or involuntarily took on the role of intercultural interpreters in the contact, conquest, and early colonial period in the Americas between 1492 and 1675. It intends to prove that these so-called “marginal” figures assumed roles that went far beyond those of linguistic and cultural translators, and often had a decisive impact on early Indian-colonial relations. ^ In the course of my research, I consulted hundreds of published sixteenth- and seventeenth-century chronicles, narratives, and memoirs in my search for references to interpreters. I augmented these accounts with information derived from unpublished archival documents, drawn primarily from the Archivo General de Indias, in Seville, Spain. ^ I organized my findings in theme-driven chapters that begin with a consideration of the historiography of that subject. Each chapter is further subdivided into chronologically-arranged historical vignettes that focus on the interpreters who mediated between the Spanish, Portuguese, French, English and Dutch and the various Native American polities and cultures. ^ I found that colonial authorities and Amerindian communities alike recognized the absolute necessity of recruiting competent and loyal interpreters and go-betweens, and that both sides tried to secure their loyal service by means both fair and foul. Although pressured, pushed, and pulled in contrary directions, most interpreters recognized the pivotal position they held in cross-cultural negotiations and rarely remained passive pawns in the contests between the forces of domination and defense. ^ All across the Americas, interpreters used their linguistic and diplomatic skills, and their intimate knowledge of the “other” not simply to facilitate conquest or spearhead the opposition, but to transform themselves from “culture brokers” into “power brokers.” Many of the decisive events that shaped colonial-Indian relations turned on the actions of these culturally-ambiguous individuals, a fact bemoaned and begrudgingly acknowledged by most of the contemporary conquistadors, chroniclers, and colonial founders, and recognized by this author. ^
Resumo:
Voice communication systems such as Voice-over IP (VoIP), Public Switched Telephone Networks, and Mobile Telephone Networks, are an integral means of human tele-interaction. These systems pose distinctive challenges due to their unique characteristics such as low volume, burstiness and stringent delay/loss requirements across heterogeneous underlying network technologies. Effective quality evaluation methodologies are important for system development and refinement, particularly by adopting user feedback based measurement. Presently, most of the evaluation models are system-centric (Quality of Service or QoS-based), which questioned us to explore a user-centric (Quality of Experience or QoE-based) approach as a step towards the human-centric paradigm of system design. We research an affect-based QoE evaluation framework which attempts to capture users' perception while they are engaged in voice communication. Our modular approach consists of feature extraction from multiple information sources including various affective cues and different classification procedures such as Support Vector Machines (SVM) and k-Nearest Neighbor (kNN). The experimental study is illustrated in depth with detailed analysis of results. The evidences collected provide the potential feasibility of our approach for QoE evaluation and suggest the consideration of human affective attributes in modeling user experience.
Resumo:
We study work extraction from the Dicke model achieved using simple unitary cyclic transformations keeping into account both a non optimal unitary protocol, and the energetic cost of creating the initial state. By analyzing the role of entanglement, we find that highly entangled states can be inefficient for energy storage when considering the energetic cost of creating the state. Such surprising result holds notwithstanding the fact that the criticality of the model at hand can sensibly improve the extraction of work. While showing the advantages of using a many-body system for work extraction, our results demonstrate that entanglement is not necessarily advantageous for energy storage purposes, when non optimal processes are considered. Our work shows the importance of better understanding the complex interconnections between non-equilibrium thermodynamics of quantum systems and correlations among their subparts.
Resumo:
Mangrove forests are the most productive and bio-diverse wetlands on earth. It generate a large amount of litter in the form of leaves, branches, twigs, inflorescence and other debris and provides habitat for diverse flora and fauna of marine and terrestrial origin such as bacteria, fungi, algae, lichens, zooplankton, benthos, birds, reptiles and mammals. These systems act as nursery for many fishes and shellfishes. The other sources may also provide important organic carbon inputs; including allochthonous riverine or marine material, autochthonous production by benthic or epiphytic micro- or macroalgae, and local water column production by phytoplankton. Since mangrove sediments are very complex which receives autochthonous and allochthonous organic matter inputs, the information extracted from the analysis of mangrove sediments is the fingerprint of both natural and human-induced changes.
Resumo:
Abstract Background and Problem: The altering business world and the growing requests from stakeholders have resulted in the establishment of new reports. These are among others Sustainability reports and Integrated Reporting. On the contrary, traditional financial reports do not consider the significance of intangible assets in modern entities. The social and relationship capital has further shown to be important for firms, especially healthcare companies and pharmaceuticals, but is not as developed as other capitals within the <IR> framework and therefore not always included in annual reports. However too few disclosures within this area could lead to high liabilities. The IIRC launched the <IR> framework year 2013 as a solution, as it gives a more comprehensive view of the reporting entity. Within this framework there are six capitals: manufactured, human, financial, natural, intellectual and social and relationship. Purpose: The purpose of this thesis is to find out how the International <IR> Framework has influenced the reporting of the social and relationship disclosures within the healthcare industry, to compare the reporting of the six medical firms chosen and to examine how the social concerns have been developed over time. Delimitations: This study is conducted over a period of three years, from year 2012 to year 2014. It only examines healthcare companies which use the International <IR> framework and it has solely focus on the social and relationship capital. All other capitals within the <IR> framework are excluded from the study. Method: This study has a qualitative research strategy and is based on information collected from published documents in form of annual reports. The annual reports from year 2010, 2011 and 2012 are used to find social and relationship disclosures and a disclosure scoreboard is used to find similarities, differences and patterns. Empirical Results and Conclusion: It has been found that the aggregated social and relationship disclosures have been reduced over time. The year followed by the release of the <IR> framework was seen to have the least disclosures and therefore conclusion was drawn that the <IR> framework had a negative influence on the social and relationship disclosures. There were also differences among the companies studied both in extent and content. The former could be linked to factors such as size and nationality and the latter could be linked to reputation preservation and legitimacy interests.
Resumo:
Introduction Compounds exhibiting antioxidant activity have received much interest in the food industry because of their potential health benefits. Carotenoids such as lycopene, which in the human diet mainly derives from tomatoes (Solanum lycopersicum), have attracted much attention in this aspect and the study of their extraction, processing and storage procedures is of importance. Optical techniques potentially offer advantageous non-invasive and specific methods to monitor them. Objectives To obtain both fluorescence and Raman information to ascertain if ultrasound assisted extraction from tomato pulp has a detrimental effect on lycopene. Method Use of time-resolved fluorescence spectroscopy to monitor carotenoids in a hexane extract obtained from tomato pulp with application of ultrasound treatment (583 kHz). The resultant spectra were a combination of scattering and fluorescence. Because of their different timescales, decay associated spectra could be used to separate fluorescence and Raman information. This simultaneous acquisition of two complementary techniques was coupled with a very high time-resolution fluorescence lifetime measurement of the lycopene. Results Spectroscopic data showed the presence of phytofluene and chlorophyll in addition to lycopene in the tomato extract. The time-resolved spectral measurement containing both fluorescence and Raman data, coupled with high resolution time-resolved measurements, where a lifetime of ~5 ps was attributed to lycopene, indicated lycopene appeared unaltered by ultrasound treatment. Detrimental changes were, however, observed in both chlorophyll and phytofluene contributions. Conclusion Extracted lycopene appeared unaffected by ultrasound treatment, while other constituents (chlorophyll and phytofluene) were degraded.
Resumo:
Relatório de Estágio apresentado à Escola Superior de Educação de Paula Frassinetti para obtenção de grau Mestre em Educação Pré-Escolar e Ensino 1º ciclo do Ensino Básico.
Resumo:
Coinduction is a proof rule. It is the dual of induction. It allows reasoning about non--well--founded structures such as lazy lists or streams and is of particular use for reasoning about equivalences. A central difficulty in the automation of coinductive proof is the choice of a relation (called a bisimulation). We present an automation of coinductive theorem proving. This automation is based on the idea of proof planning. Proof planning constructs the higher level steps in a proof, using knowledge of the general structure of a family of proofs and exploiting this knowledge to control the proof search. Part of proof planning involves the use of failure information to modify the plan by the use of a proof critic which exploits the information gained from the failed proof attempt. Our approach to the problem was to develop a strategy that makes an initial simple guess at a bisimulation and then uses generalisation techniques, motivated by a critic, to refine this guess, so that a larger class of coinductive problems can be automatically verified. The implementation of this strategy has focused on the use of coinduction to prove the equivalence of programs in a small lazy functional language which is similar to Haskell. We have developed a proof plan for coinduction and a critic associated with this proof plan. These have been implemented in CoClam, an extended version of Clam with encouraging results. The planner has been successfully tested on a number of theorems.
Resumo:
[EU]Hizkuntzaren prozesamenduan testu koherenteetan kausa taldeko erlazioak (KAUSA, ONDORIOA eta HELBURUA) automatikoki hautematea eta bereiztea erabilgarria da galdera-erantzun automatikoko sistemak eraikitzerako orduan. Horretarako Egitura Erretorikoaren Teoria (Rhetorical Structure Theory, aurrerantzean RST) eta bere erlazioak erabiliko ditugu, corpus bezala RST Treebank -a (Iruskieta et al., 2013) hartuta, zientziako laburpen-testuz osatutako corpusa, hain zuzen ere. Corpus hori XML formatuan deskargatu eta hortik XPATH tresnaren bidez informazio garrantzitsuena eskuratzen dugu. Lan honek 3 helburu nagusi ditu: lehendabizi, kausa taldeko erlazioak elkarren artean bereiztea, bigarrenez, kausa taldeko erlazio hauek beste erlazio guztiekin bereiztea, eta azkenik, EBALUAZIOA eta INTERPRETAZIOA erlazioak bereiztea sentimendu analisian aplikatu ahal izateko. Ataza horiek egiteko, RhetDB tresnarekin eskuratu diren patroi ensaguratsuenak erabili eta bi aplikazio garatu ditugu. Alde batetik, bilatu nahi ditugun patroiak adierazi eta erlazio-egitura duen edonolako testuetan bilaketak egiten dituen bilatzailea, eta bestetik, patroi esanguratsuenak emanda erlazioak etiketatzen dituen etiketatzailea. Bi aplikazio hauek gainera, ahalik eta modu parametrizagarrienean erabiltzeko garatu ditugu, kodea aldatu gabe edonork erabili ahal izateko antzeko atazak egiteko. Etiketatzaileak ebaluatu ondoren, identifikatzeko erlaziorik errazena HELBURUA erlazioa dela ikusi dugu eta KAUSA eta ONDORIOA bereizteko arazo gehiago dauzkagula ere ondorioztatu dugu. Modu berean, EBALUAZIOA eta INTERPRETAZIOA ere elkarren artean bereiz dezakegula ikusi dugu.
Resumo:
Organismal development, homeostasis, and pathology are rooted in inherently probabilistic events. From gene expression to cellular differentiation, rates and likelihoods shape the form and function of biology. Processes ranging from growth to cancer homeostasis to reprogramming of stem cells all require transitions between distinct phenotypic states, and these occur at defined rates. Therefore, measuring the fidelity and dynamics with which such transitions occur is central to understanding natural biological phenomena and is critical for therapeutic interventions.
While these processes may produce robust population-level behaviors, decisions are made by individual cells. In certain circumstances, these minuscule computing units effectively roll dice to determine their fate. And while the 'omics' era has provided vast amounts of data on what these populations are doing en masse, the behaviors of the underlying units of these processes get washed out in averages.
Therefore, in order to understand the behavior of a sample of cells, it is critical to reveal how its underlying components, or mixture of cells in distinct states, each contribute to the overall phenotype. As such, we must first define what states exist in the population, determine what controls the stability of these states, and measure in high dimensionality the dynamics with which these cells transition between states.
To address a specific example of this general problem, we investigate the heterogeneity and dynamics of mouse embryonic stem cells (mESCs). While a number of reports have identified particular genes in ES cells that switch between 'high' and 'low' metastable expression states in culture, it remains unclear how levels of many of these regulators combine to form states in transcriptional space. Using a method called single molecule mRNA fluorescent in situ hybridization (smFISH), we quantitatively measure and fit distributions of core pluripotency regulators in single cells, identifying a wide range of variabilities between genes, but each explained by a simple model of bursty transcription. From this data, we also observed that strongly bimodal genes appear to be co-expressed, effectively limiting the occupancy of transcriptional space to two primary states across genes studied here. However, these states also appear punctuated by the conditional expression of the most highly variable genes, potentially defining smaller substates of pluripotency.
Having defined the transcriptional states, we next asked what might control their stability or persistence. Surprisingly, we found that DNA methylation, a mark normally associated with irreversible developmental progression, was itself differentially regulated between these two primary states. Furthermore, both acute or chronic inhibition of DNA methyltransferase activity led to reduced heterogeneity among the population, suggesting that metastability can be modulated by this strong epigenetic mark.
Finally, because understanding the dynamics of state transitions is fundamental to a variety of biological problems, we sought to develop a high-throughput method for the identification of cellular trajectories without the need for cell-line engineering. We achieved this by combining cell-lineage information gathered from time-lapse microscopy with endpoint smFISH for measurements of final expression states. Applying a simple mathematical framework to these lineage-tree associated expression states enables the inference of dynamic transitions. We apply our novel approach in order to infer temporal sequences of events, quantitative switching rates, and network topology among a set of ESC states.
Taken together, we identify distinct expression states in ES cells, gain fundamental insight into how a strong epigenetic modifier enforces the stability of these states, and develop and apply a new method for the identification of cellular trajectories using scalable in situ readouts of cellular state.
Resumo:
Dissertação (mestrado)—Universidade de Brasília, Faculdade de Direito, Programa de Pós-Graduação em Direito, 2016.
Resumo:
O Teatro de Operações Kosovo localiza-se numa região caraterizada por séculos de confluência de rotas comerciais, culturas, etnias e religiões distintas e que, por essa e outras razões tem sido assolada por inúmeros conflitos. Numa fase pós Guerra Fria, as forças internacionais, dada a escalada da violência, intervieram neste cenário integrando também forças portuguesas, entre as quais e por diversas vezes, o 1º Batalhão de Infantaria Mecanizado. Desde o início desta intervenção, desde os finais da década de noventa, até à atualidade, o teatro de operações referido experimentou diversas alterações de cariz social, politico e étnico que se traduziram por vezes em conflitos e fenómenos violentos. Dada a intervenção portuguesa num ambiente de conflitualidade volátil e em permanente mutação, inserida no âmbito das Missões de Apoio à Paz, o trabalho de investigação desenvolvido assume como objetivo descrever as alterações ao nível do emprego do 1º Batalhão de Infantaria Mecanizado, para fazer face à tipologia Kosovo do período pós Guerra Fria (2000-2014). Pretende-se que estas alterações sejam compreendidas e interpretadas através da implementação de um modelo de análise baseado nos fatores de decisão militares1, que sistematiza e organiza informação constante em documentos resultantes de cada um dos empenhamentos abordados, bem como no depoimento de militares presentes nesses mesmos contextos, no período abordado. No que a este último aspeto concerne, a aplicação do método indutivo usando como instrumento de opção metodológica o Estudo de Caso, permite a recolha de dados qualitativos resultantes das respostas obtidas nas entrevistas semiestruturadas, a militares presentes no Kosovo no período compreendido entre 2000 e 2014. Após toda a investigação realizada, parecem ganhar evidência alterações ao nível do emprego do 1º Batalhão de Infantaria Mecanizado no TO Kosovo, evidências essas materializadas nos fatores Missão, Ameaça, Tarefas, Viaturas, Efetivo e Orgânica, como consequência quer das restruturações efetuadas da Kosovo Force (KFOR), quer das oscilações de conflitualidade inerentes ao próprio teatro, que, ao longo do lapso de tempo estudado, atravessou períodos de menor e maior estabilidade.
Resumo:
Gunshot residue (GSR) is the term used to describe the particles originating from different parts of the firearm and ammunition during the discharge. A fast and practical field tool to detect the presence of GSR can assist law enforcement in the accurate identification of subjects. A novel field sampling device is presented for the first time for the fast detection and quantitation of volatile organic compounds (VOCs). The capillary microextraction of volatiles (CMV) is a headspace sampling technique that provides fast results (< 2 min. sampling time) and is reported as a versatile and high-efficiency sampling tool. The CMV device can be coupled to a Gas Chromatography-Mass Spectrometry (GC-MS) instrument by installation of a thermal separation probe in the injection port of the GC. An analytical method using the CMV device was developed for the detection of 17 compounds commonly found in polluted environments. The acceptability of the CMV as a field sampling method for the detection of VOCs is demonstrated by following the criteria established by the Environmental Protection Agency (EPA) compendium method TO-17. The CMV device was used, for the first time, for the detection of VOCs on swabs from the hands of shooters, and non-shooters and spent cartridges from different types of ammunition (i.e., pistol, rifle, and shotgun). The proposed method consists in the headspace extraction of VOCs in smokeless powders present in the propellant of ammunition. The sensitivity of this method was demonstrated with method detection limits (MDLs) 4-26 ng for diphenylamine (DPA), nitroglycerine (NG), 2,4-dinitrotoluene (2,4-DNT), and ethyl centralite (EC). In addition, a fast method was developed for the detection of the inorganic components (i.e., Ba, Pb, and Sb) characteristic of GSR presence by Laser Induced Breakdown Spectroscopy (LIBS). Advantages of LIBS include fast analysis (~ 12 seconds per sample) and good sensitivity, with expected MDLs in the range of 0.1-20 ng for target elements. Statistical analysis of the results using both techniques was performed to determine any correlation between the variables analyzed. This work demonstrates that the information collected from the analysis of organic components has the potential to improve the detection of GSR.