922 resultados para IMML and Visual IMML
Resumo:
With the rise of smart phones, lifelogging devices (e.g. Google Glass) and popularity of image sharing websites (e.g. Flickr), users are capturing and sharing every aspect of their life online producing a wealth of visual content. Of these uploaded images, the majority are poorly annotated or exist in complete semantic isolation making the process of building retrieval systems difficult as one must firstly understand the meaning of an image in order to retrieve it. To alleviate this problem, many image sharing websites offer manual annotation tools which allow the user to “tag” their photos, however, these techniques are laborious and as a result have been poorly adopted; Sigurbjörnsson and van Zwol (2008) showed that 64% of images uploaded to Flickr are annotated with < 4 tags. Due to this, an entire body of research has focused on the automatic annotation of images (Hanbury, 2008; Smeulders et al., 2000; Zhang et al., 2012a) where one attempts to bridge the semantic gap between an image’s appearance and meaning e.g. the objects present. Despite two decades of research the semantic gap still largely exists and as a result automatic annotation models often offer unsatisfactory performance for industrial implementation. Further, these techniques can only annotate what they see, thus ignoring the “bigger picture” surrounding an image (e.g. its location, the event, the people present etc). Much work has therefore focused on building photo tag recommendation (PTR) methods which aid the user in the annotation process by suggesting tags related to those already present. These works have mainly focused on computing relationships between tags based on historical images e.g. that NY and timessquare co-exist in many images and are therefore highly correlated. However, tags are inherently noisy, sparse and ill-defined often resulting in poor PTR accuracy e.g. does NY refer to New York or New Year? This thesis proposes the exploitation of an image’s context which, unlike textual evidences, is always present, in order to alleviate this ambiguity in the tag recommendation process. Specifically we exploit the “what, who, where, when and how” of the image capture process in order to complement textual evidences in various photo tag recommendation and retrieval scenarios. In part II, we combine text, content-based (e.g. # of faces present) and contextual (e.g. day-of-the-week taken) signals for tag recommendation purposes, achieving up to a 75% improvement to precision@5 in comparison to a text-only TF-IDF baseline. We then consider external knowledge sources (i.e. Wikipedia & Twitter) as an alternative to (slower moving) Flickr in order to build recommendation models on, showing that similar accuracy could be achieved on these faster moving, yet entirely textual, datasets. In part II, we also highlight the merits of diversifying tag recommendation lists before discussing at length various problems with existing automatic image annotation and photo tag recommendation evaluation collections. In part III, we propose three new image retrieval scenarios, namely “visual event summarisation”, “image popularity prediction” and “lifelog summarisation”. In the first scenario, we attempt to produce a rank of relevant and diverse images for various news events by (i) removing irrelevant images such memes and visual duplicates (ii) before semantically clustering images based on the tweets in which they were originally posted. Using this approach, we were able to achieve over 50% precision for images in the top 5 ranks. In the second retrieval scenario, we show that by combining contextual and content-based features from images, we are able to predict if it will become “popular” (or not) with 74% accuracy, using an SVM classifier. Finally, in chapter 9 we employ blur detection and perceptual-hash clustering in order to remove noisy images from lifelogs, before combining visual and geo-temporal signals in order to capture a user’s “key moments” within their day. We believe that the results of this thesis show an important step towards building effective image retrieval models when there lacks sufficient textual content (i.e. a cold start).
Resumo:
This thesis, titled Governance and Community Capitals, explores the kinds of practical processes that have made governance work in three faith-based schools in the Western Highlands of Papua New Guinea (PNG). To date, the nation of PNG has been unable to meet its stated educational goals; however, some faith-based primary schools have overcome educational challenges by changing their local governance systems. What constitutes good governance in developing countries and how it can be achieved in a PNG schooling context has received very little scholarly attention. In this study, the subject of governance is approached at the nexus between the administrative sciences and asset-based community development. In this space, the researcher provides an understanding of the contribution that community capitals have made to understandings of local forms of governance in the development context. However, by and large, conceptions of governance have a history of being positioned within a Euro-centric frame and very little, if anything is known about the naming of capitals by indigenous peoples. In this thesis, six indigenous community capitals are made visible, expanding the repertoire of extant capitals published to date. The capitals identified and named in this thesis are: Story, Wisdom, Action, Blessing, Name and Unity. In-depth insights into these capitals are provided and through the theoretical idea of performativity, the researcher advances an understanding of how the habitual enactment of the practical components of the capitals made governance work in this unique setting. The study draws from a grounded and appreciative methodology and is based on a case study design incorporating a three-stage cycle of investigation. The first stage tested the application of an assets-based method to documentary sources of data including most significant change stories, community mapping and visual diaries. In the second stage, a group process method relevant to a PNG context was developed and employed. The third stage involved building theory from case study evidence using content analysis, language and metaphorical speech acts as guides for complex analysis. The thesis demonstrates the contribution that indigenous community capitals can make to understanding local forms of governance and how PNG faith-based schools meet their local governance challenges.
Resumo:
Dissertação de Mestrado apresentada ao Instituto Superior de Psicologia Aplicada para obtenção de grau de Mestre na especialidade de Psicologia Clínica.
Resumo:
This work is an ethnographic research with collectors women of Mangaba in the village of Ponta Negra in Natal - RN. This Women also known as Mangabeira's women reproduce a practice learned with their ancestors, collecting this fruit in the coastal tablelands forests and latter commercializing it in the local markets. This research uses the methodology of oral history and visual anthropology with presentation of collected images on board. It is intended to emphasize the botanical and environmental aspects of the Mangabeira plant, its ecosystem, territorial, economic and historical aspects of it, also the knowledge of this extractive practice of our immaterial culture.
Resumo:
According to a traditional rationalist proposal, it is possible to attain knowledge of certain necessary truths by means of insight—an epistemic mental act that combines the 'presentational' character of perception with the a priori status usually reserved for discursive reasoning. In this dissertation, I defend the insight proposal in relation to a specific subject matter: elementary Euclidean plane geometry, as set out in Book I of Euclid's Elements. In particular, I argue that visualizations and visual experiences of diagrams allow human subjects to grasp truths of geometry by means of visual insight. In the first two chapters, I provide an initial defense of the geometrical insight proposal, drawing on a novel interpretation of Plato's Meno to motivate the view and to reply to some objections. In the remaining three chapters, I provide an account of the psychological underpinnings of geometrical insight, a task that requires considering the psychology of visual imagery alongside the details of Euclid's geometrical system. One important challenge is to explain how basic features of human visual representations can serve to ground our intuitive grasp of Euclid's postulates and other initial assumptions. A second challenge is to explain how we are able to grasp general theorems by considering diagrams that depict only special cases. I argue that both of these challenges can be met by an account that regards geometrical insight as based in visual experiences involving the combined deployment of two varieties of 'dynamic' visual imagery: one that allows the subject to visually rehearse spatial transformations of a figure's parts, and another that allows the subject to entertain alternative ways of structurally integrating the figure as a whole. It is the interplay between these two forms of dynamic imagery that enables a visual experience of a diagram, suitably animated in visual imagination, to justify belief in the propositions of Euclid’s geometry. The upshot is a novel dynamic imagery account that explains how intuitive knowledge of elementary Euclidean plane geometry can be understood as grounded in visual insight.
Resumo:
Understanding spatial patterns of land use and land cover is essential for studies addressing biodiversity, climate change and environmental modeling as well as for the design and monitoring of land use policies. The aim of this study was to create a detailed map of land use land cover of the deforested areas of the Brazilian Legal Amazon up to 2008. Deforestation data from and uses were mapped with Landsat-5/TM images analysed with techniques, such as linear spectral mixture model, threshold slicing and visual interpretation, aided by temporal information extracted from NDVI MODIS time series. The result is a high spatial resolution of land use and land cover map of the entire Brazilian Legal Amazon for the year 2008 and corresponding calculation of area occupied by different land use classes. The results showed that the four classes of Pasture covered 62% of the deforested areas of the Brazilian Legal Amazon, followed by Secondary Vegetation with 21%. The area occupied by Annual Agriculture covered less than 5% of deforested areas; the remaining areas were distributed among six other land use classes. The maps generated from this project ? called TerraClass - are available at INPE?s web site (http://www.inpe.br/cra/projetos_pesquisas/terraclass2008.php)
Resumo:
People possess different sensory modalities to detect, interpret, and efficiently act upon various events in a complex and dynamic environment (Fetsch, DeAngelis, & Angelaki, 2013). Much empirical work has been done to understand the interplay of modalities (e.g. audio-visual interactions, see Calvert, Spence, & Stein, 2004). On the one hand, integration of multimodal input as a functional principle of the brain enables the versatile and coherent perception of the environment (Lewkowicz & Ghazanfar, 2009). On the other hand, sensory integration does not necessarily mean that input from modalities is always weighted equally (Ernst, 2008). Rather, when two or more modalities are stimulated concurrently, one often finds one modality dominating over another. Study 1 and 2 of the dissertation addressed the developmental trajectory of sensory dominance. In both studies, 6-year-olds, 9-year-olds, and adults were tested in order to examine sensory (audio-visual) dominance across different age groups. In Study 3, sensory dominance was put into an applied context by examining verbal and visual overshadowing effects among 4- to 6-year olds performing a face recognition task. The results of Study 1 and Study 2 support default auditory dominance in young children as proposed by Napolitano and Sloutsky (2004) that persists up to 6 years of age. For 9-year-olds, results on privileged modality processing were inconsistent. Whereas visual dominance was revealed in Study 1, privileged auditory processing was revealed in Study 2. Among adults, a visual dominance was observed in Study 1, which has also been demonstrated in preceding studies (see Spence, Parise, & Chen, 2012). No sensory dominance was revealed in Study 2 for adults. Potential explanations are discussed. Study 3 referred to verbal and visual overshadowing effects in 4- to 6-year-olds. The aim was to examine whether verbalization (i.e., verbally describing a previously seen face), or visualization (i.e., drawing the seen face) might affect later face recognition. No effect of visualization on recognition accuracy was revealed. As opposed to a verbal overshadowing effect, a verbal facilitation effect occurred. Moreover, verbal intelligence was a significant predictor for recognition accuracy in the verbalization group but not in the control group. This suggests that strengthening verbal intelligence in children can pay off in non-verbal domains as well, which might have educational implications.
Resumo:
This thesis examines the state of audiovisual translation (AVT) in the aftermath of the COVID-19 emergency, highlighting new trends with regards to the implementation of AI technologies as well as their strengths, constraints, and ethical implications. It starts with an overview of the current AVT landscape, focusing on future projections about its evolution and its critical aspects such as the worsening working conditions lamented by AVT professionals – especially freelancers – in recent years and how they might be affected by the advent of AI technologies in the industry. The second chapter delves into the history and development of three AI technologies which are used in combination with neural machine translation in automatic AVT tools: automatic speech recognition, speech synthesis and deepfakes (voice cloning and visual deepfakes for lip syncing), including real examples of start-up companies that utilize them – or are planning to do so – to localize audiovisual content automatically or semi-automatically. The third chapter explores the many ethical concerns around these innovative technologies, which extend far beyond the field of translation; at the same time, it attempts to revindicate their potential to bring about immense progress in terms of accessibility and international cooperation, provided that their use is properly regulated. Lastly, the fourth chapter describes two experiments, testing the efficacy of the currently available tools for automatic subtitling and automatic dubbing respectively, in order to take a closer look at their perks and limitations compared to more traditional approaches. This analysis aims to help discerning legitimate concerns from unfounded speculations with regards to the AI technologies which are entering the field of AVT; the intention behind it is to humbly suggest a constructive and optimistic view of the technological transformations that appear to be underway, whilst also acknowledging their potential risks.
Resumo:
In the framework of the energy transition, the acquisition of proper knowledge of fundamental aspects characterizing the use of alternative fuels is paramount as well as the development of optimized know-how and technologies. In this sense, the use of hydrogen has been indicated as a promising route for decarbonization at the end-users stage in the energy supply chain. However, the elevated reactivity and the low-density at atmospheric conditions of hydrogen pose new challenges. Among the others, the dilution of hydrogen with carbon dioxide from carbon capture and storage systems represents a possible route. However, the interactions between these species have been poorly studied so far. For these reasons, this thesis, in collaboration between the University of Bologna and Technische Universität Bergakademie of Freiberg in Saxony (Germany), investigates the laminar flame of hydrogen-based premixed gas with the dilution of carbon dioxide. An experimental system, called a heat flux burner, was adopted ad different operating conditions. The presence of the cellularity phenomenon, forming the so-called cellular flame, was observed and analysed. Theoretical and visual methods have allowed for the characterization of the investigated flames, opening new alternatives for sustainable energy production via hydrogen transformation.
Resumo:
We report the case of a 73-year-old female who presented facial numbness and pain in the first division of the trigeminal nerve, ptosis, diplopia and visual loss on the right side for the previous four months. The neurological, radiological and histological examination demonstrated a rare case of invasive fungal aspergillosis of the central nervous system, causing orbital apex syndrome, later transformed in temporal brain abscess. She died ten months later due to respiratory and renal failure in spite of specific antimycotic therapy.
Resumo:
We report on two epileptic patients who developed acute psychosis after the use of topiramate (TPM). One patient exhibited severe psychomotor agitation, heteroaggressiveness, auditory and visual hallucinations as well as severe paranoid and mystic delusions. The other patient had psychomotor agitation, depersonalization, derealization, severe anxiety and deluded that he was losing his memory. Both patients had to be taken to the casualty room. After interruption of TPM in one patient and reduction of dose in the other, a full remission of the psychotic symptoms was obtained without the need of antipsychotic drugs. Clinicians should be aware of the possibility of development of acute psychotic symptoms in patients undergoing TPM treatment.
Resumo:
A case of identical male twins with Cohen syndrome who present multiple ophthalmic findings is reported. The patients were identical 16 year-old twin boys who showed down slanting eyelids, mild ptosis, high-grade myopia, small cortical lens opacities, posterior subcapsular cataracts, myotic and corectopic pupils with poor dilation due to focal iris atrophy and retinochoroidal dystrophy. Ophthalmologists must be aware of the ocular and systemic findings of Cohen syndrome in the evaluation of young patients with mental retardation and visual impairment.
Resumo:
The cerebral cysticercosis can produce intracranial hypertension by inflammatory obstruction of the basal cysterns or by expansive lesion in the cerebral parenchima or ventricular cavities. In the latter and in tumor cases the clinical picture is very similar and only after surgery can the etiology be determined. We present 11 operated cases of intracranial cysticercosis which presented the clinical picture of an expansive lesion. There were 7 females and 4 males with ages between 4 and 65 years. Nine patients were admitted because of headache, vomiting and visual disturbances suggestive of intracranial hypertension. One patient was admited with lymphocytic meningitis and another with focal seizures following hemiparesis. Five patients presented focal signs and six edema of the papilla. Epileptic manifestations were present in 45.5% of the cases. A plain X-ray films of the skull failed to reveal calcificatons, however signs of chronic hypertension were present in three cases. The electroencephalogram showed slow focal waves in 8 patients The spinal fluid examination revealed lymphocytosis in 4 cases, increased protein content in another 4 and complement fixation for cysticercosis was positive in 2 cases. The expansive lesions were localized by angiograph and ventriculography. In these the location was temporal in 4, frontal in 3, parietal in 2, in the third ventricle in one and in the fourth ventricle in another. At surgery we removed a large cyst from the cerebral parenchyma in six cases. Around the cyst a thick glial reaction was present. In the other cases the cyst was small but fixed to the ventricular trigone and produced dilatation of the inferior horn of the lateral ventricle. In two cases we removed a solitary intraventricular cyst from the third and fourth ventricles. In the two children operated upon there were several small hard cysts involving the cerebral parenchyma which displayed intense gliosis. There were no postoperative complications.
Resumo:
Universidade Estadual de Campinas . Faculdade de Educação Física
Resumo:
A síndrome do X Frágil é a causa mais frequente de deficiência intelectual hereditária. A variante de Dandy-Walker trata-se de uma constelação específica de achados neurorradiológicos. Este estudo relata achados da comunicação oral e escrita de um menino de 15 anos com diagnóstico clínico e molecular da síndrome do X-Frágil e achados de neuroimagem do encéfalo compatíveis com variante de Dandy-Walker. A avaliação fonoaudiológica foi realizada por meio da Observação do Comportamento Comunicativo, aplicação do ABFW - Teste de Linguagem Infantil - Fonologia, Perfil de Habilidades Fonológicas, Teste de Desempenho Escolar, Teste Illinois de Habilidades Psicolinguísticas, avaliação do sistema estomatognático e avaliação audiológica. Observou-se: alteração de linguagem oral quanto às habilidades fonológicas, semânticas, pragmáticas e morfossintáticas; déficits nas habilidades psicolinguísticas (recepção auditiva, expressão verbal, combinação de sons, memória sequencial auditiva e visual, closura auditiva, associação auditiva e visual); e alterações morfológicas e funcionais do sistema estomatognático. Na leitura verificou-se dificuldades na decodificação dos símbolos gráficos e na escrita havia omissões, aglutinações e representações múltiplas com o uso predominante de vogais e dificuldades na organização viso-espacial. Em matemática, apesar do reconhecimento numérico, não realizou operações aritméticas. Não foram observadas alterações na avaliação audiológica periférica. A constelação de sintomas comportamentais, cognitivos, linguísticos e perceptivos, previstos na síndrome do X-Frágil, somada às alterações estruturais do sistema nervoso central, pertencentes à variante de Dandy-Walker, trouxeram interferências marcantes no desenvolvimento das habilidades comunicativas, no aprendizado da leitura e escrita e na integração social do indivíduo.