994 resultados para Document description


Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, we present a new feature-based approach for mosaicing of camera-captured document images. A novel block-based scheme is employed to ensure that corners can be reliably detected over a wide range of images. 2-D discrete cosine transform is computed for image blocks defined around each of the detected corners and a small subset of the coefficients is used as a feature vector A 2-pass feature matching is performed to establish point correspondences from which the homography relating the input images could be computed. The algorithm is tested on a number of complex document images casually taken from a hand-held camera yielding convincing results.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The polyamidoamide (PAMAM) class of dendrimers was one of the first dendrimers synthesized by Tomalia and co-workers at Dow. Since its discovery the PAMAMs have stimulated many discussions on the structure and dynamics of such hyperbranched polymers. Many questions remain open because the huge conformation disorder combined with very similar local symmetries have made it difficult to characterize experimentally at the atomistic level the structure and dynamics of PAMAM dendrimers. The higher generation dendrimers have also been difficult to characterize computationally because of the large size (294852 atoms for generation 11) and the huge number of conformations. To help provide a practical means of atomistic computational studies, we have developed an atomistically informed coarse-grained description for the PAMAM dendrimer. We find that a two-bead per monomer representation retains the accuracy of atomistic simulations for predicting size and conformational complexity, while reducing the degrees of freedom by tenfold. This mesoscale description has allowed us to study the structural properties of PAMAM dendrimer up to generation 11 for time scale of up to several nanoseconds. The gross properties such as the radius of gyration compare very well with those from full atomistic simulation and with available small angle x-ray experiment and small angle neutron scattering data. The radial monomer density shows very similar behavior with those obtained from the fully atomistic simulation. Our approach to deriving the coarse-grain model is general and straightforward to apply to other classes of dendrimers.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Skew correction of complex document images is a difficult task. We propose an edge-based connected component approach for robust skew correction of documents with complex layout and content. The algorithm essentially consists of two steps - an 'initialization' step to determine the image orientation from the centroids of the connected components and a 'search' step to find the actual skew of the image. During initialization, we choose two different sets of points regularly spaced across the the image, one from the left to right and the other from top to bottom. The image orientation is determined from the slope between the two succesive nearest neighbors of each of the points in the chosen set. The search step finds succesive nearest neighbors that satisfy the parameters obtained in the initialization step. The final skew is determined from the slopes obtained in the 'search' step. Unlike other connected component based methods, the proposed method does not require any binarization step that generally precedes connected component analysis. The method works well for scanned documents with complex layout of any skew with a precision of 0.5 degrees.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The document images that are fed into an Optical Character Recognition system, might be skewed. This could be due to improper feeding of the document into the scanner or may be due to a faulty scanner. In this paper, we propose a skew detection and correction method for document images. We make use of the inherent randomness in the Horizontal Projection profiles of a text block image, as the skew of the image varies. The proposed algorithm has proved to be very robust and time efficient. The entire process takes less than a second on a 2.4 GHz Pentium IV PC.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Tutkimuksen keskeinen tehtävä on selvittää, mikä on dokumentoinnin merkitys lastensuojelun sosiaalityön tiedonmuodostuksessa ja ammattikäytännöissä. Asiakirjateksteistä koostuvaa tutkimusaineistoa tarkastellaan kolmesta eri suunnasta kysymällä: 1)Miten asiakirjoja kirjoitetaan? 2) Mitä asiakirjoihin kirjoitetaan? 3) Miksi asiakirjoja kirjoitetaan niin kuin kirjoitetaan? Tutkimusaineisto muodostuu lastensuojelun sosiaalityöntekijöiden laatimista asiakastietojärjestelmään tallennetuista muistiinpanoista ja huostaanottopäätöksistä. Tutkimukseen on valittu 20 huostaanotetun eri-ikäisen lapsen ja heidän perheensä asiakirjat yhteensä 1613 asiakirjatulostussivua. Tekstit ajoittuvat vuodesta 1989 vuoteen 2000. Tutkimusmenetelmä on diskurssianalyyttinen ja tukeutuu Fairclough`n (1997)esittämään kolmiulotteiseen malliin, jossa diskurssi määritellään tekstin, käytäntöjen ja sosiokulttuurisen ympäristön suhteeksi. Diskurssianalyysi on näiden rakenteiden ja niiden välisten suhteiden kuvaamista, tulkintaa ja selittämistä. Fairclough’n mallia mukaillen tutkimuksen analyysi koostuu retoriikan ja tematiikan analyyseistä sekä pragmatiikan näkökulman sisältävästä tarkastelusta. Asiakirjatekstien pilkkominen puhujakategorioihin osoitti tekstien olevan moniäänisiä, useiden henkilöiden näkemyksiä ja mielipiteitä sisältäviä tekstipintoja. Retoriikan analyysi näytti, että lastensuojelun sosiaalityön asiakirjat sisältävät paljon dynaamisia kuvauksia työstä. Asiakirjojen kirjoittaminen moniäänisiksi tuo tekstiin uskottavuutta, ja se on myös yksi retorinen vaikuttamiskeino. Tematiikan tarkastelu osoitti,että asiakirjojen sisällölliset teemat (lapsen hoiva, arjen hallinta, yhteistyö ja päihteiden käyttö) ja kokemukselliset teemat (huoli, vastuu, yhteys ja moraali) toistuvat sisäkkäisinä ja päällekkäisinä säikeinä dynaamisesti vaihdellen. Sosiaalityöntekijät kirjaavat teksteihin monia yhtäaikaisia teemoja, joiden avulla rakentavat ammatillista ymmärrystä kyseessä olevasta tilanteesta. Asiakirjojen tutkiminen pragmatiikan suunnasta toi esiin, kirjoittamisen ja lukemisen kontekstiulottuvuudet sekä tiedonmuodostusprosessin. Asiakirjojen laatiminen on osa sosiaalityön käytäntöjä. Se on myös keskeinen alue ammattikunnan yhteisen ammatillisen ymmärryksen luomisessa ja ylläpitämisessä. Muistiinpanot, huostaanottopäätökset ja lakitekstit ovat intertekstuaalisia. Lastensuojelun sosiaalityön asiakirjojen tutkiminen on avannut uusia mahdollisuuksia ymmärtää sosiaalityön dokumentointiprosessia, merkitystä ja roolia sekä tiedonmuodostuksen dynamiikkaa. Tekstien kirjoittaminen, niiden lukeminen, tietojen siirtäminen ja asiakkaan kuuleminen samoin kuin kuulemisen kirjaaminen ovat sosiaalityön dokumentoinnin keskeisiä haasteita. Tutkimus pyrkii avaamaan ymmärrystä asiakirjatekstien monivivahteiseen ja dynaamiseen maailmaan ja siten myös sosiaalityön dokumentoinnin arkeen. Tarkastelut mahdollistavat työn kehittämisen erityisesti sosiaalityön asiakasvaikuttavuuden mittaamisen ja parantamisen suuntaan. Asiakirjoissa ilmenevä tiedonmuodostuksen dynamiikka syntyy kirjoittamiskäytäntöjen, kirjoittamisen ja lukemisen sekä toimintakäytäntöjen yhteisessä alueessa. Avainsanat: sosiaalityö, lastensuojelu, dokumentointi, asiakirja, diskurssianalyysi, tiedonmuodostus.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

It is well known that the numerical accuracy of a series solution to a boundary-value problem by the direct method depends on the technique of approximate satisfaction of the boundary conditions and on the stage of truncation of the series. On the other hand, it does not appear to be generally recognized that, when the boundary conditions can be described in alternative equivalent forms, the convergence of the solution is significantly affected by the actual form in which they are stated. The importance of the last aspect is studied for three different techniques of computing the deflections of simply supported regular polygonal plates under uniform pressure. It is also shown that it is sometimes possible to modify the technique of analysis to make the accuracy independent of the description of the boundary conditions.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Electronic document management (EDM) technology has the potential to enhance the information management in construction projects considerably, without radical changes to current practice. Over the past fifteen years this topic has been overshadowed by building product modelling in the construction IT research world, but at present EDM is quickly being introduced in practice, in particular in bigger projects. Often this is done in the form of third party services available over the World Wide Web. In the paper, a typology of research questions and methods is presented, which can be used to position the individual research efforts which are surveyed in the paper. Questions dealt with include: What features should EMD systems have? How much are they used? Are there benefits from use and how should these be measured? What are the barriers to wide-spread adoption? Which technical questions need to be solved? Is there scope for standardisation? How will the market for such systems evolve?

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Layering is a widely used method for structuring data in CAD-models. During the last few years national standardisation organisations, professional associations, user groups for particular CAD-systems, individual companies etc. have issued numerous standards and guidelines for the naming and structuring of layers in building design. In order to increase the integration of CAD data in the industry as a whole ISO recently decided to define an international standard for layer usage. The resulting standard proposal, ISO 13567, is a rather complex framework standard which strives to be more of a union than the least common denominator of the capabilities of existing guidelines. A number of principles have been followed in the design of the proposal. The first one is the separation of the conceptual organisation of information (semantics) from the way this information is coded (syntax). The second one is orthogonality - the fact that many ways of classifying information are independent of each other and can be applied in combinations. The third overriding principle is the reuse of existing national or international standards whenever appropriate. The fourth principle allows users to apply well-defined subsets of the overall superset of possible layernames. This article describes the semantic organisation of the standard proposal as well as its default syntax. Important information categories deal with the party responsible for the information, the type of building element shown, whether a layer contains the direct graphical description of a building part or additional information needed in an output drawing etc. Non-mandatory information categories facilitate the structuring of information in rebuilding projects, use of layers for spatial grouping in large multi-storey projects, and storing multiple representations intended for different drawing scales in the same model. Pilot testing of ISO 13567 is currently being carried out in a number of countries which have been involved in the definition of the standard. In the article two implementations, which have been carried out independently in Sweden and Finland, are described. The article concludes with a discussion of the benefits and possible drawbacks of the standard. Incremental development within the industry, (where ”best practice” can become ”common practice” via a standard such as ISO 13567), is contrasted with the more idealistic scenario of building product models. The relationship between CAD-layering, document management product modelling and building element classification is also discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Triggered by the very quick proliferation of Internet connectivity, electronic document management (EDM) systems are now rapidly being adopted for managing the documentation that is produced and exchanged in construction projects. Nevertheless there are still substantial barriers to the efficient use of such systems, mainly of a psychological nature and related to insufficient training. This paper presents the results of empirical studies carried out during 2002 concerning the current usage of EDM systems in the Finnish construction industry. The studies employed three different methods in order to provide a multifaceted view of the problem area, both on the industry and individual project level. In order to provide an accurate measurement of overall usage volume in the industry as a whole telephone interviews with key personnel from 100 randomly chosen construction projects were conducted. The interviews showed that while around 1/3 of big projects already have adopted the use of EDM, very few small projects have adopted this technology. The barriers to introduction were investigated through interviews with representatives for half a dozen of providers of systems and ASP-services. These interviews shed a lot of light on the dynamics of the market for this type of services and illustrated the diversity of business strategies adopted by vendors. In the final study log files from a project which had used an EDM system were analysed in order to determine usage patterns. The results illustrated that use is yet incomplete in coverage and that only a part of the individuals involved in the project used the system efficiently, either as information producers or consumers. The study also provided feedback on the usefulness of the log files.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Separation of printed text blocks from the non-text areas, containing signatures, handwritten text, logos and other such symbols, is a necessary first step for an OCR involving printed text recognition. In the present work, we compare the efficacy of some feature-classifier combinations to carry out this separation task. We have selected length-nomalized horizontal projection profile (HPP) as the starting point of such a separation task. This is with the assumption that the printed text blocks contain lines of text which generate HPP's with some regularity. Such an assumption is demonstrated to be valid. Our features are the HPP and its two transformed versions, namely, eigen and Fisher profiles. Four well known classifiers, namely, Nearest neighbor, Linear discriminant function, SVM's and artificial neural networks have been considered and efficiency of the combination of these classifiers with the above features is compared. A sequential floating feature selection technique has been adopted to enhance the efficiency of this separation task. The results give an average accuracy of about 96.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Extraction of text areas from the document images with complex content and layout is one of the challenging tasks. Few texture based techniques have already been proposed for extraction of such text blocks. Most of such techniques are greedy for computation time and hence are far from being realizable for real time implementation. In this work, we propose a modification to two of the existing texture based techniques to reduce the computation. This is accomplished with Harris corner detectors. The efficiency of these two textures based algorithms, one based on Gabor filters and other on log-polar wavelet signature, are compared. A combination of Gabor feature based texture classification performed on a smaller set of Harris corner detected points is observed to deliver the accuracy and efficiency.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A new fiber bundle approach to the gauge theory of a group G that involves space‐time symmetries as well as internal symmetries is presented. The ungauged group G is regarded as the group of left translations on a fiber bundle G(G/H,H), where H is a closed subgroup and G/H is space‐time. The Yang–Mills potential is the pullback of the Maurer–Cartan form and the Yang–Mills fields are zero. More general diffeomorphisms on the bundle space are then identified as the appropriate gauged generalizations of the left translations, and the Yang–Mills potential is identified as the pullback of the dual of a certain kind of vielbein on the group manifold. The Yang–Mills fields include a torsion on space‐time.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The dissolution, accompanied by chemical reaction, of monodisperse solid particles has been analysed. The resulting model, which accounts for the variation of mass transfer coefficient with the size of the dissolving particles, yields an approximate analytical form of a kinetic function. Rigorous numerical and approximate analytical solutions have been obtained for the governing system of nonlinear ordinary differential equations. The transient nature of the dissolution process as well as the accuracy of the analytical solution is brought out by the rigorous numerical solution. The analytical solution is fairly accurate for the major part of the range of operational times encountered in practice.