Biblioteca Digital

129 resultados para pacs: information technolgy applications

Feature selection environment for genomic applications

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Feature selection is a pattern recognition approach to choose important variables according to some criteria in order to distinguish or explain certain phenomena (i.e., for dimensionality reduction). There are many genomic and proteomic applications that rely on feature selection to answer questions such as selecting signature genes which are informative about some biological state, e. g., normal tissues and several types of cancer; or inferring a prediction network among elements such as genes, proteins and external stimuli. In these applications, a recurrent problem is the lack of samples to perform an adequate estimate of the joint probabilities between element states. A myriad of feature selection algorithms and criterion functions have been proposed, although it is difficult to point the best solution for each application. Results: The intent of this work is to provide an open-source multiplataform graphical environment for bioinformatics problems, which supports many feature selection algorithms, criterion functions and graphic visualization tools such as scatterplots, parallel coordinates and graphs. A feature selection approach for growing genetic networks from seed genes ( targets or predictors) is also implemented in the system. Conclusion: The proposed feature selection environment allows data analysis using several algorithms, criterion functions and graphic visualization tools. Our experiments have shown the software effectiveness in two distinct types of biological problems. Besides, the environment can be used in different pattern recognition applications, although the main concern regards bioinformatics tasks.

An algorithmic Friedman-Pippenger theorem on tree embeddings and applications

Relevância:

20.00% 20.00%

Publicador:

Resumo:

An (n, d)-expander is a graph G = (V, E) such that for every X subset of V with vertical bar X vertical bar <= 2n - 2 we have vertical bar Gamma(G)(X) vertical bar >= (d + 1) vertical bar X vertical bar. A tree T is small if it has at most n vertices and has maximum degree at most d. Friedman and Pippenger (1987) proved that any ( n; d)- expander contains every small tree. However, their elegant proof does not seem to yield an efficient algorithm for obtaining the tree. In this paper, we give an alternative result that does admit a polynomial time algorithm for finding the immersion of any small tree in subgraphs G of (N, D, lambda)-graphs Lambda, as long as G contains a positive fraction of the edges of Lambda and lambda/D is small enough. In several applications of the Friedman-Pippenger theorem, including the ones in the original paper of those authors, the (n, d)-expander G is a subgraph of an (N, D, lambda)-graph as above. Therefore, our result suffices to provide efficient algorithms for such previously non-constructive applications. As an example, we discuss a recent result of Alon, Krivelevich, and Sudakov (2007) concerning embedding nearly spanning bounded degree trees, the proof of which makes use of the Friedman-Pippenger theorem. We shall also show a construction inspired on Wigderson-Zuckerman expander graphs for which any sufficiently dense subgraph contains all trees of sizes and maximum degrees achieving essentially optimal parameters. Our algorithmic approach is based on a reduction of the tree embedding problem to a certain on-line matching problem for bipartite graphs, solved by Aggarwal et al. (1996).

Context tree selection: A unifying view

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Context tree models have been introduced by Rissanen in [25] as a parsimonious generalization of Markov models. Since then, they have been widely used in applied probability and statistics. The present paper investigates non-asymptotic properties of two popular procedures of context tree estimation: Rissanen's algorithm Context and penalized maximum likelihood. First showing how they are related, we prove finite horizon bounds for the probability of over- and under-estimation. Concerning overestimation, no boundedness or loss-of-memory conditions are required: the proof relies on new deviation inequalities for empirical probabilities of independent interest. The under-estimation properties rely on classical hypotheses for processes of infinite memory. These results improve on and generalize the bounds obtained in Duarte et al. (2006) [12], Galves et al. (2008) [18], Galves and Leonardi (2008) [17], Leonardi (2010) [22], refining asymptotic results of Buhlmann and Wyner (1999) [4] and Csiszar and Talata (2006) [9]. (C) 2011 Elsevier B.V. All rights reserved.

TESTING STATISTICAL HYPOTHESIS ON RANDOM TREES AND APPLICATIONS TO THE PROTEIN CLASSIFICATION PROBLEM

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Efficient automatic protein classification is of central importance in genomic annotation. As an independent way to check the reliability of the classification, we propose a statistical approach to test if two sets of protein domain sequences coming from two families of the Pfam database are significantly different. We model protein sequences as realizations of Variable Length Markov Chains (VLMC) and we use the context trees as a signature of each protein family. Our approach is based on a Kolmogorov-Smirnov-type goodness-of-fit test proposed by Balding et at. [Limit theorems for sequences of random trees (2008), DOI: 10.1007/s11749-008-0092-z]. The test statistic is a supremum over the space of trees of a function of the two samples; its computation grows, in principle, exponentially fast with the maximal number of nodes of the potential trees. We show how to transform this problem into a max-flow over a related graph which can be solved using a Ford-Fulkerson algorithm in polynomial time on that number. We apply the test to 10 randomly chosen protein domain families from the seed of Pfam-A database (high quality, manually curated families). The test shows that the distributions of context trees coming from different families are significantly different. We emphasize that this is a novel mathematical approach to validate the automatic clustering of sequences in any context. We also study the performance of the test via simulations on Galton-Watson related processes.

BORON UPTAKE AND DISTRIBUTION IN FIELD GROWN CITRUS TREES

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In low fertility tropical soils, boron (B) deficiency impairs fruit production. However, little information is available on the efficiency of nutrient application and use by trees. Therefore, this work verified the effects of soil and foliar applications of boron in a commercial citrus orchard. An experiment was conducted with fertigated 4-year-old `Valencia` sweet orange trees on `Swingle` citrumelo rootstock. Boron (isotopically-enriched 10B) was supplied to trees once or twice in the growing season, either dripped in the soil or sprayed on the leaves. Trees were sampled at different periods and separated into different parts for total B contents and 10B/11B isotope ratios analyses. Soil B applied via fertigation was more efficient than foliar application for the organs grown after the B fertilization. Recovery of labeled B by fruits was 21% for fertigation and 7% for foliar application. Residual effects of nutrient application in the grove were observed in the year after labeled fertilizer application, which greater proportions derived from the soil supply.

Comparison of univariate and multivariate calibration for the determination of micronutrients in pellets of plant materials by laser induced breakdown spectrometry

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The application of laser induced breakdown spectrometry (LIBS) aiming the direct analysis of plant materials is a great challenge that still needs efforts for its development and validation. In this way, a series of experimental approaches has been carried out in order to show that LIBS can be used as an alternative method to wet acid digestions based methods for analysis of agricultural and environmental samples. The large amount of information provided by LIBS spectra for these complex samples increases the difficulties for selecting the most appropriated wavelengths for each analyte. Some applications have suggested that improvements in both accuracy and precision can be achieved by the application of multivariate calibration in LIBS data when compared to the univariate regression developed with line emission intensities. In the present work, the performance of univariate and multivariate calibration, based on partial least squares regression (PLSR), was compared for analysis of pellets of plant materials made from an appropriate mixture of cryogenically ground samples with cellulose as the binding agent. The development of a specific PLSR model for each analyte and the selection of spectral regions containing only lines of the analyte of interest were the best conditions for the analysis. In this particular application, these models showed a similar performance. but PLSR seemed to be more robust due to a lower occurrence of outliers in comparison to the univariate method. Data suggests that efforts dealing with sample presentation and fitness of standards for LIBS analysis must be done in order to fulfill the boundary conditions for matrix independent development and validation. (C) 2009 Elsevier B.V. All rights reserved.

Economic viability of doses and split-applications of nitrogen fertilization in corn crop in a eutrophic Red Latosol

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Nitrogen is the nutrient that is most absorbed by the corn crop, with the most complex management, and has the highest share on the cost of corn production. The objective of this work was to evaluate the economic viability of different rates and split-applications of nitrogen fertilization, as such as urea, in the corn crop in a eutrophic Red Latosol (Oxisol). The study was carried out in the Experimental Station of the Regional Pole of the Sao Paulo Northwest Agribusiness Development (APTA), in Votuporanga, State of Sao Paulo, Brazil. The experimental design was randomized complete blocks with nine treatments and four replications, consisting of five N rates: 0, 55, 95, 135 and 175 kg ha(-1), 15 kg ha-l applied in the seeding and the remainder in top dressing: 40 and 80 kg ha(-1) N at forty days after seeding (DAS), or 1/2 + 1/2 at 20 and 40 DAS; 120 kg ha-1 N split in 1/2 + 1/2 or 1/3 + 1/3 + 1/3 at 20, 40 or 60 DAS; 160 kg ha(-1) N split in 1/4 + 3/8 + 3/8 or 114 + 1/4 + 1/4 + 1/4 at 20, 40, 60 and 80 DAS. The application of 135 kg ha-l of N split in three times provided the best benefit/cost ratio. The non-application of N provided the lowest economic return, proving to be unviable.

Fractionation of Zn, Cd and Pb in a Tropical Soil After Nine-Year Sewage Sludge Applications

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A long-term field experiment was carried out in the experiment farm of the Sao Paulo State University, Brazil, to evaluate the phytoavailability of Zn, Cd and Pb in a Typic Eutrorthox soil treated with sewage sludge for nine consecutive years, using the sequential extraction and organic matter fractionation methods. During 2005-2006, maize (Zea mays L.) was used as test plants and the experimental design was in randomized complete blocks with four treatments and five replicates. The treatments consisted of four sewage sludge rates (in a dry basis): 0.0 (control, with mineral fertilization), 45.0, 90.0 and 127.5 t ha(-1), annually for nine years. Before maize sowing, the sewage sludge was manually applied to the soil and incorporated at 10 cm depth. Soil samples (0-20 cm layer) for Zn, Cd and Pb analysis were collected 60 days after sowing. The successive applications of sewage sludge to the soil did not affect heavy metal (Cd and Pb) fractions in the soil, with exception of Zn fractions. The Zn, Cd and Pb distributions in the soil were strongly associated with humin and residual fractions, which are characterized by stable chemical bonds. Zinc, Cd and Pb in the soil showed low phytoavailability after nine-year successive applications of sewage sludge to the soil.

Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Introduction: Internet users are increasingly using the worldwide web to search for information relating to their health. This situation makes it necessary to create specialized tools capable of supporting users in their searches. Objective: To apply and compare strategies that were developed to investigate the use of the Portuguese version of Medical Subject Headings (MeSH) for constructing an automated classifier for Brazilian Portuguese-language web-based content within or outside of the field of healthcare, focusing on the lay public. Methods: 3658 Brazilian web pages were used to train the classifier and 606 Brazilian web pages were used to validate it. The strategies proposed were constructed using content-based vector methods for text classification, such that Naive Bayes was used for the task of classifying vector patterns with characteristics obtained through the proposed strategies. Results: A strategy named InDeCS was developed specifically to adapt MeSH for the problem that was put forward. This approach achieved better accuracy for this pattern classification task (0.94 sensitivity, specificity and area under the ROC curve). Conclusions: Because of the significant results achieved by InDeCS, this tool has been successfully applied to the Brazilian healthcare search portal known as Busca Saude. Furthermore, it could be shown that MeSH presents important results when used for the task of classifying web-based content focusing on the lay public. It was also possible to show from this study that MeSH was able to map out mutable non-deterministic characteristics of the web. (c) 2010 Elsevier Inc. All rights reserved.

MODELING SCIENTIFIC AGENTS FOR A BETTER SCIENCE

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Science is a fundamental human activity and we trust its results because it has several error-correcting mechanisms. It is subject to experimental tests that are replicated by independent parts. Given the huge amount of information available and the information asymetry between producers and users of knowledge, scientists have to rely on the reports of others. This makes it possible for social effects to influence the scientific community. Here, an Opinion Dynamics agent model is proposed to describe this situation. The influence of Nature through experiments is described as an external field that acts on the experimental agents. We will see that the retirement of old scientists can be fundamental in the acceptance of a new theory. We will also investigate the interplay between social influence and observations. This will allow us to gain insight in the problem of when social effects can have negligible effects in the conclusions of a scientific community and when we should worry about them.

Human multipotent adipose-derived stem cells restore dystrophin expression of Duchenne skeletal-muscle cells in vitro

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background information. DMD (Duchenne muscular dystrophy) is a devastating X-linked disorder characterized by progressive muscle degeneration and weakness. The use of cell therapy for the repair of defective muscle is being pursued as a possible treatment for DMD. Mesenchymal stem cells have the potential to differentiate and display a myogenic phenotype in vitro. Since liposuctioned human fat is available in large quantities, it may be an ideal source of stem cells for therapeutic applications. ASCs (adipose-derived stem cells) are able to restore dystrophin expression in the muscles of mdx (X-linked muscular dystrophy) mice. However, the outcome when these cells interact with human dystrophic muscle is still unknown. Results. We show here that ASCs participate in myotube formation when cultured together with differentiating human DMD myoblasts, resulting in the restoration of dystrophin expression. Similarly, dystrophin was induced when ASCs were co-cultivated with DMD myotubes. Experiments with GFP (green fluorescent protein)-positive ASCs and DAPI (4,6-diamidino-2-phenylindole)-stained DMD myoblasts indicated that ASCs participate in human myogenesis through cellular fusion. Conclusions. These results show that ASCs have the potential to interact with dystrophic muscle cells, restoring dystrophin expression of DMD cells in vitro. The possibility of using adipose tissue as a source of stem cell therapies for muscular diseases is extremely exciting.

THE IMPORTANCE OF DISAGREEING: CONTRARIANS AND EXTREMISM IN THE CODA MODEL

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, we study the effects of introducing contrarians in a model of Opinion Dynamics where the agents have internal continuous opinions, but exchange information only about a binary choice that is a function of their continuous opinion, the CODA model. We observe that the hung election scenario that arises when contrarians are introduced in discrete opinion models still happens. However, it is weaker and it should not be expected in every election. Finally, we also show that the introduction of contrarians make the tendency towards extremism of the original model weaker, indicating that the existence of agents that prefer to disagree might be an important aspect and help society to diminish extremist opinions.

A new indicator for international visibility: exploring Brazilian scientific community

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Brazilian science has increased fast during the last decades. An example is the increasing in the country`s share in the world`s scientific publication within the main international databases. But what is the actual weight of international publications to the whole Brazilian productivity? In order to respond this question, we have elaborated a new indicator, the International Publication Ratio (IPR). The data source was Lattes Database, a database organized by one of the main Brazilian S&T funding agency, which encompasses publication data from 1997 to 2004 of about 51,000 Brazilian researchers. Influences of distinct parameters, such as sectors, fields, career age and gender, are analyzed. We hope the data presented may help S&T managers and other S&T interests to better understand the complexity under the concept scientific productivity, especially in peripheral countries in science, such as Brazil.

New perspectives on the processing and release of public information

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This article discusses issues related to the organization and reception of information in the context of services and public information systems driven by technology. It stems from the assumption that in a ""technologized"" society, the distance between users and information is almost always of cognitive and socio-cultural nature, a product of our effort to design communication. In this context, we favor the approach of the information sign, seeking to answer how a documentary message turns into information, i.e. a structure recognized as socially useful. Observing the structural, cognitive and communicative aspects of the documentary message, based on Documentary Linguistics, Terminology, as well as on Textual Linguistics, the policy of knowledge management and innovation of the Government of the State of Sao Paulo is analyzed, which authorizes the use of Web 2.0, also questioning to what extent this initiative represents innovation in the environment of libraries.

The notion of structure and the information records of the documentary systems

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Assuming as a starting point the acknowledge that the principles and methods used to build and manage the documentary systems are disperse and lack systematization, this study hypothesizes that the notion of structure, when assuming mutual relationships among its elements, promotes more organical systems and assures better quality and consistency in the retrieval of information concerning users` matters. Accordingly, it aims to explore the fundamentals about the records of information and documentary systems, starting from the notion of structure. In order to achieve that, it presents basic concepts and relative matters to documentary systems and information records. Next to this, it lists the theoretical subsides over the notion of structure, studied by Benveniste, Ferrater Mora, Levi-Strauss, Lopes, Penalver Simo, Saussure, apart from Ducrot, Favero and Koch. Appropriations that have already been done by Paul Otlet, Garcia Gutierrez and Moreiro Gonzalez. In Documentation come as a further topic. It concludes that the adopted notion of structure to make explicit a hypothesis of real systematization achieves more organical systems, as well as it grants pedagogical reference to the documentary tasks.

«
1
2
3
4
5
6
7
8
9
»