59 resultados para Binary Vector

em Universit


Relevância:

60.00% 60.00%

Publicador:

Resumo:

A haplotype is an m-long binary vector. The XOR-genotype of two haplotypes is the m-vector of their coordinate-wise XOR. We study the following problem: Given a set of XOR-genotypes, reconstruct their haplotypes so that the set of resulting haplotypes can be mapped onto a perfect phylogeny (PP) tree. The question is motivated by studying population evolution in human genetics, and is a variant of the perfect phylogeny haplotyping problem that has received intensive attention recently. Unlike the latter problem, in which the input is "full" genotypes, here we assume less informative input, and so may be more economical to obtain experimentally. Building on ideas of Gusfield, we show how to solve the problem in polynomial time, by a reduction to the graph realization problem. The actual haplotypes are not uniquely determined by that tree they map onto, and the tree itself may or may not be unique. We show that tree uniqueness implies uniquely determined haplotypes, up to inherent degrees of freedom, and give a sufficient condition for the uniqueness. To actually determine the haplotypes given the tree, additional information is necessary. We show that two or three full genotypes suffice to reconstruct all the haplotypes, and present a linear algorithm for identifying those genotypes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A haplotype is an m-long binary vector. The XOR-genotype of two haplotypes is the m-vector of their coordinate-wise XOR. We study the following problem: Given a set of XOR-genotypes, reconstruct their haplotypes so that the set of resulting haplotypes can be mapped onto a perfect phylogeny (PP) tree. The question is motivated by studying population evolution in human genetics and is a variant of the PP haplotyping problem that has received intensive attention recently. Unlike the latter problem, in which the input is '' full '' genotypes, here, we assume less informative input and so may be more economical to obtain experimentally. Building on ideas of Gusfield, we show how to solve the problem in polynomial time by a reduction to the graph realization problem. The actual haplotypes are not uniquely determined by the tree they map onto and the tree itself may or may not be unique. We show that tree uniqueness implies uniquely determined haplotypes, up to inherent degrees of freedom, and give a sufficient condition for the uniqueness. To actually determine the haplotypes given the tree, additional information is necessary. We show that two or three full genotypes suffice to reconstruct all the haplotypes and present a linear algorithm for identifying those genotypes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Staphylococcus aureus harbors redundant adhesins mediating tissue colonization and infection. To evaluate their intrinsic role outside of the staphylococcal background, a system was designed to express them in Lactococcus lactis subsp. cremoris 1363. This bacterium is devoid of virulence factors and has a known genetic background. A new Escherichia coli-L. lactis shuttle and expression vector was constructed for this purpose. First, the high-copy-number lactococcal plasmid pIL253 was equipped with the oriColE1 origin, generating pOri253 that could replicate in E. coli. Second, the lactococcal promoters P23 or P59 were inserted at one end of the pOri253 multicloning site. Gene expression was assessed by a luciferase reporter system. The plasmid carrying P23 (named pOri23) expressed luciferase constitutively at a level 10,000 times greater than did the P59-containing plasmid. Transcription was absent in E. coli. The staphylococcal clumping factor A (clfA) gene was cloned into pOri23 and used as a model system. Lactococci carrying pOri23-clfA produced an unaltered and functional 130-kDa ClfA protein attached to their cell walls. This was indicated both by the presence of the protein in Western blots of solubilized cell walls and by the ability of ClfA-positive lactococci to clump in the presence of plasma. ClfA-positive lactococci had clumping titers (titer of 4,112) similar to those of S. aureus Newman in soluble fibrinogen and bound equally well to solid-phase fibrinogen. These experiments provide a new way to study individual staphylococcal pathogenic factors and might complement both classical knockout mutagenesis and modern in vivo expression technology and signature tag mutagenesis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The algorithmic approach to data modelling has developed rapidly these last years, in particular methods based on data mining and machine learning have been used in a growing number of applications. These methods follow a data-driven methodology, aiming at providing the best possible generalization and predictive abilities instead of concentrating on the properties of the data model. One of the most successful groups of such methods is known as Support Vector algorithms. Following the fruitful developments in applying Support Vector algorithms to spatial data, this paper introduces a new extension of the traditional support vector regression (SVR) algorithm. This extension allows for the simultaneous modelling of environmental data at several spatial scales. The joint influence of environmental processes presenting different patterns at different scales is here learned automatically from data, providing the optimum mixture of short and large-scale models. The method is adaptive to the spatial scale of the data. With this advantage, it can provide efficient means to model local anomalies that may typically arise in situations at an early phase of an environmental emergency. However, the proposed approach still requires some prior knowledge on the possible existence of such short-scale patterns. This is a possible limitation of the method for its implementation in early warning systems. The purpose of this paper is to present the multi-scale SVR model and to illustrate its use with an application to the mapping of Cs137 activity given the measurements taken in the region of Briansk following the Chernobyl accident.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A glucocorticoid-responsive vector is described which allows for the highly inducible expression of complementary DNAs (cDNAs) in stably transfected mammalian cell lines. This vector, pLK-neo, composed of a variant mouse mammary tumor virus long terminal repeat promoter, containing a hormone regulatory element, a Geneticin resistance-encoding gene in a simian virus 40 transcription unit, and a polylinker insertion site for heterologous cDNAs, was used to express the polymeric immunoglobulin (poly-Ig) receptor and the thymocyte marker, Thy-1, in Madin-Darby canine kidney (MDCK) cells and in murine fibroblast L cells. A high level of poly-Ig receptor or Thy-1 mRNA accumulation was observed in MDCK cells in response to dexamethasone with a parallel ten- to 200-fold increase in protein synthesis depending on the recombinant protein and the transfected cell clone.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: We sought to improve upon previously published statistical modeling strategies for binary classification of dyslipidemia for general population screening purposes based on the waist-to-hip circumference ratio and body mass index anthropometric measurements. METHODS: Study subjects were participants in WHO-MONICA population-based surveys conducted in two Swiss regions. Outcome variables were based on the total serum cholesterol to high density lipoprotein cholesterol ratio. The other potential predictor variables were gender, age, current cigarette smoking, and hypertension. The models investigated were: (i) linear regression; (ii) logistic classification; (iii) regression trees; (iv) classification trees (iii and iv are collectively known as "CART"). Binary classification performance of the region-specific models was externally validated by classifying the subjects from the other region. RESULTS: Waist-to-hip circumference ratio and body mass index remained modest predictors of dyslipidemia. Correct classification rates for all models were 60-80%, with marked gender differences. Gender-specific models provided only small gains in classification. The external validations provided assurance about the stability of the models. CONCLUSIONS: There were no striking differences between either the algebraic (i, ii) vs. non-algebraic (iii, iv), or the regression (i, iii) vs. classification (ii, iv) modeling approaches. Anticipated advantages of the CART vs. simple additive linear and logistic models were less than expected in this particular application with a relatively small set of predictor variables. CART models may be more useful when considering main effects and interactions between larger sets of predictor variables.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Epidemiological studies of malaria or other vector-transmitted diseases often consider vectors as passive actors in the complex life cycle of the parasites, assuming that vector populations are homogeneous and vertebrate hosts are equally susceptible to being infected during their lifetime. However, some studies based on both human and rodent malaria systems found that mosquito vectors preferentially selected infected vertebrate hosts. This subject has been scarcely investigated in avian malaria models and even less in wild animals using natural host-parasite associations. We investigated whether the malaria infection status of wild great tits, Parus major, played a role in host selection by the mosquito vector Culex pipiens. Pairs of infected and uninfected birds were tested in a dual-choice olfactometer to assess their attractiveness to the mosquitoes. Plasmodium-infected birds attracted significantly fewer mosquitoes than the uninfected ones, which suggest that avian malaria parasites alter hosts' odours involved in vector orientation. Reaction time of the mosquitoes, that is, the time taken to select a host, and activation of mosquitoes, defined as the proportion of individuals flying towards one of the hosts, were not affected by the bird's infection status. The importance of these behavioural responses for the vector is discussed in light of recent advances in related or similar model systems.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Introduction: In normal mice, lentiviral vector (LV) shows a great efficiency to infect the RPE cells, but transduces retinal neurons more efficiently during development. Here, we investigated the tropism of LV in the degenerating retina of mice, knowing that the retina structure changes during degeneration. We postulated that the viral transduction would be increased by the alteration of the interphotoreceptor matrix (IPM). We tested two different LV-pseudotypes using the VSVG and the Mokola envelopes. Methods: Subretinal injections were performed in wild-type (C57/Bl6) and rhodopsin knockout (Rho-/-) mice. We injected LV-VSVG-EFS-GFPII into 3.3-4.9 month old mice and LV-VSVG-Rho-GFP into 1-1.4 month old mice to target the photoreceptors (PR). LV-MOK-CMV-GFP was injected into 2.4-3.3 months old mice. We sacrificed the animals one week post injection, used immunohistochemistry to identify the transduced cells, and investigated the OLM integrity. Results: Using LV-VSVG-EFS-GFPII into 3.3-4.9 months mice, we observed significant retinal and RPE transduction in Rho-/- mice. However, the retinas showed transduction mainly at the injection's site. We mostly observed GFP+ cells having a Müller cell morphology. Using LV-MOK-CMV-GFP into 2.4-3.3 months mice, we evidenced the same pattern of viral infection, but with more Müller cells targeted by the virus. Using LV-VSVG-Rho-GFP into 1-1.4 month old mice, we don't note any difference between Rho-/- and wild-type mice for transduced cells. The IPM stained with ZO1 appears irregular into the 4.9 months old Rho-/- mice; for the youngest mice (Rho-/- and C57/Bl6), there is no modification of the IPM. Conclusion: The degeneration improves retinal cells transduction due to the alteration of the IPM in old Rho-/- mice. Müller cells seem (by morphological evidences) to be the principal cells expressing the transgene. The LV with Mokola envelope can transduce Müller cells in a degenerating retina with an intact IPM. In 1 month old mice, the degeneration doesn't enhance the transduction in rod PR probably because the IPM is not yet altered. The possibility to target photoreceptors at a later stage of the degeneration is under investigation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Uncertainty quantification of petroleum reservoir models is one of the present challenges, which is usually approached with a wide range of geostatistical tools linked with statistical optimisation or/and inference algorithms. Recent advances in machine learning offer a novel approach to model spatial distribution of petrophysical properties in complex reservoirs alternative to geostatistics. The approach is based of semisupervised learning, which handles both ?labelled? observed data and ?unlabelled? data, which have no measured value but describe prior knowledge and other relevant data in forms of manifolds in the input space where the modelled property is continuous. Proposed semi-supervised Support Vector Regression (SVR) model has demonstrated its capability to represent realistic geological features and describe stochastic variability and non-uniqueness of spatial properties. On the other hand, it is able to capture and preserve key spatial dependencies such as connectivity of high permeability geo-bodies, which is often difficult in contemporary petroleum reservoir studies. Semi-supervised SVR as a data driven algorithm is designed to integrate various kind of conditioning information and learn dependences from it. The semi-supervised SVR model is able to balance signal/noise levels and control the prior belief in available data. In this work, stochastic semi-supervised SVR geomodel is integrated into Bayesian framework to quantify uncertainty of reservoir production with multiple models fitted to past dynamic observations (production history). Multiple history matched models are obtained using stochastic sampling and/or MCMC-based inference algorithms, which evaluate posterior probability distribution. Uncertainty of the model is described by posterior probability of the model parameters that represent key geological properties: spatial correlation size, continuity strength, smoothness/variability of spatial property distribution. The developed approach is illustrated with a fluvial reservoir case. The resulting probabilistic production forecasts are described by uncertainty envelopes. The paper compares the performance of the models with different combinations of unknown parameters and discusses sensitivity issues.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Due to their performance enhancing properties, use of anabolic steroids (e.g. testosterone, nandrolone, etc.) is banned in elite sports. Therefore, doping control laboratories accredited by the World Anti-Doping Agency (WADA) screen among others for these prohibited substances in urine. It is particularly challenging to detect misuse with naturally occurring anabolic steroids such as testosterone (T), which is a popular ergogenic agent in sports and society. To screen for misuse with these compounds, drug testing laboratories monitor the urinary concentrations of endogenous steroid metabolites and their ratios, which constitute the steroid profile and compare them with reference ranges to detect unnaturally high values. However, the interpretation of the steroid profile is difficult due to large inter-individual variances, various confounding factors and different endogenous steroids marketed that influence the steroid profile in various ways. A support vector machine (SVM) algorithm was developed to statistically evaluate urinary steroid profiles composed of an extended range of steroid profile metabolites. This model makes the interpretation of the analytical data in the quest for deviating steroid profiles feasible and shows its versatility towards different kinds of misused endogenous steroids. The SVM model outperforms the current biomarkers with respect to detection sensitivity and accuracy, particularly when it is coupled to individual data as stored in the Athlete Biological Passport.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We use panel data from the U. S. Health and Retirement Study, 1992-2002, to estimate the effect of self-assessed health limitations on the active labor market participation of older men. Self-assessments of health are likely to be endogenous to labor supply due to justification bias and individual-specific heterogeneity in subjective evaluations. We address both concerns. We propose a semiparametric binary choice procedure that incorporates nonadditive correlated individual-specific effects. Our estimation strategy identifies and estimates the average partial effects of health and functioning on labor market participation. The results indicate that poor health plays a major role in labor market exit decisions.