960 resultados para similarity queries
Resumo:
This study assesses the decline in second birth rates for men and women across different skill levels in transitional Russia. Changes within educational groups and occupational classes are observed over three distinct time periods: the Soviet era, economic crisis, and economic recovery. The most remarkable finding is the similarity in the extent second birth rates declined within educational groups and occupational classes during the economic crisis. Although further decline occurred in the recovery period, more variation emerged across groups.
Resumo:
Background: To enhance our understanding of complex biological systems like diseases we need to put all of the available data into context and use this to detect relations, pattern and rules which allow predictive hypotheses to be defined. Life science has become a data rich science with information about the behaviour of millions of entities like genes, chemical compounds, diseases, cell types and organs, which are organised in many different databases and/or spread throughout the literature. Existing knowledge such as genotype - phenotype relations or signal transduction pathways must be semantically integrated and dynamically organised into structured networks that are connected with clinical and experimental data. Different approaches to this challenge exist but so far none has proven entirely satisfactory. Results: To address this challenge we previously developed a generic knowledge management framework, BioXM™, which allows the dynamic, graphic generation of domain specific knowledge representation models based on specific objects and their relations supporting annotations and ontologies. Here we demonstrate the utility of BioXM for knowledge management in systems biology as part of the EU FP6 BioBridge project on translational approaches to chronic diseases. From clinical and experimental data, text-mining results and public databases we generate a chronic obstructive pulmonary disease (COPD) knowledge base and demonstrate its use by mining specific molecular networks together with integrated clinical and experimental data. Conclusions: We generate the first semantically integrated COPD specific public knowledge base and find that for the integration of clinical and experimental data with pre-existing knowledge the configuration based set-up enabled by BioXM reduced implementation time and effort for the knowledge base compared to similar systems implemented as classical software development projects. The knowledgebase enables the retrieval of sub-networks including protein-protein interaction, pathway, gene - disease and gene - compound data which are used for subsequent data analysis, modelling and simulation. Pre-structured queries and reports enhance usability; establishing their use in everyday clinical settings requires further simplification with a browser based interface which is currently under development.
Resumo:
Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors.
Resumo:
One of the first useful products from the human genome will be a set of predicted genes. Besides its intrinsic scientific interest, the accuracy and completeness of this data set is of considerable importance for human health and medicine. Though progress has been made on computational gene identification in terms of both methods and accuracy evaluation measures, most of the sequence sets in which the programs are tested are short genomic sequences, and there is concern that these accuracy measures may not extrapolate well to larger, more challenging data sets. Given the absence of experimentally verified large genomic data sets, we constructed a semiartificial test set comprising a number of short single-gene genomic sequences with randomly generated intergenic regions. This test set, which should still present an easier problem than real human genomic sequence, mimics the approximately 200kb long BACs being sequenced. In our experiments with these longer genomic sequences, the accuracy of GENSCAN, one of the most accurate ab initio gene prediction programs, dropped significantly, although its sensitivity remained high. Conversely, the accuracy of similarity-based programs, such as GENEWISE, PROCRUSTES, and BLASTX was not affected significantly by the presence of random intergenic sequence, but depended on the strength of the similarity to the protein homolog. As expected, the accuracy dropped if the models were built using more distant homologs, and we were able to quantitatively estimate this decline. However, the specificities of these techniques are still rather good even when the similarity is weak, which is a desirable characteristic for driving expensive follow-up experiments. Our experiments suggest that though gene prediction will improve with every new protein that is discovered and through improvements in the current set of tools, we still have a long way to go before we can decipher the precise exonic structure of every gene in the human genome using purely computational methodology.
Resumo:
Genomic plasticity of human chromosome 8p23.1 region is highly influenced by two groups of complex segmental duplications (SDs), termed REPD and REPP, that mediate different kinds of rearrangements. Part of the difficulty to explain the wide range of phenotypes associated with 8p23.1 rearrangements is that REPP and REPD are not yet well characterized, probably due to their polymorphic status. Here, we describe a novel primate-specific gene family, named FAM90A (family with sequence similarity 90), found within these SDs. According to the current human reference sequence assembly, the FAM90A family includes 24 members along 8p23.1 region plus a single member on chromosome 12p13.31, showing copy number variation (CNV) between individuals. These genes can be classified into subfamilies I and II, which differ in their upstream and 5′-untranslated region sequences, but both share the same open reading frame and are ubiquitously expressed. Sequence analysis and comparative fluorescence in situ hybridization studies showed that FAM90A subfamily II suffered a big expansion in the hominoid lineage, whereas subfamily I members were likely generated sometime around the divergence of orangutan and African great apes by a fusion process. In addition, the analysis of the Ka/Ks ratios provides evidence of functional constraint of some FAM90A genes in all species. The characterization of the FAM90A gene family contributes to a better understanding of the structural polymorphism of the human 8p23.1 region and constitutes a good example of how SDs, CNVs and rearrangements within themselves can promote the formation of new gene sequences with potential functional consequences.
Resumo:
Background: The analysis of the promoter sequence of genes with similar expression patterns isa basic tool to annotate common regulatory elements. Multiple sequence alignments are on thebasis of most comparative approaches. The characterization of regulatory regions from coexpressedgenes at the sequence level, however, does not yield satisfactory results in manyoccasions as promoter regions of genes sharing similar expression programs often do not shownucleotide sequence conservation.Results: In a recent approach to circumvent this limitation, we proposed to align the maps ofpredicted transcription factors (referred as TF-maps) instead of the nucleotide sequence of tworelated promoters, taking into account the label of the corresponding factor and the position in theprimary sequence. We have now extended the basic algorithm to permit multiple promotercomparisons using the progressive alignment paradigm. In addition, non-collinear conservationblocks might now be identified in the resulting alignments. We have optimized the parameters ofthe algorithm in a small, but well-characterized collection of human-mouse-chicken-zebrafishorthologous gene promoters.Conclusion: Results in this dataset indicate that TF-map alignments are able to detect high-levelregulatory conservation at the promoter and the 3'UTR gene regions, which cannot be detectedby the typical sequence alignments. Three particular examples are introduced here to illustrate thepower of the multiple TF-map alignments to characterize conserved regulatory elements inabsence of sequence similarity. We consider this kind of approach can be extremely useful in thefuture to annotate potential transcription factor binding sites on sets of co-regulated genes fromhigh-throughput expression experiments.
Resumo:
Studies of large sets of SNP data have proven to be a powerful tool in the analysis of the genetic structure of human populations. In this work, we analyze genotyping data for 2,841 SNPs in 12 Sub-Saharan African populations, including a previously unsampled region of south-eastern Africa (Mozambique). We show that robust results in a world-wide perspective can be obtained when analyzing only 1,000 SNPs. Our main results both confirm the results of previous studies, and show new and interesting features in Sub-Saharan African genetic complexity. There is a strong differentiation of Nilo-Saharans, much beyond what would be expected by geography. Hunter-gatherer populations (Khoisan and Pygmies) show a clear distinctiveness with very intrinsic Pygmy (and not only Khoisan) genetic features. Populations of the West Africa present an unexpected similarity among them, possibly the result of a population expansion. Finally, we find a strong differentiation of the south-eastern Bantu population from Mozambique, which suggests an assimilation of a pre-Bantu substrate by Bantu speakers in the region.
Resumo:
This paper proposes a novel approach for the analysis of illicit tablets based on their visual characteristics. In particular, the paper concentrates on the problem of ecstasy pill seizure profiling and monitoring. The presented method extracts the visual information from pill images and builds a representation of it, i.e. it builds a pill profile based on the pill visual appearance. Different visual features are used to build different image similarity measures, which are the basis for a pill monitoring strategy based on both discriminative and clustering models. The discriminative model permits to infer whether two pills come from the same seizure, while the clustering models groups of pills that share similar visual characteristics. The resulting clustering structure allows to perform a visual identification of the relationships between different seizures. The proposed approach was evaluated using a data set of 621 Ecstasy pill pictures. The results demonstrate that this is a feasible and cost effective method for performing pill profiling and monitoring.
Resumo:
BACKGROUND: CODIS-STRs in Native Mexican groups have rarely been analysed for human identification and anthropological purposes. AIM:To analyse the genetic relationships and population structure among three Native Mexican groups from Mesoamerica.SUBJECTS AND METHODS: 531 unrelated Native individuals from Mexico were PCR-typed for 15 and 9 autosomal STRs (Identifiler™ and Profiler™ kits, respectively), including five population samples: Purépechas (Mountain, Valley and Lake), Triquis and Yucatec Mayas. Previously published STR data were included in the analyses. RESULTS:Allele frequencies and statistical parameters of forensic importance were estimated by population. The majority of Native groups were not differentiated pairwise, excepting Triquis and Purépechas, which was attributable to their relative geographic and cultural isolation. Although Mayas, Triquis and Purépechas-Mountain presented the highest number of private alleles, suggesting recurrent gene flow, the elevated differentiation of Triquis indicates a different origin of this gene flow. Interestingly, Huastecos and Mayas were not differentiated, which is in agreement with the archaeological hypothesis that Huastecos represent an ancestral Maya group. Interpopulation variability was greater in Natives than in Mestizos, both significant.CONCLUSION: Although results suggest that European admixture has increased the similarity between Native Mexican groups, the differentiation and inconsistent clustering by language or geography stresses the importance of serial founder effect and/or genetic drift in showing their present genetic relationships.
Resumo:
This paper presents a new registration algorithm, called Temporal Di eomorphic Free Form Deformation (TDFFD), and its application to motion and strain quanti cation from a sequence of 3D ultrasound (US) images. The originality of our approach resides in enforcing time consistency by representing the 4D velocity eld as the sum of continuous spatiotemporal B-Spline kernels. The spatiotemporal displacement eld is then recovered through forward Eulerian integration of the non-stationary velocity eld. The strain tensor iscomputed locally using the spatial derivatives of the reconstructed displacement eld. The energy functional considered in this paper weighs two terms: the image similarity and a regularization term. The image similarity metric is the sum of squared di erences between the intensities of each frame and a reference one. Any frame in the sequence can be chosen as reference. The regularization term is based on theincompressibility of myocardial tissue. TDFFD was compared to pairwise 3D FFD and 3D+t FFD, bothon displacement and velocity elds, on a set of synthetic 3D US images with di erent noise levels. TDFFDshowed increased robustness to noise compared to these two state-of-the-art algorithms. TDFFD also proved to be more resistant to a reduced temporal resolution when decimating this synthetic sequence. Finally, this synthetic dataset was used to determine optimal settings of the TDFFD algorithm. Subsequently, TDFFDwas applied to a database of cardiac 3D US images of the left ventricle acquired from 9 healthy volunteers and 13 patients treated by Cardiac Resynchronization Therapy (CRT). On healthy cases, uniform strain patterns were observed over all myocardial segments, as physiologically expected. On all CRT patients, theimprovement in synchrony of regional longitudinal strain correlated with CRT clinical outcome as quanti ed by the reduction of end-systolic left ventricular volume at follow-up (6 and 12 months), showing the potential of the proposed algorithm for the assessment of CRT.
Resumo:
Purpose: The objective of this study is to investigate the feasibility of detecting and quantifying 3D cerebrovascular wall motion from a single 3D rotational x-ray angiography (3DRA) acquisition within a clinically acceptable time and computing from the estimated motion field for the further biomechanical modeling of the cerebrovascular wall. Methods: The whole motion cycle of the cerebral vasculature is modeled using a 4D B-spline transformation, which is estimated from a 4D to 2D + t image registration framework. The registration is performed by optimizing a single similarity metric between the entire 2D + t measured projection sequence and the corresponding forward projections of the deformed volume at their exact time instants. The joint use of two acceleration strategies, together with their implementation on graphics processing units, is also proposed so as to reach computation times close to clinical requirements. For further characterizing vessel wall properties, an approximation of the wall thickness changes is obtained through a strain calculation. Results: Evaluation on in silico and in vitro pulsating phantom aneurysms demonstrated an accurate estimation of wall motion curves. In general, the error was below 10% of the maximum pulsation, even in the situation when substantial inhomogeneous intensity pattern was present. Experiments on in vivo data provided realistic aneurysm and vessel wall motion estimates, whereas in regions where motion was neither visible nor anatomically possible, no motion was detected. The use of the acceleration strategies enabled completing the estimation process for one entire cycle in 5-10 min without degrading the overall performance. The strain map extracted from our motion estimation provided a realistic deformation measure of the vessel wall. Conclusions: The authors' technique has demonstrated that it can provide accurate and robust 4D estimates of cerebrovascular wall motion within a clinically acceptable time, although it has to be applied to a larger patient population prior to possible wide application to routine endovascular procedures. In particular, for the first time, this feasibility study has shown that in vivo cerebrovascular motion can be obtained intraprocedurally from a 3DRA acquisition. Results have also shown the potential of performing strain analysis using this imaging modality, thus making possible for the future modeling of biomechanical properties of the vascular wall.
Resumo:
Analytical results harmonisation is investigated in this study to provide an alternative to the restrictive approach of analytical methods harmonisation which is recommended nowadays for making possible the exchange of information and then for supporting the fight against illicit drugs trafficking. Indeed, the main goal of this study is to demonstrate that a common database can be fed by a range of different analytical methods, whatever the differences in levels of analytical parameters between these latter ones. For this purpose, a methodology making possible the estimation and even the optimisation of results similarity coming from different analytical methods was then developed. In particular, the possibility to introduce chemical profiles obtained with Fast GC-FID in a GC-MS database is studied in this paper. By the use of the methodology, the similarity of results coming from different analytical methods can be objectively assessed and the utility in practice of database sharing by these methods can be evaluated, depending on profiling purposes (evidential vs. operational perspective tool). This methodology can be regarded as a relevant approach for database feeding by different analytical methods and puts in doubt the necessity to analyse all illicit drugs seizures in one single laboratory or to implement analytical methods harmonisation in each participating laboratory.
Resumo:
Cutinized and suberized cell walls form physiological important plant-environment interfaces as they act as barriers limiting water and nutrient loss and protect from radiation and invasion by pathogens. Due to the lack of protocols for the isolation and analysis of cutin and suberin in Arabidopsis, the model plant for molecular biology, mutants and transgenic plants with a defined altered cutin or suberin composition are unavailable, causing that structure and function of these apoplastic barriers are still poorly understood. Transmission electron microscopy (TEM) revealed that Arabidopsis leaf cuticle thickness ranges from only 22 nm in leaf blades to 45 nm on petioles, causing the difficulty in cuticular membrane isolation. We report the use of polysaccharide hydrolases to isolate Arabidopsis cuticular membranes, suitable for depolymerization and subsequent compositional analysis. Although cutin characteristic omega-hydroxy acids (7%) and mid-chain hydroxylated fatty acids (8%) were detected, the discovery of alpha,omega-diacids (40%) and 2-hydroxy acids (14%) as major depolymerization products reveals a so far novel monomer composition in Arabidopsis cutin, but with chemical analogy to root suberin. Histochemical and TEM analysis revealed that suberin depositions were localized to the cell walls in the endodermis of primary roots and the periderm of mature roots of Arabidopsis. Enzyme digested and solvent extracted root cell walls when subjected to suberin depolymerization conditions released omega-hydroxy acids (43%) and alpha,omega-diacids (24%) as major components together with carboxylic acids (9%), alcohols (6%) and 2-hydroxyacids (0.1%). This similarity to suberin of other species indicates that Arabidopsis roots can serve as a model for suberized tissue in general.
Resumo:
Forty-two new apatite and zircon fission track ages are presented for samples from the Western Alps in southern Switzerland, northern Italy, and southeastern France. Measured ages plotted against assumed closure temperatures yield cooling patterns for the final cooling, uplift, and exhumation of the Western Alps. Similar fission track zircon ages in the Penninic Gran Paradiso massif, Dent Blanche nappe, Sesia-Lanzo Zone, and Ivrea Zone indicate cooling of all four units to approximately 225-degrees-C by 33 Ma. Differences in apatite ages reveal differential cooling of the four blocks between 33 Ma and the present. In the Sesia-Lanzo Zone, similarity of apatite ages regardless of elevation, together with near-volcanic confined fission track length patterns suggest rapid cooling and uplift at approximately 25 Ma compared with slow cooling of other Western Alps units around 12 Ma. Uplift is thus not continuous but episodic, often over a short time interval beyond the resolution of other methods. Such episodes of uplift, as revealed here in the Sesia-Lanzo Zone, may be the rule rather than the exception.
Resumo:
Cancer progression is dependent, in part, on interactions between tumor cells and the host microenvironment. During pregnancy, physiological changes occur that include inflammation and reduced immunity, both of which can promote tumor growth. Accordingly, tumors are observed to be more aggressive and to have greater proclivity toward metastasis during pregnancy. In this work, myeloid-derived suppressor cells (MDSC), a population of heterogeneous and pluripotent cells that can down-regulate immune responses during pathological conditions, were studied in the context of mouse and human gestation. The gene expression profile of mouse MDSC has been shown to differ in pregnant and virgin mice, and the profile in pregnant animals bears similarity to that of MDSC associated with the tumor microenvironment. Common induced genes include Fibronectin1 and Olfactomedin4, which are known to be involved in extracellular matrix remodeling and tissue permissiveness to tumor cells implantation. Our observations suggest that mouse MDSC may represent a shared regulatory mechanism of tissue permissiveness that occurs during the physiological state of gestation and tumor growth. Pregnancy-associated changes in immunosuppressive myeloid cell activity have also been studied in humans. We show that CD33+ myeloid cells isolated from PBMC (peripheral blood mononuclear cells) of pregnant women are more strongly immunosuppressive on T cells than CD33+ cells removed from non-pregnant subjects. During murine gestation, decreased natural killer (NK) cell activity is responsible, at least in part, for the increase in experimental metastasis. However, although peripheral blood NK cell numbers and cytotoxicity were slightly reduced in pregnant women, neither appeared to be regulated by CD33+ cells. Nevertheless, based on its observed suppression of T cell responses, the CD33+ PBMC subset appears to be an appropriate myeloid cell population to study in order to elucidate mechanisms of immune regulation that occur during human pregnancy. Our findings regarding the immunosuppressive function of CD33+cells and the role of NK cells during human pregnancy are consistent with the notion that changes in the function of the immune system participate in the constitution of a permissive soil for tumour progression.