25 resultados para Alignment-free method, dissimilarity, distance, genome, phylogenetic analysis.
em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain
Resumo:
The genome of the bladderwort Utricularia gibba provides an unparalleled opportunity to uncover the adaptive landscape of an aquatic carnivorous plant with unique phenotypic features such as absence of roots, development of water-filled suction bladders, and a highly ramified branching pattern. Despite its tiny size, the U. gibba genome accommodates approximately as many genes as other plant genomes. To examine the relationship between the compactness of its genome and gene turnover, we compared the U. gibba genome with that of four other eudicot species, defining a total of 17,324 gene families (orthogroups). These families were further classified as either 1) lineage-specific expanded/contracted or 2) stable in size. The U. gibba-expanded families are generically related to three main phenotypic features: 1) trap physiology, 2) key plant morphogenetic/developmental pathways, and 3) response to environmental stimuli, including adaptations to life in aquatic environments. Further scans for signatures of protein functional specialization permitted identification of seven candidate genes with amino acid changes putatively fixed by positive Darwinian selection in the U. gibba lineage. The Arabidopsis orthologs of these genes (AXR, UMAMIT41, IGS, TAR2, SOL1, DEG9, and DEG10) are involved in diverse plant biological functions potentially relevant for U. gibba phenotypic diversification, including 1) auxin metabolism and signal transduction, 2) flowering induction and floral meristem transition, 3) root development, and 4) peptidases. Taken together, our results suggest numerous candidate genes and gene families as interesting targets for further experimental confirmation of their functional and adaptive roles in the U. gibba's unique lifestyle and highly specialized body plan.
Resumo:
BACKGROUND: DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. RESULTS: We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. CONCLUSION: VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
Resumo:
A cultivation-independent approach based on polymerase chain reaction (PCR)-amplified partial small subunit rRNA genes was used to characterize bacterial populations in the surface soil of a commercial pear orchard consisting of different pear cultivars during two consecutive growing seasons. Pyrus communis L. cvs Blanquilla, Conference, and Williams are among the most widely cultivated cultivars in Europe and account for the majority of pear production in Northeastern Spain. To assess the heterogeneity of the community structure in response to environmental variables and tree phenology, bacterial populations were examined using PCR-denaturing gradient gel electrophoresis (DGGE) followed by cluster analysis of the 16S ribosomal DNA profiles by means of the unweighted pair group method with arithmetic means. Similarity analysis of the band patterns failed to identify characteristic fingerprints associated with the pear cultivars. Both environmentally and biologically based principal-component analyses showed that the microbial communities changed significantly throughout the year depending on temperature and, to a lesser extent, on tree phenology and rainfall. Prominent DGGE bands were excised and sequenced to gain insight into the identities of the predominant bacterial populations. Most DGGE band sequences were related to bacterial phyla, such as Bacteroidetes, Cyanobacteria, Acidobacteria, Proteobacteria, Nitrospirae, and Gemmatimonadetes, previously associated with typical agronomic crop environments
Resumo:
This paper establishes a general framework for metric scaling of any distance measure between individuals based on a rectangular individuals-by-variables data matrix. The method allows visualization of both individuals and variables as well as preserving all the good properties of principal axis methods such as principal components and correspondence analysis, based on the singular-value decomposition, including the decomposition of variance into components along principal axes which provide the numerical diagnostics known as contributions. The idea is inspired from the chi-square distance in correspondence analysis which weights each coordinate by an amount calculated from the margins of the data table. In weighted metric multidimensional scaling (WMDS) we allow these weights to be unknown parameters which are estimated from the data to maximize the fit to the original distances. Once this extra weight-estimation step is accomplished, the procedure follows the classical path in decomposing a matrix and displaying its rows and columns in biplots.
Resumo:
The set covering problem is an NP-hard combinatorial optimization problemthat arises in applications ranging from crew scheduling in airlines todriver scheduling in public mass transport. In this paper we analyze searchspace characteristics of a widely used set of benchmark instances throughan analysis of the fitness-distance correlation. This analysis shows thatthere exist several classes of set covering instances that have a largelydifferent behavior. For instances with high fitness distance correlation,we propose new ways of generating core problems and analyze the performanceof algorithms exploiting these core problems.
Resumo:
Background: Non-long terminal repeat (non-LTR) retrotransposons have contributed to shaping the structure and function of genomes. In silico and experimental approaches have been used to identify the non-LTR elements of the urochordate Ciona intestinalis. Knowledge of the types and abundance of non-LTR elements in urochordates is a key step in understanding their contribution to the structure and function of vertebrate genomes. Results: Consensus elements phylogenetically related to the I, LINE1, LINE2, LOA and R2 elements of the 14 eukaryotic non-LTR clades are described from C. intestinalis. The ascidian elements showed conservation of both the reverse transcriptase coding sequence and the overall structural organization seen in each clade. The apurinic/apyrimidinic endonuclease and nucleic-acid-binding domains encoded upstream of the reverse transcriptase, and the RNase H and the restriction enzyme-like endonuclease motifs encoded downstream of the reverse transcriptase were identified in the corresponding Ciona families. Conclusions: The genome of C. intestinalis harbors representatives of at least five clades of non-LTR retrotransposons. The copy number per haploid genome of each element is low, less than 100, far below the values reported for vertebrate counterparts but within the range for protostomes. Genomic and sequence analysis shows that the ascidian non-LTR elements are unmethylated and flanked by genomic segments with a gene density lower than average for the genome. The analysis provides valuable data for understanding the evolution of early chordate genomes and enlarges the view on the distribution of the non-LTR retrotransposons in eukaryotes.
Resumo:
Lipoxygenases are non-heme iron enzymes essential in eukaryotes, where they catalyze the formation of the fatty acid hydroperoxides that are required by a large diversity of biological and pathological processes. In prokaryotes, most of them totally lacking in polyunsaturated fatty acids, the possible biological roles oflipoxygenases have remained obscure. In this study, it is reported the crystallization of a lipoxygenase of Pseudomonas aeruginosa (Pa_LOX), the first from a prokaryote. High resolution data has been acquired which is expected to yield structural clues to the questions adressed. Besides, a preliminar phylogenetic analysis using 14 sequences has confirmed the existence of this subfamily of bacterial lipoxygenases, on one side, and a greater diversity than in the corresponding eukaryotic ones, on the other. Finally, an evolutionary study of bacteriallipoxygenases on the same set of lipoxygenases, show a selection pressure of a basically purifying or neutral character except for a single aminoacid, which would have been selected after a positive selection event.
Resumo:
Lipoxygenases are non-heme iron enzymes essential in eukaryotes, where they catalyze the formation of the fatty acid hydroperoxides that are required by a large diversity of biological and pathological processes. In prokaryotes, most of them totally lacking in polyunsaturated fatty acids, the possible biological roles oflipoxygenases have remained obscure. In this study, it is reported the crystallization of a lipoxygenase of Pseudomonas aeruginosa (Pa_LOX), the first from a prokaryote. High resolution data has been acquired which is expected to yield structural clues to the questions adressed. Besides, a preliminar phylogenetic analysis using 14 sequences has confirmed the existence of this subfamily of bacterial lipoxygenases, on one side, and a greater diversity than in the corresponding eukaryotic ones, on the other. Finally, an evolutionary study of bacteriallipoxygenases on the same set of lipoxygenases, show a selection pressure of a basically purifying or neutral character except for a single aminoacid, which would have been selected after a positive selection event.
Resumo:
A simple and most promising oxide-assisted catalyst-free method is used to prepare silicon nitride nanowires that give rise to high yield in a short time. After a brief analysis of the state of the art, we reveal the crucial role played by the oxygen partial pressure: when oxygen partial pressure is slightly below the threshold of passive oxidation, a high yield inhibiting the formation of any silica layer covering the nanowires occurs and thanks to the synthesis temperature one can control nanowire dimensions
Resumo:
BACKGROUND: DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. RESULTS: We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. CONCLUSION: VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
Resumo:
Selection of amino acid substitutions associated with resistance to nucleos(t)ide-analog (NA) therapy in the hepatitis B virus (HBV) reverse transcriptase (RT) and their combination in a single viral genome complicates treatment of chronic HBV infection and may affect the overlapping surface coding region. In this study, the variability of an overlapping polymerase-surface region, critical for NA resistance, is investigated before treatment and under antiviral therapy, with assessment of NA-resistant amino acid changes simultaneously occurring in the same genome (linkage analysis) and their influence on the surface coding region.
Resumo:
Plesiomonas shigelloides, the only species of the genus, is an emergent pathogenic bacterium associated with human diarrheal and extraintestinal disease. We present the whole-genome sequence analysis of the representative strain for the O1 serotype (strain 302-73), providing a tool for studying bacterial outbreaks, virulence factors, and accurate diagnostic methods.
Resumo:
The distribution of the genus Barbadocladius Cranston & Krosch (Diptera: Chironomidae), previously reported from Chile to Bolivia, has extended northwards. Larvae, pupae and pupal exuviae of this genus have been found in the high mountain tropical streams of Peru to 9°22′56″, but are restricted to very high altitude streams (altitudes over 3,278 m asl) compared to the lower altitude streams (below 1,100 m asl) in which the genus is reported in Chile and Argentina. Based on morphological studies, both described species in the genus, Barbadocladius andinus Cranston & Krosch and Barbadocladius limay Cranston & Krosch, have been found in Peru as pupae or pupal exuviae. Morphological analysis of the larvae and pupae revealed no differences between the two described species from Patagonia and Peru, which are of similar size and with a similar armament of hooklets and spines in pupal tergites and sternites. However, molecular analysis of larvae and pupae revealed that in Peru, there are at least two different evolutionary lines, one distributed widely and another restricted to one site. Phylogenetic analysis (using cox1 mitochondrial sequences) of all available sequences of Barbadocladius shows that the Chilean and Argentinean material differs from that of Peru. Therefore, a total of four molecular segregates are identified, although morphologically, neither larvae nor the pupae may be differentiated.
Resumo:
En el present estudi s'analitza l'origen i evolució de 2 molècules claus pera entendre la multicel·lularitat dels animals: les molècules d'adhesió integrines i els factors de transcripció T-box. S’utilitzen els genomes recentment publicats de protists unicel•lulars parents propers dels animals. S’analitza l’origen i evolució d’aquests gens mitjançant anàlisi filogènic, determinació de motius funcionals i també tècniques de biologia molecular. A més, es documenta un cas de transferència gènica horitzontal des d'un eucariota cap a un procariota, fenomen poc habitual. Les principals conclusions són que tant l’adhesoma d'integrina com els gens T-box tenen un origen molt anterior als animals, en un context unicel•lular, i que després foren cooptats pel llinatge multicel•lular dels animals.
Obtenció de nous anàlegs amb activitat brassinoesteroide mitjançant modelització molecular i síntesi
Resumo:
Els brassinoesteroides són productes naturals que actuen com a potents reguladors del creixement vegetal. Presenten aplicacions prometedores en l’agricultura degut a que, aplicats exògenament, augmenten la qualitat i la quantitat de les collites. Ara bé, el seu ús s’ha vist restringit degut a la seva costosa obtenció. Aquest fet ha motivat la recerca de nous compostos actius més assequibles. En aquest projecte es planteja el disseny i obtenció de nous anàlegs seguint diferents estratègies que impliquen tant l’ús de mètodes de modelització molecular com de síntesi orgànica. La primera d’aquestes estratègies consisteix en buscar compostos actius en bases de dades de compostos comercials a través de processos de Virtual Screening desenvolupats amb mètodes computacionals basats en Camps d’Interacció Molecular. Així, es van establir i interpretar models de Relacions Quantitatives Estructura-Activitat (QSAR) emprant descriptors independents de l’alineament (GRIND) i, amb col•laboració amb la Universitat de Perugia, aquest criteri de cerca es va ampliar amb l’aplicació de descriptors FLAP de nova generació. Una altra estratègia es va basar en intentar substituir l’esquelet esteroide dels brassinoesteroides per una estructura equivalent, fixant com a cadena lateral el grup (R)-hexahidromandelil. S’han aplicat dos criteris: mètodes computacionals basats en models QSAR establerts amb descriptors GRIND i també en la metodologia SHOP (scaffold hopping), i, per altra banda, anàlegs proposats racionalment a partir d’un estudi efectuat sobre disruptors endocrins no esteroïdals. Sobre les estructures trobades s’hi va unir la cadena lateral comercial esmentada per via sintètica, en la qual s’ha hagut de fer un èmfasi especial en grups protectors. En total, 49 estructures es proposen per a ser obtingudes sintèticament. També s’ha treballat en l’obtenció un agonista derivat de l’hipotètic antagonista KM-01. Totes les molècules candidates, ja siguin comercials o obtingudes sintèticament, estant sent avaluades en el test d’inclinació de la làmina d’arròs (RLIT).