874 resultados para Complete Genome Sequence
Resumo:
Abstract Background The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. Results Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA+ northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. Conclusion Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data.
Resumo:
Abstract Background Hepatitis B virus (HBV) can be classified into nine genotypes (A-I) defined by sequence divergence of more than 8% based on the complete genome. This study aims to identify the genotypic distribution of HBV in 40 HBsAg-positive patients from Rondônia, Brazil. A fragment of 1306 bp partially comprising surface and polymerase overlapping genes was amplified by PCR. Amplified DNA was purified and sequenced. Amplified DNA was purified and sequenced on an ABI PRISM® 377 Automatic Sequencer (Applied Biosystems, Foster City, CA, USA). The obtained sequences were aligned with reference sequences obtained from the GenBank using Clustal X software and then edited with Se-Al software. Phylogenetic analyses were conducted by the Markov Chain Monte Carlo (MCMC) approach using BEAST v.1.5.3. Results The subgenotypes distribution was A1 (37.1%), D3 (22.8%), F2a (20.0%), D4 (17.1%) and D2 (2.8%). Conclusions These results for the first HBV genotypic characterization in Rondônia state are consistent with other studies in Brazil, showing the presence of several HBV genotypes that reflects the mixed origin of the population, involving descendants from Native Americans, Europeans, and Africans.
Resumo:
Parasiten der Apicomplexa umfassen sowohl humanpathogene, als auch tierpathogene Protozoen. Beispiele für wichtige Vertreter human- und tierpathogener Parasiten sind Plasmodium falciparum und Eimeria tenella. E. tenella verursacht die Kokzidiose des Hühnchens, eine Darmerkrankung die weltweit für Verluste in einer geschätzten Höhe von bis zu 3 Milliarden US$ verantwortlich zeichnet. Eine prophylaktische Vakzinierung gegen diese Krankheit ist ökonomisch meist ineffizient, und eine Behandlung mit Kokzidiostatika wird durch häufige Resistenzbildung gegen bekannte Wirkstoffe erschwert. Diese Situation erfordert die Entwicklung neuer kostengünstiger Alternativen. Geeignete Zielproteine für die Entwicklung neuartiger Arzneistoffe zur Behandlung der Kokzidiose sind die Zyklin-abhängigen Kinasen (CDKs), zu denen auch die CDK-related Kinase 2 (EtCRK2) aus E. tenella gehört. Diese Proteine sind maßgeblich an der Regulation des Zellzyklus beteiligt. Durch chemische Validierung mit dem CDK Inhibitor Flavopiridol konnte nachgewiesen werden, dass ein Funktionsverlust von CDKs in E. tenella die Vermehrung des Parasiten in Zellkultur inhibiert. E. tenella CDKs sind daher als Zielproteine für die Entwicklung einer Chemotherapie der Kokzidiose geeignet. Mittels bioinformatischer Tiefenanalysen sollten CDK Proteine im Parasiten E. tenella identifiziert werden. Das Genom von E. tenella liegt in Rohfassung vor [ftp://ftp.sanger.ac.uk]. Jedoch waren zum Zeitpunkt dieser Arbeiten viele Sequenzen des Genoms noch nicht annotiert. Homologe CDK Proteine von E. tenella konnten durch den Vergleich von Sequenzinformationen mit anderen Organismen der Apicomplexa identifiziert und analysiert werden. Durch diese Analysen konnten neben der bereits bekannten EtCRK2, drei weitere, bislang nicht annotierte CDKs in E. tenella identifiziert werden (EtCRK1, EtCRK3 sowie EtMRK). Darüber hinaus wurde eine Analyse der entsprechenden Zykline – der Aktivatoren der CDKs – bezüglich Funktion und Struktur, sowie eine Datenbanksuche nach bisher nicht beschriebenen Zyklinen in E. tenella durchgeführt. Diese Suchen ergaben vier neue potentielle Zykline für E. tenella, wovon EtCYC3a als Aktivator der EtCRK2 von María L. Suárez Fernández (Intervet Innovation GmbH, Schwabenheim) bestätigt werden konnte. Sequenzvergleiche lassen vermuten, dass auch EtCYC1 und EtCYC3b in der Lage sind, EtCRK2 zu aktivieren. Außerdem ist anzunehmen, dass EtCYC4 als Aktivator der EtCRK1 fungiert. Ein weiterer Schwerpunkt der vorliegenden Arbeit war die Suche und Optimierung nach neuen Inhibitoren von CDKs aus E. tenella. In vorangegangenen Arbeiten konnten bereits Inhibitoren der EtCRK2 gefunden werden [BEYER, 2007]. Mittels Substruktur- und Ähnlichkeitssuchen konnten im Rahmen dieser Arbeit weitere Inhibitoren der EtCRK2 identifiziert werden. Vier dieser Strukturklassen erfüllen die Kriterien einer Leitstruktur. Eine dieser Leitstrukturen gehört zur Strukturklasse der Benzimidazol-Carbonitrile und ist bislang nicht als Inhibitor anderer Kinasen beschrieben. Diese neu identifizierte Leitstruktur konnte in silico weiter optimiert werden. Im Rahmen dieser Arbeit wurden Bindungsenergien von Vertretern dieser Strukturklasse berechnet, um einen wahrscheinlichen Bindemodus vorherzusagen. Für die weiterführende in silico Optimierung wurde eine virtuelle kombinatorische Substanzbibliothek dieser Klasse erstellt. Die Auswahl geeigneter Verbindungen für eine chemische Synthese erfolgte durch molekulares Docking unter Nutzung von Homologiemodellen der EtCRK2. Darüber hinaus wurde ein in silico Screening nach potentiellen Inhibitoren der PfMRK und EtMRK durchgeführt. Dabei konnten weitere interessante virtuelle Hit-Strukturen aus einer Substanzdatenbank kommerziell erhältlicher Verbindungen gefunden werden. Durch dieses virtuelle Screening konnten jeweils sieben Verbindungen als virtuelle Hits der PfMRK sowie der EtMRK identifiziert werden. Die Häufung von Strukturklassen mit bekannter CDK Aktivität deutet darauf hin, dass während des virtuellen Screenings eine Anreicherung von CDK Inhibitoren stattgefunden hat. Diese Ergebnisse lassen auf eine Weiterentwicklung neuer Wirkstoffe gegen Kokzidiose und Malaria hoffen.
Resumo:
The mammalian collagen, type IX, alpha 2 gene (COL9A2) encodes the alpha-2 chain of type IX collagen and is located on horse chromosome 2p16-->p14 harbouring a quantitative trait locus for osteochondrosis. We isolated a bacterial artificial chromosome (BAC) clone containing the equine COL9A2 gene and determined the complete genomic sequence of this gene. Cloning and characterization of equine COL9A2 revealed that the equine gene consists of 32 exons spanning approximately 15 kb. The COL9A2 transcript encodes a single protein of 688 amino acids. Thirty two single nucleotide polymorphisms (SNPs) equally distributed in the gene were detected in a mutation scan of eight unrelated Hanoverian warmblood stallions, including one SNP that affects the amino acid sequence of COL9A2. Comparative analyses between horse, human, mouse and rat indicate that the chromosomal location of equine COL9A2 is in agreement with known chromosomal synteny relationships. The comparison of the gene structure and transcript revealed a high degree of conservation towards the other mammalian COL9A2 genes. We chose three informative SNPs for association and linkage disequilibrium tests in three to five paternal half-sib families of Hanoverian warmblood horses consisting of 44 to 75 genotyped animals. The test statistics did not reach the significance threshold of 5% and so we could not show an association of COL9A2 with equine osteochondrosis.
Resumo:
BACKGROUND: The neuronal ceroid lipofuscinoses (NCL) are a heterogenous group of inherited progressive neurodegenerative diseases in different mammalian species. Tibetan Terrier and Polish Owczarek Nizinny (PON) dogs show rare late-onset NCL variants with autosomal recessive inheritance, which can not be explained by mutations of known human NCL genes. These dog breeds represent animal models for human late-onset NCL. In mice the chloride channel 3 gene (Clcn3) encoding an intracellular chloride channel was described to cause a phenotype similar to NCL. RESULTS: Two full-length cDNA splice variants of the canine CLCN3 gene are reported. The current canine whole genome sequence assembly was used for gene structure analyses and revealed 13 coding CLCN3 exons in 52 kb of genomic sequence. Sequence analysis of the coding exons and flanking intron regions of CLCN3 using six NCL-affected Tibetan terrier dogs and an NCL-affected Polish Owczarek Nizinny (PON) dog, as well as eight healthy Tibetan terrier dogs revealed 13 SNPs. No consistent CLCN3 haplotype was associated with NCL. CONCLUSION: For the examined animals we excluded the complete coding region and adjacent intronic regions of canine CLCN3 to harbor disease-causing mutations. Therefore it seems to be unlikely that a mutation in this gene is responsible for the late-onset NCL phenotype in these two dog breeds.
Resumo:
A multilocus sequence typing (MLST) scheme was established and evaluated for Mycoplasma hyopneumoniae, the etiologic agent of enzootic pneumonia in swine with the aim of defining strains. Putative target genes were selected by genome sequence comparisons. Out of 12 housekeeping genes chosen and experimentally validated, the 7 genes efp, metG, pgiB, recA, adk, rpoB, and tpiA were finally used to establish the MLST scheme. Their usefulness was assessed individually and in combination using a set of well-defined field samples and strains of M. hyopneumoniae. A reduction to the three targets showing highest variation (adk, rpoB, and tpiA) was possible resulting in the same number of sequence types as using the seven targets. The established MLST approach was compared with the recently described typing method using the serine-rich repeat motif-encoding region of the p146 gene. There was coherence between the two methods, but MLST resulted in a slightly higher resolution. Farms recognized to be affected by enzootic pneumonia were always associated with a single M. hyopneumoniae clone, which in most cases differed from farm to farm. However, farms in close geographic or operational contact showed identical clones as defined by MLST typing. Population analysis showed that recombination in M. hyopneumoniae occurs and that strains are very diverse with only limited clonality observed. Elaborate classical MLST schemes using multiple targets for M. hyopneumoniae might therefore be of limited value. In contrast, MLST typing of M. hyopneumoniae using the three genes adk, rpoB, and tpiA seems to be sufficient for epidemiological investigations by direct amplification of target genes from lysate of clinical material without prior cultivation.
Resumo:
A comprehensive second-generation whole genome radiation hybrid (RH II), cytogenetic and comparative map of the horse genome (2n = 64) has been developed using the 5000rad horse x hamster radiation hybrid panel and fluorescence in situ hybridization (FISH). The map contains 4,103 markers (3,816 RH; 1,144 FISH) assigned to all 31 pairs of autosomes and the X chromosome. The RH maps of individual chromosomes are anchored and oriented using 857 cytogenetic markers. The overall resolution of the map is one marker per 775 kilobase pairs (kb), which represents a more than five-fold improvement over the first-generation map. The RH II incorporates 920 markers shared jointly with the two recently reported meiotic maps. Consequently the two maps were aligned with the RH II maps of individual autosomes and the X chromosome. Additionally, a comparative map of the horse genome was generated by connecting 1,904 loci on the horse map with genome sequences available for eight diverse vertebrates to highlight regions of evolutionarily conserved syntenies, linkages, and chromosomal breakpoints. The integrated map thus obtained presents the most comprehensive information on the physical and comparative organization of the equine genome and will assist future assemblies of whole genome BAC fingerprint maps and the genome sequence. It will also serve as a tool to identify genes governing health, disease and performance traits in horses and assist us in understanding the evolution of the equine genome in relation to other species.
Resumo:
The mammalian glycinamide ribonucleotide formyltransferase (GART) genes encode a trifunctional polypeptide involved in the de novo purine biosynthesis. We isolated a bacterial artificial chromosome (BAC) clone containing the bovine GART gene and determined the complete DNA sequence of the BAC clone. Cloning and characterization of the bovine GART gene revealed that the bovine gene consists of 23 exons spanning approximately 27 kb. RT-PCR amplification of bovine GART in different organs showed the expression of two GART transcripts in cattle similar to human and mouse. The GART transcripts encode two proteins of 1010 and 433 amino acids, respectively. Eleven single nucleotide polymorphisms (SNPs) were detected in a mutation scan of 24 unrelated animals of three different cattle breeds, including one SNP that affects the amino acid sequence of GART. The chromosomal localization of the gene was determined by fluorescence in situ hybridization. Comparative genome analysis between cattle, human and mouse indicates that the chromosomal location of the bovine GART gene is in agreement with a previously published mapping report.
Resumo:
BACKGROUND A cost-effective strategy to increase the density of available markers within a population is to sequence a small proportion of the population and impute whole-genome sequence data for the remaining population. Increased densities of typed markers are advantageous for genome-wide association studies (GWAS) and genomic predictions. METHODS We obtained genotypes for 54 602 SNPs (single nucleotide polymorphisms) in 1077 Franches-Montagnes (FM) horses and Illumina paired-end whole-genome sequencing data for 30 FM horses and 14 Warmblood horses. After variant calling, the sequence-derived SNP genotypes (~13 million SNPs) were used for genotype imputation with the software programs Beagle, Impute2 and FImpute. RESULTS The mean imputation accuracy of FM horses using Impute2 was 92.0%. Imputation accuracy using Beagle and FImpute was 74.3% and 77.2%, respectively. In addition, for Impute2 we determined the imputation accuracy of all individual horses in the validation population, which ranged from 85.7% to 99.8%. The subsequent inclusion of Warmblood sequence data further increased the correlation between true and imputed genotypes for most horses, especially for horses with a high level of admixture. The final imputation accuracy of the horses ranged from 91.2% to 99.5%. CONCLUSIONS Using Impute2, the imputation accuracy was higher than 91% for all horses in the validation population, which indicates that direct imputation of 50k SNP-chip data to sequence level genotypes is feasible in the FM population. The individual imputation accuracy depended mainly on the applied software and the level of admixture.
Resumo:
Rhizobium leguminosarum (Rl) es una alfa-proteobacteria capaz de establecer una simbiosis diazotrófica con distintas leguminosas. A pesar de la importancia de esta simbiosis en el balance global del ciclo del nitrógeno, muy pocos genomas de rhizobios han sido secuenciados, que aporten nuevos conocimientos relacionados con las características genéticas que contribuyen a importantes procesos simbióticos. Únicamente tres secuencias completas de Rl han sido publicadas: Rl bv. viciae 3841 y dos genomas de Rl bv. trifolii (WSM1325 y WSM2304), ambos simbiontes de trébol. La secuencia genómica de Rlv UPM791 se ha determinado por medio de secuenciación 454. Este genoma tiene un tamaño aproximado de 7.8 Mb, organizado en un cromosoma y 5 replicones extracromosómicos, que incluyen un plásmido simbiótico de 405 kb. Este nuevo genoma se ha analizado en relación a las funciones simbióticas y adaptativas en comparación con los genomas completos de Rlv 3841 y Rl bv. trifolii WSM1325 y WSM2304. Mientras que los plásmidos pUPM791a y b se encuentran conservados, el plásmido simbiótico pUPM791c exhibe un grado de conservación muy bajo comparado con aquellos descritos en las otras cepas de Rl. Uno de los factores implicados en el establecimiento de la simbiosis es el sistema de comunicación intercelular conocido como Quorum Sensing (QS). El análisis del genoma de Rlv UPM791 ha permitido la identificación de dos sistemas tipo LuxRI mediados por señales de tipo N-acyl-homoserina lactonas (AHLs). El análisis mediante HPLC-MS ha permitido asociar las señales C6-HSL, C7-HSL y C8-HSL al sistema rhiRI, codificado en el plásmido simbiótico; mientras que el sistema cinRI, localizado en el cromosoma, produce 3OH-C14:1-HSL. Se ha identificado una tercera sintasa (TraI) codificada en el plásmido simbiótico, pero su regulador correspondiente se encuentra truncado debido a un salto de fase. Adicionalmente, se han encontrado tres reguladores de tipo LuxR-orphan que no presentan una sintasa LuxI asociada. El efecto potencial de las señales tipo AHL se ha estudiado mediante una estrategia de quorum quenching, la cual interfiere con los sistemas de QS de la bacteria. Esta estrategia está basada en la introducción del gen aiiA de Bacillus subtilis, que expresa constitutivamente una enzima lactonasa degradadora de AHLs. Para llevar a cabo el análisis en condiciones simbióticas, se ha desarrollado un sistema de doble marcaje que permite la identificación basado en los marcadores gusA y celB, que codifican para una enzima β–glucuronidasa y una β–galactosidasa termoestable, respectivamente. Los resultados obtenidos indican que Rlv UPM791 predomina sobre la cepa Rlv 3841 para la formación de nódulos en plantas de guisante. La baja estabilidad del plásmido que codifica para aiiA, no ha permitido obtener una conclusión definitiva sobre el efecto de la lactonasa AiiA en competitividad. Con el fin de analizar el significado y la regulación de la producción de moléculas señal tipo AHL, se han generado mutantes defectivos en cada uno de los dos sistemas de QS. Se ha llevado a cabo un análisis detallado sobre la producción de AHLs, formación de biofilm y simbiosis con plantas de guisante, veza y lenteja. El efecto de las deleciones de los genes rhiI y rhiR en Rlv UPM791 es más drástico en ausencia del plásmido pUPM791d. Mutaciones en cinI o cinRIS muestran tanto ausencia de señales, como producción exclusivamente de las de bajo peso molecular, respectivamente, producidas por el sistema rhiRI. Estas mutaciones mostraron un efecto importante en simbiosis. El sistema rhiRI se necesita para un comportamiento simbiótico normal. Además, mutantes cinRIS generaron nódulos blancos e ineficientes, mientras que el mutante cinI fue incapaz de producir nódulos en ninguna de las leguminosas utilizadas. Dicha mutación resultó en la inestabilización del plasmido simbiótico por un mecanismo dependiente de cinI que no ha sido aclarado. En general, los resultados obtenidos indican la existencia de un modelo de regulación dependiente de QS significativamente distinto a los que se han descrito previamente en otras cepas de R. leguminosarum, en las cuales no se había observado ningún fenotipo relevante en simbiosis. La regulación de la producción de AHLs Rlv UPM791 es un proceso complejo que implica genes situados en los plásmidos UPM791c y UPM791d, además de la señal 3-OH-C14:1-HSL. Finalmente, se ha identificado un transportador de tipo RND, homologo a mexAB-oprM de P. aeruginosa e implicado en la extrusión de AHLs de cadena larga. La mutación he dicho transportador no tuvo efectos apreciables sobre la simbiosis. ABSTRACT Rhizobium leguminosarum (Rl) is a soil alpha-proteobacterium that establishes a diazotrophic symbiosis with different legumes. Despite the importance of this symbiosis to the global nitrogen cycling balance, very few rhizobial genomes have been sequenced so far which provide new insights into the genetic features contributing to symbiotically relevant processes. Only three complete sequences of Rl strains have been published: Rl bv. viciae 3841, harboring six plasmids (7.75 Mb) and two Rl bv. trifolii (WSM1325 and WSM2304), both clover symbionts, harboring 5 and 4 plasmids, respectively (7.41 and 6.87 Mb). The genomic sequence of Rlv UPM791 was undertaken by means of 454 sequencing. Illumina and Sanger reads were used to improve the assembly, leading to 17 final contigs. This genome has an estimated size of 7.8 Mb organized in one chromosome and five extrachromosomal replicons, including a 405 kb symbiotic plasmid. Four of these plasmids are already closed, whereas there are still gaps in the smallest one (pUPM791d) due to the presence of insertion elements and repeated sequences, which difficult the assembly. The annotation has been carried out thanks to the Manatee pipeline. This new genome has been analyzed as regarding symbiotic and adaptive functions in comparison to the Rlv 3841 complete genome, and to those from Rl bv. trifolii strains WSM1325 and WSM2304. While plasmids pUPM791a and b are conserved, the symbiotic plasmid pUPM791c exhibited the lowest degree of conservation as compared to those from the other Rl strains. One of the factors involved in the symbiotic process is the intercellular communication system known as Quorum Sensing (QS). This mechanism allows bacteria to carry out diverse biological processes in a coordinate way through the production and detection of extracellular signals that regulate the transcription of different target genes. Analysis of the Rlv UPM791 genome allowed the identification of two LuxRI-like systems mediated by N-acyl-homoserine lactones (AHLs). HPLC-MS analysis allowed the adscription of C6-HSL, C7-HSL and C8-HSL signals to the rhiRI system, encoded in the symbiotic plasmid, whereas the cinRI system, located in the chromosome, produces 3OH-C14:1-HSL, previously described as “bacteriocin small”. A third synthase (TraI) is encoded also in the symbiotic plasmid, but its cognate regulator TraR is not functional due to a fameshift mutation. Three additional LuxR orphans were also found which no associated LuxI-type synthase. The potential effect of AHLs has been studied by means of a quorum quenching approach to interfere with the QS systems of the bacteria. This approach is based upon the introduction into the strains Rl UPM791 and Rl 3841 of the Bacillus subtilis gene aiiA expressing constitutively an AHL-degrading lactonase enzyme which led to virtual absence of AHL even when AiiA-expressing cells were a fraction of the total population. No significant effect of AiiA-mediated AHL removal on competitiveness for growth in solid surface was observed. For analysis under symbiotic conditions we have set up a two-label system to identify nodules produced by two different strains in pea roots, based on the markers gusA and celB, encoding a β–glucuronidase and a thermostable β–galactosidase enzymes, respectively. The results obtained show that Rlv UPM791 outcompetes Rlv 3841 for nodule formation in pea plants, and that the presence of the AiiA plasmid does not significantly affect the relative competitiveness of the two Rlv strains. However, the low stability of the pME6863 plasmid, encoding aiiA, did not lead to a clear conclusion about the AiiA lactonase effect on competitiveness. In order to further analyze the significance and regulation of the production of AHL signal molecules, mutants deficient in each of the two QS systems were constructed. A detailed analysis of the effect of these mutations on AHL production, biofilm formation and symbiosis with pea, vetch and lentil plants has been carried out. The effect of deletions on Rlv UPM791 rhiI and rhiR genes is more pronounced in the absence of plasmid pUPM791d, as no signal is detected in UPM791.1, lacking this plasmid. Mutations in cinI or cinRIS show either no signals, or only the small ones produced by the rhiRI system, suggesting that cinR might be regulating the rhiRI system. These mutations had a strong effect on symbiosis. Analysis of rhi mutants revealed that rhiRI system is required for normal symbiotic performance, as a drastic reduction of symbiotic fitness is observed when rhiI is deleted, and rhiR is essential for nitrogen fixation in the absence of plasmid pUPM791d. Furthermore, cinRIS mutants resulted in white and inefficient nodules, whereas cinI mutant was unable to form nodules on any legume tested. The latter mutation is associated to the instabilization of the symbiotic plasmid through a mechanism still uncovered. Overall, the results obtained indicate the existence of a model of QS-dependent regulation significantly different to that previously described in other R. leguminosarum strains, where no relevant symbiotic phenotype had been observed. The regulation of AHL production in Rlv UPM791 is a complex process involving the symbiotic plasmid (pUPM791c) and the smallest plasmid (pUPM791d), with a key role for the 3-OH-C14:1-HSL signal. Finally, we made a search for potential AHL transporters in Rlv UPM791 genome. These signals diffuse freely across membranes, but in the case of the long-chain AHLs an active efflux system might be required, as it has been described for C12-HSL in the case of Pseudomonas aeruginosa. We have identified a putative AHL transporter of the RND family homologous to P. aeruginosa mexAB-oprM. A mutant strain deficient in this transporter has been generated, and TLC analysis shows absence of 3OH-C14:1-HSL in its supernatant. This deficiency was complemented by the reintroduction of an intact copy of the genes via plasmid transfer. The mutation in mexAB genes had no significant effects on the symbiotic performance of R. leguminosarum bv. viciae.
Resumo:
A menudo los científicos secuencian el ADN de un gran número de personas con el objetivo de determinar qué genes se asocian con determinadas enfermedades. Esto permite meóon del genoma humano. El precio de un perfil genómico completo se ha posicionado por debajo de los 200 dólares y este servicio lo ofrecen muchas compañías, la mayor parte localizadas en EEUU. Como consecuencia, en unos pocos a~nos la mayoría de las personas procedentes de los países desarrollados tendrán los medios para tener su ADN secuenciado. Alrededor del 0.5% del ADN de cada persona (que corresponde a varios millones de nucleótidos) es diferente del genoma de referencia debido a variaciones genéticas. Así que el genoma contiene información altamente sensible y personal y representa la identidad biológica óon sobre el entorno o estilo de vida de uno (a menudo facilmente obtenible de las redes sociales), sería posible inferir el fenotipo del individuo. Multiples GWAS (Genome Wide Association Studies) realizados en los últimos a~nos muestran que la susceptibilidad de un paciente a tener una enfermedad en particular, como el Alzheimer, cáncer o esquizofrenia, puede ser predicha parcialmente a partir de conjuntos de sus SNP (Single Nucleotide Polimorphism). Estos resultados pueden ser usados para medicina genómica personalizada (facilitando los tratamientos preventivos y diagnósticos), tests de paternidad genéticos y tests de compatibilidad genética para averiguar a qué enfermedades pueden ser susceptibles los descendientes. Estos son algunos de los beneficios que podemos obtener usando la información genética, pero si esta información no es protegida puede ser usada para investigaciones criminales y por compañías aseguradoras. Este hecho podría llevar a discriminaci ón genética. Por lo que podemos concluir que la privacidad genómica es fundamental por el hecho de que contiene información sobre nuestra herencia étnica, nuestra predisposición a múltiples condiciones físicas y mentales, al igual que otras características fenotópicas, ancestros, hermanos y progenitores, pues los genomas de cualquier par de individuos relacionados son idénticos al 99.9%, contrastando con el 99.5% de dos personas aleatorias. La legislación actual no proporciona suficiente información técnica sobre como almacenar y procesar de forma segura los genomas digitalizados, por lo tanto, es necesaria una legislación mas restrictiva ---ABSTRACT---Scientists typically sequence DNA from large numbers of people in order to determine genes associated with particular diseases. This allows to improve the modern healthcare and to provide a better understanding of the human genome. The price of a complete genome profile has plummeted below $200 and this service is ofered by a number of companies, most of them located in the USA. Therefore, in a few years, most individuals in developed countries will have the means of having their genomes sequenced. Around 0.5% of each person's DNA (which corresponds to several millions of nucleotides) is diferent from the reference genome, owing to genetic variations. Thus, the genome contains highly personal and sensitive information, and it represents our ultimate biological identity. By combining genomic data with information about one's environment or lifestyle (often easily obtainable from social networks), could make it possible to infer the individual's phenotype. Multiple Genome Wide Association Studies (GWAS) performed in recent years have shown that a patient's susceptibility to particular diseases, such as Alzheimer's, cancer, or schizophrenia, can be partially predicted from sets of his SNPs. This results can be used for personalized genomic medicine (facilitating preventive treatment and diagnosis), genetic paternity tests, ancestry and genealogical testing, and genetic compatibility tests in order to have knowledge about which deseases would the descendant be susceptible to. These are some of the betefts we can obtain using genoma information, but if this information is not protected it can be used for criminal investigations and insurance purposes. Such issues could lead to genetic discrimination. So we can conclude that genomic privacy is fundamental due to the fact that genome contains information about our ethnic heritage, predisposition to numerous physical and mental health conditions, as well as other phenotypic traits, and ancestors, siblings, and progeny, since genomes of any two closely related individuals are 99.9% identical, in contrast with 99.5%, for two random people. The current legislation does not ofer suficient technical information about safe and secure ways of storing and processing digitized genomes, therefore, there is need for more restrictive legislation.
Resumo:
The genome sequence of the extremely thermophilic archaeon Methanococcus jannaschii provides a wealth of data on proteins from a thermophile. In this paper, sequences of 115 proteins from M. jannaschii are compared with their homologs from mesophilic Methanococcus species. Although the growth temperatures of the mesophiles are about 50°C below that of M. jannaschii, their genomic G+C contents are nearly identical. The properties most correlated with the proteins of the thermophile include higher residue volume, higher residue hydrophobicity, more charged amino acids (especially Glu, Arg, and Lys), and fewer uncharged polar residues (Ser, Thr, Asn, and Gln). These are recurring themes, with all trends applying to 83–92% of the proteins for which complete sequences were available. Nearly all of the amino acid replacements most significantly correlated with the temperature change are the same relatively conservative changes observed in all proteins, but in the case of the mesophile/thermophile comparison there is a directional bias. We identify 26 specific pairs of amino acids with a statistically significant (P < 0.01) preferred direction of replacement.
Resumo:
Whole-genome duplication approximately 108 years ago was proposed as an explanation for the many duplicated chromosomal regions in Saccharomyces cerevisiae. Here we have used computer simulations and analytic methods to estimate some parameters describing the evolution of the yeast genome after this duplication event. Computer simulation of a model in which 8% of the original genes were retained in duplicate after genome duplication, and 70–100 reciprocal translocations occurred between chromosomes, produced arrangements of duplicated chromosomal regions very similar to the map of real duplications in yeast. An analytical method produced an independent estimate of 84 map disruptions. These results imply that many smaller duplicated chromosomal regions exist in the yeast genome in addition to the 55 originally reported. We also examined the possibility of determining the original order of chromosomal blocks in the ancestral unduplicated genome, but this cannot be done without information from one or more additional species. If the genome sequence of one other species (such as Kluyveromyces lactis) were known it should be possible to identify 150–200 paired regions covering the whole yeast genome and to reconstruct approximately two-thirds of the original order of blocks of genes in yeast. Rates of interchromosome translocation in yeast and mammals appear similar despite their very different rates of homologous recombination per kilobase.
Resumo:
Caenorhabditis elegans should soon be the first multicellular organism whose complete genomic sequence has been determined. This achievement provides a unique opportunity for a comprehensive assessment of the signal transduction molecules required for the existence of a multicellular animal. Although the worm C. elegans may not much resemble humans, the molecules that regulate signal transduction in these two organisms prove to be quite similar. We focus here on the content and diversity of protein kinases present in worms, together with an assessment of other classes of proteins that regulate protein phosphorylation. By systematic analysis of the 19,099 predicted C. elegans proteins, and thorough analysis of the finished and unfinished genomic sequences, we have identified 411 full length protein kinases and 21 partial kinase fragments. We also describe 82 additional proteins that are predicted to be structurally similar to conventional protein kinases even though they share minimal primary sequence identity. Finally, the richness of phosphorylation-dependent signaling pathways in worms is further supported with the identification of 185 protein phosphatases and 128 phosphoprotein-binding domains (SH2, PTB, STYX, SBF, 14-3-3, FHA, and WW) in the worm genome.
Resumo:
A recent study of the divergence times of the major groups of organisms as gauged by amino acid sequence comparison has been expanded and the data have been reanalyzed with a distance measure that corrects for both constraints on amino acid interchange and variation in substitution rate at different sites. Beyond that, the availability of complete genome sequences for several eubacteria and an archaebacterium has had a great impact on the interpretation of certain aspects of the data. Thus, the majority of the archaebacterial sequences are not consistent with currently accepted views of the Tree of Life which cluster the archaebacteria with eukaryotes. Instead, they are either outliers or mixed in with eubacterial orthologs. The simplest resolution of the problem is to postulate that many of these sequences were carried into eukaryotes by early eubacterial endosymbionts about 2 billion years ago, only very shortly after or even coincident with the divergence of eukaryotes and archaebacteria. The strong resemblances of these same enzymes among the major eubacterial groups suggest that the cyanobacteria and Gram-positive and Gram-negative eubacteria also diverged at about this same time, whereas the much greater differences between archaebacterial and eubacterial sequences indicate these two groups may have diverged between 3 and 4 billion years ago.