916 resultados para Genome-specific Sequence
Resumo:
Computational protein design (CPD) is a burgeoning field that uses a physical-chemical or knowledge-based scoring function to create protein variants with new or improved properties. This exciting approach has recently been used to generate proteins with entirely new functions, ones that are not observed in naturally occurring proteins. For example, several enzymes were designed to catalyze reactions that are not in the repertoire of any known natural enzyme. In these designs, novel catalytic activity was built de novo (from scratch) into a previously inert protein scaffold. In addition to de novo enzyme design, the computational design of protein-protein interactions can also be used to create novel functionality, such as neutralization of influenza. Our goal here was to design a protein that can self-assemble with DNA into nanowires. We used computational tools to homodimerize a transcription factor that binds a specific sequence of double-stranded DNA. We arranged the protein-protein and protein-DNA binding sites so that the self-assembly could occur in a linear fashion to generate nanowires. Upon mixing our designed protein homodimer with the double-stranded DNA, the molecules immediately self-assembled into nanowires. This nanowire topology was confirmed using atomic force microscopy. Co-crystal structure showed that the nanowire is assembled via the desired interactions. To the best of our knowledge, this is the first example of a protein-DNA self-assembly that does not rely on covalent interactions. We anticipate that this new material will stimulate further interest in the development of advanced biomaterials.
Resumo:
BACKGROUND: Mammalian genomes commonly harbor endogenous viral elements. Due to a lack of comparable genome-scale sequence data, far less is known about endogenous viral elements in avian species, even though their small genomes may enable important insights into the patterns and processes of endogenous viral element evolution. RESULTS: Through a systematic screening of the genomes of 48 species sampled across the avian phylogeny we reveal that birds harbor a limited number of endogenous viral elements compared to mammals, with only five viral families observed: Retroviridae, Hepadnaviridae, Bornaviridae, Circoviridae, and Parvoviridae. All nonretroviral endogenous viral elements are present at low copy numbers and in few species, with only endogenous hepadnaviruses widely distributed, although these have been purged in some cases. We also provide the first evidence for endogenous bornaviruses and circoviruses in avian genomes, although at very low copy numbers. A comparative analysis of vertebrate genomes revealed a simple linear relationship between endogenous viral element abundance and host genome size, such that the occurrence of endogenous viral elements in bird genomes is 6- to 13-fold less frequent than in mammals. CONCLUSIONS: These results reveal that avian genomes harbor relatively small numbers of endogenous viruses, particularly those derived from RNA viruses, and hence are either less susceptible to viral invasions or purge them more effectively.
Resumo:
To provide context for the diversification of archosaurs--the group that includes crocodilians, dinosaurs, and birds--we generated draft genomes of three crocodilians: Alligator mississippiensis (the American alligator), Crocodylus porosus (the saltwater crocodile), and Gavialis gangeticus (the Indian gharial). We observed an exceptionally slow rate of genome evolution within crocodilians at all levels, including nucleotide substitutions, indels, transposable element content and movement, gene family evolution, and chromosomal synteny. When placed within the context of related taxa including birds and turtles, this suggests that the common ancestor of all of these taxa also exhibited slow genome evolution and that the comparatively rapid evolution is derived in birds. The data also provided the opportunity to analyze heterozygosity in crocodilians, which indicates a likely reduction in population size for all three taxa through the Pleistocene. Finally, these data combined with newly published bird genomes allowed us to reconstruct the partial genome of the common ancestor of archosaurs, thereby providing a tool to investigate the genetic starting material of crocodilians, birds, and dinosaurs.
Resumo:
An 8-year-old girl with some features of Turner syndrome and karyotype 45X/46XY had developed a bilateral gonadoblastoma in her rudimentary ovaries. Her normal Y chromosome showed the characteristic distal fluorescence, as seen in her father's. Another mosaic, this time 45X/46XidicY, and also with some Turner features had rudimentary ovaries, but no gonadoblastoma had developed at age 14. The nature of her idicY, which showed no fluorescent distal Yq and had one of the centromeres inactivated, was confirmed by in situ hybridisation with a Yp-specific probe. Using primers from a human Yp-specific sequence, we amplified DNA extracted from paraffin-embedded ovarian tissue from both cases, and from a normal testicle and a normal ovary as controls. The finding of the expected Y-derived PCR product in the rudimentary gonads from these mosaic patients indicates the presence of their Y chromosome in both. We discuss the validity of the findings, and the possible role of sequences in or near the fluorescent part of Yq in the origin of gonadoblastoma in Y-bearing mosaic Turner syndrome.
Resumo:
Le virus de l’immunodéficience humaine de type 1 (VIH-1) est responsable du syndrome de l’immunodéficience acquise (SIDA). Il faut identifier de nouvelles cibles pour le développement d’agents anti-VIH-1, car ce virus développe une résistance aux agents présentement utilisés. Notre but est d’approfondir la caractérisation de l’étape du changement de cadre de lecture ribosomique en -1 (déphasage -1) nécessaire à la production du précurseur des enzymes du VIH-1. Ce déphasage est programmé et effectué par une minorité de ribosomes lorsqu’ils traduisent la séquence dite glissante à un endroit spécifique de l’ARN messager (ARNm) pleine-longueur du VIH-1. L’efficacité de déphasage est contrôlée par le signal stimulateur de déphasage (SSF), une tige-boucle irrégulière située en aval de la séquence glissante. La structure du SSF est déroulée lors du passage d’un ribosome, mais elle peut se reformer ensuite. Nous avons montré que des variations de l’initiation de la traduction affectent l’efficacité de déphasage. Nous avons utilisé, dans des cellules Jurkat-T et HEK 293T, un rapporteur bicistronique où les gènes codant pour les luciférases de la Renilla (Rluc) et de la luciole (Fluc) sont séparés par la région de déphasage du VIH-1. La Rluc est produite par tous les ribosomes traduisant l’ARNm rapporteur alors que la Fluc est produite uniquement par les ribosomes effectuant un déphasage. L’initiation de ce rapporteur est coiffe-dépendante, comme pour la majorité des ARNm cellulaires. Nous avons examiné l’effet de trois inhibiteurs de l’initiation et montré que leur présence augmente l’efficacité de déphasage. Nous avons ensuite étudié l’effet de la tige-boucle TAR, qui est présente à l’extrémité 5’ de tous les ARNm du VIH-1. TAR empêche la liaison de la petite sous-unité du ribosome (40S) à l’ARNm et module aussi l’activité de la protéine kinase dépendante de l’ARN double-brin (PKR). L’activation de PKR inhibe l’initiation en phosphorylant le facteur d’initiation eucaryote 2 (eIF2) alors que l’inhibition de PKR a l’effet inverse. Nous avons étudié l’effet de TAR sur la traduction et le déphasage via son effet sur PKR en utilisant TAR en trans ou en cis, mais à une certaine distance de l’extrémité 5’ afin d’éviter l’interférence avec la liaison de la 40S. Nous avons observé qu’une faible concentration de TAR, qui active PKR, augmente l’efficacité de déphasage alors qu’une concentration élevée de TAR, qui inhibe PKR, diminue cette efficacité. Nous avons proposé un modèle où des variations de l’initiation affectent l’efficacité de déphasage en modifiant la distance entre les ribosomes parcourant l’ARNm et, donc, la probabilité qu’ils rencontrent un SSF structuré. Par la suite, nous avons déterminé l’effet de la région 5’ non traduite (UTR) de l’ARNm pleine-longueur du VIH-1 sur l’efficacité de déphasage. Cette 5’UTR contient plusieurs régions structurées, dont TAR à l’extrémité 5’, qui peut interférer avec l’initiation. Cet ARNm a une coiffe permettant une initiation coiffe-dépendante ainsi qu’un site d’entrée interne des ribosomes (IRES), permettant une initiation IRES-dépendante. Nous avons introduit cette 5’UTR, complète ou en partie, comme 5’UTR de notre ARNm rapporteur bicistronique. Nos résultats démontrent que cette 5’UTR complète inhibe l’initiation coiffe dépendante et augmente l’efficacité de déphasage et que ces effets sont dus à la présence de TAR suivie de la tige-boucle Poly(A). Nous avons aussi construit un rapporteur tricistronique où les ribosomes exprimant les luciférases utilisent obligatoirement l’IRES. Nous avons observé que cette initiation par l’IRES est faible et que l’efficacité de déphasage correspondante est également faible. Nous avons formulé une hypothèse pour expliquer cette situation. Nous avons également observé que lorsque les deux modes d’initiation sont disponibles, l’initiation coiffe dépendante est prédominante. Finalement, nous avons étudié l’effet de la protéine virale Tat sur l’initiation de la traduction et sur l’efficacité de déphasage. Nous avons montré qu’elle augmente l’initiation de la traduction et que son effet est plus prononcé lorsque TAR est située à l’extrémité 5’ des ARNm. Nous proposons un modèle expliquant les effets de Tat sur l’initiation de la traduction par l’inhibition de PKR ainsi que par des changements de l’expression de protéines cellulaires déroulant TAR. Ces résultats permettent de mieux comprendre les mécanismes régissant le déphasage du VIH-1, ce qui est essentiel pour le développement d’agents anti-déphasage.
Resumo:
Fragaria vesca is a short-lived perennial with a seasonal-flowering habit. Seasonality of flowering is widespread in the Rosaceae and is also found in the majority of temperate polycarpic perennials. Genetic analysis has shown that seasonal flowering is controlled by a single gene in F. vesca, the SEASONAL FLOWERING LOCUS (SFL). Here, we report progress towards the marker-assisted selection and positional cloning of SFL, in which three ISSR markers linked to SFL were converted to locus-specific sequence-characterized amplified region (SCAR1–SCAR3) markers to allow large-scale screening of mapping progenies. We believe this is the first study describing the development of SCAR markers from ISSR profiles. The work also provides useful insight into the nature of polymorphisms generated by the ISSR marker system. Our results indicate that the ISSR polymorphisms originally detected were probably caused by point mutations in the positions targeted by primer anchors (causing differential PCR failure), by indels within the amplicon (leading to variation in amplicon size) and by internal sequence differences (leading to variation in DNA folding and so in band mobility). The cause of the original ISSR polymorphism was important in the selection of appropriate strategies for SCAR-marker development. The SCAR markers produced were mapped using a F. vesca f. vesca × F. vesca f. semperflorens testcross population. Marker SCAR2 was inseparable from the SFL, whereas SCAR1 mapped 3.0 cM to the north of the gene and SCAR3 1.7 cM to its south.
Resumo:
Objective: To identify genes specifically expressed in mammalian oocytes using an in silico subtraction, and to characterize the mRNA patterns of selected genes in oocytes, embryos, and adult tissues. Design: Comparison between oocyte groups and between early embryo stages. Setting: Laboratories of embryo manipulation and molecular biology from Departamento de Genetica (FMRP) and Departamento de Ciencias Basicas (FZEA) - University of Sao Paulo. Sample(s): Oocytes were collected from slaughtered cows for measurements, in vitro fertilization, and in vitro embryo culture. Somatic tissue, excluding gonad and uterus tissue, was collected from male and female cattle. Main Outcome Measure(s): Messenger RNA levels of poly(A)-binding protein nuclear-like 1 (Pabpnl1) and methyl-CpG-binding domain protein 3-like 2 (Mbd3l2). Result(s): Pabpnl1 mRNA was found to be expressed in oocytes, and Mbd3l2 transcripts were present in embryos. Quantification of Pabpnl1 transcripts showed no difference in levels between good-and bad-quality oocytes before in vitro maturation (IVM) or between good-quality oocytes before and after IVM. However, Pabpnl1 transcripts were not detected in bad-quality oocytes after IVM. Transcripts of the Mbd3l2 gene were found in 4-cell, 8-cell, and morula-stage embryos, with the highest level observed in 8-cell embryos. Conclusion(s): Pabpnl1 gene expression is restricted to oocytes and Mbd3l2 to embryos. Different Pabpnl1 mRNA levels in oocytes of varying viability suggest an important role in fertility involving the oocyte potential for embryo development. (Fertil Steril (R) 2010; 93: 2507-12. (C) 2010 by American Society for Reproductive Medicine.)
Resumo:
A conceptual problem that appears in different contexts of clustering analysis is that of measuring the degree of compatibility between two sequences of numbers. This problem is usually addressed by means of numerical indexes referred to as sequence correlation indexes. This paper elaborates on why some specific sequence correlation indexes may not be good choices depending on the application scenario in hand. A variant of the Product-Moment correlation coefficient and a weighted formulation for the Goodman-Kruskal and Kendall`s indexes are derived that may be more appropriate for some particular application scenarios. The proposed and existing indexes are analyzed from different perspectives, such as their sensitivity to the ranks and magnitudes of the sequences under evaluation, among other relevant aspects of the problem. The results help suggesting scenarios within the context of clustering analysis that are possibly more appropriate for the application of each index. (C) 2008 Elsevier Inc. All rights reserved.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Epizootics of Eimeria funduli involved estuarine killifishes (Fundulus grandis, F. pulvereus, F. similis, and F. heteroclitus) in Mississippi, Alabama, and Virginia. All of more than 500 specimens examined of F. grandis from Mississippi during 1977 through 1979 had infections, regardless of age, sex, or season collected. Oocysts occurred primarily in the liver and pancreas, replacing up to 85% of both those organs. Infrequent sites of infection were fatty tissue of the body cavity, ovary, intestine, and caudal peduncle. Living fish did not discharge oocysts. Eimeria funduli is the first known eimerian to require a second host. To complete the life cycle, an infective stage in the grass shrimp Palaemonetes pugio had to be eaten. In 6-mo-old killifish reared in the laboratory at 24 C, young schizonts were first observed in hepatic and pancreatic cells 5 days post feeding, followed by first generation merozoites by day 10, differentiation of sexual stages during days 15 to 20, fertilization between days 19 and 26, sporoblasts from days 25 to 30, and sporozoites about day 60. Unique sporopodia developed on sporocysts by day 35 when still unsporulated. Temperatures of 7 to 10 C irreversibly halted schizogony. Both schizogony and sporogony progressed slower as age of host increased. When infective shrimp in doses ranging from 1 to 10% of a fish's body weight were eaten, the level of intensity of resulting infections did not differ significantly. Pathogenesis followed a specific sequence, with the host response apparently unable to contend with extensive infections as seen typically in nature and in our experiments. Premunition was indicated. When administered Monensin® orally, infected fish exhibited a reduction in oocysts by 50 to 70% within 20 days as compared with untreated fish. Furthermore, infected killifish maintained exclusively on a diet of TetraMin® for 3 mo completely lost their infections.
Resumo:
Background: Sugarcane is an important crop worldwide for sugar production and increasingly, as a renewable energy source. Modern cultivars have polyploid, large complex genomes, with highly unequal contributions from ancestral genomes. Long Terminal Repeat retrotransposons (LTR-RTs) are the single largest components of most plant genomes and can substantially impact the genome in many ways. It is therefore crucial to understand their contribution to the genome and transcriptome, however a detailed study of LTR-RTs in sugarcane has not been previously carried out. Results: Sixty complete LTR-RT elements were classified into 35 families within four Copia and three Gypsy lineages. Structurally, within lineages elements were similar, between lineages there were large size differences. FISH analysis resulted in the expected pattern of Gypsy/heterochromatin, Copia/euchromatin, but in two lineages there was localized clustering on some chromosomes. Analysis of related ESTs and RT-PCR showed transcriptional variation between tissues and families. Four distinct patterns were observed in sRNA mapping, the most unusual of which was that of Ale1, with very large numbers of 24nt sRNAs in the coding region. The results presented support the conclusion that distinct small RNA-regulated pathways in sugarcane target the lineages of LTR-RT elements. Conclusions: Individual LTR-RT sugarcane families have distinct structures, and transcriptional and regulatory signatures. Our results indicate that in sugarcane individual LTR-RT families have distinct behaviors and can potentially impact the genome in diverse ways. For instance, these transposable elements may affect nearby genes by generating a diverse set of small RNA's that trigger gene silencing mechanisms. There is also some evidence that ancestral genomes contribute significantly different element numbers from particular LTR-RT lineages to the modern sugarcane cultivar genome.
Resumo:
Self-incompatibility (SI) systems have evolved in many flowering plants to prevent self-fertilization and thus promote outbreeding. Pear and apple, as many of the species belonging to the Rosaceae, exhibit RNase-mediated gametophytic self-incompatibility, a widespread system carried also by the Solanaceae and Plantaginaceae. Pear orchards must for this reason contain at least two different cultivars that pollenize each other; to guarantee an efficient cross-pollination, they should have overlapping flowering periods and must be genetically compatible. This compatibility is determined by the S-locus, containing at least two genes encoding for a female (pistil) and a male (pollen) determinant. The female determinant in the Rosaceae, Solanaceae and Plantaginaceae system is a stylar glycoprotein with ribonuclease activity (S-RNase), that acts as a specific cytotoxin in incompatible pollen tubes degrading cellular RNAs. Since its identification, the S-RNase gene has been intensively studied and the sequences of a large number of alleles are available in online databases. On the contrary, the male determinant has been only recently identified as a pollen-expressed protein containing a F-box motif, called S-Locus F-box (abbreviated SLF or SFB). Since F-box proteins are best known for their participation to the SCF (Skp1 - Cullin - F-box) E3 ubiquitine ligase enzymatic complex, that is involved in protein degradation through the 26S proteasome pathway, the male determinant is supposed to act mediating the ubiquitination of the S-RNases, targeting them for the degradation in compatible pollen tubes. Attempts to clone SLF/SFB genes in the Pyrinae produced no results until very recently; in apple, the use of genomic libraries allowed the detection of two F-box genes linked to each S haplotype, called SFBB (S-locus F-Box Brothers). In Japanese pear, three SFBB genes linked to each haplotype were cloned from pollen cDNA. The SFBB genes exhibit S haplotype-specific sequence divergence and pollen-specific expression; their multiplicity is a feature whose interpretation is unclear: it has been hypothesized that all of them participate in the S-specific interaction with the RNase, but it is also possible that only one of them is involved in this function. Moreover, even if the S locus male and female determinants are the only responsible for the specificity of the pollen-pistil recognition, many other factors are supposed to play a role in GSI; these are not linked to the S locus and act in a S-haplotype independent manner. They can have a function in regulating the expression of S determinants (group 1 factors), modulating their activity (group 2) or acting downstream, in the accomplishment of the reaction of acceptance or rejection of the pollen tube (group 3). This study was aimed to the elucidation of the molecular mechanism of GSI in European pear (Pyrus communis) as well as in the other Pyrinae; it was divided in two parts, the first focusing on the characterization of male determinants, and the second on factors external to the S locus. The research of S locus F-box genes was primarily aimed to the identification of such genes in European pear, for which sequence data are still not available; moreover, it allowed also to investigate about the S locus structure in the Pyrinae. The analysis was carried out on a pool of varieties of the three species Pyrus communis (European pear), Pyrus pyrifolia (Japanese pear), and Malus × domestica (apple); varieties carrying S haplotypes whose RNases are highly similar were chosen, in order to check whether or not the same level of similarity is maintained also between the male determinants. A total of 82 sequences was obtained, 47 of which represent the first S-locus F-box genes sequenced from European pear. The sequence data strongly support the hypothesis that the S locus structure is conserved among the three species, and presumably among all the Pyrinae; at least five genes have homologs in the analysed S haplotypes, but the number of F-box genes surrounding the S-RNase could be even greater. The high level of sequence divergence and the similarity between alleles linked to highly conserved RNases, suggest a shared ancestral polymorphism also for the F-box genes. The F-box genes identified in European pear were mapped on a segregating population of 91 individuals from the cross 'Abbé Fétel' × 'Max Red Bartlett'. All the genes were placed on the linkage group 17, where the S locus has been placed both in pear and apple maps, and resulted strongly associated to the S-RNase gene. The linkage with the RNase was perfect for some of the F-box genes, while for others very rare single recombination events were identified. The second part of this study was focused on the research of other genes involved in the SI response in pear; it was aimed on one side to the identification of genes differentially expressed in compatible and incompatible crosses, and on the other to the cloning and characterization of the transglutaminase (TGase) gene, whose role may be crucial in pollen rejection. For the identification of differentially expressed genes, controlled pollinations were carried out in four combinations (self pollination, incompatible, half-compatible and fully compatible cross-pollination); expression profiles were compared through cDNA-AFLP. 28 fragments displaying an expression pattern related to compatibility or incompatibility were identified, cloned and sequenced; the sequence analysis allowed to assign a putative annotation to a part of them. The identified genes are involved in very different cellular processes or in defense mechanisms, suggesting a very complex change in gene expression following the pollen/pistil recognition. The pool of genes identified with this technique offers a good basis for further study toward a better understanding of how the SI response is carried out. Among the factors involved in SI response, moreover, an important role may be played by transglutaminase (TGase), an enzyme involved both in post-translational protein modification and in protein cross-linking. The TGase activity detected in pear styles was significantly higher when pollinated in incompatible combinations than in compatible ones, suggesting a role of this enzyme in the abnormal cytoskeletal reorganization observed during pollen rejection reaction. The aim of this part of the work was thus to identify and clone the pear TGase gene; the PCR amplification of fragments of this gene was achieved using primers realized on the alignment between the Arabidopsis TGase gene sequence and several apple EST fragments; the full-length coding sequence of the pear TGase gene was then cloned from cDNA, and provided a precious tool for further study of the in vitro and in vivo action of this enzyme.
Resumo:
The Myc oncoproteins belong to a family of transcription factors composed by Myc, N-Myc and L-Myc. The most studied components of this family are Myc and N-Myc because their expressions are frequently deregulated in a wide range of cancers. These oncoproteins can act both as activators or repressors of gene transcription. As activators, they heterodimerize with Max (Myc associated X-factor) and the heterodimer recognizes and binds a specific sequence elements (E-Box) onto gene promoters recruiting histone acetylase and inducing transcriptional activation. Myc-mediated transcriptional repression is a quite debated issue. One of the first mechanisms defined for the Myc-mediated transcriptional repression consisted in the interaction of Myc-Max complex Sp1 and/or Miz1 transcription factors already bound to gene promoters. This interaction may interfere with their activation functions by recruiting co-repressors such as Dnmt3 or HDACs. Moreover, in the absence of , Myc may interfere with the Sp1 activation function by direct interaction and subsequent recruitment of HDACs. More recently the Myc/Max complex was also shown to mediate transcriptional repression by direct binding to peculiar E-box. In this study we analyzed the role of Myc overexpression in Osteosarcoma and Neuroblastoma oncogenesis and the mechanisms underling to Myc function. Myc overexpression is known to correlate with chemoresistance in Osteosarcoma cells. We extended this study by demonstrating that c-Myc induces transcription of a panel of ABC drug transporter genes. ABCs are a large family trans-membrane transporter deeply involved in multi drug resistance. Furthermore expression levels of Myc, ABCC1, ABCC4 and ABCF1 were proved to be important prognostic tool to predict conventional therapy failure. N-Myc amplification/overexpression is the most important prognostic factor for Neuroblastoma. Cyclin G2 and Clusterin are two genes often down regulated in neuroblastoma cells. Cyclin G2 is an atypical member of Cyclin family and its expression is associated with terminal differentiation and apoptosis. Moreover it blocks cell cycle progression and induces cell growth arrest. Instead, CLU is a multifunctional protein involved in many physiological and pathological processes. Several lines of evidences support the view that CLU may act as a tumour suppressor in Neuroblastoma. In this thesis I showed that N-Myc represses CCNG2 and CLU transcription by different mechanisms. • N-Myc represses CCNG2 transcription by directly interacting with Sp1 bound in CCNG2 promoter and recruiting HDAC2. Importantly, reactivation of CCNG2 expression through epigenetic drugs partially reduces N-Myc and HDAC2 mediated cell proliferation. • N-Myc/Max complex represses CLU expression by direct binding to a peculiar E-box element on CLU promoter and by recruitment of HDACs and Polycomb Complexes, to the CLU promoter. Overall our findings strongly support the model in which Myc overexpression/amplification may contribute to some aspects of oncogenesis by a dual action: i) transcription activation of genes that confer a multidrug resistant phenotype to cancer cells; ii), transcription repression of genes involved in cell cycle inhibition and cellular differentiation.
Resumo:
With the advent of cheaper and faster DNA sequencing technologies, assembly methods have greatly changed. Instead of outputting reads that are thousands of base pairs long, new sequencers parallelize the task by producing read lengths between 35 and 400 base pairs. Reconstructing an organism’s genome from these millions of reads is a computationally expensive task. Our algorithm solves this problem by organizing and indexing the reads using n-grams, which are short, fixed-length DNA sequences of length n. These n-grams are used to efficiently locate putative read joins, thereby eliminating the need to perform an exhaustive search over all possible read pairs. Our goal was develop a novel n-gram method for the assembly of genomes from next-generation sequencers. Specifically, a probabilistic, iterative approach was utilized to determine the most likely reads to join through development of a new metric that models the probability of any two arbitrary reads being joined together. Tests were run using simulated short read data based on randomly created genomes ranging in lengths from 10,000 to 100,000 nucleotides with 16 to 20x coverage. We were able to successfully re-assemble entire genomes up to 100,000 nucleotides in length.
Resumo:
OBJECTIVE: Rheumatoid arthritis (RA) usually improves during pregnancy and recurs postpartum. Fetal cells and cell-free DNA reach the maternal circulation during normal pregnancy. The present study investigated dynamic changes in levels of fetal DNA in serum from women with RA and inflammatory arthritis during and after pregnancy to test the hypothesis that the levels of circulating fetal DNA correlate with arthritis improvement. METHODS: Twenty-five pregnant patients were prospectively studied. A real-time quantitative polymerase chain reaction panel targeting unshared, paternally transmitted HLA sequences, a Y chromosome-specific sequence, or an insertion sequence within the glutathione S-transferase M1 gene was used to measure cell-free fetal DNA. Results were expressed as fetal genomic equivalents per milliliter (gE/ml) of maternal serum. Physical examinations were conducted during and after pregnancy. RESULTS: Levels of fetal DNA in women with improvement in or remission of arthritis were higher than those in women with active disease, especially in the third trimester. Overall, an inverse relationship between serum fetal DNA levels and disease activity was observed (P < 0.001). Serum fetal DNA increased with advancing gestation, reaching median levels of 24 gE/ml (range 0-334), 61 gE/ml (range 0-689), and 199 gE/ml (range 0-2,576) in the first, second, and third trimesters, respectively, with fetal DNA clearance observed postpartum. Arthritis improvement was initially noted in the first trimester for most patients, increased further or was sustained with advancing gestation, and was active postpartum. CONCLUSION: Changes in serum fetal DNA levels correlated with arthritis improvement during pregnancy and recurrence postpartum. Immunologic mechanisms by which pregnancy might modulate RA activity are described.