82 resultados para flanking sequence
em CentAUR: Central Archive University of Reading - UK
Resumo:
Before the advent of genome-wide association studies (GWASs), hundreds of candidate genes for obesity-susceptibility had been identified through a variety of approaches. We examined whether those obesity candidate genes are enriched for associations with body mass index (BMI) compared with non-candidate genes by using data from a large-scale GWAS. A thorough literature search identified 547 candidate genes for obesity-susceptibility based on evidence from animal studies, Mendelian syndromes, linkage studies, genetic association studies and expression studies. Genomic regions were defined to include the genes ±10 kb of flanking sequence around candidate and non-candidate genes. We used summary statistics publicly available from the discovery stage of the genome-wide meta-analysis for BMI performed by the genetic investigation of anthropometric traits consortium in 123 564 individuals. Hypergeometric, rank tail-strength and gene-set enrichment analysis tests were used to test for the enrichment of association in candidate compared with non-candidate genes. The hypergeometric test of enrichment was not significant at the 5% P-value quantile (P = 0.35), but was nominally significant at the 25% quantile (P = 0.015). The rank tail-strength and gene-set enrichment tests were nominally significant for the full set of genes and borderline significant for the subset without SNPs at P < 10(-7). Taken together, the observed evidence for enrichment suggests that the candidate gene approach retains some value. However, the degree of enrichment is small despite the extensive number of candidate genes and the large sample size. Studies that focus on candidate genes have only slightly increased chances of detecting associations, and are likely to miss many true effects in non-candidate genes, at least for obesity-related traits.
Resumo:
In recent years, a large number of papers have reported the response of the cusp to solar wind variations under conditions of northward or southward Interplanetary Magnetic Field (IMF) Z-component (BZ). These studies have shown the importance of both temporal and spatial factors in determining the extent and morphology of the cusp and the changes in its location, connected to variations in the reconnection geometry. Here we present a comparative study of the cusp, focusing on an interval characterised by a series of rapid reversals in the BZ-dominated IMF, based on observations from space-borne and ground-based instrumentation. During this interval, from 08:00 to 12:00 UT on 12 February 2003, the IMF BZ component underwent four reversals, remaining for around 30 min in each orientation. The Cluster spacecraft were, at the time, on an outbound trajectory through the Northern Hemisphere magnetosphere, whilst the mainland VHF and Svalbard (ESR) radars of the EISCAT facility were operating in support of the Cluster mission. Both Cluster and the EISCAT were, on occasion during the interval, observing the cusp region. The series of IMF reversal resulted in a sequence of poleward and equatorward motions of the cusp; consequently Cluster crossed the high altitude cusp twice before finally exiting the dayside magnetopause, both times under conditions of northward IMF BZ. The first magnetospheric cusp encounter, by all four Cluster spacecraft, showed reverse ion dispersion typical of lobe reconnection; subsequently, Cluster spacecraft 1 and 3 (only) crossed the cusp for a second time. We suggest that, during this second cusp crossing, these two spacecraft were likely to have been on newly closed field lines, which were first reconnected (opened) at low latitudes and later reconnected again (re-closed) poleward of the northern cusp.
Resumo:
Monomer-sequence information in synthetic copolyimides can be recognised by tweezer-type molecules binding to adjacent triplet-sequences on the polymer chains. In the present paper different tweezer-molecules are found to have different sequence-selectivities, as demonstrated in solution by 1H NMR spectroscopy and in the solid state by single crystal X-ray analyses of tweezer-complexes with linear and macrocyclic oligo-imides. This work provides clear-cut confirmation of polyimide chain-folding and adjacent-tweezer-binding. It also reveals a new and entirely unexpected mechanism for sequence-recognition which, by analogy with a related process in biomolecular information processing, may be termed "frameshift-reading". The ability of one particular tweezer-molecule to detect, with exceptionally high sensitivity, long-range sequence-information in chain-folding aromatic copolyimides, is readily explained by this novel process.
Resumo:
Inversions breaking the 1041 bp int1h-1 or the 9.5-kb int22h-1 sequence of the F8 gene cause hemophilia A in 1/30,000 males. These inversions are due to homologous recombination between the above sequences and their inverted copies on the same DNA molecule, respectively, int1h-2 and int22h-2 or int22h-3. We find that (1) int1h and int22h duplicated more than 25 million years ago; (2) the identity of the copies (>99%) of these sequences in humans and other primates is due to gene conversion; (3) gene conversion is most frequent in the internal regions of int22h; (4) breakpoints of int22h-related inversions also tend to involve the internal regions of int22h; (5) sequence variations in a sample of human X chromosomes defined eight haplotypes of int22h-1 and 27 of int22h-2 plus int22h-3; (6) the latter two sequences, which lie, respectively, 500 and 600 kb telomeric to int22h-1 are five-fold more identical when in cis than when in trans, thus suggesting that gene conversion may be predominantly intrachromosomal; (7) int1h, int22h, and flanking sequences evolved at a rate of about 0.1% substitutions per million years during the divergence between humans and other primates, except for int1h during the human-chimpanzee divergence, when its rate of evolution was significantly lower. This is reminiscent of the slower evolution of palindrome arms in the male specific regions of the Y chromosome and we propose, as an explanation, that intrachromosomal gene conversion and cosegregation of the duplicated regions favors retention of the ancestral sequence and thus reduces the evolution rate.
Resumo:
The recently described cupin superfamily of proteins includes the germin and germinlike proteins, of which the cereal oxalate oxidase is the best characterized. This superfamily also includes seed storage proteins, in addition to several microbial enzymes and proteins with unknown function. All these proteins are characterized by the conservation of two central motifs, usually containing two or three histidine residues presumed to be involved with metal binding in the catalytic active site. The present study on the coding regions of Synechocystis PCC6803 identifies a previously unknown group of 12 related cupins, each containing the characteristic two-motif signature. This group comprises 11 single-domain proteins, ranging in length from 104 to 289 residues, and includes two phosphomannose isomerases and two epimerases involved in cell wall synthesis, a member of the pirin group of nuclear proteins, a possible transcriptional regulator, and a close relative-of a cytochrome c551 from Rhodococcus. Additionally, there is a duplicated, two-domain protein that has close similarity to an oxalate decarboxylase from the fungus Collybia velutipes and that is a putative progenitor of the storage proteins of land plants.
Resumo:
The order Fabales, including Leguminosae, Polygalaceae, Quillajaceae and Surianaceae, represents a novel hypothesis emerging from angiosperm molecular phylogenies. Despite good support for the order, molecular studies to date have suggested contradictory, poorly supported interfamilial relationships. Our reappraisal of relationships within Fabales addresses past taxon sampling deficiencies, and employs parsimony and Bayesian approaches using sequences from the plastid regions rbcL (166 spp.) and matK (78 spp.). Five alternative hypotheses for interfamilial relationships within Fabales were recovered. The Shimodaira-Hasegawa test found the likelihood of a resolved topology significantly higher than the one calculated for a polytomy, but did not favour any of the alternative hypotheses of relationship within Fabales. In the light of the morphological evidence available and the comparative behavior of rbcL and matK, the topology recovering Polygalaceae as sister to the rest of the order Fabales with Leguminosae more closely related to Quillajaceae + Surianaceae, is considered the most likely hypothesis of interfamilial relationships of the order. Dating of selected crown clades in the Fabales phylogeny using penalized likelihood suggests rapid radiation of the Leguminosae, Polygalaceae, and (Quillajaceae + Surianaceae) crown clades.
Resumo:
Diversity in the chloroplast genome of 171 accessions representing the Brassica 'C' (n = 9) genome, including domesticated and wild B. oleracea and nine inter-fertile related wild species, was investigated using six chloroplast SSR (microsatellite) markers. The lack of diversity detected among 105 cultivated and wild accessions of B. oleracea contrasted starkly with that found within its wild relatives. The vast majority of B. oleracea accessions shared a single haplotype, whereas as many as six haplotypes were detected in two wild species, B. villosa Biv. and B. cretica Lam.. The SSRs proved to be highly polymorphic across haplotypes, with calculated genetic diversity values (H) of 0.23-0.87. In total, 23 different haplotypes were detected in C genome species, with an additional five haplotypes detected in B. rapa L. (A genome n = 10) and another in B. nigra L. (B genome, n = 8). The low chloroplast diversity of B. oleracea is not suggestive of multiple domestication events. The predominant B. oleracea haplotype was also common in B. incana Ten. and present in low frequencies in B. villosa, B. macrocarpa Guss, B. rupestris Raf. and B. cretica. The chloroplast SSRs reveal a wealth of diversity within wild Brassica species that will facilitate further evolutionary and phylogeographic studies of this important crop genus.
Resumo:
Investigations were conducted during the 2003, 2004 and 2005 growing seasons in northern Greece to evaluate effects of tillage regime (mouldboard plough, chisel plough and rotary tiller), cropping sequence (continuous cotton, cotton-sugar beet rotation and continuous tobacco) and herbicide treatment on weed seedbank dynamics. Amaranthus spp. and Portulaca oleracea were the most abundant species, ranging from 76% to 89% of total weed seeds found in 0-15 and 15-30 cm soil depths during the 3 years. With the mouldboard plough, 48% and 52% of the weed seedbank was found in the 0-15 and 15-30 cm soil horizons, while approximately 60% was concentrated in the upper 15 cm soil horizon for chisel plough and rotary tillage. Mouldboard ploughing significantly buried more Echinochloa crus-galli seeds in the 15-30 cm soil horizon compared with the other tillage regimes. Total seedbank (0-30 cm) of P. oleracea was significantly reduced in cotton-sugar beet rotation compared with cotton and tobacco monocultures, while the opposite occurred for E. crus-galli. Total seed densities of most annual broad-leaved weed species (Amaranthus spp., P. oleracea, Solanum nigrum) and E. crus-galli were lower in herbicide treated than in untreated plots. The results suggest that in light textured soils, conventional tillage with herbicide use gradually reduces seed density of small seeded weed species in the top 15 cm over several years. In contrast, crop rotation with the early established sugar beet favours spring-germinating grass weed species, but also prevents establishment of summer-germinating weed species by the early developing crop canopy.
Resumo:
The monophyly of the Peltophorum group, one of nine informal groups recognized by Polhill in the Caesalpinieae, was tested using sequence data from the trnL-F, rbcL, and rps16 regions of the chloroplast genome. Exemplars were included from all 16 genera of the Peltophorum group, and from 15 genera representing seven of the other eight informal groups in the tribe. The data were analyzed separately and in combined analyses using parsimony and Bayesian methods. The analysis method had little effect on the topology of well-supported relationships. The molecular data recovered a generally well-supported phylogeny with many intergeneric relationships resolved. Results show that the Peltophorum group as currently delimited is polyphyletic, but that eight genera plus one undescribed genus form a core Peltophorum group, which is referred to here as the Peltophorum group sensu stricto. These genera are Bussea, Conzattia, Colvillea, Delonix, Heteroflorum (inedit.), Lemuropisum, Parkinsonia, Peltophorum, and Schizolobium. The remaining eight genera of the Peltophorum group s.l. are distributed across the Caesalpinieae. Morphological support for the redelimited Peltophorum group and the other recovered clades was assessed, and no unique synapomorphy was found for the Peltophorum group s.s. A proposal for the reclassification of the Peltophorum group s.l. is presented.
Resumo:
We describe a general likelihood-based 'mixture model' for inferring phylogenetic trees from gene-sequence or other character-state data. The model accommodates cases in which different sites in the alignment evolve in qualitatively distinct ways, but does not require prior knowledge of these patterns or partitioning of the data. We call this qualitative variability in the pattern of evolution across sites "pattern-heterogeneity" to distinguish it from both a homogenous process of evolution and from one characterized principally by differences in rates of evolution. We present studies to show that the model correctly retrieves the signals of pattern-heterogeneity from simulated gene-sequence data, and we apply the method to protein-coding genes and to a ribosomal 12S data set. The mixture model outperforms conventional partitioning in both these data sets. We implement the mixture model such that it can simultaneously detect rate- and pattern-heterogeneity. The model simplifies to a homogeneous model or a rate- variability model as special cases, and therefore always performs at least as well as these two approaches, and often considerably improves upon them. We make the model available within a Bayesian Markov-chain Monte Carlo framework for phylogenetic inference, as an easy-to-use computer program.