916 resultados para Genome-specific Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background The majority of introns in gene transcripts are found within the coding sequences (CDSs). A small but significant fraction of introns are also found to reside within the untranslated regions (5′UTRs and 3′UTRs) of expressed sequences. Alignment of the whole genome and expressed sequence tags (ESTs) of the model plant Arabidopsis thaliana has identified introns residing in both coding and non-coding regions of the genome. Results A bioinformatic analysis revealed some interesting observations: (1) the density of introns in 5′UTRs is similar to that in CDSs but much higher than that in 3′UTRs; (2) the 5′UTR introns are preferentially located close to the initiating ATG codon; (3) introns in the 5′UTRs are, on average, longer than introns in the CDSs and 3′UTRs; and (4) 5′UTR introns have a different nucleotide composition to that of CDs and 3′UTR introns. Furthermore, we show that the 5′UTR intron of the A. thaliana EFIα-A3 gene affects the gene expression and the size of the 5′UTR intron influences the level of gene expression. Conclusion Introns within the 5′UTR show specific features that distinguish them from introns that reside within the coding sequence and the 3′UTR. In the EFIα-A3 gene, the presence of a long intron in the 5′UTR is sufficient to enhance gene expression in plants in a size dependent manner.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of 'Golden Delicious', SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The woodland strawberry, Fragaria vesca (2n = 2x = 14), is a versatile experimental plant system. This diminutive herbaceous perennial has a small genome (240 Mb), is amenable to genetic transformation and shares substantial sequence identity with the cultivated strawberry (Fragaria Ã- ananassa) and other economically important rosaceous plants. Here we report the draft F. vesca genome, which was sequenced to ×-39 coverage using second-generation technology, assembled de novo and then anchored to the genetic linkage map into seven pseudochromosomes. This diploid strawberry sequence lacks the large genome duplications seen in other rosids. Gene prediction modeling identified 34,809 genes, with most being supported by transcriptome mapping. Genes critical to valuable horticultural traits including flavor, nutritional value and flowering time were identified. Macrosyntenic relationships between Fragaria and Prunus predict a hypothetical ancestral Rosaceae genome that had nine chromosomes. New phylogenetic analysis of 154 protein-coding genes suggests that assignment of Populus to Malvidae, rather than Fabidae, is warranted.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Plant small RNAs are a class of 19- to 25-nucleotide (nt) RNA molecules that are essential for genome stability, development and differentiation, disease, cellular communication, signaling, and adaptive responses to biotic and abiotic stress. Small RNAs comprise two major RNA classes, short interfering RNAs (siRNAs) and microRNAs (miRNAs). Efficient and reliable detection and quantification of small RNA expression has become an essential step in understanding their roles in specific cells and tissues. Here we provide protocols for the detection of miRNAs by stem-loop RT-PCR. This method enables fast and reliable miRNA expression profiling from as little as 20 pg of total RNA extracted from plant tissue and is suitable for high-throughput miRNA expression analysis. In addition, this method can be used to detect other classes of small RNAs, provided the sequence is known and their GC contents are similar to those specific for miRNAs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We determined the nucleotide sequence of the mitochondrial genome (mtgenome) of Spilonota lechriaspis Meyrick (Lepidoptera: Tortricidae). The entire closed circular molecule is 15,368 bp and contains 37 genes with the typical gene complement and order for lepidopteran mtgenomes. All tRNAs except tRNASer(AGN) can be folded into the typical cloverleaf secondary structures. The protein-coding genes (PCGs) have typical mitochondrial start codons, with the exception of COI, which uses the unusual CGA one as is found in all other Lepidoptera sequenced to date. In addition, six of 13 PCGs harbor the incomplete termination codons, a single T. The A+T-rich region contains some conserved structures that are similar to those found in other lepidopteran mtgenomes, including a structure combining the motif 'ATAGA', a 19-bp poly(T) stretch and three microsatellite (AT)n elements which are part of larger 122+ bp macrorepeats. This is the first report of macrorepeats in a lepidopteran mtgenome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

MicroRNAs are small non-coding RNAs that mediate post-transcriptional gene silencing. Fear-extinction learning in C57/Bl6J mice led to increased expression of the brain-specific microRNA miR-128b, which disrupted stability of several plasticity-related target genes and regulated formation of fear-extinction memory. Increased miR-128b activity may therefore facilitate the transition from retrieval of the original fear memory toward the formation of a new fear-extinction memory.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Coleoptera is the most diverse group of insects with over 360,000 described species divided into four suborders: Adephaga, Archostemata, Myxophaga, and Polyphaga. In this study, we present six new complete mitochondrial genome (mtgenome) descriptions, including a representative of each suborder, and analyze the evolution of mtgenomes from a comparative framework using all available coleopteran mtgenomes. We propose a modification of atypical cox1 start codons based on sequence alignment to better reflect the conservation observed across species as well as findings of TTG start codons in other genes. We also analyze tRNA-Ser(AGN) anticodons, usually GCU in arthropods, and report a conserved UCU anticodon as a possible synapomorphy across Polyphaga. We further analyze the secondary structure of tRNA-Ser(AGN) and present a consensus structure and an updated covariance model that allows tRNAscan-SE (via the COVE software package) to locate and fold these atypical tRNAs with much greater consistency. We also report secondary structure predictions for both rRNA genes based on conserved stems. All six species of beetle have the same gene order as the ancestral insect. We report noncoding DNA regions, including a small gap region of about 20 bp between tRNA-Ser(UCN) and nad1 that is present in all six genomes, and present results of a base composition analysis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Escherichia coli is the primary cause of urinary tract infection (UTI) in the developed world. The major factors associated with virulence of uropathogenic E. coli (UPEC) are fimbrial adhesins, which mediate specific attachment to host receptors and trigger innate host responses. Another group of adhesins is represented by the autotransporter (AT) subgroup of proteins. In this study, we identified a new AT-encoding gene, termed upaH, present in a 6.5-kb unannotated intergenic region in the genome of the prototypic UPEC strain CFT073. Cloning and sequencing of the upaH gene from CFT073 revealed an intact 8.535-kb coding region, contrary to the published genome sequence. The upaH gene was widely distributed among a large collection of UPEC isolates as well as the E. coli Reference (ECOR) strain collection. Bioinformatic analyses suggest β-helix as the predominant structure in the large N-terminal passenger (α) domain and a 12-strand β-barrel for the C-terminal β-domain of UpaH. We demonstrated that UpaH is expressed at the cell surface of CFT073 and promotes biofilm formation. In the mouse UTI model, deletion of the upaH gene in CFT073 and in two other UPEC strains did not significantly affect colonization of the bladder in single-challenge experiments. However, in competitive colonization experiments, CFT073 significantly outcompeted its upaH isogenic mutant strain in urine and the bladder.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background Increased disease resistance is a key target of cereal breeding programs, with disease outbreaks continuing to threaten global food production, particularly in Africa. Of the disease resistance gene families, the nucleotide-binding site plus leucine-rich repeat (NBS-LRR) family is the most prevalent and ancient and is also one of the largest gene families known in plants. The sequence diversity in NBS-encoding genes was explored in sorghum, a critical food staple in Africa, with comparisons to rice and maize and with comparisons to fungal pathogen resistance QTL. Results In sorghum, NBS-encoding genes had significantly higher diversity in comparison to non NBS-encoding genes and were significantly enriched in regions of the genome under purifying and balancing selection, both through domestication and improvement. Ancestral genes, pre-dating species divergence, were more abundant in regions with signatures of selection than in regions not under selection. Sorghum NBS-encoding genes were also significantly enriched in the regions of the genome containing fungal pathogen disease resistance QTL; with the diversity of the NBS-encoding genes influenced by the type of co-locating biotic stress resistance QTL. Conclusions NBS-encoding genes are under strong selection pressure in sorghum, through the contrasting evolutionary processes of purifying and balancing selection. Such contrasting evolutionary processes have impacted ancestral genes more than species-specific genes. Fungal disease resistance hot-spots in the genome, with resistance against multiple pathogens, provides further insight into the mechanisms that cereals use in the “arms race” with rapidly evolving pathogens in addition to providing plant breeders with selection targets for fast-tracking the development of high performing varieties with more durable pathogen resistance.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background The koala, Phascolarctos cinereus, is a biologically unique and evolutionarily distinct Australian arboreal marsupial. The goal of this study was to sequence the transcriptome from several tissues of two geographically separate koalas, and to create the first comprehensive catalog of annotated transcripts for this species, enabling detailed analysis of the unique attributes of this threatened native marsupial, including infection by the koala retrovirus. Results RNA-Seq data was generated from a range of tissues from one male and one female koala and assembled de novo into transcripts using Velvet-Oases. Transcript abundance in each tissue was estimated. Transcripts were searched for likely protein-coding regions and a non-redundant set of 117,563 putative protein sequences was produced. In similarity searches there were 84,907 (72%) sequences that aligned to at least one sequence in the NCBI nr protein database. The best alignments were to sequences from other marsupials. After applying a reciprocal best hit requirement of koala sequences to those from tammar wallaby, Tasmanian devil and the gray short-tailed opossum, we estimate that our transcriptome dataset represents approximately 15,000 koala genes. The marsupial alignment information was used to look for potential gene duplications and we report evidence for copy number expansion of the alpha amylase gene, and of an aldehyde reductase gene. Koala retrovirus (KoRV) transcripts were detected in the transcriptomes. These were analysed in detail and the structure of the spliced envelope gene transcript was determined. There was appreciable sequence diversity within KoRV, with 233 sites in the KoRV genome showing small insertions/deletions or single nucleotide polymorphisms. Both koalas had sequences from the KoRV-A subtype, but the male koala transcriptome has, in addition, sequences more closely related to the KoRV-B subtype. This is the first report of a KoRV-B-like sequence in a wild population. Conclusions This transcriptomic dataset is a useful resource for molecular genetic studies of the koala, for evolutionary genetic studies of marsupials, for validation and annotation of the koala genome sequence, and for investigation of koala retrovirus. Annotated transcripts can be browsed and queried at http://koalagenome.org

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The O-specific polysaccharide (OPS) is a variable constituent of the lipopolysaccharide of Gram-negative bacteria. The polymorphic nature of OPSs within a species is usually first defined serologically, and the current serotyping scheme for Yersinia pseudotuberculosis consists of 21 O serotypes of which 15 have been characterized genetically and structurally. Here, we present the structure and DNA sequence of Y. pseudotuberculosis O:10 OPS. The O unit consists of one residue each of d-galactopyranose, N-acetyl-d-galactosamine (2-amino-2-deoxy-d-galactopyranose) and d-glucopyranose in the backbone, with two colitose (3,6-dideoxy-l-xylo-hexopyranose) side-branch residues. This structure is very similar to that shared by Escherichia coli O111 and Salmonella enterica O35. The gene cluster sequences of these serotypes, however, have only low levels of similarity to that of Y. pseudotuberculosis O:10, although there is significant conservation of gene order. Within Y. pseudotuberculosis, the O10 structure is most closely related to the O:6 and O:7 structures.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Lipooligosaccharide (LOS) is a complex surface structure that is linked to many pathogenic properties of Acinetobacter baumannii. In A. baumannii, the genes responsible for the synthesis of the outer core (OC) component of the LOS are located between ilvE and aspS. The content of the OC locus is usually variable within a species, and examination of 6 complete and 227 draft A. baumannii genome sequences available in GenBank non-redundant and Whole Genome Shotgun databases revealed nine distinct new types, OCL4-OCL12, in addition to the three known ones. The twelve gene clusters fell into two distinct groups, designated Group A and Group B, based on similarities in the genes present. OCL6 (Group B) was unique in that it included genes for the synthesis of L-Rhamnosep. Genetic exchange of the different configurations between strains has occurred as some OC forms were found in several different sequence types (STs). OCL1 (Group A) was the most widely distributed being present in 18 STs, and OCL6 was found in 16 STs. Variation within clones was also observed, with more than one OC locus type found in the two globally disseminated clones, GC1 and GC2, that include the majority of multiply antibiotic resistant isolates. OCL1 was the most abundant gene cluster in both GC1 and GC2 genomes but GC1 isolates also carried OCL2, OCL3 or OCL5, and OCL3 was also present in GC2. As replacement of the OC locus in the major global clones indicates the presence of sub-lineages, a PCR typing scheme was developed to rapidly distinguish Group A and Group B types, and to distinguish the specific forms found in GC1 and GC2 isolates.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Human brain connectivity is disrupted in a wide range of disorders from Alzheimer's disease to autism but little is known about which specific genes affect it. Here we conducted a genome-wide association for connectivity matrices that capture information on the density of fiber connections between 70 brain regions. We scanned a large twin cohort (N=366) with 4-Tesla high angular resolution diffusion imaging (105-gradient HARDI). Using whole brain HARDI tractography, we extracted a relatively sparse 70×70 matrix representing fiber density between all pairs of cortical regions automatically labeled in co-registered anatomical scans. Additive genetic factors accounted for 1-58% of the variance in connectivity between 90 (of 122) tested nodes. We discovered genome-wide significant associations between variants and connectivity. GWAS permutations at various levels of heritability, and split-sample replication, validated our genetic findings. The resulting genes may offer new leads for mechanisms influencing aberrant connectivity and neurodegeneration. © 2012 IEEE.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer's disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, highangular resolution diffusion MRI. We adapted GWASs to screen the brain's connectivity pattern, allowing us to discover genetic variants that affect the human brain's wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer's disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The caudate is a subcortical brain structure implicated in many common neurological and psychiatric disorders. To identify specific genes associated with variations in caudate volume, structural magnetic resonance imaging and genome-wide genotypes were acquired from two large cohorts, the Alzheimer's Disease NeuroImaging Initiative (ADNI; N=734) and the Brisbane Adolescent/Young Adult Longitudinal Twin Study (BLTS; N=464). In a preliminary analysis of heritability, around 90% of the variation in caudate volume was due to genetic factors. We then conducted genome-wide association to find common variants that contribute to this relatively high heritability. Replicated genetic association was found for the right caudate volume at single-nucleotide polymorphism rs163030 in the ADNI discovery sample (P=2.36 × 10 -6) and in the BLTS replication sample (P=0.012). This genetic variation accounted for 2.79 and 1.61% of the trait variance, respectively. The peak of association was found in and around two genes, WDR41 and PDE8B, involved in dopamine signaling and development. In addition, a previously identified mutation in PDE8B causes a rare autosomal-dominant type of striatal degeneration. Searching across both samples offers a rigorous way to screen for genes consistently influencing brain structure at different stages of life. Variants identified here may be relevant to common disorders affecting the caudate.