997 resultados para Reference genes
Resumo:
Next-generation sequencing (NGS) technologies have become the standard for data generation in studies of population genomics, as the 1000 Genomes Project (1000G). However, these techniques are known to be problematic when applied to highly polymorphic genomic regions, such as the human leukocyte antigen (HLA) genes. Because accurate genotype calls and allele frequency estimations are crucial to population genomics analyses, it is important to assess the reliability of NGS data. Here, we evaluate the reliability of genotype calls and allele frequency estimates of the single-nucleotide polymorphisms (SNPs) reported by 1000G (phase I) at five HLA genes (HLA-A, -B, -C, -DRB1, and -DQB1). We take advantage of the availability of HLA Sanger sequencing of 930 of the 1092 1000G samples and use this as a gold standard to benchmark the 1000G data. We document that 18.6% of SNP genotype calls in HLA genes are incorrect and that allele frequencies are estimated with an error greater than ±0.1 at approximately 25% of the SNPs in HLA genes. We found a bias toward overestimation of reference allele frequency for the 1000G data, indicating mapping bias is an important cause of error in frequency estimation in this dataset. We provide a list of sites that have poor allele frequency estimates and discuss the outcomes of including those sites in different kinds of analyses. Because the HLA region is the most polymorphic in the human genome, our results provide insights into the challenges of using of NGS data at other genomic regions of high diversity.
Resumo:
Human exposure to Bisphenol A (BPA) results mainly from ingestion of food and beverages. Information regarding BPA effects on colon cancer, one of the major causes of death in developed countries, is still scarce. Likewise, little is known about BPA drug interactions although its potential role in doxorubicin (DOX) chemoresistance has been suggested. This study aims to assess potential interactions between BPA and DOX on HT29 colon cancer cells. HT29 cell response was evaluated after exposure to BPA, DOX, or co-exposure to both chemicals. Transcriptional analysis of several cancer-associated genes (c-fos, AURKA, p21, bcl-xl and CLU) shows that BPA exposure induces slight up-regulation exclusively of bcl-xl without affecting cell viability. On the other hand, a sub-therapeutic DOX concentration (40 nM) results in highly altered c-fos, bcl-xl, and CLU transcript levels, and this is not affected by co-exposure with BPA. Conversely, DOX at a therapeutic concentration (4 μM) results in distinct and very severe transcriptional alterations of c-fos, AURKA, p21 and CLU that are counteracted by co-exposure with BPA resulting in transcript levels similar to those of control. Co-exposure with BPA slightly decreases apoptosis in relation to DOX 4 μM alone without affecting DOX-induced loss of cell viability. These results suggest that BPA exposure can influence chemotherapy outcomes and therefore emphasize the necessity of a better understanding of BPA interactions with chemotherapeutic agents in the context of risk assessment.
Resumo:
2014
Resumo:
This was a longitudinal study carried out during a period over 2 years with a cohort of 946 individuals of both sexes, aged 1 year and older, from an endemic area of American visceral leishmaniasis (AVL) in Para State, Brazil. The object was to analyze the transmission dynamics of human Leishmania (Leishmania) infantum chagasi infection based principally on the prevalence and incidence. For diagnosis of the infection, the indirect fluorescent antibody test (IFAT) and leishmanin skin test (LST) were performed with amastigote and promastigote antigens of the parasite, respectively. The prevalence by LST (11.2%) was higher (p < 0.0001) than that (3.4%) by IFAT, and the combined prevalence by both tests was 12.6%. The incidences by LST were also higher (p < 0.05) than those by IFAT at 6 (4.7% A- 0.6%), 12 (4.7% A- 2.7%), and 24 months (2.9% A- 0.3%). Moreover, there were no differences (p > 0.05) between the combined incidences by both tests on the same point surveys, 5.2%, 6.3%, and 3.6%. During the study, 12 infected persons showed high IFAT IgG titers with no LST reactions: five children and two adults developed AVL (2,560-10,120), and two children and three adults developed subclinical oligosymptomatic infection (1,280-2,560). The combined tests diagnosed a total of 231 cases of infection leading to an accumulated prevalence of 24.4%.
Resumo:
The molecular karyotypes for 20 reference strais of species complexes of Leishmania were determined by contour-clamped homogeneous eletric field (CHEF) electrosphoresis. Determination of number/position of chromosome-sized bands and chromosomal DNA locations of house-keeping genes were the two criteria used for differentiating and classifying the Leishmania species. We have established two gel running conditions of optimal separation of chromosomes, wich resolved DNA molecules as large as 2,500 kilobase pairs (kb). Chromosomes were polymorphic in number (22-30) and size (200-2,500 kb) of bands among members of five complexes of Leishmania. Although each stock had a distinct karyotype, in general the differences found between strains and/or species within each complex were not clear enough for parasite identification. However, each group showed a specific number of size-concordant DNA molecules, wich allowed distinction among the Leishmania complex parasites. Clear differences between the Old and New world groups of parasites or among some New World Leishmania species were also apparent in relation to the chromosome locations of beta-tubulin genes. Based on these results as well as data from other published studies the potencial of using DNA karyotype for identifying and classifying leishmanial field isolates is discussed.
Resumo:
Projecte de recerca elaborat a partir d’una estada a l’Institut National de la Recherche Agronomique, França, entre 2007 i 2009. Saccharomyces cerevisiae ha estat el llevat utilitzat durant mil.lenis en l'elaboració de vins. Tot i així, es té poc coneixement sobre les pressions de selecció que han actuat en la modelització del genoma dels llevats vínics. S’ha seqüenciat el genoma d'una soca vínica comercial, EC1118, obtenint 31 supercontigs que cobreixen el 97% del genoma de la soca de referència, S288c. S’ha trobat que el genoma de la soca vínica es diferencia bàsicament en la possessió de 3 regions úniques que contenen 34 gens implicats en funcions claus per al procés fermentatiu. A banda, s’han dut a terme estudis de filogènia i synteny (ordre dels gens) que mostren que una d'aquestes tres regions és pròxima a una espècie relacionada amb el gènere Saccharomyces, mentre que les altres dos regions tenen un origen no-Saccharomyces. S’ha identificat mitjançant PCR i seqüenciació a Zygosaccharomyces bailii, una espècie contaminant de les fermentacions víniques, com a espècie donadora d'una de les dues regions. Les hibridacions naturals entre soques de diferents espècies dins del grup Saccharomyces sensu stricto ja han estat descrites. El treball és el primer que presenta hibridacions entre espècies Saccharomyces i no-Saccharomyces (Z. bailii, en aquest cas). També s’assenyala que les noves regions es troben freqüent i diferencialment presents entre els clades de S. cerevisiae, trobant-se de manera gairebé exclusiva en el grup de les soques víniques, suggerint que es tracta d'una adquisició recent de transferència gènica. En general, les dades demostren que el genoma de les soques víniques pateix una constant remodelació mitjançant l'adquisició de gens exògens. Els resultats suggereixen que aquests processos estan afavorits per la proximitat ecològica i estan implicats en l'adaptació molecular de les soques víniques a les condicions d'elevada concentració en sucres, poc nitrogen i elevades concentracions en etanol.
Resumo:
The molecular karyotype of nine Trypanosoma rangeli strains was analyzed by contour-clamped homogeneous electric field electrophoresis, followed by the chromosomal localization of ß-tubulin, cysteine proteinase, 70 kDa heat shock protein (hsp 70) and actin genes. The T. rangeli strains were isolated from either insects or mammals from El Salvador, Honduras, Venezuela, Colombia, Panama and southern Brazil. Also, T. cruzi CL-Brener clone was included for comparison. Despite the great similarity observed among strains from Brazil, the molecular karyotype of all T. rangeli strains analyzed revealed extensive chromosome polymorphism. In addition, it was possible to distinguish T. rangeli from T. cruzi by the chromosomal DNA electrophoresis pattern. The localization of ß-tubulin genes revealed differences among T. rangeli strains and confirmed the similarity between the isolates from Brazil. Hybridization assays using probes directed to the cysteine proteinase, hsp 70 and actin genes discriminated T. rangeli from T. cruzi, proving that these genes are useful molecular markers for the differential diagnosis between these two species. Numerical analysis based on the molecular karyotype data revealed a high degree of polymorphism among T. rangeli strains isolated from southern Brazil and strains isolated from Central and the northern South America. The T. cruzi reference strain was not clustered with any T. rangeli strain.
Resumo:
Background: Gene expression analysis has emerged as a major biological research area, with real-time quantitative reverse transcription PCR (RT-QPCR) being one of the most accurate and widely used techniques for expression profiling of selected genes. In order to obtain results that are comparable across assays, a stable normalization strategy is required. In general, the normalization of PCR measurements between different samples uses one to several control genes (e. g. housekeeping genes), from which a baseline reference level is constructed. Thus, the choice of the control genes is of utmost importance, yet there is not a generally accepted standard technique for screening a large number of candidates and identifying the best ones. Results: We propose a novel approach for scoring and ranking candidate genes for their suitability as control genes. Our approach relies on publicly available microarray data and allows the combination of multiple data sets originating from different platforms and/or representing different pathologies. The use of microarray data allows the screening of tens of thousands of genes, producing very comprehensive lists of candidates. We also provide two lists of candidate control genes: one which is breast cancer-specific and one with more general applicability. Two genes from the breast cancer list which had not been previously used as control genes are identified and validated by RT-QPCR. Open source R functions are available at http://www.isrec.isb-sib.ch/similar to vpopovic/research/ Conclusion: We proposed a new method for identifying candidate control genes for RT-QPCR which was able to rank thousands of genes according to some predefined suitability criteria and we applied it to the case of breast cancer. We also empirically showed that translating the results from microarray to PCR platform was achievable.
Resumo:
Diarrhoeal disease is still considered a major cause of morbidity and mortality among children. Among diarrhoeagenic agents, Shigella should be highlighted due to its prevalence and the severity of the associated disease. Here, we assessed Shigella prevalence, drug susceptibility and virulence factors. Faeces from 157 children with diarrhoea who sought treatment at the Children's Hospital João Paulo II, a reference children´s hospital in Belo Horizonte, state of Minas Gerais, Brazil, were cultured and drug susceptibility of the Shigella isolates was determined by the disk diffusion technique. Shigella virulence markers were identified by polymerase chain reaction. The bacterium was recovered from 10.8% of the children (88.2% Shigella sonnei). The ipaH, iuc, sen and ial genes were detected in strains isolated from all shigellosis patients; set1A was only detected in Shigella flexneri. Additionally, patients were infected by Shigella strains of different ial, sat, sen and set1A genotypes. Compared to previous studies, we observed a marked shift in the distribution of species from S. flexneri to S. sonnei and high rates of trimethoprim/sulfamethoxazole resistance.
Resumo:
BACKGROUND: Carnitine is a key molecule in energy metabolism that helps transport activated fatty acids into the mitochondria. Its homeostasis is achieved through oral intake, renal reabsorption and de novo biosynthesis. Unlike dietary intake and renal reabsorption, the importance of de novo biosynthesis pathway in carnitine homeostasis remains unclear, due to lack of animal models and description of a single patient defective in this pathway. CASE PRESENTATION: We identified by array comparative genomic hybridization a 42 months-old girl homozygote for a 221 Kb interstitial deletions at 11p14.2, that overlaps the genes encoding Fibin and butyrobetaine-gamma 2-oxoglutarate dioxygenase 1 (BBOX1), an enzyme essential for the biosynthesis of carnitine de novo. She presented microcephaly, speech delay, growth retardation and minor facial anomalies. The levels of almost all evaluated metabolites were normal. Her serum level of free carnitine was at the lower limit of the reference range, while her acylcarnitine to free carnitine ratio was normal. CONCLUSIONS: We present an individual with a completely defective carnitine de novo biosynthesis. This condition results in mildly decreased free carnitine level, but not in clinical manifestations characteristic of carnitine deficiency disorders, suggesting that dietary carnitine intake and renal reabsorption are sufficient to carnitine homeostasis. Our results also demonstrate that haploinsufficiency of BBOX1 and/or Fibin is not associated with Primrose syndrome as previously suggested.
Resumo:
Drug-resistant tuberculosis (TB) threatens global TB control and is a major public health concern in several countries. We therefore developed a multiplex assay (LINE-TB/MDR) that is able to identify the most frequent mutations related to rifampicin (RMP) and isoniazid (INH) resistance. The assay is based on multiplex polymerase chain reaction, membrane hybridisation and colorimetric detection targeting of rpoB and katG genes, as well as the inhA promoter, which are all known to carry specific mutations associated with multidrug-resistant TB (MDR-TB). The assay was validated on a reference panel of 108 M. tuberculosis isolates that were characterised by the proportion method and by DNA sequencing of the targets. When comparing the performance of LINE-TB/MDR with DNA sequencing, the sensitivity, specificity and agreement were 100%, 100% and 100%, respectively, for RMP and 77.6%, 90.6% and 88.9%, respectively, for INH. Using drug sensibility testing as a reference standard, the performance of LINE-TB/MDR regarding sensitivity, specificity and agreement was 100%, 100% and 100% (95%), respectively, for RMP and 77%, 100% and 88.7% (82.2-95.1), respectively, for INH. LINE-TB/MDR was compared with GenoType MTBDRplus for 65 isolates, resulting in an agreement of 93.6% (86.7-97.5) for RIF and 87.4% (84.3-96.2) for INH. LINE-TB/MDR warrants further clinical validation and may be an affordable alternative for MDR-TB diagnosis.
Resumo:
Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors.
Resumo:
Background: The GENCODE consortium was formed to identify and map all protein-coding genes within the ENCODE regions. This was achieved by a combination of initial manualannotation by the HAVANA team, experimental validation by the GENCODE consortium and a refinement of the annotation based on these experimental results.Results: The GENCODE gene features are divided into eight different categories of which onlythe first two (known and novel coding sequence) are confidently predicted to be protein-codinggenes. 5’ rapid amplification of cDNA ends (RACE) and RT-PCR were used to experimentallyverify the initial annotation. Of the 420 coding loci tested, 229 RACE products have beensequenced. They supported 5’ extensions of 30 loci and new splice variants in 50 loci. In addition,46 loci without evidence for a coding sequence were validated, consisting of 31 novel and 15putative transcripts. We assessed the comprehensiveness of the GENCODE annotation byattempting to validate all the predicted exon boundaries outside the GENCODE annotation. Outof 1,215 tested in a subset of the ENCODE regions, 14 novel exon pairs were validated, only twoof them in intergenic regions.Conclusions: In total, 487 loci, of which 434 are coding, have been annotated as part of theGENCODE reference set available from the UCSC browser. Comparison of GENCODEannotation with RefSeq and ENSEMBL show only 40% of GENCODE exons are contained withinthe two sets, which is a reflection of the high number of alternative splice forms with uniqueexons annotated. Over 50% of coding loci have been experimentally verified by 5’ RACE forEGASP and the GENCODE collaboration is continuing to refine its annotation of 1% humangenome with the aid of experimental validation.
Resumo:
The vast majority of the biology of a newly sequenced genome is inferred from the set of encoded proteins. Predicting this set is therefore invariably the first step after the completion of the genome DNA sequence. Here we review the main computational pipelines used to generate the human reference protein-coding gene sets.
Resumo:
The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.