950 resultados para Copy number variations and polymorphisms


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We searched for disruptive, genic rare copy-number variants (CNVs) among 411 families affected by sporadic autism spectrum disorder (ASD) from the Simons Simplex Collection by using available exome sequence data and CoNIFER (Copy Number Inference from Exome Reads). Compared to high-density SNP microarrays, our approach yielded ∼2× more smaller genic rare CNVs. We found that affected probands inherited more CNVs than did their siblings (453 versus 394, p = 0.004; odds ratio [OR] = 1.19) and that the probands' CNVs affected more genes (921 versus 726, p = 0.02; OR = 1.30). These smaller CNVs (median size 18 kb) were transmitted preferentially from the mother (136 maternal versus 100 paternal, p = 0.02), although this bias occurred irrespective of affected status. The excess burden of inherited CNVs among probands was driven primarily by sibling pairs with discordant social-behavior phenotypes (p < 0.0002, measured by Social Responsiveness Scale [SRS] score), which contrasts with families where the phenotypes were more closely matched or less extreme (p > 0.5). Finally, we found enrichment of brain-expressed genes unique to probands, especially in the SRS-discordant group (p = 0.0035). In a combined model, our inherited CNVs, de novo CNVs, and de novo single-nucleotide variants all independently contributed to the risk of autism (p < 0.05). Taken together, these results suggest that small transmitted rare CNVs play a role in the etiology of simplex autism. Importantly, the small size of these variants aids in the identification of specific genes as additional risk factors associated with ASD.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Phylogenetic analysis of morphometric and biological characters indicated that there are two distinct forms of Lutzomyia whitmani in Brazil: one is present both north and south of the River Amazonas in the State of Pará while the other occurs in northeast Brazil, in the State of Ceará, and further south, including the type locality in State of Bahia. The Amazonian form is reportedly neither strongly anthropophilic nor synanthropic, and it is the vector of Leishmania shawi; whereas the southern form is often collected peridomestically, while biting man, and has been found infected with Le.(V.) braziliensis. The ratio of the length of the genital filaments to that the genital pump was found to be consistently smaller in males of the Amazonian populations. A middle repetitive DNA element was isolated by differentially screening a genomic library made using Amazonian material, and the sequence was diagnostic for this form of Lu. whitmani (being absent or occurring in low copy number in the southern form). The total evidence suggests there are at least two, geographically-isolated forms of Lu. whitmani, which may represent different cryptic species.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Somatic copy number aberrations (CNA) represent a mutation type encountered in the majority of cancer genomes. Here, we present the 2014 edition of arrayMap (http://www.arraymap.org), a publicly accessible collection of pre-processed oncogenomic array data sets and CNA profiles, representing a vast range of human malignancies. Since the initial release, we have enhanced this resource both in content and especially with regard to data mining support. The 2014 release of arrayMap contains more than 64,000 genomic array data sets, representing about 250 tumor diagnoses. Data sets included in arrayMap have been assembled from public repositories as well as additional resources, and integrated by applying custom processing pipelines. Online tools have been upgraded for a more flexible array data visualization, including options for processing user provided, non-public data sets. Data integration has been improved by mapping to multiple editions of the human reference genome, with the majority of the data now being available for the UCSC hg18 as well as GRCh37 versions. The large amount of tumor CNA data in arrayMap can be freely downloaded by users to promote data mining projects, and to explore special events such as chromothripsis-like genome patterns.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The 15q24.1 locus, including CYP1A2, is associated with blood pressure (BP). The CYP1A2 rs762551 C allele is associated with lower CYP1A2 enzyme activity. CYP1A2 metabolizes caffeine and is induced by smoking. The association of caffeine consumption with hypertension remains controversial. We explored the effects of CYP1A2 variants and CYP1A2 enzyme activity on BP, focusing on caffeine as the potential mediator of CYP1A2 effects. Four observational (n = 16 719) and one quasi-experimental studies (n = 106) including European adults were conducted. Outcome measures were BP, caffeine intake, CYP1A2 activity and polymorphisms rs762551, rs1133323 and rs1378942. CYP1A2 variants were associated with hypertension in non-smokers, but not in smokers (CYP1A2-smoking interaction P = 0.01). Odds ratios (95% CIs) for hypertension for rs762551 CC, CA and AA genotypes were 1 (reference), 0.78 (0.59-1.02) and 0.66 (0.50-0.86), respectively, P = 0.004. Results were similar for the other variants. Higher CYP1A2 activity was linearly associated with lower BP after quitting smoking (P = 0.049 and P = 0.02 for systolic and diastolic BP, respectively), but not while smoking. In non-smokers, the CYP1A2 variants were associated with higher reported caffeine intake, which in turn was associated with lower odds of hypertension and lower BP (P = 0.01). In Mendelian randomization analyses using rs1133323 as instrument, each cup of caffeinated beverage was negatively associated with systolic BP [-9.57 (-16.22, -2.91) mmHg]. The associations of CYP1A2 variants with BP were modified by reported caffeine intake. These observational and quasi-experimental results strongly support a causal role of CYP1A2 in BP control via caffeine intake.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

SUMMARY When exposed to heat stress, plants display a particular set of cellular and molecular responses, such as chaperones expression, which are highly conserved in all organisms. In chapter 1, I studied the ability of heat shock genes to become transiently and abundantly induced under various temperature regimes. To this aim, I designed a highly sensitive heat-shock dependent conditional gene expression system in the moss Physcomitrella patens, using the soybean heatinducible promoter (hsp17.3B). Heat-induced expression of various reporter genes was over three orders of magnitude, in tight correlation with the intensity and duration of the heat treatments. By performing repeated heating/cooling cycles, a massive accumulation of recombinant proteins was obtained. Interestingly, the hsp17.3B promoter was also activated by specific organic chemicals. Thus, in chapter 2, I took advantage of the extreme sensitivity of this promoter to small temperature variations to further address the role of various natural and organic chemicals and develop a plant based-bioassay that can serve as an early warning indicator of toxicity by pollutants and heavy metals. A screen of several organic pollutants from textile and paper industry showed that chlorophenols as well as sulfonated anthraquinones elicited a heat shock like response at noninducing temperatures. Their effects were synergistically amplified by mild elevated temperatures. In contrast to standard methods of pollutant detection, this plant-based biosensor allowed to monitor early stress-responses, in correlation with long-term toxic effect, and to attribute effective toxicity thresholds for pollutants, in a context of varying environmental cues. In chapter 3, I deepened the study of the primary mechanism by which plants sense mild temperature variations and trigger a cellular signal leading to the heat shock response. In addition to the above described heat-inducible reporter line, I generated a P. patens transgenic line to measure, in vivo, variations of cytosolic calcium during heat treatment, and another line to monitor the role of protein unfolding in heat-shock sensing and signalling. The heat shock signalling pathway was found to be triggered by the plasma membrane, where temperature up shift specifically induced the transient opening of a putative high afimity calcium channel. The calcium influx triggered a signalling cascade leading to the activation of the heat shock genes, independently on the presence of misfolded proteins in the cytoplasm. These results strongly suggest that changes in the fluidity of the plasma membrane are the primary trigger of the heatshocksignalling pathway in plants. The present thesis contributes to the understanding of the basic mechanism by which plants perceive and respond to heat and chemical stresses. This may contribute to developing appropriate better strategies to enhance plant productivity under the increasingly stressful environment of global warming. RÉSUME Les plantes exposées à des températures élevées déclenchent rapidement des réponses cellulaires qui conduisent à l'induction de gènes codant pour les heat shock proteins (HSPs). En fonction de la durée d'exposition et de la vitesse à laquelle la température augmente, les HSPs sont fortement et transitoirement induites. Dans le premier chapitre, cette caractéristique aété utilisée pour développer un système inductible d'expression de gènes dans la mousse Physcomitrella patens. En utilisant plusieurs gènes rapporteurs, j'ai montré que le promoteur du gène hsp17.3B du Soja est activé d'une manière. homogène dans tous les tissus de la mousse proportionnellement à l'intensité du heat shock physiologique appliqué. Un très fort taux de protéines recombinantes peut ainsi être produit en réalisant plusieurs cycles induction/recovery. De plus, ce promoteur peut également être activé par des composés organiques, tels que les composés anti-inflammatoires, ce qui constitue une bonne alternative à l'induction par la chaleur. Les HSPs sont induites pour remédier aux dommages cellulaires qui surviennent. Étant donné que le promoteur hsp17.3B est très sensible à des petites augmentations de température ainsi qu'à des composés chimiques, j'ai utilisé les lignées développées dans le chapitre 1 pour identifier des polluants qui déclenchent une réaction de défense impliquant les HSPs. Après un criblage de plusieurs composés, les chlorophénols et les antraquinones sulfonés ont été identifiés comme étant activateurs du promoteur de stress. La détection de leurs effets a été réalisée seulement après quelques heures d'exposition et corrèle parfaitement avec les effets toxiques détectés après de longues périodes d'exposition. Les produits identifiés montrent aussi un effet synergique avec la température, ce qui fait du biosensor développé dans ce chapitre un bon outil pour révéler les effets réels des polluants dans un environnement où les stress chimiques sont combinés aux stress abiotiques. Le troisième chapitre est consacré à l'étude des mécanismes précoces qui permettent aux plantes de percevoir la chaleur et ainsi de déclencher une cascade de signalisation spécifique qui aboutit à l'induction des gènes HSPs. J'ai généré deux nouvelles lignées afin de mesurer en temps réel les changements de concentrations du calcium cytosolique ainsi que l'état de dénaturation des protéines au cours du heat shock. Quand la fluidité de la membrane augmente après élévation de la température, elle semble induire l'ouverture d'un canal qui permet de faire entrer le calcium dans les cellules. Ce dernier initie une cascade de signalisation qui finit par activer la transcription des gènes HSPs indépendamment de la dénaturation de protéines cytoplasmiques. Les résultats présentés dans ce chapitre montrent que la perception de la chaleur se fait essentiellement au niveau de la membrane plasmique qui joue un rôle majeur dans la régulation des gènes HSPs. L'élucidation des mécanismes par lesquels les plantes perçoivent les signaux environnementaux est d'une grande utilité pour le développement de nouvelles stratégies afin d'améliorer la productivité des plantes soumises à des conditions extrêmes. La présente thèse contribue à décortiquer la voie de signalisation impliquée dans la réponse à la chaleur.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In order to search for novel genes involved in cell proliferation, the hypothesis was that by infecting primary cells with a cDNA library of immortal cells would render immortalizing genes. Consequently it has been discovered CIRP (Cold inducible RNA-binding protein). Mammalian cells exposed to mild hypothermia show a general inhibition of protein synthesis and a concomitant increase in the expression of a small number of cold-shock mRNAs and proteins. Rbm3, another RNA binding protein belonging to the same family, has been postulated to facilitate protein synthesis at mild cold shock. To investigate if the same occurs for CIRP, CIRP was overexpressed in primary cells and protein sintesis was measured. Interestingly, CIRP increased protein synthesis, however, such increase did not involve an increase in the polysome fraction or affected the ribosome profile. In addition, the effect caused by CIRP inhibition or knockdown was also analyzed. Different siRNAs against CIRP were tested. Once checked their efficiency by decreasing CIRP at mRNA and protein levels, proliferation was tested by BrdU, cell number (DAPI) and proliferation curves were performed. Interestingly, CIRP provoke a decreased proliferation in primary cells: MEFs, HMEC; and cancer cells: TERA2 and HeLa. In conclusion, we describe for the first time that CIRP bypasses replicative senescence when over-expressed at physiological temperature (37ºC) by increasing a general protein synthesis. This effect is achieved through ERK1/2 activation in MEFs.The decrease in growth rate found in mammalian cells treated with mild cold stress is not entirely attributable to arrested metabolism. This decrease may also involve an active process in which CIRP and other stress-responsive proteins play a fundamental role in stimulating proliferation. Although most cell proteins are down-regulated or inhibited with cold stress, CIRP is activated to maintain cells in an active proliferative status and its overexpression at 37°C might be potentially oncogenic.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A total of 797 specimens of wild adult triatomines, belonging to six species from the entomological collections of the Costa Rican National Biodiversity Institute, was studied from the standpoint of their relative abundance, as reflected by light traps, distribution in the country, seasonal variations and climatic and altitudinal preferences. Triatoma dimidiata was the most abundant species (32.9% of the total specimens), with a very extensive distribution in different ecological zones, being more common between 100 to 400 m above sea level mainly at the end of the dry season. T. dispar was the third in frequency (21.5%), with narrower distribution, more abundant between 600 to 800 m and scarce during the dry season. Panstrongylus geniculatus and P. rufotuberculatus, second and fourth in frequency (22.1% and 15.1%, respectively), were widely distributed on both the Pacific and Caribbean basins, the former being more common between 80 to 270 m all year round and the latter below 800 m mainly during the first semester. Eratyrus cuspidatus which represented only 4.9% of the insects, was also present on both basins mainly below 200 m with a tendency to be scarce during certain months of the year, and was found in all types of ecological zones. Finally, Rhodnius pallescens, the least abundant species (3.6%) was restricted to very humid areas below 20 m, on the north side and Caribbean basin. With the exception of R. pallescens, males were more commonly found than females. Some epidemiological implications related to the six species are discussed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The choice of sample preparation protocol is a critical influential factor for isoelectric focusing which in turn affects the two-dimensional gel result in terms of quality and protein species distribution. The optimal protocol varies depending on the nature of the sample for analysis and the properties of the constituent protein species (hydrophobicity, tendency to form aggregates, copy number) intended for resolution. This review explains the standard sample buffer constituents and illustrates a series of protocols for processing diverse samples for two-dimensional gel electrophoresis, including hydrophobic membrane proteins. Current methods for concentrating lower abundance proteins, by removal of high abundance proteins, are also outlined. Finally, since protein staining is becoming increasingly incorporated into the sample preparation procedure, we describe the principles and applications of current (and future) pre-electrophoretic labelling methods.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Introduction: Cognitive impairment affects 40-65% of multiple sclerosis (MS) patients, often since early stages of the disease (relapsing remitting MS, RRMS). Frequently affected functions are memory, attention or executive abilities but the most sensitive measure of cognitive deficits in early MS is the information processing speed (Amato, 2008). MRI has been extensively exploited to investigate the substrate of cognitive dysfunction in MS but the underlying physiopathological mechanisms remain unclear. White matter lesion load, whole-brain atrophy and cortical lesions' number play a role but correlations are in some cases modest (Rovaris, 2006; Calabrese, 2009). In this study, we aimed at characterizing and correlating the T1 relaxation times of cortical and sub-cortical lesions with cognitive deficits detected by neuropsychological tests in a group of very early RR MS patients. Methods: Ten female patients with very early RRMS (age: 31.6 ±4.7y; disease duration: 3.8 ±1.9y; EDSS disability score: 1.8 ±0.4) and 10 age- and gender-matched healthy volunteers (mean age: 31.2 ±5.8y) were included in the study. All participants underwent the following neuropsychological tests: Rao's Brief Repeatable Battery of Neuropsychological tests (BRB-N), Stockings of Cambridge, Trail Making Test (TMT, part A and B), Boston Naming Test, Hooper Visual Organization Test and copy of the Rey-Osterrieth Complex Figure. Within 2 weeks from neuropsychological assessment, participants underwent brain MRI at 3T (Magnetom Trio a Tim System, Siemens, Germany) using a 32-channel head coil. The imaging protocol included 3D sequences with 1x1x1.2 mm3 resolution and 256x256x160 matrix, except for axial 2D-FLAIR: -DIR (T2-weighted, suppressing both WM and CSF; Pouwels, 2006) -MPRAGE (T1-weighted; Mugler, 1991) -MP2RAGE (T1-weighted with T1 maps; Marques, 2010) -FLAIR SPACE (only for patient 4-10, T2-weighted; Mugler, 2001) -2D Axial FLAIR (0.9x0.9x2.5 mm3, 256x256x44 matrix). Lesions were identified by one experienced neurologist and radiologist using all contrasts, manually contoured and assigned to regional locations (cortical or sub-cortical). Lesion number, volume and T1 relaxation time were calculated for lesions in each contrast and in a merged mask representing the union of the lesions from all contrasts. T1 relaxation times of lesions were normalized with the mean T1 value in corresponding control regions of the healthy subjects. Statistical analysis was performed using GraphPad InStat software. Cognitive scores were compared between patients and controls with paired t-tests; p values ≤ 0.05 were considered significant. Spearmann correlation tests were performed between the cognitive tests, which differed significantly between patients and controls, and lesions' i) number ii) volume iii) T1 relaxation time iv) disease duration and v) years of study. Results: Cortical and sub-cortical lesions count, T1 values and volume are reported in Table 1 (A and B). All early RRMS patients showed cortical lesions (CLs) and the majority consisted of CLs type I (lesions with a cortical component extending to the sub-cortical tissue). The rest of cortical lesions were characterized as type II (intra-cortical lesions). No type III/IV lesions (large sub-pial lesions) were detected. RRMS patients were slightly less educated (13.5±2.5y vs. 16.3±1.8y of study, p=0.02) than the controls. Signs of cortical dysfunction (i.e. impaired learning, language, visuo-spatial skills or gnosis) were rare in all patients. However, patients showed on average lower scores on measures of visual attention and information processing speed (TMT-part A: p=0.01; TMT-part B: p=0.006; PASAT-included in the BRB-N: p=0.04). The T1 relaxation values of CLs type I negatively correlated with the TMT-part A score (r=0.78, p<0.01). The correlations of TMT-part B score and PASAT score with T1 relaxation time of lesions as well and the correlation between TMT-part A, TMT-part B and PASAT score with lesions' i) number ii) volume iii) disease duration and iv) years of study did not reach significance. In order to preclude possible influences from partial volume effects on the T1 values, the correlation between lesion volume and T1 value of CLs type I was calculated; no correlation was found, suggesting that partial volume effects did not affect the statistics. Conclusions: The present pilot study reports for the first time the presence and the T1 characteristics at 3 T of cortical lesions in very early RRMS (< 6 y disease duration). It also shows that CLS type I represents the most frequent cortical lesion type in this cohort of RRMS patients. In addition, it reveals a negative correlation between the attentional test TMT-part A and the T1 properties of cortical lesions type I. In other words, lower attention deficits are concomitant with longer T1-relaxation time in cortical lesions. In respect to this last finding, it could be speculated that long relaxation time correspond to a certain degree of tissue loss that is enough to stimulate compensatory mechanisms. This hypothesis is in line with previous fMRI studies showing functional compensatory mechanisms to help maintaining normal or sub-normal attention performances in RR MS patients (Penner, 2003).

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tuberculosis (TB) is a major concern in developing countries. In Brazil, few genotyping studies have been conducted to verify the number of IS6110 copies present in local prevalent strains of Mycobacterium tuberculosis, the distribution and clustering of strains. IS6110 DNA fingerprinting was performed on a sample of M. tuberculosis isolates from patients with AFB smear-positive pulmonary TB, at a hospital in Brazil. The IS6110 profiles were analyzed and compared to a M. tuberculosis database of the Houston Tuberculosis Initiative, Houston, US. Seventy-six fingerprints were obtained from 98 patients. All M. tuberculosis strains had an IS6110 copy number between 5-21 allowing for differentiation of the isolates. Human immunodeficiency virus infection was confirmed in nearly half the patients of whom data was available. Fifty-eight strains had unique patterns, while 17 strains were grouped in 7 clusters (2 to 6 strains). When compared to the HTI database, 6 strains matched isolates from El Paso, Ciudad de Juarez, Houston, and New York. Recently acquired infections were documented in 19% of cases. The community transmission of infection is intense, since some clustered strains were recovered during the four-year study period. The intercontinental dissemination of M. tuberculosis strains is suspected by demonstration of identical fingerprints in a distant country.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Vitamin D insufficiency has been associated with the occurrence of various types of cancer, but causal relationships remain elusive. We therefore aimed to determine the relationship between genetic determinants of vitamin D serum levels and the risk of developing hepatitis C virus (HCV)-related hepatocellular carcinoma (HCC). METHODOLOGYPRINCIPAL FINDINGS: Associations between CYP2R1, GC, and DHCR7 genotypes that are determinants of reduced 25-hydroxyvitamin D (25[OH]D3) serum levels and the risk of HCV-related HCC development were investigated for 1279 chronic hepatitis C patients with HCC and 4325 without HCC, respectively. The well-known associations between CYP2R1 (rs1993116, rs10741657), GC (rs2282679), and DHCR7 (rs7944926, rs12785878) genotypes and 25(OH)D3 serum levels were also apparent in patients with chronic hepatitis C. The same genotypes of these single nucleotide polymorphisms (SNPs) that are associated with reduced 25(OH)D3 serum levels were found to be associated with HCV-related HCC (P = 0.07 [OR = 1.13, 95% CI = 0.99-1.28] for CYP2R1, P = 0.007 [OR = 1.56, 95% CI = 1.12-2.15] for GC, P = 0.003 [OR = 1.42, 95% CI = 1.13-1.78] for DHCR7; ORs for risk genotypes). In contrast, no association between these genetic variations and liver fibrosis progression rate (P>0.2 for each SNP) or outcome of standard therapy with pegylated interferon-α and ribavirin (P>0.2 for each SNP) was observed, suggesting a specific influence of the genetic determinants of 25(OH)D3 serum levels on hepatocarcinogenesis. CONCLUSIONSSIGNIFICANCE: Our data suggest a relatively weak but functionally relevant role for vitamin D in the prevention of HCV-related hepatocarcinogenesis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Accurate catalogs of structural variants (SVs) in mammalian genomes are necessary to elucidate the potential mechanisms that drive SV formation and to assess their functional impact. Next generation sequencing methods for SV detection are an advance on array-based methods, but are almost exclusively limited to four basic types: deletions, insertions, inversions and copy number gains. RESULTS: By visual inspection of 100 Mbp of genome to which next generation sequence data from 17 inbred mouse strains had been aligned, we identify and interpret 21 paired-end mapping patterns, which we validate by PCR. These paired-end mapping patterns reveal a greater diversity and complexity in SVs than previously recognized. In addition, Sanger-based sequence analysis of 4,176 breakpoints at 261 SV sites reveal additional complexity at approximately a quarter of structural variants analyzed. We find micro-deletions and micro-insertions at SV breakpoints, ranging from 1 to 107 bp, and SNPs that extend breakpoint micro-homology and may catalyze SV formation. CONCLUSIONS: An integrative approach using experimental analyses to train computational SV calling is essential for the accurate resolution of the architecture of SVs. We find considerable complexity in SV formation; about a quarter of SVs in the mouse are composed of a complex mixture of deletion, insertion, inversion and copy number gain. Computational methods can be adapted to identify most paired-end mapping patterns.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Breastfeeding has important health benefits for both mother and child. Breastfed babies are less likely to report with gastric, respiratory and urinary tract infections and allergic diseases, while they are also less likely to become obese in later childhood. Improving breastfeeding initiation has become a national priority, and a national target has been set ̢?oto deliver an increase of two percentage points per annum in breastfeeding initiation rate, focusing especially on women from disadvantaged areas̢?. Despite improvements in data quality in previous years, it still remains difficult to construct an accurate and reliable picture of variations and trends in breastfeeding in the East Midlands. It is essential that nationally standardised data collection systems are put in place to enable effective and accurate monitoring and evaluation of breastfeeding status both at a local and national level.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Injury mortality and morbidity among children aged 0-14 varies substantially depending on the child's age, gender, socio-economic group, cultural and/or ethnic group, and where they live. This report describes and seeks to understand these variations and explains why each factor is associated with injury risk. It then highlights how a range of intervention studies have attempted to address these inequalities.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

To provide a novel resource for analysis of the genome of Biomphalaria glabrata, members of the international Biomphalaria glabrata Genome Initiative (biology.unm.edu/biomphalaria-genome.html), working with the Arizona Genomics Institute (AGI) and supported by the National Human Genome Research Institute (NHGRI), produced a high quality bacterial artificial chromosome (BAC) library. The BB02 strain B. glabrata, a field isolate (Belo Horizonte, Minas Gerais, Brasil) that is susceptible to several strains of Schistosoma mansoni, was selfed for two generations to reduce haplotype diversity in the offspring. High molecular weight DNA was isolated from ovotestes of 40 snails, partially digested with HindIII, and ligated into pAGIBAC1 vector. The resulting B. glabrata BAC library (BG_BBa) consists of 61824 clones (136.3 kb average insert size) and provides 9.05 × coverage of the 931 Mb genome. Probing with single/low copy number genes from B. glabrata and fingerprinting of selected BAC clones indicated that the BAC library sufficiently represents the gene complement. BAC end sequence data (514 reads, 299860 nt) indicated that the genome of B. glabrata contains ~ 63% AT, and disclosed several novel genes, transposable elements, and groups of high frequency sequence elements. This BG_BBa BAC library, available from AGI at cost to the research community, gains in relevance because BB02 strain B. glabrata is targeted whole genome sequencing by NHGRI.