935 resultados para Complete Genome Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Four related cows showed hairless streaks on various parts of the body with no correlation to the pigmentation pattern. The stripes occurred in a consistent pattern resembling the lines of Blaschko. The non-syndromic hairlessness phenotype observed occurred across three generations of a single family and was compatible with an X-linked mode of inheritance. Linkage analysis and subsequent whole genome sequencing of one affected female identified two perfectly associated non-synonymous sequence variants in the critical interval on bovine chromosome X. Both variants occurred in complete linkage disequilibrium and were absent in more than 3900 controls. An ERCC6L missense mutation was predicted to cause an amino acid substitution of a non-conserved residue. Analysis in mice showed no specific Ercc6l expression pattern related to hair follicle development and therefore ERCC6L was not considered as causative gene. A point mutation at the 5'-splice junction of exon 5 of the TSR2, 20S rRNA accumulation, homolog (S. cerevisiae), gene led to the production of two mutant transcripts, both of which contain a frameshift and generate a premature stop codon predicted to truncate approximately 25% of the protein. Interestingly, in addition to the presence of both physiological TSR2 transcripts, the two mutant transcripts were predominantly detected in the hairless skin of the affected cows. Immunohistochemistry, using an antibody against the N-terminal part of the bovine protein demonstrated the specific expression of the TSR2 protein in the skin and the hair of the affected and the control cows as well as in bovine fetal skin and hair. The RNA hybridization in situ showed that Tsr2 was expressed in pre- and post-natal phases of hair follicle development in mice. Mammalian TSR2 proteins are highly conserved and are known to be broadly expressed, but their precise in vivo functions are poorly understood. Thus, by dissecting a naturally occurring mutation in a domestic animal species, we identified TSR2 as a regulator of hair follicle development.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Hypothyroidism is a complex clinical condition found in both humans and dogs, thought to be caused by a combination of genetic and environmental factors. In this study we present a multi-breed analysis of predisposing genetic risk factors for hypothyroidism in dogs using three high-risk breeds-the Gordon Setter, Hovawart and the Rhodesian Ridgeback. Using a genome-wide association approach and meta-analysis, we identified a major hypothyroidism risk locus shared by these breeds on chromosome 12 (p = 2.1x10-11). Further characterisation of the candidate region revealed a shared ~167 kb risk haplotype (4,915,018-5,081,823 bp), tagged by two SNPs in almost complete linkage disequilibrium. This breed-shared risk haplotype includes three genes (LHFPL5, SRPK1 and SLC26A8) and does not extend to the dog leukocyte antigen (DLA) class II gene cluster located in the vicinity. These three genes have not been identified as candidate genes for hypothyroid disease previously, but have functions that could potentially contribute to the development of the disease. Our results implicate the potential involvement of novel genes and pathways for the development of canine hypothyroidism, raising new possibilities for screening, breeding programmes and treatments in dogs. This study may also contribute to our understanding of the genetic etiology of human hypothyroid disease, which is one of the most common endocrine disorders in humans.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background Simple Sequence Repeats (SSRs) are widely used in population genetic studies but their classical development is costly and time-consuming. The ever-increasing available DNA datasets generated by high-throughput techniques offer an inexpensive alternative for SSRs discovery. Expressed Sequence Tags (ESTs) have been widely used as SSR source for plants of economic relevance but their application to non-model species is still modest. Methods Here, we explored the use of publicly available ESTs (GenBank at the National Center for Biotechnology Information-NCBI) for SSRs development in non-model plants, focusing on genera listed by the International Union for the Conservation of Nature (IUCN). We also search two model genera with fully annotated genomes for EST-SSRs, Arabidopsis and Oryza, and used them as controls for genome distribution analyses. Overall, we downloaded 16 031 555 sequences for 258 plant genera which were mined for SSRsand their primers with the help of QDD1. Genome distribution analyses in Oryza and Arabidopsis were done by blasting the sequences with SSR against the Oryza sativa and Arabidopsis thaliana reference genomes implemented in the Basal Local Alignment Tool (BLAST) of the NCBI website. Finally, we performed an empirical test to determine the performance of our EST-SSRs in a few individuals from four species of two eudicot genera, Trifolium and Centaurea. Results We explored a total of 14 498 726 EST sequences from the dbEST database (NCBI) in 257 plant genera from the IUCN Red List. We identify a very large number (17 102) of ready-to-test EST-SSRs in most plant genera (193) at no cost. Overall, dinucleotide and trinucleotide repeats were the prevalent types but the abundance of the various types of repeat differed between taxonomic groups. Control genomes revealed that trinucleotide repeats were mostly located in coding regions while dinucleotide repeats were largely associated with untranslated regions. Our results from the empirical test revealed considerable amplification success and transferability between congenerics. Conclusions The present work represents the first large-scale study developing SSRs by utilizing publicly accessible EST databases in threatened plants. Here we provide a very large number of ready-to-test EST-SSR (17 102) for 193 genera. The cross-species transferability suggests that the number of possible target species would be large. Since trinucleotide repeats are abundant and mainly linked to exons they might be useful in evolutionary and conservation studies. Altogether, our study highly supports the use of EST databases as an extremely affordable and fast alternative for SSR developing in threatened plants.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Many endoparasitic wasps inject, along with the egg, polydnavirus into their insect hosts, the virus being a prerequisite for successful parasitoid development. The genome of polydnaviruses consists of multiple circular dsDNA molecules of variable size. We show for a 12 kbp segment of the braconid Chelonus inanitus (CiV12) that it is integrated into the wasp genome. This is the first direct demonstration of integration for a bracovirus. PCR data indicated that the integrated form of CiV12 was present in all male and female stages investigated while the excised circular virus DNA only appeared in females after a specific stage in pupal-adult development. The data also indicated that after excision of virus DNA the genomic DNA was rejoined. This has not yet been reported for any polydnavirus. Sequence analyses in the junction regions revealed the presence of an imperfect consensus sequence of 15 nucleotides in CiV12, in each terminus of the integrated virus DNA and in the rejoined genomic DNA. Within these repeats two sequence types (ATA, TAC) were observed in the various virus clones and in the clones encompassing the rejoined genomic DNA; they corresponded to the sequence type in the right and left junction, respectively. To explain this, we propose a model of virus DNA replication in which the genomic DNA is folded to juxtapose the direct repeat of the left with that of the right junction; recombination at specific sites would then yield the two types of virus and rejoined genomic DNA.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Over 250 Mendelian traits and disorders, caused by rare alleles have been mapped in the canine genome. Although each disease is rare in the dog as a species, they are collectively common and have major impact on canine health. With SNP-based genotyping arrays, genome-wide association studies (GWAS) have proven to be a powerful method to map the genomic region of interest when 10-20 cases and 10-20 controls are available. However, to identify the genetic variant in associated regions, fine-mapping and targeted re-sequencing is required. Here we present a new approach using whole-genome sequencing (WGS) of a family trio without prior GWAS. As a proof-of-concept, we chose an autosomal recessive disease known as hereditary footpad hyperkeratosis (HFH) in Kromfohrl änder dogs. To our knowledge, this is the first time this family trio WGS-approach, has successfully been used to identify a genetic variant that perfectly segregates with a canine disorder. The sequencing of three Kromfohrl änder dogs from a family trio (an affected offspring and both its healthy parents) resulted in an average genome coverage of 9.2X per individual. After applying stringent filtering criteria for candidate causative coding variants, 527 single nucleotide variants (SNVs) and 15 indels were found to be homozygous in the affected offspring and heterozygous in the parents. Using the computer software packages ANNOVAR and SIFT to functionally annotate coding sequence differences and to predict their functional effect, resulted in seven candidate variants located in six different genes. Of these, only FAM83G:c155G>C (p.R52P) was found to be concordant in eight additional cases and 16 healthy Kromfohrl änder dogs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Current methods for detection of copy number variants (CNV) and aberrations (CNA) from targeted sequencing data are based on the depth of coverage of captured exons. Accurate CNA determination is complicated by uneven genomic distribution and non-uniform capture efficiency of targeted exons. Here we present CopywriteR, which eludes these problems by exploiting 'off-target' sequence reads. CopywriteR allows for extracting uniformly distributed copy number information, can be used without reference, and can be applied to sequencing data obtained from various techniques including chromatin immunoprecipitation and target enrichment on small gene panels. CopywriteR outperforms existing methods and constitutes a widely applicable alternative to available tools.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Classical swine fever virus (CSFV) causes a highly contagious disease in pigs that can range from a severe haemorrhagic fever to a nearly unapparent disease, depending on the virulence of the virus strain. Little is known about the viral molecular determinants of CSFV virulence. The nonstructural protein NS4B is essential for viral replication. However, the roles of CSFV NS4B in viral genome replication and pathogenesis have not yet been elucidated. NS4B of the GPE-  vaccine strain and of the highly virulent Eystrup strain differ by a total of seven amino acid residues, two of which are located in the predicted trans-membrane domains of NS4B and were described previously to relate to virulence, and five residues clustering in the N-terminal part. In the present study, we examined the potential role of these five amino acids in modulating genome replication and determining pathogenicity in pigs. A chimeric low virulent GPE- -derived virus carrying the complete Eystrup NS4B showed enhanced pathogenicity in pigs. The in vitro replication efficiency of the NS4B chimeric GPE-  replicon was significantly higher than that of the replicon carrying only the two Eystrup-specific amino acids in NS4B. In silico and in vitro data suggest that the N-terminal part of NS4B forms an amphipathic α-helix structure. The N-terminal NS4B with these five amino acid residues is associated with the intracellular membranes. Taken together, this is the first gain-of-function study showing that the N-terminal domain of NS4B can determine CSFV genome replication in cell culture and viral pathogenicity in pigs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Around 14 distinct virus species-complexes have been detected in honeybees, each with one or more strains or sub-species. Here we present the initial characterization of an entirely new virus species-complex discovered in honeybee (Apis mellifera L.) and varroa mite (Varroa destructor) samples from Europe and the USA. The virus has a naturally poly-adenylated RNA genome of about 6500 nucleotides with a genome organization and sequence similar to the Tymoviridae (Tymovirales; Tymoviridae), a predominantly plant-infecting virus family. Literature and laboratory analyses indicated that the virus had not previously been described. The virus is very common in French apiaries, mirroring the results from an extensive Belgian survey, but could not be detected in equally-extensive Swedish and Norwegian bee disease surveys. The virus appears to be closely linked to varroa, with the highest prevalence found in varroa samples and a clear seasonal distribution peaking in autumn, coinciding with the natural varroa population development. Sub-genomic RNA analyses show that bees are definite hosts, while varroa is a possible host and likely vector. The tentative name of Bee Macula-like virus (BeeMLV) is therefore proposed. A second, distantly related Tymoviridae-like virus was also discovered in varroa transcriptomes, tentatively named Varroa Tymo-like virus (VTLV).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Trypanosomes show an intriguing organization of their mitochondrial DNA into a catenated network, the kinetoplast DNA (kDNA). While more than 30 proteins involved in kDNA replication have been described, only few components of kDNA segregation machinery are currently known. Electron microscopy studies identified a high-order structure, the tripartite attachment complex (TAC), linking the basal body of the flagellum via the mitochondrial membranes to the kDNA. Here we describe TAC102, a novel core component of the TAC, which is essential for proper kDNA segregation during cell division. Loss of TAC102 leads to mitochondrial genome missegregation but has no impact on proper organelle biogenesis and segregation. The protein is present throughout the cell cycle and is assembled into the newly developing TAC only after the pro-basal body has matured indicating a hierarchy in the assembly process. Furthermore, we provide evidence that the TAC is replicated de novo rather than using a semi-conservative mechanism. Lastly, we demonstrate that TAC102 lacks an N-terminal mitochondrial targeting sequence and requires sequences in the C-terminal part of the protein for its proper localization.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Ciliates have evolved highly complex and intricately controlled pathways to ensure the precise and complete removal of all genomic sequences not required for vegetative growth. At the same time, they retain a reference copy of all their genetic information for future generations. This chapter describes how different ciliates use RNA-mediated DNA comparison processes to form new somatic nuclei from germline nuclei. While these processes vary in their precise mechanisms, they all use RNA to target genomic DNA sequences—either for retention or elimination. They also all consist of more than one individual pathway acting cooperatively—the two subsets of small RNAs in Paramecium and the guide RNAs and Piwi-interacting RNAs in Oxytricha—to ensure a strong belt-and-braces approach to consistent and precise somatic nucleus development. Nonetheless, this genome comparison approach to somatic nucleus development provides an elegant method for trans-generational environmental adaptation. Conceptually, it is easy to imagine how somatic changes that occur during vegetative growth could be transferred to meiotic offspring, while an unaltered germline genome is retained. Further research in this area will have far-reaching implications for the trans-generational adaptation of more distantly related eukaryotes, such as humans.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Historically morphological features were used as the primary means to classify organisms. However, the age of molecular genetics has allowed us to approach this field from the perspective of the organism's genetic code. Early work used highly conserved sequences, such as ribosomal RNA. The increasing number of complete genomes in the public data repositories provides the opportunity to look not only at a single gene, but at organisms' entire parts list. ^ Here the Sequence Comparison Index (SCI) and the Organism Comparison Index (OCI), algorithms and methods to compare proteins and proteomes, are presented. The complete proteomes of 104 sequenced organisms were compared. Over 280 million full Smith-Waterman alignments were performed on sequence pairs which had a reasonable expectation of being related. From these alignments a whole proteome phylogenetic tree was constructed. This method was also used to compare the small subunit (SSU) rRNA from each organism and a tree constructed from these results. The SSU rRNA tree by the SCI/OCI method looks very much like accepted SSU rRNA trees from sources such as the Ribosomal Database Project, thus validating the method. The SCI/OCI proteome tree showed a number of small but significant differences when compared to the SSU rRNA tree and proteome trees constructed by other methods. Horizontal gene transfer does not appear to affect the SCI/OCI trees until the transferred genes make up a large portion of the proteome. ^ As part of this work, the Database of Related Local Alignments (DaRLA) was created and contains over 81 million rows of sequence alignment information. DaRLA, while primarily used to build the whole proteome trees, can also be applied shared gene content analysis, gene order analysis, and creating individual protein trees. ^ Finally, the standard BLAST method for analyzing shared gene content was compared to the SCI method using 4 spirochetes. The SCI system performed flawlessly, finding all proteins from one organism against itself and finding all the ribosomal proteins between organisms. The BLAST system missed some proteins from its respective organism and failed to detect small ribosomal proteins between organisms. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The genomes of Fusobacterium nucleatum subspecies polymorphum strain ATCC 10953, Rickettsia typhi strain Wilmington, and Francisella tularensis subspecies holarctica strain OSU18 were sequenced, annotated, and analyzed. Each genome was then compared to the sequenced genomes of closely related bacteria. The genome of F. nucleatum ATCC 10953 was compared to two additional F. nucleatum subspecies, subspecies nucleatum and subspecies vincentii. This analysis revealed substantial evidence of horizontal gene transfer along with considerable genetic diversity within the species of F. nucleatum. R. typhi was compared to R. prowazekii and R. conorii. This analysis uncovered a hotspot for chromosomal rearrangements in the Spotted Fever Group but not the Typhus Group Rickettsia and revealed the close genetic relationship between the Typhus Group rickettsial species. F. tularensis OSU18 was compared to two additional F. tularensis strains. These comparisons uncovered significant chromosomal rearrangements between F. tularensis subspecies due to recombination between insertion sequence elements. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In population studies, most current methods focus on identifying one outcome-related SNP at a time by testing for differences of genotype frequencies between disease and healthy groups or among different population groups. However, testing a great number of SNPs simultaneously has a problem of multiple testing and will give false-positive results. Although, this problem can be effectively dealt with through several approaches such as Bonferroni correction, permutation testing and false discovery rates, patterns of the joint effects by several genes, each with weak effect, might not be able to be determined. With the availability of high-throughput genotyping technology, searching for multiple scattered SNPs over the whole genome and modeling their joint effect on the target variable has become possible. Exhaustive search of all SNP subsets is computationally infeasible for millions of SNPs in a genome-wide study. Several effective feature selection methods combined with classification functions have been proposed to search for an optimal SNP subset among big data sets where the number of feature SNPs far exceeds the number of observations. ^ In this study, we take two steps to achieve the goal. First we selected 1000 SNPs through an effective filter method and then we performed a feature selection wrapped around a classifier to identify an optimal SNP subset for predicting disease. And also we developed a novel classification method-sequential information bottleneck method wrapped inside different search algorithms to identify an optimal subset of SNPs for classifying the outcome variable. This new method was compared with the classical linear discriminant analysis in terms of classification performance. Finally, we performed chi-square test to look at the relationship between each SNP and disease from another point of view. ^ In general, our results show that filtering features using harmononic mean of sensitivity and specificity(HMSS) through linear discriminant analysis (LDA) is better than using LDA training accuracy or mutual information in our study. Our results also demonstrate that exhaustive search of a small subset with one SNP, two SNPs or 3 SNP subset based on best 100 composite 2-SNPs can find an optimal subset and further inclusion of more SNPs through heuristic algorithm doesn't always increase the performance of SNP subsets. Although sequential forward floating selection can be applied to prevent from the nesting effect of forward selection, it does not always out-perform the latter due to overfitting from observing more complex subset states. ^ Our results also indicate that HMSS as a criterion to evaluate the classification ability of a function can be used in imbalanced data without modifying the original dataset as against classification accuracy. Our four studies suggest that Sequential Information Bottleneck(sIB), a new unsupervised technique, can be adopted to predict the outcome and its ability to detect the target status is superior to the traditional LDA in the study. ^ From our results we can see that the best test probability-HMSS for predicting CVD, stroke,CAD and psoriasis through sIB is 0.59406, 0.641815, 0.645315 and 0.678658, respectively. In terms of group prediction accuracy, the highest test accuracy of sIB for diagnosing a normal status among controls can reach 0.708999, 0.863216, 0.639918 and 0.850275 respectively in the four studies if the test accuracy among cases is required to be not less than 0.4. On the other hand, the highest test accuracy of sIB for diagnosing a disease among cases can reach 0.748644, 0.789916, 0.705701 and 0.749436 respectively in the four studies if the test accuracy among controls is required to be at least 0.4. ^ A further genome-wide association study through Chi square test shows that there are no significant SNPs detected at the cut-off level 9.09451E-08 in the Framingham heart study of CVD. Study results in WTCCC can only detect two significant SNPs that are associated with CAD. In the genome-wide study of psoriasis most of top 20 SNP markers with impressive classification accuracy are also significantly associated with the disease through chi-square test at the cut-off value 1.11E-07. ^ Although our classification methods can achieve high accuracy in the study, complete descriptions of those classification results(95% confidence interval or statistical test of differences) require more cost-effective methods or efficient computing system, both of which can't be accomplished currently in our genome-wide study. We should also note that the purpose of this study is to identify subsets of SNPs with high prediction ability and those SNPs with good discriminant power are not necessary to be causal markers for the disease.^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The creation, preservation, and degeneration of cis-regulatory elements controlling developmental gene expression are fundamental genome-level evolutionary processes about which little is known. In this study, critical differences in cis-regulatory elements controlling the expression of the sea urchin aboral ectoderm-specific spec genes were identified and explored. In genomes of species within the Strongylocentrotidae family, multiple copies of a repetitive sequence element termed RSR were present, but RSRs were not detected in genomes of species outside Strongylocentrotidae. RSRs are invariably associated with spec genes, and in Strongylocentrotus purpuratus, the spec2a RSR functioned as a transcriptional enhancer displaying greater activity than RSRs from the spec1 or spec2c paralogs. Single base-pair differences at two cis-regulatory elements within the spec2a RSR greatly increased the binding affinities of four transcription factors: SpCCAAT-binding factor at one element and SpOtx, SpGoosecoid, and SpGATA-E at another. The cis-regulatory elements to which SpCCAAT-binding factor, SpOtx, SpGoosecoid, and SpGATA-E bound were recent evolutionary acquisitions that could act either to activate or repress transcription, depending on the cell type. These elements were found in the spec2a RSR ortholog in Strongylocentrotus pallidus but not in the RSR orthologs of Strongylocentrotus droebachiensis or Hemicentrotus pulcherrimus. These results indicate that spec genes exhibit a dynamic pattern of cis-regulatory element evolution while stabilizing selection preserves their aboral ectoderm expression domain. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

La semilla es el órgano que garantiza la propagación y continuidad evolutiva de las plantas espermatofitas y constituye un elemento indispensable en la alimentación humana y animal. La semilla de cereales acumula en el endospermo durante la maduración, mayoritariamente, almidón y proteínas de reserva. Estas reservas son hidrolizadas en la germinación por hidrolasas sintetizadas en la aleurona en respuesta a giberelinas (GA), siendo la principal fuente de energía hasta que la plántula emergente es fotosintéticamente activa. Ambas fases del desarrollo de la semilla, están reguladas por una red de factores de transcripción (TF) que unen motivos conservados en cis- en los promotores de sus genes diana. Los TFs son proteínas que han desempeñado un papel central en la evolución y en el proceso de domesticación, siendo uno de los principales mecanismos de regulación génica; en torno al 7% de los genes de plantas codifican TFs. Atendiendo al motivo de unión a DNA, éstos, se han clasificado en familias. La familia DOF (DNA binding with One Finger) participa en procesos vitales exclusivos de plantas superiores y sus ancestros cercanos (algas, musgos y helechos). En las semillas de las Triticeae (subfamilia Pooideae), se han identificado varias proteínas DOF que desempeñan un papel fundamental en la regulación de la expresión génica. Brachypodium distachyon es la primera especie de la subfamilia Pooideae cuyo genoma (272 Mbp) ha sido secuenciado. Su pequeño tamaño, ciclo de vida corto, y la posibilidad de ser transformado por Agrobacterium tumefaciens (plásmido Ti), hacen que sea el sistema modelo para el estudio de cereales de la tribu Triticeae con gran importancia agronómica mundial, como son el trigo y la cebada. En este trabajo, se han identificado 27 genes Dof en el genoma de B. distachyon y se han establecido las relaciones evolutivas entre estos genes Dof y los de cebada (subfamilia Pooideae) y de arroz (subfamilia Oryzoideae), construyendo un árbol filogenético en base al alineamiento múltiple del dominio DOF. La cebada contiene 26 genes Dof y en arroz se han anotado 30. El análisis filogenético establece cuatro grupos de genes ortólogos (MCOGs: Major Clusters of Orthologous Genes), que están validados por motivos conservados adicionales, además del dominio DOF, entre las secuencias de las proteínas de un mismo MCOG. El estudio global de expresión en diferentes órganos establece un grupo de nueve genes BdDof expresados abundantemente y/o preferencialmente en semillas. El estudio detallado de expresión de estos genes durante la maduración y germinación muestra que BdDof24, ortólogo putativo a BPBF-HvDOF24 de cebada, es el gen más abundante en las semillas en germinación de B. distachyon. La regulación transcripcional de los genes que codifican hidrolasas en la aleurona de las semillas de cereales durante la post‐germinación ha puesto de manifiesto la existencia en sus promotores de un motivo tripartito en cis- conservado GARC (GA-Responsive Complex), que unen TFs de la clase MYB-R2R3, DOF y MYBR1-SHAQKYF. En esta tesis, se ha caracterizado el gen BdCathB de Brachypodium que codifica una proteasa tipo catepsina B y es ortólogo a los genes Al21 de trigo y HvCathB de cebada, así como los TFs responsables de su regulación transcripcional BdDOF24 y BdGAMYB (ortólogo a HvGAMYB). El análisis in silico del promotor BdCathB ha identificado un motivo GARC conservado, en posición y secuencia, con sus ortólogos en trigo y cebada. La expresión de BdCathB se induce durante la germinación, así como la de los genes BdDof24 y BdGamyb. Además, los TFs BdDOF24 y BdGAMYB interaccionan en el sistema de dos híbridos de levadura e in planta en experimentos de complementación bimolecular fluorescente. En capas de aleurona de cebada, BdGAMYB activa el promotor BdCathB, mientras que BdDOF24 lo reprime; este resultado es similar al obtenido con los TFs ortólogos de cebada BPBF-HvDOF24 y HvGAMYB. Sin embargo, cuando las células de aleurona se transforman simultáneamente con los dos TFs, BdDOF24 tiene un efecto aditivo sobre la trans-activación mediada por BdGAMYB, mientras que su ortólogo BPBF-HvDOF24 produce el efecto contrario, revirtiendo el efecto de HvGAMYB sobre el promotor BdCathB. Las diferencias entre las secuencias deducidas de las proteínas BdDOF24 y BPBF-HvDOF24 podrían explicar las funciones opuestas que desempeñan en su interacción con GAMYB. Resultados preliminares con líneas de inserción de T-DNA y de sobre-expresión estable de BdGamyb, apoyan los resultados obtenidos en expresión transitoria. Además las líneas homocigotas knock-out para el gen BdGamyb presentan alteraciones en anteras y polen y no producen semillas viables. ABSTRACT The seed is the plant organ of the spermatophytes responsible for the dispersion and survival in the course of evolution. In addition, it constitutes one of the most importan elements of human food and animal feed. The main reserves accumulated in the endosperm of cereal seeds through the maturation phase of development are starch and proteins. Its degradation by hydrolases synthetized in aleurone cells in response to GA upon germination provides energy, carbon and nitrogen to the emerging seedling before it acquires complete photosynthetic capacity. Both phases of seed development are controlled by a network of transcription factors (TFs) that interact with specific cis- elements in the promoters of their target genes. TFs are proteins that have played a central role during evolution and domestication, being one of the most important regulatory mechanisms of gene expression. Around 7% of genes in plant genomes encode TFs. Based on the DNA binding motif, TFs are classified into families. The DOF (DNA binding with One Finger) family is involved in specific processes of plants and its ancestors (algae, mosses and ferns). Several DOF proteins have been described to play important roles in the regulation of genes in seeds of the Triticeae tribe (Pooideae subfamily). Brachypodium distachyon is the first member of the Pooideae subfamily to be sequenced. Its small size and compact structured genome (272 Mbp), the short life cycle, small plant size and the possibility of being transformed with Agrobacterium tumefaciens (Ti-plasmid) make Brachypodium the model system for comparative studies within cereals of the Triticeae tribe that have big economic value such as wheat and barley. In this study, 27 Dof genes have been identified in the genome of B. distachyon and the evolutionary relationships among these Dof genes and those frome barley (Pooideae subfamily) and those from rice (Oryzoideae subfamily) have been established by building a phylogenetic tree based on the multiple alignment of the DOF DNA binding domains. The barley genome (Hordeum vulgare) contains 26 Dof genes and in rice (Oryza sativa) 30 genes have been annotated. The phylogenetic analysis establishes four Major Clusters of Orthologous Genes (MCOGs) that are supported by additional conserved motives out of the DOF domain, between proteins of the same MCOG. The global expression study of BdDof genes in different organs and tissues classifies BdDof genes into two groups; nine of the 27 BdDof genes are abundantly or preferentially expressed in seeds. A more detailed expression analysis of these genes during seed maturation and germination shows that BdDof24, orholog to barley BPBF-HvDof24, is the most abundantly expressed gene in germinating seeds. Transcriptional regulation studies of genes that encode hydrolases in aleurone cells during post-germination of cereal seeds, have identified in their promoters a tripartite conserved cis- motif GARC (GA-Responsive Complex) that binds TFs of the MYB-R2R3, DOF and MYBR1-SHAQKYF families. In this thesis, the characterization of the BdCathB gene, encoding a Cathepsin B-like protease and that is ortholog to the wheat Al21 and the barley HvCathB genes, has been done and its transcriptional regulation by the TFs BdDOF24 and BdGAMYB (ortholog to HvGAMYB) studied. The in silico analysis of the BdCathB promoter sequence has identified a GARC motif. BdCathB expression is induced upon germination, as well as, those of BdDof24 and BdGamyb genes. Moreover, BdDOF24 and BdGAMYB interact in yeast (Yeast 2 Hybrid System, Y2HS) and in planta (Bimolecular Fluorecence Complementation, BiFC). In transient assays in aleurone cells, BdGAMYB activates the BdCathB promoter, whereas BdDOF24 is a transcriptional repressor, this result is similar to that obtained with the barley orthologous genes BPBF-HvDOF24 and HvGAMYB. However, when aleurone cells are simultaneously transformed with both TFs, BdDOF24 has an additive effect to the trans-activation mediated by BdGAMYB, while its ortholog BPBF-HvDOF24 produces an opposite effect by reducing the HvGAMYB activation of the BdCathB promoter. The differences among the deduced protein sequences between BdDOF24 and BPBF-HvDOF24 could explain their opposite functions in the interaction with GAMYB protein. Preliminary results of T-DNA insertion (K.O.) and stable over-expression lines of BdGamyb support the data obtained in transient expression assays. In addition, the BdGamyb homozygous T-DNA insertion (K.O.) lines have anther and pollen alterations and they do not produce viable seeds.