962 resultados para Clustering a large document collection
Resumo:
The paper explores an efficiency hypothesis regarding the contractual process between large retailers, such as Wal-Mart and Carrefour, and their suppliers. The empirical evidence presented supports the idea that large retailers play a quasi-judicial role, acting as "courts of first instance" in their relationships with suppliers. In this role, large retailers adjust the terms of trade to on-going changes and sanction performance failures, sometimes delaying payments. A potential abuse of their position is limited by the need for re-contracting and preserving their reputations. Suppliers renew their confidence in their retailers on a yearly basis, through writing new contracts. This renovation contradicts the alternative hypothesis that suppliers are expropriated by large retailers as a consequence of specific investments.
Resumo:
A major challenge in community ecology is a thorough understanding of the processes that govern the assembly and composition of communities in time and space. The growing threat of climate change to the vascular plant biodiversity of fragile ecosystems such as mountains has made it equally imperative to develop comprehensive methodologies to provide insights into how communities are assembled. In this perspective, the primary objective of this PhD thesis is to contribute to the theoretical and methodological development of community ecology, by proposing new solutions to better detect the ecological and evolutionary processes that govern community assembly. As phylogenetic trees provide by far, the most advanced tools to integrate the spatial, ecological and evolutionary dynamics of plant communities, they represent the cornerstone on which this work was based. In this thesis, I proposed new solutions to: (i) reveal trends in community assembly on phylogenies, depicted by the transition of signals at the nodes of the different species and lineages responsible for community assembly, (ii) contribute to evidence the importance of evolutionarily labile traits in the distribution of mountain plant species. More precisely, I demonstrated that phylogenetic and functional compositional turnover in plant communities was driven by climate and human land use gradients mostly influenced by evolutionarily labile traits, (iii) predict and spatially project the phylogenetic structure of communities using species distribution models, to identify the potential distribution of phylogenetic diversity, as well as areas of high evolutionary potential along elevation. The altitudinal setting of the Diablerets mountains (Switzerland) provided an appropriate model for this study. The elevation gradient served as a compression of large latitudinal variations similar to a collection of islands within a single area, and allowed investigations on a large number of plant communities. Overall, this thesis highlights that stochastic and deterministic environmental filtering processes mainly influence the phylogenetic structure of plant communities in mountainous areas. Negative density-dependent processes implied through patterns of phylogenetic overdispersion were only detected at the local scale, whereas environmental filtering implied through phylogenetic clustering was observed at both the regional and local scale. Finally, the integration of indices of phylogenetic community ecology with species distribution models revealed the prospects of providing novel and insightful explanations on the potential distribution of phylogenetic biodiversity in high mountain areas. These results generally demonstrate the usefulness of phylogenies in inferring assembly processes, and are worth considering in the theoretical and methodological development of tools to better understand phylogenetic community structure.
Resumo:
BACKGROUND: Along the chromosome of the obligate intracellular bacteria Protochlamydia amoebophila UWE25, we recently described a genomic island Pam100G. It contains a tra unit likely involved in conjugative DNA transfer and lgrE, a 5.6-kb gene similar to five others of P. amoebophila: lgrA to lgrD, lgrF. We describe here the structure, regulation and evolution of these proteins termed LGRs since encoded by "Large G+C-Rich" genes. RESULTS: No homologs to the whole protein sequence of LGRs were found in other organisms. Phylogenetic analyses suggest that serial duplications producing the six LGRs occurred relatively recently and nucleotide usage analyses show that lgrB, lgrE and lgrF were relocated on the chromosome. The C-terminal part of LGRs is homologous to Leucine-Rich Repeats domains (LRRs). Defined by a cumulative alignment score, the 5 to 18 concatenated octacosapeptidic (28-meric) LRRs of LGRs present all a predicted alpha-helix conformation. Their closest homologs are the 28-residue RI-like LRRs of mammalian NODs and the 24-meres of some Ralstonia and Legionella proteins. Interestingly, lgrE, which is present on Pam100G like the tra operon, exhibits Pfam domains related to DNA metabolism. CONCLUSION: Comparison of the LRRs, enable us to propose a parsimonious evolutionary scenario of these domains driven by adjacent concatenations of LRRs. Our model established on bacterial LRRs can be challenged in eucaryotic proteins carrying less conserved LRRs, such as NOD proteins and Toll-like receptors.
Resumo:
Using comprehensive administrative data on France's single largest financialaid program, this paper provides new evidence on the impact of large-scaleneed-based grant programs on the college enrollment decisions, persistenceand graduation rates of low-income students. We exploit sharp discontinuitiesin the grant eligibility formula to identify the impact of aid on student outcomesat different levels of study. We find that eligibility for an annual cashallowance of 1,500 euros increases college enrollment rates by up to 5 percentagepoints. Moreover, we show that need-based grants have positive effectson student persistence and degree completion.
Resumo:
Aquaporins (AQPs) are membrane channels belonging to the major intrinsic proteins family and are known for their ability to facilitate water movement. While in Populus trichocarpa, AQP proteins form a large family encompassing fifty-five genes, most of the experimental work focused on a few genes or subfamilies. The current work was undertaken to develop a comprehensive picture of the whole AQP gene family in Populus species by delineating gene expression domain and distinguishing responsiveness to developmental and environmental cues. Since duplication events amplified the poplar AQP family, we addressed the question of expression redundancy between gene duplicates. On these purposes, we carried a meta-analysis of all publicly available Affymetrix experiments. Our in-silico strategy controlled for previously identified biases in cross-species transcriptomics, a necessary step for any comparative transcriptomics based on multispecies design chips. Three poplar AQPs were not supported by any expression data, even in a large collection of situations (abiotic and biotic constraints, temporal oscillations and mutants). The expression of 11 AQPs was never or poorly regulated whatever the wideness of their expression domain and their expression level. Our work highlighted that PtTIP1;4 was the most responsive gene of the AQP family. A high functional divergence between gene duplicates was detected across species and in response to tested cues, except for the root-expressed PtTIP2;3/PtTIP2;4 pair exhibiting 80% convergent responses. Our meta-analysis assessed key features of aquaporin expression which had remained hidden in single experiments, such as expression wideness, response specificity and genotype and environment interactions. By consolidating expression profiles using independent experimental series, we showed that the large expansion of AQP family in poplar was accompanied with a strong divergence of gene expression, even if some cases of functional redundancy could be suspected.
Resumo:
SUMMARY: We present a tool designed for visualization of large-scale genetic and genomic data exemplified by results from genome-wide association studies. This software provides an integrated framework to facilitate the interpretation of SNP association studies in genomic context. Gene annotations can be retrieved from Ensembl, linkage disequilibrium data downloaded from HapMap and custom data imported in BED or WIG format. AssociationViewer integrates functionalities that enable the aggregation or intersection of data tracks. It implements an efficient cache system and allows the display of several, very large-scale genomic datasets. AVAILABILITY: The Java code for AssociationViewer is distributed under the GNU General Public Licence and has been tested on Microsoft Windows XP, MacOSX and GNU/Linux operating systems. It is available from the SourceForge repository. This also includes Java webstart, documentation and example datafiles.
Resumo:
The recent release of the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) by the American Psychiatric Association has led to much debate. For this forum article, we asked BMC Medicine Editorial Board members who are experts in the field of psychiatry to discuss their personal views on how the changes in DSM-5 might affect clinical practice in their specific areas of psychiatric medicine. This article discusses the influence the DSM-5 may have on the diagnosis and treatment of autism, trauma-related and stressor-related disorders, obsessive-compulsive and related disorders, mood disorders (including major depression and bipolar disorders), and schizophrenia spectrum disorders.
Resumo:
BACKGROUND: After peripheral nerve injury, spontaneous ectopic activity arising from the peripheral axons plays an important role in inducing central sensitization and neuropathic pain. Recent evidence indicates that activation of spinal cord microglia also contributes to the development of neuropathic pain. In particular, activation of p38 mitogen-activated protein kinase (MAPK) in spinal microglia is required for the development of mechanical allodynia. However, activity-dependent activation of microglia after nerve injury has not been fully addressed. To determine whether spontaneous activity from C- or A-fibers is required for microglial activation, we used resiniferatoxin (RTX) to block the conduction of transient receptor potential vanilloid subtype 1 (TRPV1) positive fibers (mostly C- and Adelta-fibers) and bupivacaine microspheres to block all fibers of the sciatic nerve in rats before spared nerve injury (SNI), and observed spinal microglial changes 2 days later. RESULTS: SNI induced robust mechanical allodynia and p38 activation in spinal microglia. SNI also induced marked cell proliferation in the spinal cord, and all the proliferating cells (BrdU+) were microglia (Iba1+). Bupivacaine induced a complete sensory and motor blockade and also significantly inhibited p38 activation and microglial proliferation in the spinal cord. In contrast, and although it produced an efficient nociceptive block, RTX failed to inhibit p38 activation and microglial proliferation in the spinal cord. CONCLUSION: (1) Blocking peripheral input in TRPV1-positive fibers (presumably C-fibers) is not enough to prevent nerve injury-induced spinal microglial activation. (2) Peripheral input from large myelinated fibers is important for microglial activation. (3) Microglial activation is associated with mechanical allodynia.
Resumo:
α-dystroglycan is a highly O-glycosylated extracellular matrix receptor that is required for anchoring of the basement membrane to the cell surface and for the entry of Old World arenaviruses into cells. Like-acetylglucosaminyltransferase (LARGE) is a key molecule that binds to the N-terminal domain of α-dystroglycan and attaches ligand-binding moieties to phosphorylated O-mannose on α-dystroglycan. Here we show that the LARGE modification required for laminin- and virus-binding occurs on specific Thr residues located at the extreme N terminus of the mucin-like domain of α-dystroglycan. Deletion and mutation analyses demonstrate that the ligand-binding activity of α-dystroglycan is conferred primarily by LARGE modification at Thr-317 and -319, within the highly conserved first 18 amino acids of the mucin-like domain. The importance of these paired residues in laminin-binding and clustering activity on myoblasts and in arenavirus cell entry is confirmed by mutational analysis with full-length dystroglycan. We further demonstrate that a sequence of five amino acids, Thr(317)ProThr(319)ProVal, contains phosphorylated O-glycosylation and, when modified by LARGE is sufficient for laminin-binding. Because the N-terminal region adjacent to the paired Thr residues is removed during posttranslational maturation of dystroglycan, our results demonstrate that the ligand-binding activity resides at the extreme N terminus of mature α-dystroglycan and is crucial for α-dystroglycan to coordinate the assembly of extracellular matrix proteins and to bind arenaviruses on the cell surface.
Resumo:
Arenaviruses are enveloped negative-strand RNA viruses that contain a bi-segmented genome. They are rodent-borne pathogens endemic to the Americas and Africa, with the exception of lymphocytic choriomeningitis virus (LCMV) that is world-wide distributed. The arenaviruses include numerous important human pathogens including the Old World arenavirus Lassa virus (LASV), the causative agent of a severe viral hemorrhagic fever in humans with several hundred thousand infections per year in Africa and thousands of deaths. Viruses are obligatory intracellular parasites, strictly depending on cellular processes and factors to complete their replication cycle. The binding of a virus to target cells is the first step of every viral infection, and is mainly mediated by viral proteins that can directly engage cellular receptors, providing a key determinant for viral tropism. This early step of infection represents a promising target to block the pathogen before it can take control over the host cell. Old World arenaviruses, such as LASV and LCMV, bind to host cells via attachment to their main receptor, dystroglycan (DG), an ubiquitous receptor for extracellular matrix proteins. The engagement of DG by LASV results in a fast internalization and transfer the virus to late endosomal compartment suggesting that the virus binding to DG causes marked changes in the dynamics of the receptor. These events could result in the clustering of the receptor and subsequent induction of signaling that could be modulated by the virus. Recently, numerous findings also suggest the presence of alternative receptor(s) for LASV in absence of the main DG receptor. In my first project, I was interested to investigate the effects of virus-receptor binding on the tyrosine phosphorylation of the cytoplasmic domain of DG and to test if this post-translational modification was crucial for the internalization of the LASV-receptor complex. We found that engagement of cellular DG by a recombinant LCMV expressing the envelope GP of LASV in human epithelial cells induced tyrosine phosphorylation of the cytoplasmic domain of DG. LASV GP binding to DG further resulted in dissociation of the adapter protein utrophin from virus-bound DG. Virus-induced dissociation of utrophin and consequent virus internalization were affected by the broadly specific tyrosine kinase inhibitor genistein. We speculate that the detachment of virus- bound DG from the actin-based cytoskeleton following DG phosphorylation may facilitate subsequent endocytosis of the virus-receptor complex. In the second project, I was interested to characterize the newly indentified LASV alternative receptor Axl in the context of productive arenavirus infection. In a first step, we demonstrated that Axl supports productive infection by rLCMV-LASVGP in a DG-independent manner. In line with previous studies, cell entry of rLCMV-LASVGP via Axl was less efficient when compared to functional DG. Interestingly, Axl-mediated infection showed rapid kinetics similar to DG-dependent entry. Using a panel of inhibitors, we found that Axl-mediated cell entry of rLCMV-LASVGP involved a clathrin-independent pathway that critically depended on actin and dynamin and was sensitive to EIPA but not to PAK inhibitors, compatible with a macropinocytosis-like mechanism of entry. In a next step, we aimed to investigate the molecular mechanism by which rLCMV-LASVGP recognizes Axl. Phosphatidylserine (PS) is the natural ligand of Axl via the adaptor protein Gas6. We detected the presence of PS in the envelope of Old World arenaviruses, suggesting that PS could mediate Axl-virus binding, in a mechanism of apoptotic mimicry already described for other viruses. Whether envelope PS and/or the GP of LASV plays any role in virus entry via Axl is still an open question. The molecular mechanisms underlying host cell-virus interaction are of particular interest to answer basic scientific questions as well as to apply key findings to translational research. Understanding pathogen induced-signaling and its link to invasion of the host cell is of great importance to develop drugs for therapeutic intervention against highly pathogenic viruses like LASV. - Les Arenavirus sont des virus enveloppés à ARN négatifs organisés sous forme de génome bisegmenté. Ils sont véhiculés par les rongeurs et se retrouvent de manière endémique aux Amériques et en Afrique avec l'exception du virus de la chorioméningite lymphocytaire (LCMV) qui lui est distribué mondialement. De nombreux pathogènes humains font parti de la famille des Arenavirus dont le virus de l'Ancien Monde Lassa (LASV), un agent responsable de fièvres hémorragiques sévères chez les humains. Le virus de Lassa cause plusieurs centaines de milliers d'infections par année en Afrique ainsi que des milliers de morts. De manière générale, les virus sont des parasites intracellulaires obligatoires qui dépendent strictement de processus et facteurs cellulaires pour clore leur cycle de réplication. L'attachement d'un virus à sa cellule cible représente la première étape de chaque infection virale et est principalement dirigée par des protéines virales qui interagissent directement avec leur récepteurs cellulaires respectifs fournissant ainsi un indicateur déterminant pour le tropisme d'un virus. Cette première étape de l'infection représente aussi une cible prometteuse pour bloquer le pathogène avant qu'il ne puisse prendre le contrôle de la cellule. Les Arenavirus de l'Ancien Monde comme LASV et LCMV s'attachent à la cellule hôte en se liant à leur récepteur principal, le dystroglycan (DG), un récepteur ubiquitaire pour les protéines de la matrice extracellulaire. La liaison du DG par LASV résulte en une rapide internalisation transférant le virus aux endosomes tardifs suggérant ainsi que l'attachement du virus au DG peut provoquer des changements marqués dans la dynamique moléculaire du récepteur. Ces événements sont susceptibles d'induire un regroupement du récepteur à la surface cellulaire, ainsi qu'une induction subséquente qui pourrait être, par la suite, modulée par le virus. Récemment, plusieurs découvertes suggèrent aussi la présence d'un récepteur alternatif pour LASV en l'absence du récepteur principal, le DG. Concernant mon premier projet, j'étais intéressée à étudier les effets de la liaison virus- récepteur sur la phosphorylation des acides aminés tyrosines se trouvant dans la partie cytoplasmique du DG, le but étant de tester si cette modification post-translationnelle était cruciale pour Γ internalisation du complexe LASV-DG récepteur. Nous avons découvert que l'engagement du récepteur DG par le virus recombinant LCMV, exprimant la glycoprotéine de LASV, dans des cellules épithéliales humaines induit une phosphorylation de résidu(s) tyrosine se situant dans le domaine cytoplasmique du DG. La liaison de la glycoprotéine de LASV au DG induit par la suite la dissociation de la protéine adaptatrice utrophine du complexe virus-DG récepteur. Nous avons observé que cette dissociation de l'utrophine, induite par le virus, ainsi que son internalisation, sont affectées par l'inhibiteur à large spectre des tyrosines kinases, la génistéine. Nous avons donc supposé que le détachement du virus, lié au récepteur DG, du cytosquelette d'actine suite à la phosphorylation du DG faciliterait l'endocytose subséquente du complexe virus-récepteur. Dans le second projet, j'étais intéressée à caractériser le récepteur alternatif Axl qui a été récemment identifié dans le contexte de l'infection productive des Arenavirus. Dans un premier temps, nous avons démontré que le récepteur alternatif Axl permet l'infection des cellules par le virus LCMV recombinant LASV indépendamment du récepteur DG. Conformément aux études publiées précédemment, nous avons pu observer que l'entrée du virus recombinant LASV via Axl est moins efficace que via le récepteur principal DG. De façon intéressante, nous avons aussi remarqué que l'infection autorisée par Axl manifeste une cinétique virale d'entrée similaire à celle observée avec le récepteur DG. Utilisant un éventail de différents inhibiteurs, nous avons trouvé que l'entrée du virus recombinant rLCMV-LASVGP via Axl implique une voie d'entrée indépendante de la clathrine et dépendant de manière critique de l'actine et de la dynamine. Cette nouvelle voie d'entrée est aussi sensible à l'EIPA contrairement aux inhibiteurs PAK indiquant un mécanisme d'entrée compatible avec un mécanisme de macropinocytose. L'étape suivante du projet a été d'investiguer le mécanisme moléculaire par lequel le virus recombinant rLCMV-LASVGP reconnaît le récepteur alternatif Axl. La phosphatidylsérine (PS) se trouve être un ligand naturel pour Axl via la protéine adaptatrice Gas6. Nous avons détecté la présence de PS dans l'enveloppe des Arenavirus du Vieux Monde suggérant que la PS pourrait médier la liaison du virus à Axl dans un mécanisme de mimétisme apoptotique déjà observé et décrit pour d'autres virus. Cependant, il reste encore à déterminer qui de la PS ou de la glycoprotéine de l'enveloppe virale intervient dans le processus d'entrée de LASV via le récepteur alternatif Axl. Les mécanismes moléculaires à la base de l'interaction entre virus et cellule hôte sont d'intérêts particuliers pour répondre aux questions scientifiques de base ainsi que dans l'application de découvertes clés pour la recherche translationnelle. La compréhension de la signalisation induite par les pathogènes ainsi que son lien à l'invasion de la cellule hôte est d'une importance considérable pour le développement de drogues pour l'intervention thérapeutique contre les virus hautement pathogènes comme LASV.
Resumo:
Among the types of remote sensing acquisitions, optical images are certainly one of the most widely relied upon data sources for Earth observation. They provide detailed measurements of the electromagnetic radiation reflected or emitted by each pixel in the scene. Through a process termed supervised land-cover classification, this allows to automatically yet accurately distinguish objects at the surface of our planet. In this respect, when producing a land-cover map of the surveyed area, the availability of training examples representative of each thematic class is crucial for the success of the classification procedure. However, in real applications, due to several constraints on the sample collection process, labeled pixels are usually scarce. When analyzing an image for which those key samples are unavailable, a viable solution consists in resorting to the ground truth data of other previously acquired images. This option is attractive but several factors such as atmospheric, ground and acquisition conditions can cause radiometric differences between the images, hindering therefore the transfer of knowledge from one image to another. The goal of this Thesis is to supply remote sensing image analysts with suitable processing techniques to ensure a robust portability of the classification models across different images. The ultimate purpose is to map the land-cover classes over large spatial and temporal extents with minimal ground information. To overcome, or simply quantify, the observed shifts in the statistical distribution of the spectra of the materials, we study four approaches issued from the field of machine learning. First, we propose a strategy to intelligently sample the image of interest to collect the labels only in correspondence of the most useful pixels. This iterative routine is based on a constant evaluation of the pertinence to the new image of the initial training data actually belonging to a different image. Second, an approach to reduce the radiometric differences among the images by projecting the respective pixels in a common new data space is presented. We analyze a kernel-based feature extraction framework suited for such problems, showing that, after this relative normalization, the cross-image generalization abilities of a classifier are highly increased. Third, we test a new data-driven measure of distance between probability distributions to assess the distortions caused by differences in the acquisition geometry affecting series of multi-angle images. Also, we gauge the portability of classification models through the sequences. In both exercises, the efficacy of classic physically- and statistically-based normalization methods is discussed. Finally, we explore a new family of approaches based on sparse representations of the samples to reciprocally convert the data space of two images. The projection function bridging the images allows a synthesis of new pixels with more similar characteristics ultimately facilitating the land-cover mapping across images.
Resumo:
In groundwater applications, Monte Carlo methods are employed to model the uncertainty on geological parameters. However, their brute-force application becomes computationally prohibitive for highly detailed geological descriptions, complex physical processes, and a large number of realizations. The Distance Kernel Method (DKM) overcomes this issue by clustering the realizations in a multidimensional space based on the flow responses obtained by means of an approximate (computationally cheaper) model; then, the uncertainty is estimated from the exact responses that are computed only for one representative realization per cluster (the medoid). Usually, DKM is employed to decrease the size of the sample of realizations that are considered to estimate the uncertainty. We propose to use the information from the approximate responses for uncertainty quantification. The subset of exact solutions provided by DKM is then employed to construct an error model and correct the potential bias of the approximate model. Two error models are devised that both employ the difference between approximate and exact medoid solutions, but differ in the way medoid errors are interpolated to correct the whole set of realizations. The Local Error Model rests upon the clustering defined by DKM and can be seen as a natural way to account for intra-cluster variability; the Global Error Model employs a linear interpolation of all medoid errors regardless of the cluster to which the single realization belongs. These error models are evaluated for an idealized pollution problem in which the uncertainty of the breakthrough curve needs to be estimated. For this numerical test case, we demonstrate that the error models improve the uncertainty quantification provided by the DKM algorithm and are effective in correcting the bias of the estimate computed solely from the MsFV results. The framework presented here is not specific to the methods considered and can be applied to other combinations of approximate models and techniques to select a subset of realizations
Resumo:
AbstractIn addition to genetic changes affecting the function of gene products, changes in gene expression have been suggested to underlie many or even most of the phenotypic differences among mammals. However, detailed gene expression comparisons were, until recently, restricted to closely related species, owing to technological limitations. Thus, we took advantage of the latest technologies (RNA-Seq) to generate extensive qualitative and quantitative transcriptome data for a unique collection of somatic and germline tissues from representatives of all major mammalian lineages (placental mammals, marsupials and monotremes) and birds, the evolutionary outgroup.In the first major project of my thesis, we performed global comparative analyses of gene expression levels based on these data. Our analyses provided fundamental insights into the dynamics of transcriptome change during mammalian evolution (e.g., the rate of expression change across species, tissues and chromosomes) and allowed the exploration of the functional relevance and phenotypic implications of transcription changes at a genome-wide scale (e.g., we identified numerous potentially selectively driven expression switches).In a second project of my thesis, which was also based on the unique transcriptome data generated in the context of the first project we focused on the evolution of alternative splicing in mammals. Alternative splicing contributes to transcriptome complexity by generating several transcript isoforms from a single gene, which can, thus, perform various functions. To complete the global comparative analysis of gene expression changes, we explored patterns of alternative splicing evolution. This work uncovered several general and unexpected patterns of alternative splicing evolution (e.g., we found that alternative splicing evolves extremely rapidly) as well as a large number of conserved alternative isoforms that may be crucial for the functioning of mammalian organs.Finally, the third and final project of my PhD consisted in analyzing in detail the unique functional and evolutionary properties of the testis by exploring the extent of its transcriptome complexity. This organ was previously shown to evolve rapidly both at the phenotypic and molecular level, apparently because of the specific pressures that act on this organ and are associated with its reproductive function. Moreover, my analyses of the amniote tissue transcriptome data described above, revealed strikingly widespread transcriptional activity of both functional and nonfunctional genomic elements in the testis compared to the other organs. To elucidate the cellular source and mechanisms underlying this promiscuous transcription in the testis, we generated deep coverage RNA-Seq data for all major testis cell types as well as epigenetic data (DNA and histone methylation) using the mouse as model system. The integration of these complete dataset revealed that meiotic and especially post-meiotic germ cells are the major contributors to the widespread functional and nonfunctional transcriptome complexity of the testis, and that this "promiscuous" spermatogenic transcription is resulting, at least partially, from an overall transcriptionally permissive chromatin state. We hypothesize that this particular open state of the chromatin results from the extensive chromatin remodeling that occurs during spermatogenesis which ultimately leads to the replacement of histones by protamines in the mature spermatozoa. Our results have important functional and evolutionary implications (e.g., regarding new gene birth and testicular gene expression evolution).Generally, these three large-scale projects of my thesis provide complete and massive datasets that constitute valuables resources for further functional and evolutionary analyses of mammalian genomes.
Resumo:
The proportion of population living in or around cites is more important than ever. Urban sprawl and car dependence have taken over the pedestrian-friendly compact city. Environmental problems like air pollution, land waste or noise, and health problems are the result of this still continuing process. The urban planners have to find solutions to these complex problems, and at the same time insure the economic performance of the city and its surroundings. At the same time, an increasing quantity of socio-economic and environmental data is acquired. In order to get a better understanding of the processes and phenomena taking place in the complex urban environment, these data should be analysed. Numerous methods for modelling and simulating such a system exist and are still under development and can be exploited by the urban geographers for improving our understanding of the urban metabolism. Modern and innovative visualisation techniques help in communicating the results of such models and simulations. This thesis covers several methods for analysis, modelling, simulation and visualisation of problems related to urban geography. The analysis of high dimensional socio-economic data using artificial neural network techniques, especially self-organising maps, is showed using two examples at different scales. The problem of spatiotemporal modelling and data representation is treated and some possible solutions are shown. The simulation of urban dynamics and more specifically the traffic due to commuting to work is illustrated using multi-agent micro-simulation techniques. A section on visualisation methods presents cartograms for transforming the geographic space into a feature space, and the distance circle map, a centre-based map representation particularly useful for urban agglomerations. Some issues on the importance of scale in urban analysis and clustering of urban phenomena are exposed. A new approach on how to define urban areas at different scales is developed, and the link with percolation theory established. Fractal statistics, especially the lacunarity measure, and scale laws are used for characterising urban clusters. In a last section, the population evolution is modelled using a model close to the well-established gravity model. The work covers quite a wide range of methods useful in urban geography. Methods should still be developed further and at the same time find their way into the daily work and decision process of urban planners. La part de personnes vivant dans une région urbaine est plus élevé que jamais et continue à croître. L'étalement urbain et la dépendance automobile ont supplanté la ville compacte adaptée aux piétons. La pollution de l'air, le gaspillage du sol, le bruit, et des problèmes de santé pour les habitants en sont la conséquence. Les urbanistes doivent trouver, ensemble avec toute la société, des solutions à ces problèmes complexes. En même temps, il faut assurer la performance économique de la ville et de sa région. Actuellement, une quantité grandissante de données socio-économiques et environnementales est récoltée. Pour mieux comprendre les processus et phénomènes du système complexe "ville", ces données doivent être traitées et analysées. Des nombreuses méthodes pour modéliser et simuler un tel système existent et sont continuellement en développement. Elles peuvent être exploitées par le géographe urbain pour améliorer sa connaissance du métabolisme urbain. Des techniques modernes et innovatrices de visualisation aident dans la communication des résultats de tels modèles et simulations. Cette thèse décrit plusieurs méthodes permettant d'analyser, de modéliser, de simuler et de visualiser des phénomènes urbains. L'analyse de données socio-économiques à très haute dimension à l'aide de réseaux de neurones artificiels, notamment des cartes auto-organisatrices, est montré à travers deux exemples aux échelles différentes. Le problème de modélisation spatio-temporelle et de représentation des données est discuté et quelques ébauches de solutions esquissées. La simulation de la dynamique urbaine, et plus spécifiquement du trafic automobile engendré par les pendulaires est illustrée à l'aide d'une simulation multi-agents. Une section sur les méthodes de visualisation montre des cartes en anamorphoses permettant de transformer l'espace géographique en espace fonctionnel. Un autre type de carte, les cartes circulaires, est présenté. Ce type de carte est particulièrement utile pour les agglomérations urbaines. Quelques questions liées à l'importance de l'échelle dans l'analyse urbaine sont également discutées. Une nouvelle approche pour définir des clusters urbains à des échelles différentes est développée, et le lien avec la théorie de la percolation est établi. Des statistiques fractales, notamment la lacunarité, sont utilisées pour caractériser ces clusters urbains. L'évolution de la population est modélisée à l'aide d'un modèle proche du modèle gravitaire bien connu. Le travail couvre une large panoplie de méthodes utiles en géographie urbaine. Toutefois, il est toujours nécessaire de développer plus loin ces méthodes et en même temps, elles doivent trouver leur chemin dans la vie quotidienne des urbanistes et planificateurs.