94 resultados para Geo-referenced database on Recreio dos Bandeirantes
em Université de Lausanne, Switzerland
Resumo:
Mountain ranges are biodiversity hotspots worldwide and provide refuge to many organisms under contemporary climate change. Gathering field information on mountain biodiversity over time is of primary importance to understand the response of biotic communities to climate changes. For plants, several long-term observation sites and networks of mountain biodiversity are emerging worldwide to gather field data and monitor altitudinal range shifts and community composition changes under contemporary climate change. Most of these monitoring sites, however, focus on alpine ecosystems and mountain summits, such as the global observation research initiative in alpine environments (GLORIA). Here we describe the Alps Vegetation Database, a comprehensive community level archive (GIVD ID EU-00-014) which aims at compiling all available geo-referenced vegetation plots from lowland forests to alpine grasslands across the greatest mountain range in Europe: the Alps. This research initiative was funded between 2008 and 2011 by the Danish Council for Independent Research and was part of a larger project to compare cross-scale plant community structure between the Alps and the Scandes. The Alps Vegetation Database currently harbours 35,731 geo-referenced vegetation plots and 5,023 valid taxa across Mediterranean, temperate and alpine environments. The data are mainly used by the main contributors of the Alps Vegetation Database in an ecoinformatics approach to test hypotheses related to plant macroecology and biogeography, but external proposals for joint collaborations are welcome.
Resumo:
The use of Geographic Information Systems has revolutionalized the handling and the visualization of geo-referenced data and has underlined the critic role of spatial analysis. The usual tools for such a purpose are geostatistics which are widely used in Earth science. Geostatistics are based upon several hypothesis which are not always verified in practice. On the other hand, Artificial Neural Network (ANN) a priori can be used without special assumptions and are known to be flexible. This paper proposes to discuss the application of ANN in the case of the interpolation of a geo-referenced variable.
Resumo:
We study the effect of civil conflict on social capital, focusing on Uganda's experience during the last decade. Using individual and county-level data, we document large causal effects on trust and ethnic identity of an exogenous outburst of ethnic conflicts in 2002-2005. We exploit two waves of survey data from Afrobarometer (Round 4 Afrobarometer Survey in Uganda, 2000, 2008), including information on socioeconomic characteristics at the individual level, and geo-referenced measures of fighting events from ACLED. Our identification strategy exploits variations in the both the spatial and ethnic intensity of fighting. We find that more intense fighting decreases generalized trust and increases ethnic identity. The effects are quantitatively large and robust to a number of control variables, alternative measures of violence, and different statistical techniques involving ethnic and spatial fixed effects and instrumental variables. Controlling for the intensity of violence during the conflict, we also document that post-conflict economic recovery is slower in ethnically fractionalized counties. Our findings are consistent with the existence of a self-reinforcing process between conflicts and ethnic cleavages.
Resumo:
BACKGROUND: Animal societies are diverse, ranging from small family-based groups to extraordinarily large social networks in which many unrelated individuals interact. At the extreme of this continuum, some ant species form unicolonial populations in which workers and queens can move among multiple interconnected nests without eliciting aggression. Although unicoloniality has been mostly studied in invasive ants, it also occurs in some native non-invasive species. Unicoloniality is commonly associated with very high queen number, which may result in levels of relatedness among nestmates being so low as to raise the question of the maintenance of altruism by kin selection in such systems. However, the actual relatedness among cooperating individuals critically depends on effective dispersal and the ensuing pattern of genetic structuring. In order to better understand the evolution of unicoloniality in native non-invasive ants, we investigated the fine-scale population genetic structure and gene flow in three unicolonial populations of the wood ant F. paralugubris. RESULTS: The analysis of geo-referenced microsatellite genotypes and mitochondrial haplotypes revealed the presence of cryptic clusters of genetically-differentiated nests in the three populations of F. paralugubris. Because of this spatial genetic heterogeneity, members of the same clusters were moderately but significantly related. The comparison of nuclear (microsatellite) and mitochondrial differentiation indicated that effective gene flow was male-biased in all populations. CONCLUSION: The three unicolonial populations exhibited male-biased and mostly local gene flow. The high number of queens per nest, exchanges among neighbouring nests and restricted long-distance gene flow resulted in large clusters of genetically similar nests. The positive relatedness among clustermates suggests that kin selection may still contribute to the maintenance of altruism in unicolonial populations if competition occurs among clusters.
Resumo:
The coverage and volume of geo-referenced datasets are extensive and incessantly¦growing. The systematic capture of geo-referenced information generates large volumes¦of spatio-temporal data to be analyzed. Clustering and visualization play a key¦role in the exploratory data analysis and the extraction of knowledge embedded in¦these data. However, new challenges in visualization and clustering are posed when¦dealing with the special characteristics of this data. For instance, its complex structures,¦large quantity of samples, variables involved in a temporal context, high dimensionality¦and large variability in cluster shapes.¦The central aim of my thesis is to propose new algorithms and methodologies for¦clustering and visualization, in order to assist the knowledge extraction from spatiotemporal¦geo-referenced data, thus improving making decision processes.¦I present two original algorithms, one for clustering: the Fuzzy Growing Hierarchical¦Self-Organizing Networks (FGHSON), and the second for exploratory visual data analysis:¦the Tree-structured Self-organizing Maps Component Planes. In addition, I present¦methodologies that combined with FGHSON and the Tree-structured SOM Component¦Planes allow the integration of space and time seamlessly and simultaneously in¦order to extract knowledge embedded in a temporal context.¦The originality of the FGHSON lies in its capability to reflect the underlying structure¦of a dataset in a hierarchical fuzzy way. A hierarchical fuzzy representation of¦clusters is crucial when data include complex structures with large variability of cluster¦shapes, variances, densities and number of clusters. The most important characteristics¦of the FGHSON include: (1) It does not require an a-priori setup of the number¦of clusters. (2) The algorithm executes several self-organizing processes in parallel.¦Hence, when dealing with large datasets the processes can be distributed reducing the¦computational cost. (3) Only three parameters are necessary to set up the algorithm.¦In the case of the Tree-structured SOM Component Planes, the novelty of this algorithm¦lies in its ability to create a structure that allows the visual exploratory data analysis¦of large high-dimensional datasets. This algorithm creates a hierarchical structure¦of Self-Organizing Map Component Planes, arranging similar variables' projections in¦the same branches of the tree. Hence, similarities on variables' behavior can be easily¦detected (e.g. local correlations, maximal and minimal values and outliers).¦Both FGHSON and the Tree-structured SOM Component Planes were applied in¦several agroecological problems proving to be very efficient in the exploratory analysis¦and clustering of spatio-temporal datasets.¦In this thesis I also tested three soft competitive learning algorithms. Two of them¦well-known non supervised soft competitive algorithms, namely the Self-Organizing¦Maps (SOMs) and the Growing Hierarchical Self-Organizing Maps (GHSOMs); and the¦third was our original contribution, the FGHSON. Although the algorithms presented¦here have been used in several areas, to my knowledge there is not any work applying¦and comparing the performance of those techniques when dealing with spatiotemporal¦geospatial data, as it is presented in this thesis.¦I propose original methodologies to explore spatio-temporal geo-referenced datasets¦through time. Our approach uses time windows to capture temporal similarities and¦variations by using the FGHSON clustering algorithm. The developed methodologies¦are used in two case studies. In the first, the objective was to find similar agroecozones¦through time and in the second one it was to find similar environmental patterns¦shifted in time.¦Several results presented in this thesis have led to new contributions to agroecological¦knowledge, for instance, in sugar cane, and blackberry production.¦Finally, in the framework of this thesis we developed several software tools: (1)¦a Matlab toolbox that implements the FGHSON algorithm, and (2) a program called¦BIS (Bio-inspired Identification of Similar agroecozones) an interactive graphical user¦interface tool which integrates the FGHSON algorithm with Google Earth in order to¦show zones with similar agroecological characteristics.
Resumo:
This study aims to assess prevalence and pregnancy outcome for sex chromosome trisomies (SCTs) diagnosed prenatally or in the first year of life. Data held by the European Surveillance of Congenital Anomalies (EUROCAT) database on SCT cases delivered 2000-2005 from 19 population-based registries in 11 European countries covering 2.5 million births were analysed. Cases included were livebirths diagnosed to 1 year of age, fetal deaths from 20 weeks gestation and terminations of pregnancy for fetal anomaly (TOPFA). In all, 465 cases of SCT were diagnosed between 2000 and 2005, a prevalence of 1.88 per 10,000 births (95% CI 1.71-2.06). Prevalence of XXX, XXY and XYY were 0.54 (95% CI 0.46-0.64), 1.04 (95% CI 0.92-1.17) and 0.30 (95% CI 0.24-0.38), respectively. In all, 415 (89%) were prenatally diagnosed and 151 (36%) of these resulted in TOPFA. There was wide country variation in prevalence (0.19-5.36 per 1000), proportion prenatally diagnosed (50-100%) and proportion of prenatally diagnosed resulting in TOPFA (13-67%). Prevalence of prenatally diagnosed cases was higher in countries with high prenatal detection rates of Down syndrome. The EUROCAT prevalence rate for SCTs diagnosed prenatally or up to 1 year of age represents 12% of the prevalence expected from cytogenetic studies of newborn babies, as the majority of cases are never diagnosed or are diagnosed later in life. There is a wide variation between European countries in prevalence, prenatal detection and TOPFA proportions, related to differences in screening policies as well as organizational and cultural factors.
Resumo:
Forest fire sequences can be modelled as a stochastic point process where events are characterized by their spatial locations and occurrence in time. Cluster analysis permits the detection of the space/time pattern distribution of forest fires. These analyses are useful to assist fire-managers in identifying risk areas, implementing preventive measures and conducting strategies for an efficient distribution of the firefighting resources. This paper aims to identify hot spots in forest fire sequences by means of the space-time scan statistics permutation model (STSSP) and a geographical information system (GIS) for data and results visualization. The scan statistical methodology uses a scanning window, which moves across space and time, detecting local excesses of events in specific areas over a certain period of time. Finally, the statistical significance of each cluster is evaluated through Monte Carlo hypothesis testing. The case study is the forest fires registered by the Forest Service in Canton Ticino (Switzerland) from 1969 to 2008. This dataset consists of geo-referenced single events including the location of the ignition points and additional information. The data were aggregated into three sub-periods (considering important preventive legal dispositions) and two main ignition-causes (lightning and anthropogenic causes). Results revealed that forest fire events in Ticino are mainly clustered in the southern region where most of the population is settled. Our analysis uncovered local hot spots arising from extemporaneous arson activities. Results regarding the naturally-caused fires (lightning fires) disclosed two clusters detected in the northern mountainous area.
Resumo:
BACKGROUND: Body mass index (BMI) may cluster in space among adults and be spatially dependent. Whether BMI clusters among children and how age-specific BMI clusters are related remains unknown. We aimed to identify and compare the spatial dependence of BMI in adults and children in a Swiss general population, taking into account the area's income level. METHODS: Geo-referenced data from the Bus Santé study (adults, n=6663) and Geneva School Health Service (children, n=3601) were used. We implemented global (Moran's I) and local (local indicators of spatial association (LISA)) indices of spatial autocorrelation to investigate the spatial dependence of BMI in adults (35-74 years) and children (6-7 years). Weight and height were measured using standardized procedures. Five spatial autocorrelation classes (LISA clusters) were defined including the high-high BMI class (high BMI participant's BMI value correlated with high BMI-neighbors' mean BMI values). The spatial distributions of clusters were compared between adults and children with and without adjustment for area's income level. RESULTS: In both adults and children, BMI was clearly not distributed at random across the State of Geneva. Both adults' and children's BMIs were associated with the mean BMI of their neighborhood. We found that the clusters of higher BMI in adults and children are located in close, yet different, areas of the state. Significant clusters of high versus low BMIs were clearly identified in both adults and children. Area's income level was associated with children's BMI clusters. CONCLUSIONS: BMI clusters show a specific spatial dependence in adults and children from the general population. Using a fine-scale spatial analytic approach, we identified life course-specific clusters that could guide tailored interventions.
Resumo:
General Introduction This thesis can be divided into two main parts :the first one, corresponding to the first three chapters, studies Rules of Origin (RoOs) in Preferential Trade Agreements (PTAs); the second part -the fourth chapter- is concerned with Anti-Dumping (AD) measures. Despite wide-ranging preferential access granted to developing countries by industrial ones under North-South Trade Agreements -whether reciprocal, like the Europe Agreements (EAs) or NAFTA, or not, such as the GSP, AGOA, or EBA-, it has been claimed that the benefits from improved market access keep falling short of the full potential benefits. RoOs are largely regarded as a primary cause of the under-utilization of improved market access of PTAs. RoOs are the rules that determine the eligibility of goods to preferential treatment. Their economic justification is to prevent trade deflection, i.e. to prevent non-preferred exporters from using the tariff preferences. However, they are complex, cost raising and cumbersome, and can be manipulated by organised special interest groups. As a result, RoOs can restrain trade beyond what it is needed to prevent trade deflection and hence restrict market access in a statistically significant and quantitatively large proportion. Part l In order to further our understanding of the effects of RoOs in PTAs, the first chapter, written with Pr. Olivier Cadot, Celine Carrère and Pr. Jaime de Melo, describes and evaluates the RoOs governing EU and US PTAs. It draws on utilization-rate data for Mexican exports to the US in 2001 and on similar data for ACP exports to the EU in 2002. The paper makes two contributions. First, we construct an R-index of restrictiveness of RoOs along the lines first proposed by Estevadeordal (2000) for NAFTA, modifying it and extending it for the EU's single-list (SL). This synthetic R-index is then used to compare Roos under NAFTA and PANEURO. The two main findings of the chapter are as follows. First, it shows, in the case of PANEURO, that the R-index is useful to summarize how countries are differently affected by the same set of RoOs because of their different export baskets to the EU. Second, it is shown that the Rindex is a relatively reliable statistic in the sense that, subject to caveats, after controlling for the extent of tariff preference at the tariff-line level, it accounts for differences in utilization rates at the tariff line level. Finally, together with utilization rates, the index can be used to estimate total compliance costs of RoOs. The second chapter proposes a reform of preferential Roos with the aim of making them more transparent and less discriminatory. Such a reform would make preferential blocs more "cross-compatible" and would therefore facilitate cumulation. It would also contribute to move regionalism toward more openness and hence to make it more compatible with the multilateral trading system. It focuses on NAFTA, one of the most restrictive FTAs (see Estevadeordal and Suominen 2006), and proposes a way forward that is close in spirit to what the EU Commission is considering for the PANEURO system. In a nutshell, the idea is to replace the current array of RoOs by a single instrument- Maximum Foreign Content (MFC). An MFC is a conceptually clear and transparent instrument, like a tariff. Therefore changing all instruments into an MFC would bring improved transparency pretty much like the "tariffication" of NTBs. The methodology for this exercise is as follows: In step 1, I estimate the relationship between utilization rates, tariff preferences and RoOs. In step 2, I retrieve the estimates and invert the relationship to get a simulated MFC that gives, line by line, the same utilization rate as the old array of Roos. In step 3, I calculate the trade-weighted average of the simulated MFC across all lines to get an overall equivalent of the current system and explore the possibility of setting this unique instrument at a uniform rate across lines. This would have two advantages. First, like a uniform tariff, a uniform MFC would make it difficult for lobbies to manipulate the instrument at the margin. This argument is standard in the political-economy literature and has been used time and again in support of reductions in the variance of tariffs (together with standard welfare considerations). Second, uniformity across lines is the only way to eliminate the indirect source of discrimination alluded to earlier. Only if two countries face uniform RoOs and tariff preference will they face uniform incentives irrespective of their initial export structure. The result of this exercise is striking: the average simulated MFC is 25% of good value, a very low (i.e. restrictive) level, confirming Estevadeordal and Suominen's critical assessment of NAFTA's RoOs. Adopting a uniform MFC would imply a relaxation from the benchmark level for sectors like chemicals or textiles & apparel, and a stiffening for wood products, papers and base metals. Overall, however, the changes are not drastic, suggesting perhaps only moderate resistance to change from special interests. The third chapter of the thesis considers whether Europe Agreements of the EU, with the current sets of RoOs, could be the potential model for future EU-centered PTAs. First, I have studied and coded at the six-digit level of the Harmonised System (HS) .both the old RoOs -used before 1997- and the "Single list" Roos -used since 1997. Second, using a Constant Elasticity Transformation function where CEEC exporters smoothly mix sales between the EU and the rest of the world by comparing producer prices on each market, I have estimated the trade effects of the EU RoOs. The estimates suggest that much of the market access conferred by the EAs -outside sensitive sectors- was undone by the cost-raising effects of RoOs. The chapter also contains an analysis of the evolution of the CEECs' trade with the EU from post-communism to accession. Part II The last chapter of the thesis is concerned with anti-dumping, another trade-policy instrument having the effect of reducing market access. In 1995, the Uruguay Round introduced in the Anti-Dumping Agreement (ADA) a mandatory "sunset-review" clause (Article 11.3 ADA) under which anti-dumping measures should be reviewed no later than five years from their imposition and terminated unless there was a serious risk of resumption of injurious dumping. The last chapter, written with Pr. Olivier Cadot and Pr. Jaime de Melo, uses a new database on Anti-Dumping (AD) measures worldwide to assess whether the sunset-review agreement had any effect. The question we address is whether the WTO Agreement succeeded in imposing the discipline of a five-year cycle on AD measures and, ultimately, in curbing their length. Two methods are used; count data analysis and survival analysis. First, using Poisson and Negative Binomial regressions, the count of AD measures' revocations is regressed on (inter alia) the count of "initiations" lagged five years. The analysis yields a coefficient on measures' initiations lagged five years that is larger and more precisely estimated after the agreement than before, suggesting some effect. However the coefficient estimate is nowhere near the value that would give a one-for-one relationship between initiations and revocations after five years. We also find that (i) if the agreement affected EU AD practices, the effect went the wrong way, the five-year cycle being quantitatively weaker after the agreement than before; (ii) the agreement had no visible effect on the United States except for aone-time peak in 2000, suggesting a mopping-up of old cases. Second, the survival analysis of AD measures around the world suggests a shortening of their expected lifetime after the agreement, and this shortening effect (a downward shift in the survival function postagreement) was larger and more significant for measures targeted at WTO members than for those targeted at non-members (for which WTO disciplines do not bind), suggesting that compliance was de jure. A difference-in-differences Cox regression confirms this diagnosis: controlling for the countries imposing the measures, for the investigated countries and for the products' sector, we find a larger increase in the hazard rate of AD measures covered by the Agreement than for other measures.
Resumo:
OBJECTIVES: To conduct a national survey on adolescent health and lifestyles in Georgia and to thus set up a database on adolescent. METHODS: A two-stage cluster sample of around 8000-10000 in-school 15-18 years adolescents are being reached through a random selection of classes in Georgia. The sample has been stratified by age, region, type of school and language. A self-administered questionnaire of 87 questions has been developed and translated into the four main languages used in Georgia. RESULTS: Up to June 2004, the researchers have reached 511 classes (9306 pupils). In total, 8039 questionnaires have been considered valid. The main concerns encountered for this survey are linked with acceptance of the survey, cross-cultural issues, political and strategic problems as well as inadequate physical environmental support. CONCLUSION: Despite Georgia's unfavourable economical and political situation, it has been possible to run a national survey on the health of adolescents, according to the usual standards used in the field. This survey should allow for 1) the identification of priorities in the field of health care and health promotion 2) the monitoring of adolescent health in the future.
Resumo:
Switzerland, the country with the highest health expenditure per capita, is lacking data on trauma care and system planning. Recently, 12 trauma centres were designated to be reassessed through a future national trauma registry by 2015. Lausanne University Hospital launched the first Swiss trauma registry in 2008, which contains the largest database on trauma activity nationwide. METHODS: Prospective analysis of data from consecutively admitted shock room patients from 1 January 2008 to 31 December 2012. Shock room admission is based on physiology and mechanism of injury, assessed by prehospital physicians. Management follows a surgeon-led multidisciplinary approach. Injuries are coded by Association for the Advancement of Automotive Medicine (AAAM) certified coders. RESULTS: Over the 5 years, 1,599 trauma patients were admitted, predominantly males with a median age of 41.4 years and median injury severity score (ISS) of 13. Rate of ISS >15 was 42%. Principal mechanisms of injury were road traffic (40.4%) and falls (34.4%), with 91.5% blunt trauma. Principal patterns were brain (64.4%), chest (59.8%) and extremity/pelvic girdle (52.9%) injuries. Severe (abbreviated injury scale [AIS] score ≥ 3) orthopaedic injuries, defined as extremity and spine injuries together, accounted for 67.1%. Overall, 29.1% underwent immediate intervention, mainly by orthopaedics (27.3%), neurosurgeons (26.3 %) and visceral surgeons (13.9%); 43.8% underwent a surgical intervention within the first 24 hours and 59.1% during their hospitalisation. In-hospital mortality for patients with ISS >15 was 26.2%. CONCLUSION: This is the first 5-year report on trauma in Switzerland. Trauma workload was similar to other European countries. Despite high levels of healthcare, mortality exceeds published rates by >50%. Regardless of the importance of a multidisciplinary approach, trauma remains a surgical disease and needs dedicated surgical resources.
Resumo:
90Y-labelled radiopharmaceuticals offer promising prospects for radionuclide therapies of tumours, e.g. radioimmunotherapies (RIT), (EANM, 2007), peptide receptor radiotherapies (PRRT), (Otte et al., 1998), and selective internal radiotherapies (SIRT), (Salem and Thurston, 2006). 90Y, an almost pure high-energy beta radiation emitter (Eβ,max = 2.28 MeV), is a favourable radionuclide for therapeutic purposes. However, when preparing and performing these therapies, high activities of 90Y (>1 GBq) are to be manipulated and technicians, physicians and nurses may receive high skin exposures to the hands. If radiation protection standards are low, the exposure of staff can exceed the annual skin dose limit of 500 mSv. Within a particular work package (WP4) of the ORAMED project, comprehensive measurements in nuclear medicine departments of several hospitals in 6 European countries were carried out. The study focussed on 90Y-labelled substances such as Zevalin® and DOTATOC to achieve a representative database on staff exposure. This paper summarises the most important results and conclusions for individual monitoring of skin exposure of staff.
Resumo:
The purpose of this study was to assess the safety and efficacy of stenting in upper airway reconstructions for benign laryngotracheal stenosis (LTS) with a newly designed prosthesis, the LT-Mold?. The LT-Mold and its proper use during open surgery and endoscopy are described, and the experience gathered from a prospectively collected database on 65 patients treated for complex LTS or severe aspiration is reported. This series is compared to the results of other stenting methods. All patients were available for evaluation. In all but one case, the prosthesis was removed at the end of the study. The new prosthesis did not induce any stent-related trauma to the supraglottis, glottis and subglottis. Before adding a distal round-shaped silicone cap to the LT-Mold, granulation tissue was usually seen at the stent-mucosal interface at the tracheostoma level. In 14 cases, there has been a spontaneous extrusion of the prosthesis through the mouth; this problem was solved by fixing the prosthesis through the reinforced portion of the prosthesis at the cap level and by adding one fixation stitch in the supraglottis. We have to document the loss of the silicone cap in three cases. This problem was resolved by designing a new prototype with an integrated cap, glued with a slow hardening silicone glue. Fifty-four (83 %) of 65 patients were decannulated after a mean duration of stenting of 3 months (range 1-12 months). The mean follow-up after decannulation was 23 months (range 1 month to 10 years). The experience gathered with the LT-Mold shows that long-term stenting for complex LTS is safely achieved when the prosthesis is used with its distal integrated silicone cap. The softness and smoothness of the prosthesis with a round-shaped configuration of both extremities help avoid ulceration and granulation tissue formation in the reconstructed airway. Adequate fixation is mandatory to avoid extrusion.
Resumo:
Familial searching consists of searching for a full profile left at a crime scene in a National DNA Database (NDNAD). In this paper we are interested in the circumstance where no full match is returned, but a partial match is found between a database member's profile and the crime stain. Because close relatives share more of their DNA than unrelated persons, this partial match may indicate that the crime stain was left by a close relative of the person with whom the partial match was found. This approach has successfully solved important crimes in the UK and the USA. In a previous paper, a model, which takes into account substructure and siblings, was used to simulate a NDNAD. In this paper, we have used this model to test the usefulness of familial searching and offer guidelines for pre-assessment of the cases based on the likelihood ratio. Siblings of "persons" present in the simulated Swiss NDNAD were created. These profiles (N=10,000) were used as traces and were then compared to the whole database (N=100,000). The statistical results obtained show that the technique has great potential confirming the findings of previous studies. However, effectiveness of the technique is only one part of the story. Familial searching has juridical and ethical aspects that should not be ignored. In Switzerland for example, there are no specific guidelines to the legality or otherwise of familial searching. This article both presents statistical results, and addresses criminological and civil liberties aspects to take into account risks and benefits of familial searching.
Resumo:
Since the advent of high-throughput DNA sequencing technologies, the ever-increasing rate at which genomes have been published has generated new challenges notably at the level of genome annotation. Even if gene predictors and annotation softwares are more and more efficient, the ultimate validation is still in the observation of predicted gene product( s). Mass-spectrometry based proteomics provides the necessary high throughput technology to show evidences of protein presence and, from the identified sequences, confirmation or invalidation of predicted annotations. We review here different strategies used to perform a MS-based proteogenomics experiment with a bottom-up approach. We start from the strengths and weaknesses of the different database construction strategies, based on different genomic information (whole genome, ORF, cDNA, EST or RNA-Seq data), which are then used for matching mass spectra to peptides and proteins. We also review the important points to be considered for a correct statistical assessment of the peptide identifications. Finally, we provide references for tools used to map and visualize the peptide identifications back to the original genomic information.