986 resultados para Grouped data
Resumo:
The objective of this study was to investigate whether differences in diet and in single-nucleotide polymorphisms (SNPs) found in paraoxonase-1 (PON-1), 3-hydroxy-3-methylglutaryl-coenzyme A reductase (HMGCR), cholesterol ester transfer protein (CETP) and apolipoprotein E (APOE) genes, are associated with oxidative stress biomarkers and consequently with susceptibility of low-density cholesterol (LDL) to oxidation. A multivariate approach was applied to a group of 55 patients according to three biomarkers: plasma antioxidant activity, malondialdehyde and oxidized LDL (oxLDL) concentrations. Individuals classified in Cluster III showed the worst prognoses in terms of antioxidant activity and oxidative status. Individuals classified in Cluster I presented the lowest oxidative status, while individuals grouped in Cluster II presented the highest levels of antioxidant activity. No difference in nutrient intake was observed among the clusters. Significantly higher gamma- and delta-tocopherol concentrations were observed in those individuals with the highest levels of antioxidant activity. No single linear regression was statistically significant, suggesting that mutant alleles of the SNPs selected did not contribute to the differences observed in oxidative stress response. Although not statistically significant, the p value of the APO E coefficient for oxLDL response was 0.096, indicating that patients who carry the TT allele of the APO E gene tend to present lower plasma oxLDL concentrations. Therefore, the differences in oxidative stress levels observed in this study could not be attributed to diet or to the variant alleles of PON-1, CETP, HMGCR or APO E. This data supports the influence of gamma-tocopherol and delta-tocopherol on antioxidant activity, and highlights the need for further studies investigating APO E alleles and LDL oxidation.
Resumo:
We have investigated the use of hierarchical clustering of flow cytometry data to classify samples of conventional central chondrosarcoma, a malignant cartilage forming tumor of uncertain cellular origin, according to similarities with surface marker profiles of several known cell types. Human primary chondrosarcoma cells, articular chondrocytes, mesenchymal stem cells, fibroblasts, and a panel of tumor cell lines from chondrocytic or epithelial origin were clustered based on the expression profile of eleven surface markers. For clustering, eight hierarchical clustering algorithms, three distance metrics, as well as several approaches for data preprocessing, including multivariate outlier detection, logarithmic transformation, and z-score normalization, were systematically evaluated. By selecting clustering approaches shown to give reproducible results for cluster recovery of known cell types, primary conventional central chondrosacoma cells could be grouped in two main clusters with distinctive marker expression signatures: one group clustering together with mesenchymal stem cells (CD49b-high/CD10-low/CD221-high) and a second group clustering close to fibroblasts (CD49b-low/CD10-high/CD221-low). Hierarchical clustering also revealed substantial differences between primary conventional central chondrosarcoma cells and established chondrosarcoma cell lines, with the latter not only segregating apart from primary tumor cells and normal tissue cells, but clustering together with cell lines from epithelial lineage. Our study provides a foundation for the use of hierarchical clustering applied to flow cytometry data as a powerful tool to classify samples according to marker expression patterns, which could lead to uncover new cancer subtypes.
Resumo:
The purpose of this research is to develop a new statistical method to determine the minimum set of rows (R) in a R x C contingency table of discrete data that explains the dependence of observations. The statistical power of the method will be empirically determined by computer simulation to judge its efficiency over the presently existing methods. The method will be applied to data on DNA fragment length variation at six VNTR loci in over 72 populations from five major racial groups of human (total sample size is over 15,000 individuals; each sample having at least 50 individuals). DNA fragment lengths grouped in bins will form the basis of studying inter-population DNA variation within the racial groups are significant, will provide a rigorous re-binning procedure for forensic computation of DNA profile frequencies that takes into account intra-racial DNA variation among populations. ^
Population genetic and dispersal modeling data for Bathymodiolus mussels from the Mid-Atlantic Ridge
Resumo:
The zip folder comprises a text file and a gzipped tar archive. 1) The text file contains individual genotype data for 90 SNPs, 9 microsatellites and the mitochondrial ND4 gene that were determined in deep-sea hydrothermal vent mussels from the Mid-Atlantic Ridge (genus Bathymodiolus). Mussel specimens are grouped according to the population (pop)/location from which they have been sampled (first column). The remaining columns contain the respective allele/haplotype codes for the different genetic loci (names in the header line). The data file is in CONVERT format and can be directly transformed into different input files for population genetic statistics. 2) The tar archive contains NetCDF files with larval dispersal probabilities for simulated annual larval releases between 1998 and 2007. For each simulated vent location (Menez Gwen, Lucky Strike, Rainbow, Vent 1-10) two NetCDF files are given, one for an assumed pelagic larval duration of 1 year and the other one for an assumed pelagic larval duration of 6 months (6m).
Resumo:
Purpose: To provide for the basis for collecting strength training data using a rigorously validated injury report form. Methods: A group of specialist designed a questionnaire of 45 item grouped into 4 dimensions. Six stages were used to assess face, content, and criterion validity of the weight training injury report form. A 13 members panel assessed the form for face validity, and an expert panel assessed it for content and criterion validity. Panel members were consulted until consensus was reached. A yardstick developed by an expert panel using Intraclass correlation technique was used to assess the reability of the form. Test-retest reliability was assessed with the intraclass correlation coefficient (ICC).The strength training injury report form was developed, and the face, content, and criterion validity successfully assessed. A six step protocol to create a yardstick was also developed to assist in the validation process. Both inter-rater and intra rater reliability results indicated a 98% agreement. Inter-rater reliability agreement of 98% for three injuries. Results: The Cronbach?s alpha of the questionnaire was 0.944 (pmenor que0.01) and the ICC of the entire questionnaire was 0.894 (pmenor que0.01). Conclusion: The questionnaire gathers together enough psychometric properties to be considered a valid and reliable tool for register injury data in strength training, and providing researchers with a basis for future studies in this area. Key Words: data collection; validation; injury prevention; strength training
Resumo:
In this paper, a new method is presented to ensure automatic synchronization of intracardiac ECG data, yielding a three-stage algorithm. We first compute a robust estimate of the derivative of the data to remove low-frequency perturbations. Then we provide a grouped-sparse representation of the data, by means of the Group LASSO, to ensure that all the electrical spikes are simultaneously detected. Finally, a post-processing step, based on a variance analysis, is performed to discard false alarms. Preliminary results on real data for sinus rhythm and atrial fibrillation show the potential of this approach.
Resumo:
Ontology-Based Data Access (OBDA) permite el acceso a diferentes tipos de fuentes de datos (tradicionalmente bases de datos) usando un modelo más abstracto proporcionado por una ontología. La reescritura de consultas (query rewriting) usa una ontología para reescribir una consulta en una consulta reescrita que puede ser evaluada en la fuente de datos. Las consultas reescritas recuperan las respuestas que están implicadas por la combinación de los datos explicitamente almacenados en la fuente de datos, la consulta original y la ontología. Al trabajar sólo sobre las queries, la reescritura de consultas permite OBDA sobre cualquier fuente de datos que puede ser consultada, independientemente de las posibilidades para modificarla. Sin embargo, producir y evaluar las consultas reescritas son procesos costosos que suelen volverse más complejos conforme la expresividad y tamaño de la ontología y las consultas aumentan. En esta tesis exploramos distintas optimizaciones que peuden ser realizadas tanto en el proceso de reescritura como en las consultas reescritas para mejorar la aplicabilidad de OBDA en contextos realistas. Nuestra contribución técnica principal es un sistema de reescritura de consultas que implementa las optimizaciones presentadas en esta tesis. Estas optimizaciones son las contribuciones principales de la tesis y se pueden agrupar en tres grupos diferentes: -optimizaciones que se pueden aplicar al considerar los predicados en la ontología que no están realmente mapeados con las fuentes de datos. -optimizaciones en ingeniería que se pueden aplicar al manejar el proceso de reescritura de consultas en una forma que permite reducir la carga computacional del proceso de generación de consultas reescritas. -optimizaciones que se pueden aplicar al considerar metainformación adicional acerca de las características de la ABox. En esta tesis proporcionamos demostraciones formales acerca de la corrección y completitud de las optimizaciones propuestas, y una evaluación empírica acerca del impacto de estas optimizaciones. Como contribución adicional, parte de este enfoque empírico, proponemos un banco de pruebas (benchmark) para la evaluación de los sistemas de reescritura de consultas. Adicionalmente, proporcionamos algunas directrices para la creación y expansión de esta clase de bancos de pruebas. ABSTRACT Ontology-Based Data Access (OBDA) allows accessing different kinds of data sources (traditionally databases) using a more abstract model provided by an ontology. Query rewriting uses such ontology to rewrite a query into a rewritten query that can be evaluated on the data source. The rewritten queries retrieve the answers that are entailed by the combination of the data explicitly stored in the data source, the original query and the ontology. However, producing and evaluating the rewritten queries are both costly processes that become generally more complex as the expressiveness and size of the ontology and queries increase. In this thesis we explore several optimisations that can be performed both in the rewriting process and in the rewritten queries to improve the applicability of OBDA in real contexts. Our main technical contribution is a query rewriting system that implements the optimisations presented in this thesis. These optimisations are the core contributions of the thesis and can be grouped into three different groups: -optimisations that can be applied when considering the predicates in the ontology that are actually mapped to the data sources. -engineering optimisations that can be applied by handling the process of query rewriting in a way that permits to reduce the computational load of the query generation process. -optimisations that can be applied when considering additional metainformation about the characteristics of the ABox. In this thesis we provide formal proofs for the correctness of the proposed optimisations, and an empirical evaluation about the impact of the optimisations. As an additional contribution, part of this empirical approach, we propose a benchmark for the evaluation of query rewriting systems. We also provide some guidelines for the creation and expansion of this kind of benchmarks.
Resumo:
"GAO-03-176."
Resumo:
Comprehensive published radiocarbon data from selected atmospheric records, tree rings, and recent organic matter were analyzed and grouped into 4 different zones (three for the Northern Hemisphere and one for the whole Southern Hemisphere). These C-14 data for the summer season of each hemisphere were employed to construct zonal, hemispheric, and global data sets for use in regional and global carbon model calculations including calibrating and comparing carbon cycle models. In addition, extended monthly atmospheric C-14 data sets for 4 different zones were compiled for age calibration purposes. This is the first time these data sets were constructed to facilitate the dating of recent organic material using the bomb C-14 curves. The distribution of bomb C-14 reflects the major zones of atmospheric circulation.
Resumo:
The key to the correct application of ANOVA is careful experimental design and matching the correct analysis to that design. The following points should therefore, be considered before designing any experiment: 1. In a single factor design, ensure that the factor is identified as a 'fixed' or 'random effect' factor. 2. In more complex designs, with more than one factor, there may be a mixture of fixed and random effect factors present, so ensure that each factor is clearly identified. 3. Where replicates can be grouped or blocked, the advantages of a randomised blocks design should be considered. There should be evidence, however, that blocking can sufficiently reduce the error variation to counter the loss of DF compared with a randomised design. 4. Where different treatments are applied sequentially to a patient, the advantages of a three-way design in which the different orders of the treatments are included as an 'effect' should be considered. 5. Combining different factors to make a more efficient experiment and to measure possible factor interactions should always be considered. 6. The effect of 'internal replication' should be taken into account in a factorial design in deciding the number of replications to be used. Where possible, each error term of the ANOVA should have at least 15 DF. 7. Consider carefully whether a particular factorial design can be considered to be a split-plot or a repeated measures design. If such a design is appropriate, consider how to continue the analysis bearing in mind the problem of using post hoc tests in this situation.
Data collection of Calanus finmarchicus reproduction life history traits in the North Atlantic Ocean
Resumo:
Observations of egg production rates (EPR) for female Calanus finmarchicus were compared from different regions of the North Atlantic. The regions were diverse in size and sampling frequency, ranging from a fixed time series station in the Lower St Lawrence Estuary, off Rimouski, where nearly 200 experiments were carried out between May and December from 1994 to 2006, to a large-scale survey in the Northern Norwegian Sea, where about 50 experiments were carried out between April and June from 2002 to 2004. For this analysis the stations were grouped mostly along geographic lines, with only limited attention being paid to oceanographic features. There is some overlap between regions, however, where stations were sometimes kept together when they were sampled on the same cruise. As well some stations other than off Rimouski were occupied more than once during different years and/or in different seasons.
Resumo:
Skates (Rajidae) have been commercially exploited in Europe for hundreds of years with some species’ abundances declining dramatically during the twentieth century. In 2009 it became “prohibited for EU vessels to target, retain, tranship or land” certain species in some ICES areas, including the critically endangered common skate and the endangered white skate. To examine compliance with skate bans the official UK landings data for 2011–2014 were analysed. Surprisingly, it was found that after the ban prohibited species were still reported landed in UK ports, including 9.6 t of common skate during 2011–2014. The majority of reported landings of common and white skate were from northern UK waters and landed into northern UK ports. Although past landings could not be validated as being actual prohibited species, the landings’ patterns found reflect known abundance distributions that suggest actual landings were made, rather than sporadic occurrence across ports that would be evident if landings were solely due to systematic misidentification or data entry errors. Nevertheless, misreporting and data entry errors could not be discounted as factors contributing to the recorded landings of prohibited species. These findings raise questions about the efficacy of current systems to police skate landings to ensure prohibited species remain protected. By identifying UK ports with the highest apparent landings of prohibited species and those still landing species grouped as'skates and rays’, these results may aid authorities in allocating limited resources more effectively to reduce landings, misreporting and data errors of prohibited species, and increase species-specific landing compliance.
Resumo:
Skates (Rajidae) have been commercially exploited in Europe for hundreds of years with some species’ abundances declining dramatically during the twentieth century. In 2009 it became “prohibited for EU vessels to target, retain, tranship or land” certain species in some ICES areas, including the critically endangered common skate and the endangered white skate. To examine compliance with skate bans the official UK landings data for 2011–2014 were analysed. Surprisingly, it was found that after the ban prohibited species were still reported landed in UK ports, including 9.6 t of common skate during 2011–2014. The majority of reported landings of common and white skate were from northern UK waters and landed into northern UK ports. Although past landings could not be validated as being actual prohibited species, the landings’ patterns found reflect known abundance distributions that suggest actual landings were made, rather than sporadic occurrence across ports that would be evident if landings were solely due to systematic misidentification or data entry errors. Nevertheless, misreporting and data entry errors could not be discounted as factors contributing to the recorded landings of prohibited species. These findings raise questions about the efficacy of current systems to police skate landings to ensure prohibited species remain protected. By identifying UK ports with the highest apparent landings of prohibited species and those still landing species grouped as'skates and rays’, these results may aid authorities in allocating limited resources more effectively to reduce landings, misreporting and data errors of prohibited species, and increase species-specific landing compliance.