951 resultados para Statistical hypothesis testing


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Nuevas biotecnologías, como los marcadores de la molécula de ADN, permiten caracterizar el genoma vegetal. El uso de la información genómica producida para cientos o miles de posiciones cromosómicas permite identificar genotipos superiores en menos tiempo que el requerido por la selección fenotípica tradicional. La mayoría de los caracteres de las especies vegetales cultivadas de importancia agronómica y económica, son controlados por poli-genes causantes de un fenotipo con variación continua, altamente afectados por el ambiente. Su herencia es compleja ya que resulta de la interacción entre genes, del mismo o distinto cromosoma, y de la interacción del genotipo con el ambiente, dificultando la selección. Estas biotecnologías producen bases de datos con gran cantidad de información y estructuras complejas de correlación que requieren de métodos y modelos biométricos específicos para su procesamiento. Los modelos estadísticos focalizados en explicar el fenotipo a partir de información genómica masiva requieren la estimación de un gran número de parámetros. No existen métodos, dentro de la estadística paramétrica capaces de abordar este problema eficientemente. Además los modelos deben contemplar no-aditividades (interacciones) entre efectos génicos y de éstos con el ambiente que son también dificiles de manejar desde la concepción paramétrica. Se hipotetiza que el análisis de la asociación entre caracteres fenotípicos y genotipos moleculares, caracterizados por abundante información genómica, podría realizarse eficientemente en el contexto de los modelos mixtos semiparamétricos y/o de métodos no-paramétricos basados en técnicas de aprendizaje automático. El objetivo de este proyecto es desarrollar nuevos métodos para análisis de datos que permitan el uso eficiente de información genómica masiva en evaluaciones genéticas de interés agro-biotecnológico. Los objetivos específicos incluyen la comparación, respecto a propiedades estadísticas y computacionales, de estrategias analíticas paramétricas con estrategias semiparamétricas y no-paramétricas. Se trabajará con aproximaciones por regresión del análisis de loci de caracteres cuantitativos bajo distintas estrategias y escenarios (reales y simulados) con distinto volúmenes de datos de marcadores moleculares. En el área paramétrica se pondrá especial énfasis en modelos mixtos, mientras que en el área no paramétrica se evaluarán algoritmos de redes neuronales, máquinas de soporte vectorial, filtros multivariados, suavizados del tipo LOESS y métodos basados en núcleos de reciente aparición. La propuesta semiparamétrica se basará en una estrategia de análisis en dos etapas orientadas a: 1) reducir la dimensionalidad de los datos genómicos y 2) modelar el fenotipo introduciendo sólo las señales moleculares más significativas. Con este trabajo se espera poner a disposición de investigadores de nuestro medio, nuevas herramientas y procedimientos de análisis que permitan maximizar la eficiencia en el uso de los recursos asignados a la masiva captura de datos genómicos y su aplicación en desarrollos agro-biotecnológicos.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

El objetivo de este proyecto, enmarcado en el área de metodología de análisis en bioingeniería-biotecnología aplicadas al estudio del cancer, es el análisis y caracterización a través modelos estadísticos con efectos mixtos y técnicas de aprendizaje automático, de perfiles de expresión de proteínas y genes de las vías metabolicas asociadas a progresión tumoral. Dicho estudio se llevará a cabo mediante la utilización de tecnologías de alto rendimiento. Las mismas permiten evaluar miles de genes/proteínas en forma simultánea, generando así una gran cantidad de datos de expresión. Se hipotetiza que para un análisis e interpretación de la información subyacente, caracterizada por su abundancia y complejidad, podría realizarse mediante técnicas estadístico-computacionales eficientes en el contexto de modelos mixtos y técnias de aprendizaje automático. Para que el análisis sea efectivo es necesario contemplar los efectos ocasionados por los diferentes factores experimentales ajenos al fenómeno biológico bajo estudio. Estos efectos pueden enmascarar la información subycente y así perder informacion relavante en el contexto de progresión tumoral. La identificación de estos efectos permitirá obtener, eficientemente, los perfiles de expresión molecular que podrían permitir el desarrollo de métodos de diagnóstico basados en ellos. Con este trabajo se espera poner a disposición de investigadores de nuestro medio, herramientas y procedimientos de análisis que maximicen la eficiencia en el uso de los recursos asignados a la masiva captura de datos genómicos/proteómicos que permitan extraer información biológica relevante pertinente al análisis, clasificación o predicción de cáncer, el diseño de tratamientos y terapias específicos y el mejoramiento de los métodos de detección como así tambien aportar al entendimieto de la progresión tumoral mediante análisis computacional intensivo.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A partir de las últimas décadas se ha impulsado el desarrollo y la utilización de los Sistemas de Información Geográficos (SIG) y los Sistemas de Posicionamiento Satelital (GPS) orientados a mejorar la eficiencia productiva de distintos sistemas de cultivos extensivos en términos agronómicos, económicos y ambientales. Estas nuevas tecnologías permiten medir variabilidad espacial de propiedades del sitio como conductividad eléctrica aparente y otros atributos del terreno así como el efecto de las mismas sobre la distribución espacial de los rendimientos. Luego, es posible aplicar el manejo sitio-específico en los lotes para mejorar la eficiencia en el uso de los insumos agroquímicos, la protección del medio ambiente y la sustentabilidad de la vida rural. En la actualidad, existe una oferta amplia de recursos tecnológicos propios de la agricultura de precisión para capturar variación espacial a través de los sitios dentro del terreno. El óptimo uso del gran volumen de datos derivado de maquinarias de agricultura de precisión depende fuertemente de las capacidades para explorar la información relativa a las complejas interacciones que subyacen los resultados productivos. La covariación espacial de las propiedades del sitio y el rendimiento de los cultivos ha sido estudiada a través de modelos geoestadísticos clásicos que se basan en la teoría de variables regionalizadas. Nuevos desarrollos de modelos estadísticos contemporáneos, entre los que se destacan los modelos lineales mixtos, constituyen herramientas prometedoras para el tratamiento de datos correlacionados espacialmente. Más aún, debido a la naturaleza multivariada de las múltiples variables registradas en cada sitio, las técnicas de análisis multivariado podrían aportar valiosa información para la visualización y explotación de datos georreferenciados. La comprensión de las bases agronómicas de las complejas interacciones que se producen a la escala de lotes en producción, es hoy posible con el uso de éstas nuevas tecnologías. Los objetivos del presente proyecto son: (l) desarrollar estrategias metodológicas basadas en la complementación de técnicas de análisis multivariados y geoestadísticas, para la clasificación de sitios intralotes y el estudio de interdependencias entre variables de sitio y rendimiento; (ll) proponer modelos mixtos alternativos, basados en funciones de correlación espacial de los términos de error que permitan explorar patrones de correlación espacial de los rendimientos intralotes y las propiedades del suelo en los sitios delimitados. From the last decades the use and development of Geographical Information Systems (GIS) and Satellite Positioning Systems (GPS) is highly promoted in cropping systems. Such technologies allow measuring spatial variability of site properties including electrical conductivity and others soil features as well as their impact on the spatial variability of yields. Therefore, site-specific management could be applied to improve the efficiency in the use of agrochemicals, the environmental protection, and the sustainability of the rural life. Currently, there is a wide offer of technological resources to capture spatial variation across sites within field. However, the optimum use of data coming from the precision agriculture machineries strongly depends on the capabilities to explore the information about the complex interactions underlying the productive outputs. The covariation between spatial soil properties and yields from georeferenced data has been treated in a graphical manner or with standard geostatistical approaches. New statistical modeling capabilities from the Mixed Linear Model framework are promising to deal with correlated data such those produced by the precision agriculture. Moreover, rescuing the multivariate nature of the multiple data collected at each site, several multivariate statistical approaches could be crucial tools for data analysis with georeferenced data. Understanding the basis of complex interactions at the scale of production field is now within reach the use of these new techniques. Our main objectives are: (1) to develop new statistical strategies, based on the complementarities of geostatistics and multivariate methods, useful to classify sites within field grown with grain crops and analyze the interrelationships of several soil and yield variables, (2) to propose mixed linear models to predict yield according spatial soil variability and to build contour maps to promote a more sustainable agriculture.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

As digital imaging processing techniques become increasingly used in a broad range of consumer applications, the critical need to evaluate algorithm performance has become recognised by developers as an area of vital importance. With digital image processing algorithms now playing a greater role in security and protection applications, it is of crucial importance that we are able to empirically study their performance. Apart from the field of biometrics little emphasis has been put on algorithm performance evaluation until now and where evaluation has taken place, it has been carried out in a somewhat cumbersome and unsystematic fashion, without any standardised approach. This paper presents a comprehensive testing methodology and framework aimed towards automating the evaluation of image processing algorithms. Ultimately, the test framework aims to shorten the algorithm development life cycle by helping to identify algorithm performance problems quickly and more efficiently.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The research described in this thesis was developed as part o f the Information Management for Green Design (IMA GREE) Project. The 1MAGREE Project was founded by Enterprise Ireland under a Strategic Research Grant Scheme as a partnership project between Galway Mayo Institute o f Technology and C1MRU University College Galway. The project aimed to develop a CAD integrated software tool to support environmental information management for design, particularly for the electronics-manufacturing sector in Ireland.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This is a study of a state of the art implementation of a new computer integrated testing (CIT) facility within a company that designs and manufactures transport refrigeration systems. The aim was to use state of the art hardware, software and planning procedures in the design and implementation of three CIT systems. Typical CIT system components include data acquisition (DAQ) equipment, application and analysis software, communication devices, computer-based instrumentation and computer technology. It is shown that the introduction of computer technology into the area of testing can have a major effect on such issues as efficiency, flexibility, data accuracy, test quality, data integrity and much more. Findings reaffirm how the overall area of computer integration continues to benefit any organisation, but with more recent advances in computer technology, communication methods and software capabilities, less expensive more sophisticated test solutions are now possible. This allows more organisations to benefit from the many advantages associated with CIT. Examples of computer integration test set-ups and the benefits associated with computer integration have been discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Results are presented from the analysis of observations data on flash flood in Georgia over a period of 45 years, from 1961 to 2005, provided of the of Hydro-meteorology Service of Georgia.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

შესწავლილია ხილვადობის სიშორის სტატისტიკური სტრუქტურა თბილისში 1980-დან 2008 წლამდე პერიოდისათვის. გამოყენებულია საქართველოს ჰიდრომეტეოროლოგიური დეპარტამენტის მონაცემები ხილვადობის სხვადასხვა ბალიანობის მქონე დღეების რიცხვის შესახებ წელიწადში 9, 12 და 15 საათზე დაკვირვებებისათვის.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Magdeburg, Univ., Fak. für Mathematik, Diss., 2011

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract ST2 is a member of the interleukin-1 receptor family biomarker and circulating soluble ST2 concentrations are believed to reflect cardiovascular stress and fibrosis. Recent studies have demonstrated soluble ST2 to be a strong predictor of cardiovascular outcomes in both chronic and acute heart failure. It is a new biomarker that meets all required criteria for a useful biomarker. Of note, it adds information to natriuretic peptides (NPs) and some studies have shown it is even superior in terms of risk stratification. Since the introduction of NPs, this has been the most promising biomarker in the field of heart failure and might be particularly useful as therapy guide.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract Background: The kinetics of high-sensitivity troponin T (hscTnT) release should be studied in different situations, including functional tests with transient ischemic abnormalities. Objective: To evaluate the release of hscTnT by serial measurements after exercise testing (ET), and to correlate hscTnT elevations with abnormalities suggestive of ischemia. Methods: Patients with acute ST-segment elevation myocardial infarction (STEMI) undergoing primary angioplasty were referred for ET 3 months after infarction. Blood samples were collected to measure basal hscTnT immediately before (TnT0h), 2 (TnT2h), 5 (TnT5h), and 8 hours (TnT8h) after ET. The outcomes were peak hscTnT, TnT5h/TnT0h ratio, and the area under the blood concentration-time curve (AUC) for hscTnT levels. Log-transformation was performed on hscTnT values, and comparisons were assessed with the geometric mean ratio, along with their 95% confidence intervals. Statistical significance was assessed by analysis of covariance with no adjustment, and then, adjusted for TnT0h, age and sex, followed by additional variables (metabolic equivalents, maximum heart rate achieved, anterior wall STEMI, and creatinine clearance). Results: This study included 95 patients. The highest geometric means were observed at 5 hours (TnT5h). After adjustments, peak hscTnT, TnT5h/TnT0h and AUC were 59% (p = 0.002), 59% (p = 0.003) and 45% (p = 0.003) higher, respectively, in patients with an abnormal ET as compared to those with normal tests. Conclusion: Higher elevations of hscTnT may occur after an abnormal ET as compared to a normal ET in patients with STEMI.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The general properties of POISSON distributions and their relations to the binomial distribuitions are discussed. Two methods of statistical analysis are dealt with in detail: X2-test. In order to carry out the X2-test, the mean frequency and the theoretical frequencies for all classes are calculated. Than the observed and the calculated frequencies are compared, using the well nown formula: f(obs) - f(esp) 2; i(esp). When the expected frequencies are small, one must not forget that the value of X2 may only be calculated, if the expected frequencies are biger than 5. If smaller values should occur, the frequencies of neighboroughing classes must ge pooled. As a second test reintroduced by BRIEGER, consists in comparing the observed and expected error standard of the series. The observed error is calculated by the general formula: δ + Σ f . VK n-1 where n represents the number of cases. The theoretical error of a POISSON series with mean frequency m is always ± Vm. These two values may be compared either by dividing the observed by the theoretical error and using BRIEGER's tables for # or by dividing the respective variances and using SNEDECOR's tables for F. The degree of freedom for the observed error is one less the number of cases studied, and that of the theoretical error is always infinite. In carrying out these tests, one important point must never be overlloked. The values for the first class, even if no concrete cases of the type were observed, must always be zero, an dthe value of the subsequent classes must be 1, 2, 3, etc.. This is easily seen in some of the classical experiments. For instance in BORKEWITZ example of accidents in Prussian armee corps, the classes are: no, one, two, etc., accidents. When counting the frequency of bacteria, these values are: no, one, two, etc., bacteria or cultures of bacteria. Ins studies of plant diseases equally the frequencies are : no, one, two, etc., plants deseased. Howewer more complicated cases may occur. For instance, when analising the degree of polyembriony, frequently the case of "no polyembryony" corresponds to the occurrence of one embryo per each seed. Thus the classes are not: no, one, etc., embryo per seed, but they are: no additional embryo, one additional embryo, etc., per seed with at least one embryo. Another interestin case was found by BRIEGER in genetic studies on the number os rows in maize. Here the minimum number is of course not: no rows, but: no additional beyond eight rows. The next class is not: nine rows, but: 10 rows, since the row number varies always in pairs of rows. Thus the value of successive classes are: no additional pair of rows beyond 8, one additional pair (or 10 rows), two additional pairs (or 12 rows) etc.. The application of the methods is finally shown on the hand of three examples : the number of seeds per fruit in the oranges M Natal" and "Coco" and in "Calamondin". As shown in the text and the tables, the agreement with a POISSON series is very satisfactory in the first two cases. In the third case BRIEGER's error test indicated a significant reduction of variability, and the X2 test showed that there were two many fruits with 4 or 5 seeds and too few with more or with less seeds. Howewer the fact that no fruit was found without seed, may be taken to indicate that in Calamondin fruits are not fully parthenocarpic and may develop only with one seed at the least. Thus a new analysis was carried out, on another class basis. As value for the first class the following value was accepted: no additional seed beyond the indispensable minimum number of one seed, and for the later classes the values were: one, two, etc., additional seeds. Using this new basis for all calculations, a complete agreement of the observed and expected frequencies, of the correspondig POISSON series was obtained, thus proving that our hypothesis of the impossibility of obtaining fruits without any seed was correct for Calamondin while the other two oranges were completely parthenocarpic and fruits without seeds did occur.