947 resultados para Unconditional maximum likelihood criterion
Resumo:
O objetivo deste trabalho foi avaliar influência da informação de parentesco na seleção de progênies de soja quanto à produtividade e aos teores de óleo e proteína, com base no uso de modelos mistos de predição dos valores genéticos. Novecentas progênies F4:6 e 200 progênies F4:7 de soja foram avaliadas nas safras 2010/2011 e 2011/2012, respectivamente. As progênies foram obtidas de cruzamentos múltiplos a partir de 57 progenitores. Os dados foram analisados por meio de modelos aleatórios (quadrados mínimos) e mistos BLUP/REML ("best linear unbiased prediction/restricted maximum likelihood"). Os maiores valores de ganhos preditos foram obtidos com o BLUP/REML. Os valores genéticos preditos com o método BLUP/REML, sem informação de parentesco, apresentaram alta correlação com aqueles obtidos com o modelo aleatório, além de detectada alta coincidência das progênies selecionadas. A inclusão da matriz de parentesco resultou na seleção de progênies diferentes e em maior acurácia na predição dos valores genéticos.
Resumo:
This paper deals with the goodness of the Gaussian assumption when designing second-order blind estimationmethods in the context of digital communications. The low- andhigh-signal-to-noise ratio (SNR) asymptotic performance of the maximum likelihood estimator—derived assuming Gaussiantransmitted symbols—is compared with the performance of the optimal second-order estimator, which exploits the actualdistribution of the discrete constellation. The asymptotic study concludes that the Gaussian assumption leads to the optimalsecond-order solution if the SNR is very low or if the symbols belong to a multilevel constellation such as quadrature-amplitudemodulation (QAM) or amplitude-phase-shift keying (APSK). On the other hand, the Gaussian assumption can yield importantlosses at high SNR if the transmitted symbols are drawn from a constant modulus constellation such as phase-shift keying (PSK)or continuous-phase modulations (CPM). These conclusions are illustrated for the problem of direction-of-arrival (DOA) estimation of multiple digitally-modulated signals.
Resumo:
This paper analyzes the asymptotic performance of maximum likelihood (ML) channel estimation algorithms in wideband code division multiple access (WCDMA) scenarios. We concentrate on systems with periodic spreading sequences (period larger than or equal to the symbol span) where the transmitted signal contains a code division multiplexed pilot for channel estimation purposes. First, the asymptotic covariances of the training-only, semi-blind conditional maximum likelihood (CML) and semi-blind Gaussian maximum likelihood (GML) channelestimators are derived. Then, these formulas are further simplified assuming randomized spreading and training sequences under the approximation of high spreading factors and high number of codes. The results provide a useful tool to describe the performance of the channel estimators as a function of basicsystem parameters such as number of codes, spreading factors, or traffic to training power ratio.
Resumo:
In this paper, the theory of hidden Markov models (HMM) isapplied to the problem of blind (without training sequences) channel estimationand data detection. Within a HMM framework, the Baum–Welch(BW) identification algorithm is frequently used to find out maximum-likelihood (ML) estimates of the corresponding model. However, such a procedureassumes the model (i.e., the channel response) to be static throughoutthe observation sequence. By means of introducing a parametric model fortime-varying channel responses, a version of the algorithm, which is moreappropriate for mobile channels [time-dependent Baum-Welch (TDBW)] isderived. Aiming to compare algorithm behavior, a set of computer simulationsfor a GSM scenario is provided. Results indicate that, in comparisonto other Baum–Welch (BW) versions of the algorithm, the TDBW approachattains a remarkable enhancement in performance. For that purpose, onlya moderate increase in computational complexity is needed.
Resumo:
This work provides a general framework for the design of second-order blind estimators without adopting anyapproximation about the observation statistics or the a prioridistribution of the parameters. The proposed solution is obtainedminimizing the estimator variance subject to some constraints onthe estimator bias. The resulting optimal estimator is found todepend on the observation fourth-order moments that can be calculatedanalytically from the known signal model. Unfortunately,in most cases, the performance of this estimator is severely limitedby the residual bias inherent to nonlinear estimation problems.To overcome this limitation, the second-order minimum varianceunbiased estimator is deduced from the general solution by assumingaccurate prior information on the vector of parameters.This small-error approximation is adopted to design iterativeestimators or trackers. It is shown that the associated varianceconstitutes the lower bound for the variance of any unbiasedestimator based on the sample covariance matrix.The paper formulation is then applied to track the angle-of-arrival(AoA) of multiple digitally-modulated sources by means ofa uniform linear array. The optimal second-order tracker is comparedwith the classical maximum likelihood (ML) blind methodsthat are shown to be quadratic in the observed data as well. Simulationshave confirmed that the discrete nature of the transmittedsymbols can be exploited to improve considerably the discriminationof near sources in medium-to-high SNR scenarios.
Resumo:
This paper addresses the estimation of the code-phase(pseudorange) and the carrier-phase of the direct signal received from a direct-sequence spread-spectrum satellite transmitter. Thesignal is received by an antenna array in a scenario with interferenceand multipath propagation. These two effects are generallythe limiting error sources in most high-precision positioning applications.A new estimator of the code- and carrier-phases is derivedby using a simplified signal model and the maximum likelihood(ML) principle. The simplified model consists essentially ofgathering all signals, except for the direct one, in a component withunknown spatial correlation. The estimator exploits the knowledgeof the direction-of-arrival of the direct signal and is much simplerthan other estimators derived under more detailed signal models.Moreover, we present an iterative algorithm, that is adequate for apractical implementation and explores an interesting link betweenthe ML estimator and a hybrid beamformer. The mean squarederror and bias of the new estimator are computed for a numberof scenarios and compared with those of other methods. The presentedestimator and the hybrid beamforming outperform the existingtechniques of comparable complexity and attains, in manysituations, the Cramér–Rao lower bound of the problem at hand.
Resumo:
Phylogenetic trees representing the evolutionary relationships of homologous genes are the entry point for many evolutionary analyses. For instance, the use of a phylogenetic tree can aid in the inference of orthology and paralogy relationships, and in the detection of relevant evolutionary events such as gene family expansions and contractions, horizontal gene transfer, recombination or incomplete lineage sorting. Similarly, given the plurality of evolutionary histories among genes encoded in a given genome, there is a need for the combined analysis of genome-wide collections of phylogenetic trees (phylomes). Here, we introduce a new release of PhylomeDB (http://phylomedb.org), a public repository of phylomes. Currently, PhylomeDB hosts 120 public phylomes, comprising >1.5 million maximum likelihood trees and multiple sequence alignments. In the current release, phylogenetic trees are annotated with taxonomic, protein-domain arrangement, functional and evolutionary information. PhylomeDB is also a major source for phylogeny-based predictions of orthology and paralogy, covering >10 million proteins across 1059 sequenced species. Here we describe newly implemented PhylomeDB features, and discuss a benchmark of the orthology predictions provided by the database, the impact of proteome updates and the use of the phylome approach in the analysis of newly sequenced genomes and transcriptomes.
Resumo:
With the increasing availability of various 'omics data, high-quality orthology assignment is crucial for evolutionary and functional genomics studies. We here present the fourth version of the eggNOG database (available at http://eggnog.embl.de) that derives nonsupervised orthologous groups (NOGs) from complete genomes, and then applies a comprehensive characterization and analysis pipeline to the resulting gene families. Compared with the previous version, we have more than tripled the underlying species set to cover 3686 organisms, keeping track with genome project completions while prioritizing the inclusion of high-quality genomes to minimize error propagation from incomplete proteome sets. Major technological advances include (i) a robust and scalable procedure for the identification and inclusion of high-quality genomes, (ii) provision of orthologous groups for 107 different taxonomic levels compared with 41 in eggNOGv3, (iii) identification and annotation of particularly closely related orthologous groups, facilitating analysis of related gene families, (iv) improvements of the clustering and functional annotation approach, (v) adoption of a revised tree building procedure based on the multiple alignments generated during the process and (vi) implementation of quality control procedures throughout the entire pipeline. As in previous versions, eggNOGv4 provides multiple sequence alignments and maximum-likelihood trees, as well as broad functional annotation. Users can access the complete database of orthologous groups via a web interface, as well as through bulk download.
Resumo:
O objetivo deste trabalho foi estimar os parâmetros genéticos e predizer o valor genético de populações e indivíduos oriundos de populações segregantes de trigo, com o uso da metodologia de modelos mistos ("restricted maximum likelihood"/"best linear unbiased prediction", REML/BLUP). Trinta e seis populações segregantes de trigo e quatro controles foram avaliados na geração F3, em delineamento de blocos ao acaso, com informações de indivíduo retiradas de dentro das parcelas. Os caracteres avaliados foram: produção de grãos, índice de colheita, número de perfilhos e altura de planta. Observou-se a existência de variabilidade genética entre populações em todos os caracteres avaliados. A herdabilidade média variou de 39,15 a 92,78%, e a acurácia, de 62,57 a 96,32%, na seleção de populações. A herdabilidade individual no sentido restrito foi baixa dentro das populações, em todos os caracteres. A acurácia na seleção individual apresentou magnitude média, quanto ao caráter altura de plantas, e baixa quanto aos demais caracteres. A herdabilidade individual contribui para maior ganho nos caracteres altura de planta e índice de colheita com o uso do BLUP individual, em comparação ao BLUP de populações. As populações segregantes Embrapa22/BRS207, Embrapa22/VI98053, Embrapa22/IVI01041, BRS254/BRS207, BRS254/VI98053, BRS254/UFVT1Pioneiro e BRS264/BRS207 destacam-se por apresentar valor genético aditivo elevado em dois ou mais caracteres.
Resumo:
El presente trabajo tiene por objetivo principal analizar tres funciones de perfil del fuste sobre tres clones de Populus x euramericana (Canadá Blanco, I-214 y MC) en la Comunidad Foral de Navarra para elaborar una tarifa de cubicación con clasificación de volumen. Para minimizar el efecto de la autocorrelación entre los residuos se emplea una estructura de error continua autorregresiva de orden 2 o de orden 3 en función del clon analizado. Por otra parte, se compara el coeficiente local de forma de cada uno de los clones estudiados mediante dos metodologías: el análisis de la varianza de la estimación individual de dicho coeficiente y el contraste del estadístico de máxima verosimilitud entre ajustes, resultando ser el clon Canadá el más cónico de los tres. Los datos utilizados provienen de 143 chopos de plantaciones coetáneas y con mismo marco de plantación (marco real de 4,5 × 4,5 m).
Resumo:
El presente estudio se enmarca en el proyecto europeo SIBERIA. Trata de explorar el uso de imágenes radar de satélite (ERS y JERS) para la actualización de la cartografía de vegetación de zonas boreales. Se dispone de 8 imágenes de amplitud y coherencia tomadas en 1998, así como de un inventario de vegetación georreferenciado de dos pequeñas zonas. Se proponen tres tipos de clasificaciones supervisadas por el método de máxima verosimilitud. La primera con las imágenes de satélite, la segunda añadiendo algunas imágenes texturales, y la tercera utilizando sólo las imágenes de los componentes principales más significativos. Se siguen los criterios establecidos en el proyecto SIBERIA para la obtención de áreas de entrenamiento. Se propone una doble validación, por una parte vía matrices de confusión a partir de áreas de verdad-terreno obtenidas por el mismo método que las áreas de entrenamiento, y por otra parte contrastando y correlacionando las clasificaciones con los parámetros de inventario disponibles para dos pequeñas áreas de verdad-terreno. Los resultados indican una sensible mejora en la clasificación con la incorporación de imágenes texturales (la precisión aumenta de un 66% a un 75%), y señalan el parámetro biomasa como el mejor correlacionado con las clasificaciones derivadas (coeficiente de correlación r de hasta 0,49). Diferentes fuentes de error permiten augurar un margen de mejora para posteriores estudios.
Resumo:
We consider robust parametric procedures for univariate discrete distributions, focusing on the negative binomial model. The procedures are based on three steps: ?First, a very robust, but possibly inefficient, estimate of the model parameters is computed. ?Second, this initial model is used to identify outliers, which are then removed from the sample. ?Third, a corrected maximum likelihood estimator is computed with the remaining observations. The final estimate inherits the breakdown point (bdp) of the initial one and its efficiency can be significantly higher. Analogous procedures were proposed in [1], [2], [5] for the continuous case. A comparison of the asymptotic bias of various estimates under point contamination points out the minimum Neyman's chi-squared disparity estimate as a good choice for the initial step. Various minimum disparity estimators were explored by Lindsay [4], who showed that the minimum Neyman's chi-squared estimate has a 50% bdp under point contamination; in addition, it is asymptotically fully efficient at the model. However, the finite sample efficiency of this estimate under the uncontaminated negative binomial model is usually much lower than 100% and the bias can be strong. We show that its performance can then be greatly improved using the three step procedure outlined above. In addition, we compare the final estimate with the procedure described in
Resumo:
Membrane proteins account for about 20% to 30% of all proteins encoded in a typical genome. They play central roles in multiple cellular processes mediating the interaction of the cell with its surrounding. Over 60% of all drug targets contain a membrane domain. The experimental difficulties of obtaining a crystal structural severely limits our ability or understanding of membrane protein function. Computational evolutionary studies of proteins are crucial for the prediction of 3D structures. In this project, we construct a tool able to quantify the evolutionary positive selective pressure on each residue of membrane proteins through maximum likelihood phylogeny reconstruction. The conservation plot combined with a structural homology model is also a potent tool to predict those residues that have essentials roles in the structure and function of a membrane protein and can be very useful in the design of validation experiments.
Resumo:
BACKGROUND: Blood pressure (BP) is known to aggregate in families. Yet, heritability estimates are population-specific and no Swiss data have been published so far. We estimated the heritability of ambulatory and office BP in a Swiss population-based sample. METHODS: The Swiss Kidney Project on Genes in Hypertension is a population-based family study focusing on BP genetics. Office and ambulatory BP were measured in 1009 individuals from 271 nuclear families. Heritability was estimated for SBP, DBP, and pulse pressure using a maximum likelihood method implanted in the Statistical Analysis in Genetic Epidemiology software. RESULTS: The 518 women and 491 men included in this analysis had a mean (±SD) age of 48.3 (±17.4) and 47.3 (±17.7) years, and a mean BMI of 23.8 (±4.2) and 25.9 (±4.1) kg/m, respectively. Narrow-sense heritability estimates (±standard error) for ambulatory SBP, DBP, and pulse pressure were 0.37 ± 0.07, 0.26 ± 0.07, and 0.29 ± 0.07 for 24-h BP; 0.39 ± 0.07, 0.28 ± 0.07, and 0.27 ± 0.07 for day BP; and 0.25 ± 0.07, 0.20 ± 0.07, and 0.30 ± 0.07 for night BP, respectively (all P < 0.001). Heritability estimates for office SBP, DBP, and pulse pressure were 0.21 ± 0.08, 0.25 ± 0.08, and 0.18 ± 0.07 (all P < 0.01). CONCLUSIONS: We found significant heritability estimates for both ambulatory and office BP in this Swiss population-based study. Our findings justify the ongoing search for the genetic determinants of BP.
Resumo:
BACKGROUND: The bacterial flagellum is the most important organelle of motility in bacteria and plays a key role in many bacterial lifestyles, including virulence. The flagellum also provides a paradigm of how hierarchical gene regulation, intricate protein-protein interactions and controlled protein secretion can result in the assembly of a complex multi-protein structure tightly orchestrated in time and space. As if to stress its importance, plants and animals produce receptors specifically dedicated to the recognition of flagella. Aside from motility, the flagellum also moonlights as an adhesion and has been adapted by humans as a tool for peptide display. Flagellar sequence variation constitutes a marker with widespread potential uses for studies of population genetics and phylogeny of bacterial species. RESULTS: We sequenced the complete flagellin gene (flaA) in 18 different species and subspecies of Aeromonas. Sequences ranged in size from 870 (A. allosaccharophila) to 921 nucleotides (A. popoffii). The multiple alignment displayed 924 sites, 66 of which presented alignment gaps. The phylogenetic tree revealed the existence of two groups of species exhibiting different FlaA flagellins (FlaA1 and FlaA2). Maximum likelihood models of codon substitution were used to analyze flaA sequences. Likelihood ratio tests suggested a low variation in selective pressure among lineages, with an omega ratio of less than 1 indicating the presence of purifying selection in almost all cases. Only one site under potential diversifying selection was identified (isoleucine in position 179). However, 17 amino acid positions were inferred as sites that are likely to be under positive selection using the branch-site model. Ancestral reconstruction revealed that these 17 amino acids were among the amino acid changes detected in the ancestral sequence. CONCLUSION: The models applied to our set of sequences allowed us to determine the possible evolutionary pathway followed by the flaA gene in Aeromonas, suggesting that this gene have probably been evolving independently in the two groups of Aeromonas species since the divergence of a distant common ancestor after one or several episodes of positive selection. REVIEWERS: This article was reviewed by Alexey Kondrashov, John Logsdon and Olivier Tenaillon (nominated by Laurence D Hurst).