Biblioteca Digital

833 resultados para complete linkage clustering

Heritability of adult body height: A comparative study of twin cohorts in eight countries

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A major component of variation in body height is due to genetic differences, but environmental factors have a substantial contributory effect. In this study we aimed to analyse whether the genetic architecture of body height varies between affluent western societies. We analysed twin data from eight countries comprising 30,111 complete twin pairs by using the univariate genetic model of the Mx statistical package. Body height and zygosity were self-reported in seven populations and measured directly in one population. We found that there was substantial variation in mean body height between countries; body height was least in Italy (177 cm in men and 163 cm in women) and greatest in the Netherlands (184 cm and 171 cm, respectively). In men there was no corresponding variation in heritability of body height, heritability estimates ranging from 0.87 to 0.93 in populations under an additive genes/unique environment (AE) model. Among women the heritability estimates were generally lower than among men with greater variation between countries, ranging from 0.68 to 0.84 when an additive genes/shared environment/unique environment (ACE) model was used. In four populations where an AE model fit equally well or better, heritability ranged from 0.89 to 0.93. This difference between the sexes was mainly due to the effect of the shared environmental component of variance, which appears to be more important among women than among men in our study populations. Our results indicate that, in general, there are only minor differences in the genetic architecture of height between affluent Caucasian populations, especially among men.

Two-locus Linkage Analysis Applied to Putative Quantitative Trait Loci for Lipoprotein(a) Levels

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Plasma levels of lipoprotein(a) _ Lp(a) _ are associated with cardiovascular risk (Danesh et al., 2000) and were long believed to be influenced by the LPA locus on chromosome 6q27 only. However, a recent report of Broeckel et al. (2002) suggested the presence of a second quantitative trait locus on chromosome 1 influencing Lp(a) levels. Using a two-locus model, we found no evidence for an additional Lp(a) locus on chromosome 1 in a linkage study among 483 dizygotic twin pairs.

Complete genomic sequence of the Australian south-west genotype of Sindbis virus: Comparisons with other Sindbis strains and identification of a unique deletion in the 3 '-untranslated region

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Our previous studies have shown that two distinct genotypes of Sindbis (SIN) virus occur in Australia. One of these, the Oriental/Australian type, circulates throughout most of the Australian continent, whereas the recently identified south-west (SW) genetic type appears to be restricted to a distinct geographic region located in the temperate south-west of Australia. We have now determined the complete nucleotide and translated amino acid sequences of a SW isolate of SIN virus (SW6562) and performed comparative analyses with other SIN viruses at the genomic level. The genome of SW6562 is 11,569 nucleotides in length, excluding the cap nucleotide and poly (A) tail. Overall this virus differs from the prototype SIN virus (strain AR339) by 23% in nucleotide sequence and 12.5% in amino acid sequence. Partial sequences of four regions of the genome of four SW isolates were determined and compared with the corresponding sequences from a number of SIN isolates from different regions of the World. These regions are the non-structural protein (nsP3), the E2 gene, the capsid gene, and the repeated sequence elements (RSE) of the 3'UTR. These comparisons revealed that the SW SIN viruses were more closely related to South African and European strains than to other Australian isolates of SIN virus. Thus the SW genotype of SIN virus may have been introduced into this region of Australia by viremic humans or migratory birds and subsequently evolved independently in the region. The sequence data also revealed that the SW genotype contains a unique deletion in the RSE of the 3'UTR region of the genome. Previous studies have shown that deletions in this region of the SIN genome can have significant effects on virus replication in mosquito and avian cells, which may explain the restricted distribution of this genotype of SIN virus.

Prospects for whole genome linkage disequilibrium mapping in domestic dog breeds

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Linkage disequilibrium (LD) mapping is commonly used as a fine mapping tool in human genome mapping and has been used with some success for initial disease gene isolation in certain isolated inbred human populations. An understanding of the population history of domestic dog breeds suggests that LID mapping could be routinely utilized in this species for initial genome-wide scans. Such an approach offers significant advantages over traditional linkage analysis. Here, we demonstrate, using canine copper toxicosis in the Bedlington terrier as the model, that LID mapping could be reasonably expected to be a useful strategy in low-resolution, genome-wide scans in pure-bred dogs. Significant LID was demonstrated over distances up to 33.3 cM. It is very unlikely, for a number of reasons discussed, that this result could be extrapolated to the rest of the genome. It is, however, consistent with the expectation given the population structure of canine breeds and, in this breed at least, with the hypothesis that it may be possible to utilize LID in a genome-wide scan. In this study, LD mapping confirmed the location of the copper toxicosis in Bedlington terrier gene (CT-BT) and was able to do so in a population that was refractory to traditional linkage analysis.

Decompositions of complete graphs into theta graphs with fewer than ten edges

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A theta graph is a graph consisting of three pairwise internally disjoint paths with common end points. Methods for decomposing the complete graph K-nu into theta graphs with fewer than ten edges are given.

O uso da técnica de "Linkage" de sistemas de informação em estudos de coorte sobre mortalidade neonatal

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Discute-se o uso da "linkage" dos Sistemas Oficiais de Informação de Nascido Vivo (SINASC) e de Óbitos (SIM) em estudos de mortalidade neonatal. Essa técnica baseia-se na "ligação" dos bancos de dados obtidos a partir das informações existentes nesses sistemas, o que possibilita o emprego de estudos do tipo de coorte. O estudo foi realizado no Município de Santo André, Região Metropolitana de São Paulo, Brasil. São apresentados os cuidados metodológicos que foram empregados para evitar a presença de viéses de seleção e de efeito, que podem ocorrer. O uso da "linkage" mostrou-se operacionalmente viável, permitindo obter as probabilidades de morte e os riscos relativos dos nascidos vivos, expostos e não expostos, às variáveis que são objeto de registro na declaração de nascido vivo, identificando-se, desta maneira, os recém-nascidos de risco. Essa técnica, de baixo custo operacional, visto que utiliza dados já registrados, permite um dimensionamento mais adequado da assistência pré-natal e ao parto.

Incidence rate and spatio-temporal clustering of type 1 diabetes in Santiago, Chile, from 1997 to 1998

Relevância:

20.00% 20.00%

Publicador:

Resumo:

OBJECTIVE: To estimate the incidence rate of type 1 diabetes in the urban area of Santiago, Chile, from March 21, 1997 to March 20, 1998, and to assess the spatio-temporal clustering of cases during that period. METHODS: All sixty-one incident cases were located temporally (day of diagnosis) and spatially (place of residence) in the area of study. Knox's method was used to assess spatio-temporal clustering of incident cases. RESULTS: The overall incidence rate of type 1 diabetes was 4.11 cases per 100,000 children aged less than 15 years per year (95% confidence interval: 3.06--5.14). The incidence rate seems to have increased since the last estimate of the incidence calculated for the years 1986--1992 in the metropolitan region of Santiago. Different combinations of space-time intervals have been evaluated to assess spatio-temporal clustering. The smallest p-value was found for the combination of critical distances of 750 meters and 60 days (uncorrected p-value = 0.048). CONCLUSIONS: Although these are preliminary results regarding space-time clustering in Santiago, exploratory analysis of the data method would suggest a possible aggregation of incident cases in space-time coordinates.

Probabilistic linkage in household survey on hospital care usage

Relevância:

20.00% 20.00%

Publicador:

Resumo:

OBJECTIVE: To evaluate the potential advantages and limitations of the use of the Brazilian hospital admission authorization forms database and the probabilistic record linkage methodology for the validation of reported utilization of hospital care services in household surveys. METHODS: A total of 2,288 households interviews were conducted in the county of Duque de Caxias, Brazil. Information on the occurrence of at least one hospital admission in the year preceding the interview was obtained from a total of 10,733 household members. The 130 records of household members who reported at least one hospital admission in a public hospital were linked to a hospital database with 801,587 records, using an automatic probabilistic approach combined with an extensive clerical review. RESULTS: Seventy-four (57%) of the 130 household members were identified in the hospital database. Yet only 60 subjects (46%) showed a record of hospitalization in the hospital database in the study period. Hospital admissions due to a surgery procedure were significantly more likely to have been identified in the hospital database. The low level of concordance seen in the study can be explained by the following factors: errors in the linkage process; a telescoping effect; and an incomplete record in the hospital database. CONCLUSIONS: The use of hospital administrative databases and probabilistic linkage methodology may represent a methodological alternative for the validation of reported utilization of health care services, but some strategies should be employed in order to minimize the problems related to the use of this methodology in non-ideal conditions. Ideally, a single identifier, such as a personal health insurance number, and the universal coverage of the database would be desirable.

Definition of MV Load Diagrams via Weighted Evidence Accumulation Clustering using Subsampling

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A definition of medium voltage (MV) load diagrams was made, based on the data base knowledge discovery process. Clustering techniques were used as support for the agents of the electric power retail markets to obtain specific knowledge of their customers’ consumption habits. Each customer class resulting from the clustering operation is represented by its load diagram. The Two-step clustering algorithm and the WEACS approach based on evidence accumulation (EAC) were applied to an electricity consumption data from a utility client’s database in order to form the customer’s classes and to find a set of representative consumption patterns. The WEACS approach is a clustering ensemble combination approach that uses subsampling and that weights differently the partitions in the co-association matrix. As a complementary step to the WEACS approach, all the final data partitions produced by the different variations of the method are combined and the Ward Link algorithm is used to obtain the final data partition. Experiment results showed that WEACS approach led to better accuracy than many other clustering approaches. In this paper the WEACS approach separates better the customer’s population than Two-step clustering algorithm.

Determination of electricity consumers’ load profiles via weighted evidence accumulation clustering using subsampling

Relevância:

20.00% 20.00%

Publicador:

Resumo:

With the electricity market liberalization, the distribution and retail companies are looking for better market strategies based on adequate information upon the consumption patterns of its electricity consumers. A fair insight on the consumers’ behavior will permit the definition of specific contract aspects based on the different consumption patterns. In order to form the different consumers’ classes, and find a set of representative consumption patterns we use electricity consumption data from a utility client’s database and two approaches: Two-step clustering algorithm and the WEACS approach based on evidence accumulation (EAC) for combining partitions in a clustering ensemble. While EAC uses a voting mechanism to produce a co-association matrix based on the pairwise associations obtained from N partitions and where each partition has equal weight in the combination process, the WEACS approach uses subsampling and weights differently the partitions. As a complementary step to the WEACS approach, we combine the partitions obtained in the WEACS approach with the ALL clustering ensemble construction method and we use the Ward Link algorithm to obtain the final data partition. The characterization of the obtained consumers’ clusters was performed using the C5.0 classification algorithm. Experiment results showed that the WEACS approach leads to better results than many other clustering approaches.

Typical load profiles in the smart grid context – a clustering methods comparison

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The present research paper presents five different clustering methods to identify typical load profiles of medium voltage (MV) electricity consumers. These methods are intended to be used in a smart grid environment to extract useful knowledge about customer’s behaviour. The obtained knowledge can be used to support a decision tool, not only for utilities but also for consumers. Load profiles can be used by the utilities to identify the aspects that cause system load peaks and enable the development of specific contracts with their customers. The framework presented throughout the paper consists in several steps, namely the pre-processing data phase, clustering algorithms application and the evaluation of the quality of the partition, which is supported by cluster validity indices. The process ends with the analysis of the discovered knowledge. To validate the proposed framework, a case study with a real database of 208 MV consumers is used.

Demand response programs definition supported by clustering and classification techniques

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The growing importance and influence of new resources connected to the power systems has caused many changes in their operation. Environmental policies and several well know advantages have been made renewable based energy resources largely disseminated. These resources, including Distributed Generation (DG), are being connected to lower voltage levels where Demand Response (DR) must be considered too. These changes increase the complexity of the system operation due to both new operational constraints and amounts of data to be processed. Virtual Power Players (VPP) are entities able to manage these resources. Addressing these issues, this paper proposes a methodology to support VPP actions when these act as a Curtailment Service Provider (CSP) that provides DR capacity to a DR program declared by the Independent System Operator (ISO) or by the VPP itself. The amount of DR capacity that the CSP can assure is determined using data mining techniques applied to a database which is obtained for a large set of operation scenarios. The paper includes a case study based on 27,000 scenarios considering a diversity of distributed resources in a 33 bus distribution network.

Cluster Analysis of Business Data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This journal provides immediate open access to its content on the principle that making research freely available to the public supports a greater global exchange of knowledge.

The complete sequence of a 9000 bp fragment of the right arm of Saccharomyces cerevisiae chromosome VII contains four previously unknown open reading frames

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We report the sequence of a 9000 bp fragment from the right arm of Saccharomyces cerevisiae chromosome VII. Analysis of the sequence revealed four complete previously unknown open reading frames, which were named G7587, G7589, G7591 and G7594 following standard rules for provisional nomenclature. Outstanding features of some of these proteins were the homology of the putative protein coded by G7589 with proteins involved in transcription regulation and the transmembrane domains predicted in the putative protein coded by G7591.

Clustering of variables with a three-way approach for health sciences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

TPM Vol. 21, No. 4, December 2014, 435-447 – Special Issue © 2014 Cises.

«
1
2
...
8
9
10
11
12
13
14
...
55
56
»