16 resultados para variable sample size
em Indian Institute of Science - Bangalore - Índia
Resumo:
1 Species-accumulation curves for woody plants were calculated in three tropical forests, based on fully mapped 50-ha plots in wet, old-growth forest in Peninsular Malaysia, in moist, old-growth forest in central Panama, and in dry, previously logged forest in southern India. A total of 610 000 stems were identified to species and mapped to < Im accuracy. Mean species number and stem number were calculated in quadrats as small as 5 m x 5 m to as large as 1000 m x 500 m, for a variety of stem sizes above 10 mm in diameter. Species-area curves were generated by plotting species number as a function of quadrat size; species-individual curves were generated from the same data, but using stem number as the independent variable rather than area. 2 Species-area curves had different forms for stems of different diameters, but species-individual curves were nearly independent of diameter class. With < 10(4) stems, species-individual curves were concave downward on log-log plots, with curves from different forests diverging, but beyond about 104 stems, the log-log curves became nearly linear, with all three sites having a similar slope. This indicates an asymptotic difference in richness between forests: the Malaysian site had 2.7 times as many species as Panama, which in turn was 3.3 times as rich as India. 3 Other details of the species-accumulation relationship were remarkably similar between the three sites. Rectangular quadrats had 5-27% more species than square quadrats of the same area, with longer and narrower quadrats increasingly diverse. Random samples of stems drawn from the entire 50 ha had 10-30% more species than square quadrats with the same number of stems. At both Pasoh and BCI, but not Mudumalai. species richness was slightly higher among intermediate-sized stems (50-100mm in diameter) than in either smaller or larger sizes, These patterns reflect aggregated distributions of individual species, plus weak density-dependent forces that tend to smooth the species abundance distribution and 'loosen' aggregations as stems grow. 4 The results provide support for the view that within each tree community, many species have their abundance and distribution guided more by random drift than deterministic interactions. The drift model predicts that the species-accumulation curve will have a declining slope on a log-log plot, reaching a slope of O.1 in about 50 ha. No other model of community structure can make such a precise prediction. 5 The results demonstrate that diversity studies based on different stem diameters can be compared by sampling identical numbers of stems. Moreover, they indicate that stem counts < 1000 in tropical forests will underestimate the percentage difference in species richness between two diverse sites. Fortunately, standard diversity indices (Fisher's sc, Shannon-Wiener) captured diversity differences in small stem samples more effectively than raw species richness, but both were sample size dependent. Two nonparametric richness estimators (Chao. jackknife) performed poorly, greatly underestimating true species richness.
Resumo:
Room temperature, uniaxial compression creep experiments were performed on micro-/nano-sized pillars (having diameters in the range of 250-2000 nm) of a Zr-based bulk metallic glass (BMG) to investigate the influence of sample size on the time-dependent plastic deformation behavior in amorphous alloys. Experimental results reveal that plastic deformation indeed occurs at ambient temperature and at stresses that are well below the nominal quasi-static yield stress. At a given stress, higher total strains accrue in the smaller specimens. In all cases, plastic deformation was found to be devoid of shear bands, i.e., it occurs in homogeneous manner. The stress exponent obtained from the slope of the linear relation between strain rate and applied stress also shows a strong size effect, which is rationalized in terms of the amount of free volume created during deformation and the surface-to-volume ratio of the pillar. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
A computationally efficient agglomerative clustering algorithm based on multilevel theory is presented. Here, the data set is divided randomly into a number of partitions. The samples of each such partition are clustered separately using hierarchical agglomerative clustering algorithm to form sub-clusters. These are merged at higher levels to get the final classification. This algorithm leads to the same classification as that of hierarchical agglomerative clustering algorithm when the clusters are well separated. The advantages of this algorithm are short run time and small storage requirement. It is observed that the savings, in storage space and computation time, increase nonlinearly with the sample size.
Resumo:
The thermal decomposition of ammonium perchlorate based solid composite propellant using carboxyl terminated polybutadiene as binder has been studied employing thermogravimetry and differential thermal analysis techniques. The thermal decomposition characteristics of the propellant have been found to be quite similar to those of pure ammonium perchlorate with activation energy, 32 Kcal/mole and 60 Kcal/mole respectively in the low and high temperature regions. The effect of the sample size and shape on the thermal decomposition has also been evaluated.
Resumo:
Background & objectives: Methylenetetrahydrofolate reductase (MTHFR) is a critical enzyme in folate metabolism and involved in DNA synthesis, DNA repair and DNA methylation. The two common functional polymorphisms of MTHFR, 677C -> T and 1298 A -> C have shown to impact several diseases including cancer. This case-control study was undertaken to analyse the association of the MTHFR gene polymorphisms 677 C -> T and 1298 A -> C and risk of colorectal cancer (CRC).Methods: One hundred patients with a confirmed histopathologic diagnosis of CRC and 86 age and gender matched controls with no history of cancer were taken for this study. DNA was isolated from peripheral blood samples and the genotypes were determined by PCR-RFLP. The risk association was estimated by compounding odds ratio (OR) with 95 per cent confidence interval (CI). Results: Genotype frequency of MTHFR 677 CC, CT and TT were 76.7, 22.1 and 1.16 per cent in controls, and 74,25 and 1.0 per cent among patients. The 'T' allele frequency was 12.21 and 13.5 per cent in controls and patients respectively. The genotype frequency of MTHFR 1298 AA, AC, and CC were 25.6, 58.1 and 16.3 per cent for controls and 22, 70 and 8 per cent for patents respectively. The 'C' allele frequency for 1298 A -> C was 43.0 and 45.3 per cent respectively for controls and patients. The OR for 677 CT was 1.18 (95% CI 0.59-2.32, P = 0.642), OR for 1298 AC was 1.68 (95% CI 0.92-3.08, P = 0.092) and OR for 1298 CC was 0.45(95% CI 0.18-1.12, P = 0.081). The OR for the combined heterozygous state (677 CT and 1298 AC) was 1.18(95% CI 0.52-2.64, P =0.697).Interpretation & conclusion: The frequency of the MTHFR 677 TT genotype is rare as compared to 1298 CC genotype in the population studied. There was no association between 677 C -> T and 1298 A -> C polymorphisms and risk of CRC either individually or in combination. The homozygous state for 1298 A -> C polymorphism appears to slightly lower risk of CRC. This needs to be confirmed with a larger sample size.
Resumo:
Methylenetetrahydrofolate reductase (MTHFR) is a critical enzyme in folate metabolism and is involved in DNA synthesis, DNA repair and DNA methylation. Genetic polymorphisms of this enzyme have been shown to impact several diseases, including cancer. Leukemias are malignancies arising from rapidly proliferating hematopoietic cells having great requirement of DNA synthesis. This case-control study was undertaken to analyze the association of the MTHFR gene polymorphisms 677 C"T and 1298 A"C and the risk of acute lymphoblastic leukemia in children. Materials and Methods: Eighty-six patients aged below 15 years with a confirmed diagnosis of acute lymphoblastic leukemia (ALL) and 99 matched controls were taken for this study. Analysis of the polymorphisms was done using the polymerase chain reaction -restriction fragment length polymorphism (PCR-RFLP) method. Results: Frequency of MTHFR 677 CC and CT were 85.9% and 14.1% in the controls, and 84.9% and 15.1% in the cases. The 'T' allele frequency was 7% and 7.5% in cases and controls respectively. The frequency of MTHFR 1298 AA, AC, and CC were 28.3%, 55.6% and 16.1% for controls and 23.3%, 59.3% and 17.4% for cases respectively. The 'C' allele frequency for 1298 A→C was 43.9% and 47% respectively for controls and cases. The odds ratio (OR) for C677T was 1.08 (95% CI 0.48- 2.45, p = 0.851) and OR for A1298C was 1.29(95% CI 0.65-2.29, p = 0.46) and OR for 1298 CC was 1.31 (95% CI 0.53-3.26, p =0.56). The OR for the combined heterozygous status (677 CT and 1298 AC) was 1.94 (95% CI 0.58 -6.52, p = 0.286). Conclusion: The prevalence of 'T' allele for 677 MTHFR polymorphism was low in the population studied. There was no association between MTHFR 677 C→T and 1298 A→C gene polymorphisms and risk of ALL, which may be due to the small sample size.
Resumo:
CMPs enable simultaneous execution of multiple applications on the same platforms that share cache resources. Diversity in the cache access patterns of these simultaneously executing applications can potentially trigger inter-application interference, leading to cache pollution. Whereas a large cache can ameliorate this problem, the issues of larger power consumption with increasing cache size, amplified at sub-100nm technologies, makes this solution prohibitive. In this paper in order to address the issues relating to power-aware performance of caches, we propose a caching structure that addresses the following: 1. Definition of application-specific cache partitions as an aggregation of caching units (molecules). The parameters of each molecule namely size, associativity and line size are chosen so that the power consumed by it and access time are optimal for the given technology. 2. Application-Specific resizing of cache partitions with variable and adaptive associativity per cache line, way size and variable line size. 3. A replacement policy that is transparent to the partition in terms of size, heterogeneity in associativity and line size. Through simulation studies we establish the superiority of molecular cache (caches built as aggregations of molecules) that offers a 29% power advantage over that of an equivalently performing traditional cache.
Resumo:
Time scales associated with activated transitions between glassy metastable states of a free-energy functional appropriate for a dense hard-sphere system are calculated by using a new Monte Carlo method for the local density variables. In particular, we calculate the time the system, initially placed in a shallow glassy minimum of the free-energy, spends in the neighborhood of this minimum before making a transition to the basin of attraction of another free-energy minimum. This time scale is found to increase as the average density is increased. We find a crossover density near which this time scale increases very sharply and becomes longer than the longest times accessible in our simulation. This time scale does not show any evidence of increasing with sample size
Resumo:
This paper considers the problem of spectrum sensing in cognitive radio networks when the primary user is using Orthogonal Frequency Division Multiplexing (OFDM). For this we develop cooperative sequential detection algorithms that use the autocorrelation property of cyclic prefix (CP) used in OFDM systems. We study the effect of timing and frequency offset, IQ-imbalance and uncertainty in noise and transmit power. We also modify the detector to mitigate the effects of these impairments. The performance of the proposed algorithms is studied via simulations. We show that sequential detection can significantly improve the performance over a fixed sample size detector.
Resumo:
Background: Temporal analysis of gene expression data has been limited to identifying genes whose expression varies with time and/or correlation between genes that have similar temporal profiles. Often, the methods do not consider the underlying network constraints that connect the genes. It is becoming increasingly evident that interactions change substantially with time. Thus far, there is no systematic method to relate the temporal changes in gene expression to the dynamics of interactions between them. Information on interaction dynamics would open up possibilities for discovering new mechanisms of regulation by providing valuable insight into identifying time-sensitive interactions as well as permit studies on the effect of a genetic perturbation. Results: We present NETGEM, a tractable model rooted in Markov dynamics, for analyzing the dynamics of the interactions between proteins based on the dynamics of the expression changes of the genes that encode them. The model treats the interaction strengths as random variables which are modulated by suitable priors. This approach is necessitated by the extremely small sample size of the datasets, relative to the number of interactions. The model is amenable to a linear time algorithm for efficient inference. Using temporal gene expression data, NETGEM was successful in identifying (i) temporal interactions and determining their strength, (ii) functional categories of the actively interacting partners and (iii) dynamics of interactions in perturbed networks. Conclusions: NETGEM represents an optimal trade-off between model complexity and data requirement. It was able to deduce actively interacting genes and functional categories from temporal gene expression data. It permits inference by incorporating the information available in perturbed networks. Given that the inputs to NETGEM are only the network and the temporal variation of the nodes, this algorithm promises to have widespread applications, beyond biological systems. The source code for NETGEM is available from https://github.com/vjethava/NETGEM
Resumo:
We propose a randomized algorithm for large scale SVM learning which solves the problem by iterating over random subsets of the data. Crucial to the algorithm for scalability is the size of the subsets chosen. In the context of text classification we show that, by using ideas from random projections, a sample size of O(log n) can be used to obtain a solution which is close to the optimal with a high probability. Experiments done on synthetic and real life data sets demonstrate that the algorithm scales up SVM learners, without loss in accuracy. 1
Resumo:
Background: India has the third largest HIV-1 epidemic with 2.4 million infected individuals. Molecular epidemiological analysis has identified the predominance of HIV-1 subtype C (HIV-1C). However, the previous reports have been limited by sample size, and uneven geographical distribution. The introduction of HIV-1C in India remains uncertain due to this lack of structured studies. To fill the gap, we characterised the distribution pattern of HIV-1 subtypes in India based on data collection from nationwide clinical cohorts between 2007 and 2011. We also reconstructed the time to the most recent common ancestor (tMRCA) of the predominant HIV-1C strains. Methodology/Principal Findings: Blood samples were collected from 168 HIV-1 seropositive subjects from 7 different states. HIV-1 subtypes were determined using two or three genes, gag, pol, and env using several methods. Bayesian coalescent-based approach was used to reconstruct the time of introduction and population growth patterns of the Indian HIV-1C. For the first time, a high prevalence (10%) of unique recombinant forms (BC and A1C) was observed when two or three genes were used instead of one gene (p<0.01; p = 0.02, respectively). The tMRCA of Indian HIV-1C was estimated using the three viral genes, ranged from 1967 (gag) to 1974 (env). Pol-gene analysis was considered to provide the most reliable estimate 1971, (95% CI: 1965-1976)]. The population growth pattern revealed an initial slow growth phase in the mid-1970s, an exponential phase through the 1980s, and a stationary phase since the early 1990s. Conclusions/Significance: The Indian HIV-1C epidemic originated around 40 years ago from a single or few genetically related African lineages, and since then largely evolved independently. The effective population size in the country has been broadly stable since the 1990s. The evolving viral epidemic, as indicated by the increase of recombinant strains, warrants a need for continued molecular surveillance to guide efficient disease intervention strategies.
Resumo:
This paper describes a new method of color text localization from generic scene images containing text of different scripts and with arbitrary orientations. A representative set of colors is first identified using the edge information to initiate an unsupervised clustering algorithm. Text components are identified from each color layer using a combination of a support vector machine and a neural network classifier trained on a set of low-level features derived from the geometric, boundary, stroke and gradient information. Experiments on camera-captured images that contain variable fonts, size, color, irregular layout, non-uniform illumination and multiple scripts illustrate the robustness of the method. The proposed method yields precision and recall of 0.8 and 0.86 respectively on a database of 100 images. The method is also compared with others in the literature using the ICDAR 2003 robust reading competition dataset.
Resumo:
In order to explore the potential use of fly ash and plastic waste in bulk quantities in civil engineering applications, it is necessary to understand the behavior of fly ash and fly ash mixed with plastic waste. These materials are considered as wastes and in this study, it is shown that combination of fly ash and plastic waste is very useful. In this regard, various tests such as classification tests, unconfined compressive strength and compressibility tests, consolidated undrained tests, and California bearing ratio tests were conducted. The results indicated that the inclusion of plastic waste in fly ash is effective in improving the engineering properties of fly ash in terms of compressive strength, shear strength parameters, and CBR values. In order to understand the effect of sample size on the shear strength parameters of fly ash and fly ash mixed with plastic waste, consolidated undrained tests were conducted with sample sizes of 38x76mm and 50x100mm. The results of the tests indicate that the shear strength increases with the increase in sample size. The implication of the use of fly ash mixed with plastic waste in unpaved roads is presented in terms of reduction of carbon print.
Resumo:
Spin noise phenomenon was predicted way back in 1946. However, experimental investigations regarding spin noise became possible only recently with major technological improvements in NMR hardware. These experiments have several potential novel applications and also demand refinements in the existing theoretical framework to explain the phenomenon. Elegance of noise spectroscopy in gathering information about the properties of a system lies in the fact that it does not require external perturbation, and the system remains in thermal equilibrium. Spin noise is intrinsic magnetic fluctuations, and both longitudinal and transverse components have been detected independently in many systems. Detection of fluctuating longitudinal magnetization leads to field of Magnetic Resonance Force Microscopy (MRFM) that can efficiently probe very few spins even down to the level of single spin utilizing ultrasensitive cantilevers. Transverse component of spin noise, which can simultaneously monitor different resonances over a given frequency range enabling one to distinguish between different chemical environments, has also received considerable attention, and found many novel applications. These experiments demand a detailed understanding of the underlying spin noise phenomenon in order to perform perturbation-free magnetic resonance and widen the highly promising application area. Detailed investigations of noise magnetization have been performed recently using force microscopy on equilibrium ensemble of paramagnetic alkali atoms. It was observed that random fluctuations generate spontaneous spin coherences which has similar characteristics as generated by macroscopic magnetization of polarized ensemble in terms of precession and relaxation properties. Several other intrinsic properties like g-factors, isotope-abundance ratios, hyperfine splitting, spin coherence lifetimes etc. also have been achieved without having to excite the sample. In contrast to MRFM-approaches, detection of transverse spin noise also offers novel applications, attracting considerable attention. This has unique advantage as different resonances over a given frequency range enable one to distinguish between different chemical environments. Since these noise signatures scale inversely with sample size, these approaches lead to the possibility of non-perturbative magnetic resonance of small systems down to nano-scale. In this review, these different approaches will be highlighted with main emphasis on transverse spin noise investigations.