10 resultados para Dataset
em Cochin University of Science
Resumo:
The marine atmospheric boundary layer (MABL) plays a vital role in the transport of momentum and heat from the surface of the ocean into the atmosphere. A detailed study on the MABL characteristics was carried out using high-resolution surface-wind data as measured by the QuikSCAT (Quick scatterometer) satellite. Spatial variations in the surface wind, frictional velocity, roughness parameter and drag coe±cient for the di®erent seasons were studied. The surface wind was strong during the southwest monsoon season due to the modulation induced by the Low Level Jetstream. The drag coe±cient was larger during this season, due to the strong winds and was lower during the winter months. The spatial variations in the frictional velocity over the seas was small during the post-monsoon season (»0.2 m s¡1). The maximum spatial variation in the frictional velocity was found over the south Arabian Sea (0.3 to 0.5 m s¡1) during the southwest monsoon period, followed by the pre-monsoon over the Bay of Bengal (0.1 to 0.25 m s¡1). The mean wind-stress curl during the winter was positive over the equatorial region, with a maximum value of 1.5£10¡7 N m¡3, but on either side of the equatorial belt, a negative wind-stress curl dominated. The area average of the frictional velocity and drag coe±cient over the Arabian Sea and Bay of Bengal were also studied. The values of frictional velocity shows a variability that is similar to the intraseasonal oscillation (ISO) and this was con¯rmed via wavelet analysis. In the case of the drag coe±cient, the prominent oscillations were ISO and quasi-biweekly mode (QBM). The interrelationship between the drag coe±cient and the frictional velocity with wind speed in both the Arabian Sea and the Bay of Bengal was also studied.
Resumo:
The present study helped to understand the trend in rainfall patterns at smaller spatial scales and the large regional differences in the variability of rainfall. The effect of land use and orography on the diurnal variability is also understood. But a better understanding on the long term variation in rainfall is possible by using a longer dataset,which may provide insight into the rainfall variation over country during the past century. The basic mechanism behind the interannual rainfall variability would be possible with numerical studies using coupled Ocean-Atmosphere models. The regional difference in the active-break conditions points to the significance of regional studies than considering India as a single unit. The underlying dynamics of diurnal variability need to be studied by making use of a high resolution model as the present study could not simulate the local onshore circulation. Also the land use modification in this study, selected a region, which is surrounded by crop land. This implies the high possibility for the conversion of the remaining region to agricultural land. Therefore the study is useful than considering idealized conditions, but the adverse effect of irrigated crop is more than non-irrigated crop. Therefore, such studies would help to understand the climate changes occurred in the recent period. The large accumulation of rainfall between 300-600 m height of western Ghats has been found but the reason behind this need to be studied, which is possible by utilizing datasets that would better represent the orography and landuse over the region in high resolution model. Similarly a detailed analysis is needed to clearly identify the causative relations of the predictors identified with the predictant and the physical reasons behind them. New approaches that include nonlinear relationships and dynamical variables from model simulations can be included in the existing statistical models to improve the skill of the models. Also the statistical models for the forecasts of monsoon have to be continually updated.
Resumo:
Microarray data analysis is one of data mining tool which is used to extract meaningful information hidden in biological data. One of the major focuses on microarray data analysis is the reconstruction of gene regulatory network that may be used to provide a broader understanding on the functioning of complex cellular systems. Since cancer is a genetic disease arising from the abnormal gene function, the identification of cancerous genes and the regulatory pathways they control will provide a better platform for understanding the tumor formation and development. The major focus of this thesis is to understand the regulation of genes responsible for the development of cancer, particularly colorectal cancer by analyzing the microarray expression data. In this thesis, four computational algorithms namely fuzzy logic algorithm, modified genetic algorithm, dynamic neural fuzzy network and Takagi Sugeno Kang-type recurrent neural fuzzy network are used to extract cancer specific gene regulatory network from plasma RNA dataset of colorectal cancer patients. Plasma RNA is highly attractive for cancer analysis since it requires a collection of small amount of blood and it can be obtained at any time in repetitive fashion allowing the analysis of disease progression and treatment response.
Resumo:
During the period from 12 to 15 April, 2009 nearly the entire Iran, apart from the southern border, experienced an advective cooling event. While winter freezing concerns are typical, the nature of this freezing event was unusual with respect to its date of occurrence and accompanying synoptic meteorological situation. To analyze the freezing event, the relevant meteorological data at multiple levels of the atmosphere were examined from the NCEP/ NCAR reanalysis dataset. The results showed that a polar vortex was responsible for the freezing event over the country extending southward extraordinarily in such a way that its ridge influenced most parts of Iran. This was recognized as an abnormal extension of a polar vortex in the recent years. The sea-level pressure fields indicated that a ridge of large-scale anticyclone centered over Black Sea extended southward and prevailed over most parts of Iran. This resulted in the formation of a severe cold air advection from high latitudes (Polar region) over Iran. During the study period, moisture pumping was observed from the Arabian Sea and Persian Gulf. The winds at 1000 hPa level blew with a magnitude of 10 m s-1 toward south in the region of convergence (between -2 9 10-6 s-1 and -12 9 10-6 s-1). The vertical profilesof temperature and humidity also indicated that the ICE structural icing occurred at multiple levels of the atmosphere, i.e, from 800 hPa through 400 hPa levels. In addition to the carburetor (or induction), icing occurred between 900 and 700 hPa levels in the selected radiosonde stations during the study period. In addition, the HYSPLIT backward trajectory model outputs were in quite good agreement with the observed synoptic features
Resumo:
A distinct cold tongue has recently been noticed in the South China Sea during the winter monsoon, with the cold tongue temperature minimum occurring in the January or February. This cold tongue shows signi¯cant links with the Maritime Continent's rainfall during the winter period. The cold tongue and its interaction with the Maritime Continent's weather were studied using Reynolds SST data, wind ¯elds from the NCEP{NCAR reanalysis dataset and the quikSCAT dataset. In addition, rainfall from the GOES Precipitation Index (GPI) for the periods 2000 to 2008 was also used. The propagation of the cold tongue towards the south is explained using wind dynamics and the western boundary current. During the period of strong cold tongue, the surface wind is strong and the western boundary current advects the cold tongue to the south. During the period of strong winds the zonal gradient of SST is high [0.5±C (25 km)¡1]. The cold tongue plays an important role in regulating the climate over the Maritime Continent. It creates a zonal/meridional SST gradient and this gradient ultimately leads in the formation of convection. Hence, two maximum precipitation zones are found in the Maritime Continent, with a zone of relatively lower precipitation between, which coincides with the cold tongue's regions. It was found that the precipitation zones have strong links with the intensity of the cold tongue. During stronger cold tongue periods the precipitation on either side of the cold tongue is considerably greater than during weaker cold tongue periods. The features of convection on the eastern and western sides of the cold tongue behave di®erently. On the eastern side convection is preceded by one day with SST gradient, while on the western side it is four days.
Resumo:
Biometrics has become important in security applications. In comparison with many other biometric features, iris recognition has very high recognition accuracy because it depends on iris which is located in a place that still stable throughout human life and the probability to find two identical iris's is close to zero. The identification system consists of several stages including segmentation stage which is the most serious and critical one. The current segmentation methods still have limitation in localizing the iris due to circular shape consideration of the pupil. In this research, Daugman method is done to investigate the segmentation techniques. Eyelid detection is another step that has been included in this study as a part of segmentation stage to localize the iris accurately and remove unwanted area that might be included. The obtained iris region is encoded using haar wavelets to construct the iris code, which contains the most discriminating feature in the iris pattern. Hamming distance is used for comparison of iris templates in the recognition stage. The dataset which is used for the study is UBIRIS database. A comparative study of different edge detector operator is performed. It is observed that canny operator is best suited to extract most of the edges to generate the iris code for comparison. Recognition rate of 89% and rejection rate of 95% is achieved
Resumo:
Biclustering is simultaneous clustering of both rows and columns of a data matrix. A measure called Mean Squared Residue (MSR) is used to simultaneously evaluate the coherence of rows and columns within a submatrix. In this paper a novel algorithm is developed for biclustering gene expression data using the newly introduced concept of MSR difference threshold. In the first step high quality bicluster seeds are generated using K-Means clustering algorithm. Then more genes and conditions (node) are added to the bicluster. Before adding a node the MSR X of the bicluster is calculated. After adding the node again the MSR Y is calculated. The added node is deleted if Y minus X is greater than MSR difference threshold or if Y is greater than MSR threshold which depends on the dataset. The MSR difference threshold is different for gene list and condition list and it depends on the dataset also. Proper values should be identified through experimentation in order to obtain biclusters of high quality. The results obtained on bench mark dataset clearly indicate that this algorithm is better than many of the existing biclustering algorithms
Resumo:
In this paper, we propose a multispectral analysis system using wavelet based Principal Component Analysis (PCA), to improve the brain tissue classification from MRI images. Global transforms like PCA often neglects significant small abnormality details, while dealing with a massive amount of multispectral data. In order to resolve this issue, input dataset is expanded by detail coefficients from multisignal wavelet analysis. Then, PCA is applied on the new dataset to perform feature analysis. Finally, an unsupervised classification with Fuzzy C-Means clustering algorithm is used to measure the improvement in reproducibility and accuracy of the results. A detailed comparative analysis of classified tissues with those from conventional PCA is also carried out. Proposed method yielded good improvement in classification of small abnormalities with high sensitivity/accuracy values, 98.9/98.3, for clinical analysis. Experimental results from synthetic and clinical data recommend the new method as a promising approach in brain tissue analysis.
Resumo:
In our study we use a kernel based classification technique, Support Vector Machine Regression for predicting the Melting Point of Drug – like compounds in terms of Topological Descriptors, Topological Charge Indices, Connectivity Indices and 2D Auto Correlations. The Machine Learning model was designed, trained and tested using a dataset of 100 compounds and it was found that an SVMReg model with RBF Kernel could predict the Melting Point with a mean absolute error 15.5854 and Root Mean Squared Error 19.7576
Resumo:
Knowledge discovery in databases is the non-trivial process of identifying valid, novel potentially useful and ultimately understandable patterns from data. The term Data mining refers to the process which does the exploratory analysis on the data and builds some model on the data. To infer patterns from data, data mining involves different approaches like association rule mining, classification techniques or clustering techniques. Among the many data mining techniques, clustering plays a major role, since it helps to group the related data for assessing properties and drawing conclusions. Most of the clustering algorithms act on a dataset with uniform format, since the similarity or dissimilarity between the data points is a significant factor in finding out the clusters. If a dataset consists of mixed attributes, i.e. a combination of numerical and categorical variables, a preferred approach is to convert different formats into a uniform format. The research study explores the various techniques to convert the mixed data sets to a numerical equivalent, so as to make it equipped for applying the statistical and similar algorithms. The results of clustering mixed category data after conversion to numeric data type have been demonstrated using a crime data set. The thesis also proposes an extension to the well known algorithm for handling mixed data types, to deal with data sets having only categorical data. The proposed conversion has been validated on a data set corresponding to breast cancer. Moreover, another issue with the clustering process is the visualization of output. Different geometric techniques like scatter plot, or projection plots are available, but none of the techniques display the result projecting the whole database but rather demonstrate attribute-pair wise analysis