979 resultados para Multivariate generalized t -distribution
Resumo:
For probability distributions on ℝq, a detailed study of the breakdown properties of some multivariate M-functionals related to Tyler's [Ann. Statist. 15 (1987) 234] ‘distribution-free’ M-functional of scatter is given. These include a symmetrized version of Tyler's M-functional of scatter, and the multivariate t M-functionals of location and scatter. It is shown that for ‘smooth’ distributions, the (contamination) breakdown point of Tyler's M-functional of scatter and of its symmetrized version are 1/q and inline image, respectively. For the multivariate t M-functional which arises from the maximum likelihood estimate for the parameters of an elliptical t distribution on ν ≥ 1 degrees of freedom the breakdown point at smooth distributions is 1/(q + ν). Breakdown points are also obtained for general distributions, including empirical distributions. Finally, the sources of breakdown are investigated. It turns out that breakdown can only be caused by contaminating distributions that are concentrated near low-dimensional subspaces.
Resumo:
With the recognition of the importance of evidence-based medicine, there is an emerging need for methods to systematically synthesize available data. Specifically, methods to provide accurate estimates of test characteristics for diagnostic tests are needed to help physicians make better clinical decisions. To provide more flexible approaches for meta-analysis of diagnostic tests, we developed three Bayesian generalized linear models. Two of these models, a bivariate normal and a binomial model, analyzed pairs of sensitivity and specificity values while incorporating the correlation between these two outcome variables. Noninformative independent uniform priors were used for the variance of sensitivity, specificity and correlation. We also applied an inverse Wishart prior to check the sensitivity of the results. The third model was a multinomial model where the test results were modeled as multinomial random variables. All three models can include specific imaging techniques as covariates in order to compare performance. Vague normal priors were assigned to the coefficients of the covariates. The computations were carried out using the 'Bayesian inference using Gibbs sampling' implementation of Markov chain Monte Carlo techniques. We investigated the properties of the three proposed models through extensive simulation studies. We also applied these models to a previously published meta-analysis dataset on cervical cancer as well as to an unpublished melanoma dataset. In general, our findings show that the point estimates of sensitivity and specificity were consistent among Bayesian and frequentist bivariate normal and binomial models. However, in the simulation studies, the estimates of the correlation coefficient from Bayesian bivariate models are not as good as those obtained from frequentist estimation regardless of which prior distribution was used for the covariance matrix. The Bayesian multinomial model consistently underestimated the sensitivity and specificity regardless of the sample size and correlation coefficient. In conclusion, the Bayesian bivariate binomial model provides the most flexible framework for future applications because of its following strengths: (1) it facilitates direct comparison between different tests; (2) it captures the variability in both sensitivity and specificity simultaneously as well as the intercorrelation between the two; and (3) it can be directly applied to sparse data without ad hoc correction. ^
Resumo:
Current statistical methods for estimation of parametric effect sizes from a series of experiments are generally restricted to univariate comparisons of standardized mean differences between two treatments. Multivariate methods are presented for the case in which effect size is a vector of standardized multivariate mean differences and the number of treatment groups is two or more. The proposed methods employ a vector of independent sample means for each response variable that leads to a covariance structure which depends only on correlations among the $p$ responses on each subject. Using weighted least squares theory and the assumption that the observations are from normally distributed populations, multivariate hypotheses analogous to common hypotheses used for testing effect sizes were formulated and tested for treatment effects which are correlated through a common control group, through multiple response variables observed on each subject, or both conditions.^ The asymptotic multivariate distribution for correlated effect sizes is obtained by extending univariate methods for estimating effect sizes which are correlated through common control groups. The joint distribution of vectors of effect sizes (from $p$ responses on each subject) from one treatment and one control group and from several treatment groups sharing a common control group are derived. Methods are given for estimation of linear combinations of effect sizes when certain homogeneity conditions are met, and for estimation of vectors of effect sizes and confidence intervals from $p$ responses on each subject. Computational illustrations are provided using data from studies of effects of electric field exposure on small laboratory animals. ^
Resumo:
Body fat distribution is a cardiovascular health risk factor in adults. Body fat distribution can be measured through various methods including anthropometry. It is not clear which anthropometric index is suitable for epidemiologic studies of fat distribution and cardiovascular disease. The purpose of the present study was to select a measure of body fat distribution from among a series of indices (those traditionally used in the literature and others constructed from the analysis) that is most highly correlated with lipid-related variables and is independent of overall fatness. Subjects were Mexican-American men and women (N = 1004) from a study of gallbladder disease in Starr County, Texas. Multivariate associations were sought between lipid profile measures (lipids, lipoproteins, and apolipoproteins) and two sets of anthropometric variables (4 circumferences and 6 skinfolds). This was done to assess the association between lipid-related measures and the two sets of anthropometric variables and guide the construction of indices.^ Two indices emerged from the analysis that seemed to be highly correlated with lipid profile measures independent of obesity. These indices are: 2*arm circumference-thigh skinfold in pre- and post-menopausal women and arm/thigh circumference ratio in men. Next, using the sum of all skinfolds to represent obesity and the selected body fat distribution indices, the following hypotheses were tested: (1) state of obesity and centrally/upper distributed body fat are equally predictive of lipids, lipoproteins and apolipoproteins, and (2) the correlation among the lipid-related measures is not altered by obesity and body fat distribution.^ With respect to the first hypothesis, the present study found that most lipids, lipoproteins and apolipoproteins were significantly associated with both overall fatness and anatomical location of body fat in both sex and menopausal groups. However, within men and post-menopausal women, certain lipid profile measures (triglyceride and HDLT among post-menopausal women and apos C-II, CIII, and E among men) had substantially higher correlation with body fat distribution as compared with overall fatness.^ With respect to the second hypothesis, both obesity and body fat distribution were found to alter the association among plasma lipid variables in men and women. There was a suggestion from the data that the pattern of correlations among men and post-menopausal women are more comparable. Among men correlations involving apo A-I, HDLT, and HDL$\sb2$ seemed greatly influenced by obesity, and A-II by fat distribution; among post-menopausal women correlations involving apos A-I and A-II were highly affected by the location of body fat.^ Thus, these data point out that not only can obesity and fat distribution affect levels of single measures, they also can markedly influence the pattern of relationship among measures. The fact that such changes are seen for both obesity and fat distribution is significant, since the indices employed were chosen because they were independent of one another. ^
Resumo:
A large number of ridge regression estimators have been proposed and used with little knowledge of their true distributions. Because of this lack of knowledge, these estimators cannot be used to test hypotheses or to form confidence intervals.^ This paper presents a basic technique for deriving the exact distribution functions for a class of generalized ridge estimators. The technique is applied to five prominent generalized ridge estimators. Graphs of the resulting distribution functions are presented. The actual behavior of these estimators is found to be considerably different than the behavior which is generally assumed for ridge estimators.^ This paper also uses the derived distributions to examine the mean squared error properties of the estimators. A technique for developing confidence intervals based on the generalized ridge estimators is also presented. ^
Resumo:
To reach the goals established by the Institute of Medicine (IOM) and the Centers for Disease Control's (CDC) STOP TB USA, measures must be taken to curtail a future peak in Tuberculosis (TB) incidence and speed the currently stagnant rate of TB elimination. Both efforts will require, at minimum, the consideration and understanding of the third dimension of TB transmission: the location-based spread of an airborne pathogen among persons known and unknown to each other. This consideration will require an elucidation of the areas within the U.S. that have endemic TB. The Houston Tuberculosis Initiative (HTI) was a population-based active surveillance of confirmed Houston/Harris County TB cases from 1995–2004. Strengths in this dataset include the molecular characterization of laboratory confirmed cases, the collection of geographic locations (including home addresses) frequented by cases, and the HTI time period that parallels a decline in TB incidence in the United States (U.S.). The HTI dataset was used in this secondary data analysis to implement a GIS analysis of TB cases, the locations frequented by cases, and their association with risk factors associated with TB transmission. ^ This study reports, for the first time, the incidence of TB among the homeless in Houston, Texas. The homeless are an at-risk population for TB disease, yet they are also a population whose TB incidence has been unknown and unreported due to their non-enumeration. The first section of this dissertation identifies local areas in Houston with endemic TB disease. Many Houston TB cases who reported living in these endemic areas also share the TB risk factor of current or recent homelessness. Merging the 2004–2005 Houston enumeration of the homeless with historical HTI surveillance data of TB cases in Houston enabled this first-time report of TB risk among the homeless in Houston. The homeless were more likely to be US-born, belong to a genotypic cluster, and belong to a cluster of a larger size. The calculated average incidence among homeless persons was 411/100,000, compared to 9.5/100,000 among housed. These alarming rates are not driven by a co-infection but by social determinants. The unsheltered persons were hospitalized more days and required more follow-up time by staff than those who reported a steady housing situation. The homeless are a specific example of the increased targeting of prevention dollars that could occur if TB rates were reported for specific areas with known health disparities rather than as a generalized rate normalized over a diverse population. ^ It has been estimated that 27% of Houstonians use public transportation. The city layout allows bus routes to run like veins connecting even the most diverse of populations within the metropolitan area. Secondary data analysis of frequent bus use (defined as riding a route weekly) among TB cases was assessed for its relationship with known TB risk factors. The spatial distribution of genotypic clusters associated with bus use was assessed, along with the reported routes and epidemiologic-links among cases belonging to the identified clusters. ^ TB cases who reported frequent bus use were more likely to have demographic and social risk factors associated with poverty, immune suppression and health disparities. An equal proportion of bus riders and non-bus riders were cultured for Mycobacterium tuberculosis, yet 75% of bus riders were genotypically clustered, indicating recent transmission, compared to 56% of non-bus riders (OR=2.4, 95%CI(2.0, 2.8), p<0.001). Bus riders had a mean cluster size of 50.14 vs. 28.9 (p<0.001). Second order spatial analysis of clustered fingerprint 2 (n=122), a Beijing family cluster, revealed geographic clustering among cases based on their report of bus use. Univariate and multivariate analysis of routes reported by cases belonging to these clusters found that 10 of the 14 clusters were associated with use. Individual Metro routes, including one route servicing the local hospitals, were found to be risk factors for belonging to a cluster shown to be endemic in Houston. The routes themselves geographically connect the census tracts previously identified as having endemic TB. 78% (15/23) of Houston Metro routes investigated had one or more print groups reporting frequent use for every HTI study year. We present data on three specific but clonally related print groups and show that bus-use is clustered in time by route and is the only known link between cases in one of the three prints: print 22. (Abstract shortened by UMI.)^
Resumo:
Background Past and recent evidence shows that radionuclides in drinking water may be a public health concern. Developmental thresholds for birth defects with respect to chronic low level domestic radiation exposures, such as through drinking water, have not been definitely recognized, and there is a strong need to address this deficiency in information. In this study we examined the geographic distribution of orofacial cleft birth defects in and around uranium mining district Counties in South Texas (Atascosa, Bee, Brooks, Calhoun, Duval, Goliad, Hidalgo, Jim Hogg, Jim Wells, Karnes, Kleberg, Live Oak, McMullen, Nueces, San Patricio, Refugio, Starr, Victoria, Webb, and Zavala), from 1999 to 2007. The probable association of cleft birth defect rates by ZIP codes classified according to uranium and radium concentrations in drinking water supplies was evaluated. Similar associations between orofacial cleft birth defects and radium/radon in drinking water were reported earlier by Cech and co-investigators in another of the Gulf Coast region (Harris County, Texas).50, 55 Since substantial uranium mining activity existed and still exists in South Texas, contamination of drinking water sources with radiation and its relation to birth defects is a ground for concern. ^ Methods Residential addresses of orofacial cleft birth defect cases, as well as live births within the twenty Counties during 1999-2007 were geocoded and mapped. Prevalence rates were calculated by ZIP codes and were mapped accordingly. Locations of drinking water supplies were also geocoded and mapped. ZIP codes were stratified as having high combined uranium (≥30μg/L) vs. low combined uranium (<30μg/L). Likewise, ZIP codes having the uranium isotope, Ra-226 in drinking water, were also stratified as having elevated radium (≥3 pCi/L) vs. low radium (<3 pCi/L). A linear regression was performed using STATA® generalized linear model (GLM) program to evaluate the probable association between cleft birth defect rates by ZIP codes and concentration of uranium and radium via domestic water supply. These rates were further adjusted for potentially confounding variables such as maternal age, education, occupation, and ethnicity. ^ Results This study showed higher rates of cleft births in ZIP codes classified as having high combined uranium versus ZIP codes having low combined uranium. The model was further improved by adding radium stratified as explained above. Adjustment for maternal age and ethnicity did not substantially affect the statistical significance of uranium or radium concentrations in household water supplies. ^ Conclusion Although this study lacks individual exposure levels, the findings suggest a significant association between elevated uranium and radium concentrations in tap water and high orofacial birth defect rates by ZIP codes. Future case-control studies that can measure individual exposure levels and adjust for contending risk factors could result in a better understanding of the exposure-disease association.^
Resumo:
Measures of agro-ecosystems genetic variability are essential to sustain scientific-based actions and policies tending to protect the ecosystem services they provide. To build the genetic variability datum it is necessary to deal with a large number and different types of variables. Molecular marker data is highly dimensional by nature, and frequently additional types of information are obtained, as morphological and physiological traits. This way, genetic variability studies are usually associated with the measurement of several traits on each entity. Multivariate methods are aimed at finding proximities between entities characterized by multiple traits by summarizing information in few synthetic variables. In this work we discuss and illustrate several multivariate methods used for different purposes to build the datum of genetic variability. We include methods applied in studies for exploring the spatial structure of genetic variability and the association of genetic data to other sources of information. Multivariate techniques allow the pursuit of the genetic variability datum, as a unifying notion that merges concepts of type, abundance and distribution of variability at gene level.
Resumo:
This paper reports the concentrations and within-class distributions of long-chain alkenones and alkyl alkenoates in the surface waters (0-50 m) of the eastern North Atlantic, and correlates their abundance and distribution with those of source organisms and with water temperature and other environmental variables. We collected these samples of >0.8 µm particulate material from the euphotic zone along the JGOFS 20°W longitude transect, from 61°N to 24°N, during seven cruises of the UK-JGOFS Biogeochemical Ocean Flux Study (BOFS) in 1989-1991; the biogeographical range of our 53 samples extends from the cold (<10°C), nutrient-rich and highly productive subarctic waters of the Iceland Basin to the warm (>25°C) oligotrophic subtropical waters off Africa. Surface water concentrations of total alkenone and alkenoates ranged from <50 ng/l in oligotrophic waters below 40°N to 2000-4500 ng/l in high latitude E. huxleyi blooms, and were well correlated with E. huxleyi cell densities, supporting the assumption that E. huxleyi is the predominant source of these compounds in the present day North Atlantic. The within-class distribution of the C37 and C38 alkenones and C36 alkenoates varied strongly as a function of temperature, and was largely unaffected by nutrient concentration, bloom status and other surface water properties. The biosynthetic response of the source organisms to growth temperature differed between the cold (<16°C) waters above 47°N and the warmer waters to the south. In cold (<16°C) waters above 47°N, the relative amounts of alkenoates and C38 alkenones synthesized was a strong function of growth temperature, while the unsaturation ratio of the alkenones (C37 and C38) was uncorrelated with temperature. Conversely, in warm (>16°C) waters below 47°N, the relative proportions of alkenoates and alkenones synthesized remained constant with increasing temperature while the unsaturation ratios of the C37 and C38 methyl alkenones (Uk37 and Uk38Me, respectively) increased linearly. The fitted regressions of Uk37 and Uk38Me versus temperature for waters >16°C were both highly significant (r**2 > 0.96) and had identical slopes (0.057) that were 50% higher than the slope (0.034) of the temperature calibration of Uk37 reported by Prahl and Wakeham (1987; doi:10.1038/330367a0) over the same temperature range. These observations suggest either a physiological adjustment in biochemical response to growth temperature above a 16-17°C threshold and/or variation between different E. huxleyi strains and/or related species inhabiting the cold and warm water regions of the eastern North Atlantic. Using our North Atlantic data set, we have produced multivariate temperature calibrations incorporating all major features of the alkenone and alkenoate data set. Predicted temperatures using multivariate calibrations are largely unbiased, with a standard error of approximately ±1°C over the entire data range. In contrast, simpler calibration models cannot adequately incorporate regional diversity and nonlinear trends with temperature. Our results indicate that calibrations based upon single variables, such as Uk37, can be strongly biased by unknown systematic errors arising from natural variability in the biosynthetic response of the source organisms to growth temperature. Multivariate temperature calibration can be expected to give more precise estimates of Integrated Production Temperatures (IPT) in the sedimentary record over a wider range of paleoenvironmental conditions, when derived using a calibration data set incorporating a similar range of natural variability in biosynthetic response.
Resumo:
This study subdivides the Potter Cove, King George Island, Antarctica, into seafloor regions using multivariate statistical methods. These regions are categories used for comparing, contrasting and quantifying biogeochemical processes and biodiversity between ocean regions geographically but also regions under development within the scope of global change. The division obtained is characterized by the dominating components and interpreted in terms of ruling environmental conditions. The analysis includes in total 42 different environmental variables, interpolated based on samples taken during Australian summer seasons 2010/2011 and 2011/2012. The statistical errors of several interpolation methods (e.g. IDW, Indicator, Ordinary and Co-Kriging) with changing settings have been compared and the most reasonable method has been applied. The multivariate mathematical procedures used are regionalized classification via k means cluster analysis, canonical-correlation analysis and multidimensional scaling. Canonical-correlation analysis identifies the influencing factors in the different parts of the cove. Several methods for the identification of the optimum number of clusters have been tested and 4, 7, 10 as well as 12 were identified as reasonable numbers for clustering the Potter Cove. Especially the results of 10 and 12 clusters identify marine-influenced regions which can be clearly separated from those determined by the geological catchment area and the ones dominated by river discharge.
Resumo:
High-resolution quantitative diatom data are tabulated for the early part of the late Pliocene ( 3.25 to 2.08 Ma ) at DSDP Site 580 in the northwestern Pacific. Sample spacing averages 11 k.y. between 3.1 and 2.8 Ma, but increases to 14 to 19 k.y. prior to 3.1 Ma and after 2.8 Ma. Q-mode factor analysis of the middle Pliocene assemblage reveals four factors which explain 92.4% of the total variance of the 47 samples studied between 3.25 and 2.55 Ma. Three of the factors are closely related to modern subarctic, transitional, and subtropical elements, while the fourth factor, which is dominated by Coscinodiscus marginatus and the extinct Pliocene species Neodenticula kamtschatica, appears to correspond to a middle Pliocene precursor of the subarctic water mass. Knowledge of the modern and generalized Pliocene paleoclimatic relationships of various diatom taxa is used to generate a paleoclimate curve ("Twt") based on the ratio of warm-water (subtropical) to cold-water diatoms with warm-water transitional taxa (Thalassionema nitzschioides, Thalassiosira oestrupii, and Coscinodiscus radiatus) factored into the equation at an intermediate (0.5) value. The "Twt" ratios at more southerly DSDP Sites 579 and 578 are consistently higher (warmer) than those at Site 580 throughout the Pliocene, suggesting the validity of the ratio as a paleoclimatic index. Diatom paleoclimatic data reveal a middle Pliocene (3.1 to 3.0 Ma) warm interval at Site 580 during which paleotemperatures may have exceeded maximum Holocene values by 3 °- 5.5 °C at least three times. This middle Pliocene warm interval is also recognized by planktic foraminifers in the North Atlantic, and it appears to correspond with generalized depleted oxygen isotope values suggesting polar warming. The diatom "Twt" curve for Site 580 compares fairly well with radiolarian and silicoflagellate paleoclimatic curves for Site 580, planktic foraminiferal sea-surface temperature estimates for the North Atlantic, and benthic oxygen isotope curves for late Pliocene, although higher resolution studies on paired samples are required to test the correspondence of these various paleoclimatic indices.
Resumo:
An organic-walled dinoflagellate cyst analysis was carried out on 53 surface sediment samples from West Africa (17-6°N) to obtain insight in the relationship between their spatial distribution and hydrological conditions in the upper water column as well as marine productivity in the study area. Multivariate analysis of the dinoflagellate cyst relative abundances and environmental parameters of the water column shows that sea-surface temperature, salinity, marine productivity and bottom water oxygen are the factors that relate significantly to the distribution patterns of individual species in the region. The composition of cyst assemblages and dinoflagellate cyst concentrations allows the identification of four hydrographic regimes; 1) the northern regime between 17 and 14°N characterized by high productivity associated with seasonal coastal upwelling, 2) the southern regime between 12 and 6°N associated with high-nutrient waters influenced by river discharge 3) the intermediate regime between 14 and 12°N influenced mainly by seasonal coastal upwelling additionally associated with fluvial input of terrestrial nutrients and 4) the offshore regime characterized by low chlorophyll-a concentrations in upper waters and high bottom water oxygen concentrations. Our data show that cysts of Polykrikos kofoidii, Selenopemphix quanta, Dubridinium spp., Echinidinium species, cysts of Protoperidinium monospinum and Spiniferites pachydermus are the best proxies to reconstruct the boundary between the NE trade winds and the monsoon winds in the subtropical eastern Atlantic Ocean. The association of Bitectatodinium spongium, Lejeunecysta oliva, Quinquecuspis concreta, Selenopemphix nephroides, Trinovantedinium applanatum can be used to reconstruct past river outflow variations within this region.
Resumo:
To obtain insight in the relationship between the spatial distribution of organic-walled dinoflagellate cysts (dinocysts) and local environmental conditions, fifty-eight surface sediment samples from the coastal shelf off SW Africa were investigated on their dinocyst content with special focus on the two main river systems and the active upwelling that characterise this region. To avoid possible overprint by species-selective preservation, samples have been selected mainly from shelf sites where high sedimentation rates and/or low bottom water oxygen concentrations prevail. Multivariate ordination analyses have been carried out to investigate the relationship between the distribution patterns of individual species to environmental parameters of the upper water column and sediment transport processes. The main oceanographical variables at the surface (temperature, salinity, nutrients chlorophyll-a) in the region show onshore-offshore gradients. This pattern is reflected in the dinocyst associations with high relative abundances of heterotrophic dinocyst species in neritic regions characterised by high chlorophyll-aand low salinity conditions in surface waters. Phototrophic dinocyst species, notably Operculodinium centrocarpum, dominate in the more oceanic area. Differences in the distribution of phototrophic dinocyst species can be related to sea surface salinity and sea surface temperature gradients and to a lesser extent to chlorophyll-a concentrations. Apart from longitudinal gradients the dinocyst distribution clearly reflects regional environmental features. Six groups of species can be distinguished, characteristic for (1) coastal regions (cysts of Polykrikos kofoidii and Selenopemphix quanta), (2) the vicinity of active upwelling (Brigantedinium spp., Echinidinium aculeatum, Echinidinium spp. and Echinidinium transparantum), (3) river mouths (Lejeunecysta oliva, cysts of Protoperidinium americanum, Selenopemphix nephroides and Votadinium calvum), (4) slope and open ocean sediments (Dalella chathamense, Impagidinium patulum and Operculodinium centrocarpum, (5) the southern Benguela region (south of 24°S) (Spiniferites ramosus) and (6) the northern Benguela region (north of 24°S) (Nematosphaeropsis labyrinthus and Pyxidinopsis reticulata). No indication of overprint of the palaeo-ecological signal by lateral transport of allochthonous species could be observed.
Resumo:
One of the most promising areas in which probabilistic graphical models have shown an incipient activity is the field of heuristic optimization and, in particular, in Estimation of Distribution Algorithms. Due to their inherent parallelism, different research lines have been studied trying to improve Estimation of Distribution Algorithms from the point of view of execution time and/or accuracy. Among these proposals, we focus on the so-called distributed or island-based models. This approach defines several islands (algorithms instances) running independently and exchanging information with a given frequency. The information sent by the islands can be either a set of individuals or a probabilistic model. This paper presents a comparative study for a distributed univariate Estimation of Distribution Algorithm and a multivariate version, paying special attention to the comparison of two alternative methods for exchanging information, over a wide set of parameters and problems ? the standard benchmark developed for the IEEE Workshop on Evolutionary Algorithms and other Metaheuristics for Continuous Optimization Problems of the ISDA 2009 Conference. Several analyses from different points of view have been conducted to analyze both the influence of the parameters and the relationships between them including a characterization of the configurations according to their behavior on the proposed benchmark.
Resumo:
Understanding spatial distributions and how environmental conditions influence catch-per-unit-effort (CPUE) is important for increased fishing efficiency and sustainable fisheries management. This study investigated the relationship between CPUE, spatial factors, temperature, and depth using generalized additive models. Combinations of factors, and not one single factor, were frequently included in the best model. Parameters which best described CPUE varied by geographic region. The amount of variance, or deviance, explained by the best models ranged from a low of 29% (halibut, Charlotte region) to a high of 94% (sablefish, Charlotte region). Depth, latitude, and longitude influenced most species in several regions. On the broad geographic scale, depth was associated with CPUE for every species, except dogfish. Latitude and longitude influenced most species, except halibut (Areas 4 A/D), sablefish, and cod. Temperature was important for describing distributions of halibut in Alaska, arrowtooth flounder in British Columbia, dogfish, Alaska skate, and Aleutian skate. The species-habitat relationships revealed in this study can be used to create improved fishing and management strategies.