|
|
|||||||||||
|
|
|
|||||||||||
|
Youth Substance Use: State Estimates From the 1999 National Household Survey on Drug Abuse |
||||||||||||
Appendix G: State Estimation Methodology
In response to the need for State-level information on substance abuse problems, the Substance Abuse and Mental Health Services Administration (SAMHSA) began developing and testing small area estimation (SAE) methods for the National Household Survey on Drug Abuse (NHSDA) in 1994 under a contract with the Research Triangle Institute (RTI). That developmental work used logistic regression models with data from the combined 1991 to 1993 NHSDAs and local area indicators, such as drug-related arrests, alcohol-related death rates, and block group/tract level characteristics from the 1990 Census that were found to be associated with substance abuse. In 1996, the results were published for 25 States for which there were sufficient sample data (SAMHSA, 1996). A subsequent report described the methodology in detail and noted areas in which improvements were needed (Folsom & Judkins, 1997).
The increasing need for State-level estimates of substance use led to the decision to expand the NHSDA to provide estimates for all 50 States and the District of Columbia on an annual basis beginning in 1999. It was determined that, with the use of modeling similar to that used with the 1991 to 1993 NHSDA data in conjunction with a sample designed for State-level estimation, a sample of about 67,500 persons would be sufficient to make reasonably precise estimates.
The State-based NHSDA sample design implemented in 1999 had the following characteristics:
1. States are stratified into field interviewer (FI) regions that covered the geography of each State. The FI regions are comprised of contiguous Census tracts and counties and designed to yield about 75 interviews per region. In the 42 smaller States (by population) and the District of Columbia, there are 12 FI regions; in the eight largest States, there are 48 FI regions.
2. Within each region, eight segments are randomly selected and two are allocated to each calendar quarter of data collection.
3. Within each segment, households are screened, and a sample of one to two persons per household is selected. An average of nine responding persons per segment is sought.
4. The samples are selected so that approximately 900 responding persons, 300 in each age group (12 to 17, 18 to 25, and 26 or older), are drawn in each of the 42 States and the District of Columbia. In the eight large States, the person samples are allocated equally to the three age groups with overall respondent sample sizes ranging from 2,669 to 4,681.
In preparation for the modeling of the 1999 data, RTI used the data from the combined 1994-96 NHSDAs to develop an improved methodology that utilized more local area data and produced better estimates of the accuracy of the State estimates (Folsom, Shah, & Vaish, 1999). That effort involved the development of procedures that would validate the results for geographic areas with large samples. This work was reviewed by a panel with expertise in smallarea estimation.1 They approved of the methodology, but suggested further improvements for the modeling to be used to produce the 1999 State estimates. Those improvements have been incorporated into the methodology finally used for the 1999 State estimates included in this report. The methodology, called Survey-Weighted Hierarchical Bayes Estimation (HB), is described below.
There were several goals underlying the estimation process. The first was to model drug use at the lowest possible level and aggregate over the levels to form the State estimates. The chosen level of aggregation was the 32 age group (12 to 17, 18 to 25, 26 to 34, 35+) by race/ethnicity (white-not Hispanic, black-not Hispanic, Hispanic, Other) by gender cells at the block group level. Estimated population counts could be obtained from a private vendor for each block group for each of the 32 cells. This level of aggregation was desired because the NHSDA first stage of sample selection was at the block group level, so that there would be data at this level to fit a model. In addition, there was a great deal of information from the Census at the block group level that could be used as predictors in the models. If prevalence rates could be estimated for each of the 32 cells at the block group level, it would only be necessary to multiply by the estimated population counts and aggregate to the State level.
Another goal of the estimation process was to include the sampling weight in the model in such a way that the small area estimates would converge to the design-based (sample-weighted) estimate when they are aggregated to a sufficient sample size. There was a desire for the estimates to have this characteristic so that there would be consistency with the survey-weighted national estimates based on the entire sample.
A third goal was to include as much local source data as possible, especially data related to each substance use measure. This would help provide a better fit beyond the strictly sociodemographic information. The desire was to use national sources of these data so that there would be consistency of collection and estimation methodology across States.
Recognizing that estimates based solely on these "fixed" effects would not reflect differences across States due to differences in laws, enforcement activities, advertising campaigns, outreach activities, and other such unique State contributions, a fourth goal was to include "random" effects to compensate for these differences. The types of random effects that could be supported by the NHSDA data were a function of the size of sample and the model fit to the sample data. For the 1999 survey, random effects were included at the State level and for substate regions comprised of three neighboring FI regions. Although this grouping of the three FI regions was principally motivated by the need to accumulate enough sample to support good model fitting for the low prevalence NHSDA outcomes, it was also reasoned that it would be possible to produce substate HB estimates for areas comprised of these FI region groups, once 2 or 3 years of NHSDA data were available, because that would yield substate region samples of at least 400 respondents. For substate areas like counties and large municipalities that do not conform to the substate region boundaries, HB estimates could be derived from their elementalblock group level contributions, but the direct survey data employed in the estimation of the associated substate region effects would not be restricted to the county or city of interest. This mismatch of FI region and county/large municipality boundaries weakens the theoretical appeal of the associated HB estimate. For this reason, substate HB estimates probably should be restricted to areas that can be matched reasonably well to FI region groups.
One of the difficulties of typical SAE has been obtaining good estimates of the accuracy of the estimates with prediction intervals that give a good representation of the true probability of coverage of the intervals. Therefore, the final major goal was to provide accurate prediction intervals-ones that would approach the usual sample-based intervals as the sample size increases.
A set of 20 measures covering a variety of aspects of substance use and abuse was designated. These variables are listed below. The first seven measures in the list were considered priority variables and were discussed in the Summary of Findings from the 1999 National Household Survey on Drug Abuse (SAMHSA, 2000b). The remaining variables have been estimated. Some have been discussed in this report, and the remaining ones will be released in a separate State report later this year.
1. |
past month binge alcohol use |
12. |
perceived great risk of smoking one or more packs of cigarettes everyday | |
2. |
past month cigarette use |
13. |
perceived great risk of having five or more alcoholic drinks once or twice a week | |
3. |
past month marijuana use |
14. |
past year receipt of treatment for illicit drugs | |
4. |
past month any illicit drug use |
15. |
past year receipt of treatment for illicit drugs or alcohol | |
5. |
past month any illicit drug except marijuana use |
16. |
past year needed treatment for illicit drugs or dependent on alcohol | |
6. |
past year dependence on illicit drugs |
17. |
past year needed treatment for illicit drugs | |
7. |
past year dependence on alcohol or illicit drugs |
18. |
past year cocaine use | |
8. |
past month alcohol use |
19. |
past month tobacco use | |
9. |
never use marijuana |
20. |
Food Stamp participation rate | |
10. |
first time use of marijuana in the past 2 years |
|||
11. |
perceived great risk of smoking marijuana once a month |
|||
Local area data used as potential predictor variables in the logistic regression models were obtained from several sources, including Claritas, the Census Bureau, the FBI (Uniform Crime Reports), Health Resources and Services Administration (Area Resource File), SAMHSA (Uniform Facility Data Set), and the National Center for Health Statistics (mortality data). The list of sources and potential data items used in the modeling are provided below.
Claritas
Demographic data package called Building Block Basic, Age by Race from Claritas for 1999 with projections to 2004; the estimates for 1999-population counts were used
Census Bureau
1990 Census, demographic and socioeconomic variables
July 1997 Food Stamp participation rates
Federal Bureau of Investigation
Uniform Crime Report (UCR), UCR arrest totals from: http://fisher.lib.Virginia.EDU/crime/; the most current data are for 1997 for most counties, and previous years data were used in a few cases
Health Resources and Services Administration
Area Resource File (ARF), some variables relating to income and employment from the ARF February 1999 release from the Bureau of Health Professions, Office of Research and Planning
National Center for Health Statistics
Mortality data using International Classification of Diseases, 9th revision (ICD-9), 1992 to 1997; ICD-9 death rate data from the Centers for Disease Control and Prevention at the National Center for Health Statistics
SAMHSA, Office of Applied Studies
Uniform Facility Data Set (UFDS), 1997 to 1998 UFDS data on drug and alcohol treatment rates from Synectics for Management Decisions, Inc.
The following tables list the specific independent variables that were potential predictors in the models.
Claritas Data | |
| Description | Level |
% Population aged 0-18 in block group |
Block group |
% Population aged 19-24 in block group |
Block group |
% Population aged 25-34 in block group |
Block group |
% Population aged 35-44 in block group |
Block group |
% Population aged 45-54 in block group |
Block group |
% Population aged 55-64 in block group |
Block group |
% Population aged 65+ in block group |
Block group |
% Blacks in block group |
Block group |
% Hispanics in block group |
Block group |
% Other race in block group |
Block group |
% Whites in block group |
Block group |
% Males in block group |
Block group |
% Females in block group |
Block group |
% American Indian, Eskimo, Aleut in tract |
Tract |
% Asian, Pacific Islander in tract |
Tract |
% Population aged 0-18 in tract |
Tract |
% Population aged 19-24 in tract |
Tract |
% Population aged 25-34 in tract |
Tract |
% Population aged 35-44 in tract |
Tract |
% Population aged 45-54 in tract |
Tract |
% Population aged 55-64 in tract |
Tract |
% Population aged 65+ in tract |
Tract |
% Blacks in tract |
Tract |
% Hispanics in tract |
Tract |
% Other race in tract |
Tract |
% Whites in tract |
Tract |
% Males in tract |
Tract |
% Females in tract |
Tract |
% Population aged 0-18 in county |
County |
% Population aged 19-24 in county |
County |
% Population aged 25-34 in county |
County |
% Population aged 35-44 in county |
County |
% Population aged 45-54 in county |
County |
% Population aged 55-64 in county |
County |
% Population aged 65+ in county |
County |
% Blacks in county |
County |
% Hispanics in county |
County |
% Other race in county |
County |
% Whites in county |
County |
% Males in county |
County |
% Females in county |
County |
1990 Census Data | |
Description |
Level |
% Population who dropped out of high school |
Tract |
% Housing units built in 1940-1949 |
Tract |
% Persons 16-64 with a work disability |
Tract |
% Hispanics who are Cuban |
Tract |
% Females 16 years or older in labor force |
Tract |
% Females never married |
Tract |
% Females separated/divorced/widowed/other |
Tract |
% One-person households |
Tract |
% Female head of household, no spouse, child #18 |
Tract |
% Males 16 years or older in labor force |
Tract |
% Males never married |
Tract |
% Males separated/divorced/widowed/other |
Tract |
% Housing units built in 1939 or earlier |
Tract |
Average persons per room |
Tract |
% Families below poverty level |
Tract |
% Households with public assistance income |
Tract |
% Housing units rented |
Tract |
% Population 9-12 years of school, no high school diploma |
Tract |
% Population 0-8 years of school |
Tract |
% Population with associate's degree |
Tract |
% Population some college and no degree |
Tract |
% Population with bachelor's, graduate, professional degree |
Tract |
Median rents for rental units |
Tract |
Median value of owner-occupied housing units |
Tract |
Median household income |
Tract |
Uniform Crime Report Data | |
Description |
Level |
Drug possession arrest rate |
County |
Drug sale/manufacture arrest rate |
County |
Drug violations' arrest rate |
County |
Marijuana possession arrest rate |
County |
Marijuana sale/manufacture arrest rate |
County |
Opium cocaine possession arrest rate |
County |
Opium cocaine sale/manufacture arrest rate |
County |
Other drug possession arrest rate |
County |
Other dangerous non-narcotics arrest rate |
County |
Serious crime arrest rate |
County |
Violent crime arrest rate |
County |
Categorical Data | ||
Description |
Source |
Level |
=1 if Hispanic, =0 otherwise |
Sample |
Person |
=1 if non-Hispanic Black, =0 otherwise |
Sample |
Person |
=1 if non-Hispanic Other, =0 otherwise |
Sample |
Person |
=1 if male, =0 if female |
Sample |
Person |
=1 if Northeast region, =0 otherwise |
1990 Census |
State |
=1 if Midwest region, =0 otherwise |
1990 Census |
State |
=1 if South region, =0 otherwise |
1990 Census |
State |
=1 if MSA with 1 million +, =0 otherwise |
1990 Census |
County |
=1 if MSA with <1 million, =0 otherwise |
1990 Census |
County |
=1 if non-MSA urban, =0 otherwise |
1990 Census |
Tract |
Underclass indicator |
Urban Institute |
Tract |
=1 if no Cubans in tract, =0 otherwise |
1990 Census |
Tract |
=1 if urban area, =0 if rural area |
1990 Census |
Tract |
=1 if no arrests for dangerous non-narcotics |
UCR |
County |
=0 otherwise | ||
Miscellaneous Data | ||
Variable Description |
Level |
Source |
Alcohol death rate, direct cause |
County |
ICD-9 |
Alcohol death rate, indirect cause |
County |
ICD-9 |
Cigarettes death rate, direct cause |
County |
ICD-9 |
Cigarettes death rate, indirect cause |
County |
ICD-9 |
Drug death rate, direct cause |
County |
ICD-9 |
Drug death rate, indirect cause |
County |
ICD-9 |
Alcohol treatment rate |
County |
UFDS |
Alcohol and drug treatment rate |
County |
UFDS |
Drug treatment rate |
County |
UFDS |
% Families below poverty level |
County |
ARF |
Unemployment rate |
County |
ARF |
Median personal income |
County |
ARF |
Food stamp participation rate |
County |
Census Bureau |
Independent variables for modeling each of the substance use measures were first identified by a CHAID (Chi-squared Automatic Interaction Detector) algorithm. CHAID is an algorithm that does not use sample weights. Prior to this process, all the continuous variables were categorized using deciles and were treated as ordinal in CHAID. Race, region, and gender were treated as nominal categorical variables in CHAID. Significant independent variables from each model that were final nodes in the tree-growing process were identified as indicator variables destined for inclusion at a later step.
Independently, a SAS stepwise logistic regression model was fit for each dependent variable by age group. The SAS stepwise was used because it was able to quickly run all of the variables for all of the models, although it was recognized that the software would not take into account the complex sample design and the weights. The independent variables included all the first-order or linear polynomial trend contrasts across the 10 levels of the categorized variables plus the gender, region, and race variables. Significant variables (at the 3 percent level) were identified from this process. Based on this list, a list of variables was created that included the second- and third-order polynomials and the interaction of the first-order polynomials with the gender, race, and region variables.
Next, the variables from the CHAID process and the SAS process were entered into a SAS stepwise logistic model at the 1 percent significance level. Because of past concerns about overfitting of the data in earlier estimation using the 1991 to 1993 NHSDA data, the significance levels were made quite stringent. These variables were then entered into a SUDAAN logistic regression model because the SUDAAN software would adjust for the effects of the weights andother aspects of the complex sample design. All variables that were still significant at the 1 percent significance level were entered into the survey weighted HB process.
Independently, a factor-analytic approach was used to determine the important variables to include in the model. This approach would allow the data to self-identify the important dimensions. The concern here was to use an alternate method that would have a certain face validity. That method was utilized to identify an independent set of variables that were then processed through the HB estimation. The results, however, in terms of model-fit and prediction intervals were generally not as good as with the CHAID/SAS/SUDAAN screening process for candidate independent variables. Also, the factor-analytic approach involves an inherently subjective step to attribute names to the various factor loadings, and the interest was more in the predictive ability of variables than in a substantive description of the dimensions. Nevertheless, it was encouraging to see that the results of the two approaches gave reasonably similar results. For these reasons, the estimates in this report were those based on the latter method that started with the CHAID process.
The model can be characterized as a complex mixed model (including both fixed and random effects) of the form:
8=X$ + ZU
Each of the symbols represents a matrix or vector. The leading term X$ is the usual (fixed) regression contribution, and ZU represents random effects for the States and FI region groups that the data will support and for which estimates are desired. Not obvious from the notation is that the form of the model is a logistic model used to estimate dichotomous data. The 8 vector has elements ln[Bijk /(1-Bijk)], where the Bijk is the propensity for the kth person in the jth FI composite region in the ith State to engage in the behavior of interest (e.g., to use marijuana in the past month). Also not obvious from the notation is that the model fitting utilizes the final "sample" weights as discussed above. The "sample" weights have been adjusted for nonresponse and poststratified to known Census counts.
The estimate for each State behaves like a "weighted" average of the direct survey estimate in that State and the predicted value based on the national regression model. The "weights" in this case are functions of the relative precision of the sample based estimate for the State and the predicted estimate based on the national model. The eight large States have large samples, and thus more "weight" is given to the sample estimate relative to the model-based regression estimate. The 42 small States and the District of Columbia put relatively more "weight" on the regression estimate because of their smaller samples. The national regression estimate actually uses national parameters that are based on the full sample of approximately 67,000 persons; however, the regression estimate for a specific State is based on applying the national regression parameters to that State's "local" county, block group, and tract level predictor variables and summing to the State level. Therefore, even the national regression component of the estimate for a State includes "local" State data.
The goal then was to come up with the best estimates of $ and U. This would lead to the best estimates of 8, which would in turn lead to the best estimate of B. Once the best estimate of B for each block group and each age/race/gender cell within a block group has been estimated, the results could be weighted by the projected Census population counts at that level to make estimates for any geographic area larger than a block group.
G.7 Implementation of Modeling
The solution to the equation for 8 in the above section is not straightforward but involves a series of iterative steps to generate values of the desired fixed and random effects from the underlying joint distribution. The details of the technique will be described in more detail in a methodological report currently in progress. In the interim, the basic process can be described as follows.
Let $
denote the matrix of fixed effects, 0
be the matrix of State random effects i = 1-51, and
< denote the matrix of FI composite region
effects j within State i. Because the goal is to estimate separate
models for four age groups, it is assumed that the random effects vectors
are four variate Normal with null mean vectors and 4X4 covariance matrices D0
and D<,
respectively. To estimate the individual effects, a Bayesian approach is used to
represent the joint density function given the data by f($,
0 ,
<, D<
, D0
| y). According to the Bayes process, this can be estimated once the
conditional distributions are known:
f1( $ | 0, <, D< , D0 , y), f2(D< , D0 | $ 0 , <, y), and f3(0 , < | $, D< , D0 , y).
To generate random draws from these distributions, Markov Chain Monte Carlo (MCMC) processes need to be used. These are a body of methods for generating pseudo-random draws from probability distributions via Markov chains. A Markov chain is fully specified by its starting distribution P(X0) and the transition kernel P(Xt |Xt-1).
Each MCMC step that involves the vector of binary outcome variables y in the conditioning set needs first to be modified by defining a pseudo-likelihood using survey weights. In defining pseudo-likelihood, weights are introduced after scaling them to the effective sample size based on a suitable design effect. Note that with the pseudo-likelihood, the covariance matrix of the pseudo-score functions is no longer equal to the pseudo-information matrix, and therefore a sandwich-type of covariance matrix was to compute the design effect. In this process, weights are largely assumed to be noninformative (i.e., unrelated to the outcome variable y). The assumption of noninformative weights is useful in finding tractable expressions for the appropriate information matrix of the pseudo score functions. The pseudo log-likelihood remains an unbiased estimate of the finite-population log-likelihood regardless of this assumption.
Step I [$" | 0, v, y] (this does not depend on D0, Dv )
With flat prior for $", the conditional posterior is proportional to the pseudo-likelihood function. For large samples, this posterior can be approximated by the multivariate normal distribution with mean vector equal to the pseudo-maximum likelihood estimate and with asymptotic covariance matrix having the associated sandwich form. Assuming that the survey weights are noninformative makes the age group specific $" vectors conditionally independent of each other. Therefore, the $" can be updated separately at each MCMC cycle.
Step II [0i | $, vi, D0, y] (this does not depend on Dv )
Here the conditional posterior is proportional to the product of the prior g(0i|.), the pseudo-likelihood function f(y|.) as well as the prior p($,D0); this last prior can be omitted as it does not involve 0i. To calculate the denominator (or the normalization constant) of the posterior distribution for 0i requires multidimensional integration and is numerically intractable. To get around this problem, the Metropolis-Hastings (M-H) algorithm is used that requires a dominating density convenient for Monte Carlo sampling. For this purpose, the mode andcurvature of the conditional posterior distribution are used; these can be simply obtained from its numerator. Then a Gaussian distribution is used with matching mode and curvature to define the dominating density for M-H. As with the age group specific $" parameters, the State-specific random effect vectors 0i are conditionally independent of each other and can be updated separately at each MCMC cycle.
Step III [vij | $, 0i, Dv, y] (this does not depend on D0)
Similar to step II.
Step IV [D0 | 0] , [Dv | v] (here, 0 and v include all the information from y)
Here, the pseudo-likelihood involving design weights comes in implicitly through the conditioning parameters 0 and v evaluated at the current cycle. An exact conditional posterior distribution is obtained because the inverse Wishart priors for D0 and Dv are conjugate.
Remarks
In the NHSDA application, three FI regions were combined to form a minimum of four substate regions with corresponding random effects. This was done to ensure adequate sample sizes for estimation purposes.
There is self-calibration built in to the modeling. This is achieved via design effect-scaling of survey weights incorporated in the conditional posterior density so that small area estimates for large States become asymptotically equivalent to the direct estimates. Similarly, survey-weighted estimates of the fixed parameters (in particular the intercept) give calibration of the aggregate of small area estimates to the national direct estimate.
For posterior variance estimation purposes, the survey weights were largely assumed to be noninformative. The survey design effects on the posterior variance are therefore restricted to unequal weighting effects. It was assumed that all the design-related clustering effects are represented by between State and between substate (within State) variability of random effects. This does not take care of variability at lower levels of clustering. However, sample size is not sufficient at lower levels to support stable estimates of random effects for area segments.
If the logistic mixed model fits well, the variance estimates should be reasonable. The self-calibration property provides some protection against model breakdown. Research is currently under way to develop a new MCMC algorithm that fully accounts for survey design effects on the small area estimate posterior prediction intervals.
The following validation methodology was implemented at the time of the first release of the 1999 NHSDA data (SAMHSA, 2000b) and is based on the seven variables discussed in that report. Subsequently, an error in the imputation program was discovered, and the corrected estimates have been made available on the SAMHSA website. The imputation error should not have affected the results of the validation process in which estimates from repeated simulated samples were compared to the overall direct estimates because the imputation error would have been reflected in both the simulated data and the overall direct estimate. Therefore, those results are presented again below.
To validate the fit of the SAE models, the eight large sample States were used as internal benchmarks. For this purpose, 12 pseudo FI regions within each large sample State were created by pooling the 48 initial regions into groups of 4. Each of these pseudo FI regions were then expected to have 8 area segments per calendar quarter. For each of these pseudo FI region by quarter sets of 8 area segments, any segments that were devoid of interviews were first randomly replaced by a selection from the non-empty segments in the set. The completed set of 8 segments from each pseudo FI region by quarter combination was then randomly partitioned into 4 replicates of 2 segments each. Combined across the 12 pseudo FI regions and the 4 calendar quarters, each of the 4 substate replicates mimicked the size and design structure of a small State sample.
Having created four pseudo small State samples and associated universe level files for each large State, SAEs were then produced for 75 States (43 + 32), including the 43 small States and 32 substate territories defined across the eight large sample States. Tables G.3 and G.4 show these 32 substate SAEs and their direct survey weighted analogs for two of the seven substances included in the validation analysis-one with a medium prevalence, and one with a low prevalence. Full State sample estimates have been included for comparison purposes. Relative absolute biases of the substate estimates are shown where the full State sample direct estimate is used as the benchmark value.
The State specific relative absolute bias (RB) quantities in Tables G.3 and G.4 equal the absolute differences of the averaged four substate small area estimates (SS1, .., SS4) and the State full sample design based benchmark (e.g., California, etc.) divided by the benchmark. The average relative absolute bias (ARB) is the simple average across the eight large States of the RBs. For the two highest prevalence items, binge alcohol and cigarette use, these ARB quantities are quite small; namely 1.30 and 1.71 percent, respectively, for the total age 12 or older age group. For the three items with prevalence rates in the middle range, dependence on illicit drugs or alcohol, marijuana use and any illicit drug use, the ARB measures range from 4.75 to 5.82 percent for the total age group. The two lowest prevalence items, dependence on illicit drugs and use of any illicit drug other than marijuana, have ARBs of 8.38 and 11.49 percent for the total age group. The age groups with the lowest prevalence rates are seen to have the largest ARBs.
Table G.2 provides estimates of the relative absolute bias for the eight large States for three substance measures. The RB for a specific State is the absolute value of the difference between the survey weighted HB estimate and the direct survey estimate based on the full sample, divided by the direct survey estimate. Because models for these States put less reliance on the model, their biases are smaller than for the 42 States and the District of Columbia. For past month use of cigarettes (not shown) among the age 12 or older population, the ARB across the eight States was 1.4 percent. For past month use of any illicit drug, the ARB was 4.2 percent, and for the substance with the lowest prevalence, past year dependence on any illicit drug, the percentage was 7.5.
To compare the overall precision of the small area estimates with the direct survey estimates, ratios of the corresponding 95 percent Bayes (credible) intervals, which fully account for the posterior variance of the fixed and random effect parameters, were compared to the corresponding direct survey confidence intervals. These results are displayed in Tables G.5 and G.6 for past month use of any illicit drug and past year dependence on any illicit drug.
The SAE and direct intervals are summarized by showing average ratios of the relative interval widths (the interval width for a State divided by the corresponding estimate for that State) by State and overall averaged of the ratios across States by outcome. For the eight largeStates for those aged 12 or older, the average ratios are cigarettes .89, any illicit drug .84, and dependence on any illicit drug.78. For the other States and the District of Columbia, the comparable estimates are cigarettes .71, any illicit drug .62, and dependence on any illicit drug .60. This indicates that on average the HB estimates are more precise than the corresponding direct survey estimates.
Table G.1 shows the screening, interview, and overall response rate for each State and the District of Columbia. As mentioned in the text, these variable response rates can be associated with variable levels of nonresponse bias. In addition, there may also be varying levels of response bias as a result of underreporting (and sometimes overreporting) use of illicit substances. For 1999, the assumptions being made are that the biases from these two sources are constant across States so that comparisons among States still hold.
Another possible contributor to bias in the State estimates, and the estimates in general, was the effect of editing and imputation on two substances-past month use of marijuana and past month binge use of alcohol. In developing the editing and imputation process for 1999 and subsequent years, the desire was to minimize the amount of editing that is typically somewhat subjective, and instead let the random imputation process supply any partially missing information. Overall, the percentage of imputed information is quite small for any given substance. The method as described earlier is based on a multivariate imputation in which some demographic and other substance use information from the respondent is used to determine a donor who is similar in those characteristics but has supplied data for the drug in question. Often, information was also available from the partial respondent on the recency of drug use. For example, respondents may have indicated that they used the drug in their lifetime or in the past year, but left blank the question about use in the past month. For many of the records, this auxiliary information was available. In a small portion of the time, no auxiliary information was available, in which case a random donor with similar drug use patterns and demographic characteristics was used. For the different substances, the largest differences between the edited and the imputed estimates typically occurred when there was a lot of auxiliary information. For marijuana, the State with the largest percentage change from edited to imputed data was Alabama, whose edited rate of use of marijuana was 2.1 percent and imputed rate of use was 3.1 percent-a relative increase of almost 50 percent.
Lastly, the differences in State levels of substance use often reflect differences that are due in part to underlying socioeconomic differences. Table G.7 presents State information on a few variables that are may have some association with substance use. These variables include the percentage of persons aged 18 to 25, the percentage of persons by race/ethnicity, the percentage of persons below poverty, the percentage who are urban, the percentage of female heads of household, the unemployment rate, the mean personal income, and the median household income.
Table G.1 1999 NHSDA Weighted CAI Screening and Interview Response Rates, by State
|
State |
Screening Response Rate |
Interview Response Rate |
Overall Response Rate |
State |
Screening Response Rate |
Interview Response Rate |
Overall Response Rate |
Total |
89.63 |
68.55 |
61.44 |
Missouri |
91.32 |
73.59 |
67.21 |
Alabama |
92.60 |
71.36 |
66.08 |
Montana |
92.76 |
76.39 |
70.86 |
Alaska |
91.07 |
77.20 |
70.31 |
Nebraska |
89.99 |
72.05 |
64.84 |
Arizona |
94.43 |
65.87 |
62.21 |
Nevada |
79.89 |
63.05 |
50.37 |
Arkansas |
95.71 |
80.45 |
77.00 |
New Hampshire |
85.36 |
69.87 |
59.65 |
California |
87.47 |
64.12 |
56.08 |
New Jersey |
89.65 |
65.24 |
58.48 |
Colorado |
91.62 |
65.84 |
60.32 |
New Mexico |
96.12 |
77.77 |
74.75 |
Connecticut |
85.62 |
58.60 |
50.17 |
New York |
84.28 |
59.98 |
50.55 |
Delaware |
87.13 |
58.36 |
50.85 |
North Carolina |
92.87 |
71.84 |
66.72 |
District of Columbia |
93.35 |
79.93 |
74.61 |
North Dakota |
89.89 |
77.48 |
69.65 |
Florida |
89.94 |
68.20 |
61.33 |
Ohio |
90.35 |
67.78 |
61.24 |
Georgia |
90.47 |
66.97 |
60.59 |
Oklahoma |
91.58 |
67.79 |
62.08 |
Hawaii |
89.11 |
67.61 |
60.25 |
Oregon |
85.20 |
71.57 |
60.98 |
Idaho |
92.93 |
75.45 |
70.11 |
Pennsylvania |
92.34 |
68.99 |
63.71 |
Illinois |
87.35 |
63.74 |
55.68 |
Rhode Island |
86.68 |
66.72 |
57.83 |
Indiana |
91.68 |
73.06 |
66.98 |
South Carolina |
91.96 |
65.92 |
60.61 |
Iowa |
92.44 |
69.69 |
64.41 |
South Dakota |
94.35 |
76.14 |
71.84 |
Kansas |
90.59 |
72.89 |
66.03 |
Tennessee |
90.92 |
67.70 |
61.56 |
Kentucky |
92.36 |
73.75 |
68.12 |
Texas |
92.57 |
75.12 |
69.54 |
Louisiana |
94.81 |
76.97 |
72.98 |
Utah |
93.16 |
81.70 |
76.11 |
Maine |
89.96 |
75.18 |
67.63 |
Vermont |
90.26 |
74.49 |
67.24 |
Maryland |
87.78 |
64.66 |
56.76 |
Virginia |
89.84 |
66.28 |
59.55 |
Massachusetts |
80.59 |
61.82 |
49.82 |
Washington |
86.49 |
75.06 |
64.92 |
Michigan |
88.21 |
66.54 |
58.70 |
West Virginia |
95.59 |
74.31 |
71.03 |
Minnesota |
89.46 |
77.72 |
69.53 |
Wisconsin |
90.19 |
73.05 |
65.89 |
Mississippi |
94.51 |
82.77 |
78.23 |
Wyoming |
93.79 |
72.62 |
68.11 |
Source: SAMHSA, Office of Applied Studies, National Household Survey on Drug Abuse, 1999 CAI.
Table G.2 Percentage Relative Absolute Bias of Selected Past Month Drug Use and Past Year Dependence for the Eight Large States
Past Month Use |
Past Year Dependence | |||||||||||
Cigarette |
Any Illicit Drug | |||||||||||
|
State |
Total |
12-17 |
18-25 |
26 or Older |
Total |
12-17 |
18-25 |
26 or Older |
Total |
12-17 |
18-25 |
26 or Older |
National |
0.57 |
1.14 |
0.38 |
0.79 |
2.45 |
1.37 |
1.16 |
5.32 |
6.85 |
3.30 |
0.78 |
14.54 |
Eight Large States |
||||||||||||
California |
1.77 |
1.73 |
1.57 |
1.83 |
2.81 |
0.19 |
1.03 |
5.75 |
1.68 |
1.20 |
1.17 |
3.48 |
Florida |
0.92 |
10.59 |
1.43 |
0.36 |
1.52 |
10.42 |
1.74 |
3.34 |
10.02 |
7.57 |
0.60 |
21.32 |
Illinois |
0.75 |
0.49 |
0.18 |
1.12 |
1.25 |
3.65 |
0.30 |
1.39 |
7.46 |
1.39 |
17.77 |
3.16 |
Michigan |
4.18 |
1.63 |
0.65 |
5.77 |
3.82 |
6.21 |
3.39 |
3.39 |
3.81 |
0.26 |
8.61 |
19.22 |
New York |
1.54 |
2.21 |
0.89 |
2.32 |
14.32 |
8.13 |
2.81 |
30.79 |
22.79 |
16.97 |
3.73 |
69.97 |
Ohio |
0.51 |
2.70 |
1.83 |
0.97 |
4.20 |
2.46 |
4.10 |
10.80 |
10.66 |
19.88 |
2.01 |
20.40 |
Pennsylvania |
1.08 |
3.26 |
0.93 |
1.37 |
4.01 |
3.77 |
7.39 |
2.60 |
2.74 |
6.49 |
16.35 |
2.32 |
Texas |
0.50 |
1.80 |
0.81 |
0.28 |
1.43 |
1.76 |
0.47 |
2.20 |
0.88 |
2.53 |
10.94 |
6.70 |
Eight Large State Average |
1.41 |
3.05 |
1.04 |
1.75 |
4.17 |
4.58 |
2.66 |
7.53 |
7.51 |
7.04 |
7.65 |
18.32 |
Relative Absolute Bias=|(Small Area Estimate-Design Based Estimate)|/Design Based Estimate
Table G.3 Simulated Small State Prevalence Rates and Relative Absolute Bias for Past Month Any Illicit Drug Use
Past Month Illicit Drug Use | ||||||||
Design Based Estimate |
Small Area Estimate | |||||||
Total |
12-17 |
18-25 |
26 or Older |
Total |
12-17 |
18-25 |
26 or Older | |
CA1 |
9.19 |
11.66 |
14.68 |
7.85 |
8.57 |
11.79 |
16.01 |
6.77 |
CA2 |
7.30 |
11.86 |
15.24 |
5.22 |
8.01 |
12.31 |
16.25 |
5.91 |
CA3 |
7.39 |
10.81 |
17.21 |
5.14 |
8.21 |
10.91 |
17.51 |
6.16 |
CA4 |
8.67 |
13.82 |
21.43 |
5.65 |
8.62 |
13.15 |
19.24 |
6.07 |
California |
8.04 |
11.96 |
17.24 |
5.83 |
8.26 |
11.94 |
17.06 |
6.16 |
REL ABS BIAS |
1.23 |
0.67 |
0.56 |
2.34 |
3.90 |
0.71 |
0.10 |
6.87 |
FL1 |
6.66 |
8.93 |
18.99 |
4.75 |
6.68 |
9.75 |
17.15 |
4.92 |
FL2 |
5.61 |
8.10 |
14.89 |
4.08 |
6.37 |
9.35 |
15.99 |
4.74 |
FL3 |
7.36 |
6.96 |
21.63 |
5.51 |
7.01 |
9.30 |
19.36 |
5.10 |
FL4 |
7.46 |
6.83 |
13.22 |
6.76 |
6.75 |
8.58 |
14.62 |
5.48 |
Florida |
6.86 |
7.57 |
16.99 |
5.42 |
6.75 |
8.36 |
16.69 |
5.24 |
REL ABS BIAS |
1.19 |
1.78 |
1.15 |
2.65 |
2.26 |
22.13 |
1.23 |
6.64 |
IL1 |
7.57 |
14.35 |
17.49 |
4.95 |
7.05 |
12.59 |
18.23 |
4.38 |
IL2 |
7.33 |
15.47 |
19.19 |
4.19 |
6.82 |
12.75 |
18.38 |
4.03 |
IL3 |
6.64 |
9.65 |
12.86 |
5.16 |
6.51 |
10.85 |
15.43 |
4.39 |
IL4 |
7.01 |
13.46 |
20.68 |
3.79 |
6.88 |
12.36 |
19.14 |
4.02 |
Illinois |
6.98 |
13.23 |
17.94 |
4.24 |
6.89 |
12.75 |
17.99 |
4.18 |
REL ABS BIAS |
2.34 |
0.00 |
2.12 |
6.57 |
2.28 |
8.25 |
0.79 |
0.90 |
MI1 |
7.11 |
12.67 |
21.37 |
3.93 |
7.91 |
12.90 |
20.76 |
5.04 |
MI2 |
8.31 |
7.34 |
17.00 |
6.97 |
8.17 |
10.51 |
18.81 |
6.05 |
MI3 |
6.20 |
9.98 |
20.62 |
3.23 |
7.44 |
11.67 |
19.42 |
4.82 |
MI4 |
8.57 |
13.11 |
14.89 |
6.87 |
8.00 |
12.96 |
17.80 |
5.65 |
Michigan |
7.66 |
11.08 |
18.26 |
5.39 |
7.96 |
11.76 |
18.88 |
5.58 |
REL ABS BIAS |
1.51 |
2.75 |
1.11 |
2.68 |
2.82 |
8.43 |
5.11 |
0.10 |
(continued)
Table G.3 (continued) Simulated Small State Prevalence Rates and Relative Absolute Bias for Past Month Any Illicit Drug Use
Past Month Illicit Drug Use | ||||||||
Design Based Estimate |
Small Area Estimate | |||||||
Total |
12-17 |
18-25 |
26 or Older |
Total |
12-17 |
18-25 |
26 or Older | |
NY1 |
5.57 |
9.09 |
17.95 |
3.19 |
7.11 |
10.75 |
18.21 |
4.90 |
NY2 |
5.96 |
8.82 |
18.70 |
3.60 |
7.20 |
10.72 |
17.55 |
5.13 |
NY3 |
6.21 |
11.39 |
20.69 |
3.27 |
7.58 |
11.85 |
18.95 |
5.26 |
NY4 |
6.27 |
11.15 |
18.63 |
3.71 |
7.57 |
11.70 |
19.27 |
5.21 |
New York |
6.10 |
9.93 |
19.04 |
3.59 |
6.98 |
10.74 |
18.51 |
4.69 |
REL ABS BIAS |
1.64 |
1.78 |
0.26 |
4.00 |
20.70 |
13.30 |
2.87 |
42.96 |
OH1 |
6.42 |
8.73 |
18.28 |
4.10 |
6.73 |
10.21 |
16.92 |
4.54 |
OH2 |
7.63 |
10.56 |
16.41 |
5.75 |
7.12 |
10.59 |
16.47 |
5.07 |
OH3 |
5.17 |
9.49 |
16.48 |
2.69 |
6.71 |
10.59 |
17.27 |
4.40 |
OH4 |
6.12 |
11.98 |
17.14 |
3.48 |
7.05 |
11.75 |
18.06 |
4.57 |
Ohio |
6.28 |
10.25 |
16.92 |
3.95 |
6.54 |
10.50 |
16.23 |
4.38 |
REL ABS BIAS |
0.86 |
0.55 |
0.92 |
1.31 |
9.88 |
5.27 |
1.53 |
17.52 |
PA1 |
6.31 |
7.55 |
14.58 |
4.94 |
6.91 |
9.46 |
17.30 |
5.06 |
PA2 |
6.05 |
9.59 |
17.08 |
3.98 |
6.67 |
9.99 |
17.48 |
4.65 |
PA3 |
7.77 |
10.47 |
13.59 |
6.58 |
7.14 |
10.31 |
16.29 |
5.40 |
PA4 |
6.62 |
10.84 |
14.75 |
4.89 |
7.01 |
10.69 |
17.27 |
5.03 |
Pennsylvania |
6.74 |
9.51 |
15.14 |
5.15 |
7.01 |
9.87 |
16.26 |
5.28 |
REL ABS BIAS |
0.71 |
1.09 |
0.94 |
1.01 |
2.93 |
6.34 |
12.85 |
2.17 |
TX1 |
6.03 |
11.53 |
13.70 |
3.62 |
5.48 |
11.03 |
13.62 |
2.97 |
TX2 |
4.43 |
11.49 |
12.43 |
1.71 |
5.25 |
10.68 |
13.70 |
2.71 |
TX3 |
5.54 |
8.17 |
17.42 |
2.76 |
5.45 |
9.00 |
15.78 |
2.83 |
TX4 |
5.43 |
9.75 |
14.63 |
2.90 |
5.39 |
9.74 |
14.43 |
2.90 |
Texas |
5.30 |
10.21 |
14.32 |
2.72 |
5.38 |
10.39 |
14.39 |
2.78 |
REL ABS BIAS |
1.03 |
0.28 |
1.60 |
0.86 |
1.76 |
0.94 |
0.47 |
4.69 |
AVERAGE |
1.31 |
1.11 |
1.08 |
2.68 |
5.82 |
8.17 |
3.12 |
10.23 |
Source: SAMHSA, Office of Applied Studies, National Household Survey on Drug Abuse, 1999 CAI.
Table G.4 Simulated Small State Prevalence Rates and Relative Absolute Bias for Past Year Illicit Drug Dependence
Past Year Illicit Drug Dependence | ||||||||
Design Based Estimate |
Small Area Estimate | |||||||
Total |
12-17 |
18-25 |
26 or Older |
Total |
12-17 |
18-25 |
26 or Older | |
CA1 |
2.58 |
3.96 |
4.46 |
2.05 |
2.49 |
4.27 |
4.71 |
1.84 |
CA2 |
2.35 |
3.89 |
5.31 |
1.60 |
2.41 |
4.16 |
5.10 |
1.68 |
CA3 |
2.81 |
3.26 |
6.27 |
2.13 |
2.42 |
3.62 |
5.38 |
1.72 |
CA4 |
1.97 |
4.69 |
3.97 |
1.22 |
2.31 |
4.32 |
4.52 |
1.62 |
California |
2.26 |
3.91 |
5.05 |
1.52 |
2.30 |
3.96 |
4.99 |
1.57 |
REL ABS BIAS |
7.60 |
0.99 |
0.91 |
15.04 |
6.59 |
4.60 |
2.43 |
12.64 |
FL1 |
1.34 |
5.25 |
6.58 |
0.18 |
1.49 |
9.75 |
4.86 |
0.78 |
FL2 |
1.09 |
1.48 |
4.41 |
0.61 |
1.38 |
2.74 |
4.36 |
0.82 |
FL3 |
1.32 |
0.79 |
5.19 |
0.87 |
1.39 |
2.69 |
4.62 |
0.81 |
FL4 |
1.21 |
3.51 |
2.41 |
0.79 |
1.41 |
3.15 |
3.83 |
0.89 |
Florida |
1.22 |
2.75 |
4.36 |
0.62 |
1.34 |
2.96 |
4.33 |
0.75 |
REL ABS BIAS |
2.03 |
0.29 |
6.58 |
1.25 |
16.45 |
12.06 |
1.38 |
33.00 |
IL1 |
1.45 |
3.37 |
2.14 |
1.08 |
1.68 |
3.33 |
3.95 |
1.07 |
IL2 |
2.76 |
5.49 |
5.42 |
1.94 |
1.97 |
4.10 |
5.09 |
1.14 |
IL3 |
1.24 |
1.45 |
2.07 |
1.07 |
1.58 |
2.77 |
3.93 |
1.01 |
IL4 |
0.95 |
2.66 |
4.39 |
0.13 |
1.60 |
3.18 |
4.65 |
0.86 |
Illinois |
1.49 |
3.24 |
3.61 |
0.89 |
1.60 |
3.29 |
4.25 |
0.91 |
REL ABS BIAS |
7.77 |
0.02 |
2.85 |
19.07 |
14.66 |
3.20 |
22.10 |
15.03 |
MI1 |
1.69 |
5.89 |
8.20 |
0.00 |
1.92 |
4.03 |
5.65 |
0.99 |
MI2 |
1.47 |
1.46 |
6.69 |
0.59 |
1.88 |
2.93 |
5.52 |
1.12 |
MI3 |
1.98 |
3.23 |
2.24 |
1.76 |
1.87 |
3.31 |
4.22 |
1.27 |
MI4 |
2.03 |
4.21 |
4.30 |
1.34 |
1.90 |
3.60 |
4.76 |
1.18 |
Michigan |
1.76 |
3.47 |
5.57 |
0.87 |
1.83 |
3.48 |
5.09 |
1.04 |
REL ABS BIAS |
1.85 |
6.56 |
3.86 |
5.43 |
7.62 |
0.12 |
9.60 |
30.52 |
(continued)
Table G.4 (continued) Simulated Small State Prevalence Rates and Relative Absolute Bias for Past Year Illicit Drug Dependence
Past Year Illicit Drug Dependence | ||||||||
Design Based Estimate |
Small Area Estimate | |||||||
Total |
12-17 |
18-25 |
26 or Older |
Total |
12-17 |
18-25 |
26 or Older | |
NY1 |
1.55 |
4.26 |
5.50 |
0.59 |
1.86 |
3.91 |
5.61 |
1.00 |
NY2 |
1.31 |
2.78 |
5.90 |
0.40 |
1.80 |
3.55 |
5.60 |
0.99 |
NY3 |
1.66 |
3.98 |
5.97 |
0.69 |
1.86 |
3.73 |
5.68 |
1.02 |
NY4 |
1.48 |
1.54 |
7.19 |
0.57 |
1.84 |
3.30 |
6.18 |
0.98 |
New York |
1.49 |
2.88 |
6.14 |
0.59 |
1.83 |
3.36 |
5.92 |
1.00 |
REL ABS BIAS |
0.49 |
9.14 |
0.04 |
4.10 |
23.26 |
25.97 |
6.12 |
69.88 |
OH1 |
1.39 |
1.87 |
5.63 |
0.61 |
1.64 |
2.94 |
4.85 |
0.92 |
OH2 |
1.49 |
2.01 |
3.93 |
1.01 |
1.71 |
2.97 |
4.46 |
1.07 |
OH3 |
1.34 |
3.04 |
3.75 |
0.71 |
1.68 |
3.19 |
4.45 |
1.01 |
OH4 |
1.90 |
2.73 |
6.16 |
1.07 |
1.84 |
3.15 |
5.31 |
1.08 |
Ohio |
1.45 |
2.38 |
4.82 |
0.75 |
1.60 |
2.86 |
4.72 |
0.90 |
REL ABS BIAS |
5.99 |
1.09 |
1.05 |
13.43 |
18.70 |
28.39 |
1.03 |
36.00 |
PA1 |
1.51 |
4.21 |
5.36 |
0.61 |
1.60 |
3.82 |
5.71 |
0.71 |
PA2 |
1.18 |
3.60 |
5.55 |
0.24 |
1.49 |
3.49 |
5.59 |
0.64 |
PA3 |
0.66 |
3.82 |
2.51 |
0.00 |
1.39 |
3.61 |
4.65 |
0.63 |
PA4 |
2.71 |
5.47 |
4.20 |
2.14 |
1.76 |
4.34 |
5.35 |
0.90 |
Pennsylvania |
1.49 |
3.99 |
4.25 |
0.78 |
1.54 |
3.73 |
4.95 |
0.76 |
REL ABS BIAS |
1.46 |
7.12 |
3.64 |
3.90 |
4.25 |
4.49 |
25.21 |
7.15 |
TX1 |
1.22 |
2.35 |
3.08 |
0.67 |
1.34 |
2.81 |
3.62 |
0.65 |
TX2 |
1.50 |
5.25 |
2.98 |
0.61 |
1.45 |
3.58 |
3.77 |
0.65 |
TX3 |
1.41 |
3.22 |
3.45 |
0.72 |
1.35 |
3.09 |
3.80 |
0.59 |
TX4 |
1.38 |
2.30 |
4.00 |
0.71 |
1.35 |
2.71 |
4.09 |
0.58 |
Texas |
1.37 |
3.21 |
3.38 |
0.67 |
1.38 |
3.13 |
3.75 |
0.63 |
REL ABS BIAS |
0.68 |
2.32 |
0.23 |
0.41 |
0.38 |
4.97 |
12.92 |
8.11 |
AVERAGE |
3.48 |
3.44 |
2.39 |
7.83 |
11.49 |
10.47 |
10.10 |
26.54 |
Source: SAMHSA, Office of Applied Studies, National Household Survey on Drug Abuse, 1999 CAI.
Table G.5 Ratio of Relative Widths of Small Area Estimate Prediction Intervals to the Design-Based Confidence Intervals for Past Month Any Illicit Drug Use
|
State |
Past Month Illicit Drug Use | |||
Total |
12-17 |
18-25 |
26 or Older | |
California |
78.00 |
86.67 |
73.28 |
80.28 |
Florida |
99.47 |
74.40 |
90.81 |
102.31 |
Illinois |
76.45 |
82.66 |
76.15 |
79.28 |
Michigan |
79.27 |
95.38 |
76.17 |
81.40 |
New York |
102.68 |
81.03 |
95.28 |
96.59 |
Ohio |
77.46 |
80.64 |
62.56 |
75.74 |
Pennsylvania |
85.02 |
77.67 |
78.47 |
82.38 |
Texas |
70.22 |
88.02 |
86.87 |
68.65 |
Average Over Eight Large States |
83.57 |
83.31 |
79.95 |
83.33 |
Alabama |
24.26 |
74.74 |
54.32 |
15.73 |
Alaska |
53.35 |
56.98 |
53.58 |
57.87 |
Arizona |
47.88 |
54.69 |
73.29 |
45.72 |
Arkansas |
37.69 |
74.89 |
51.20 |
33.89 |
Colorado |
60.86 |
66.44 |
47.32 |
59.91 |
Connecticut |
65.45 |
56.36 |
60.98 |
57.71 |
Delaware |
62.82 |
101.82 |
61.49 |
68.57 |
District of Columbia |
62.91 |
92.84 |
76.30 |
59.90 |
Georgia |
43.06 |
72.98 |
85.36 |
47.60 |
Hawaii |
66.13 |
104.85 |
64.98 |
75.05 |
Idaho |
50.39 |
76.25 |
74.77 |
34.71 |
Indiana |
82.32 |
91.90 |
49.66 |
79.41 |
Iowa |
50.62 |
70.66 |
59.03 |
39.57 |
Kansas |
71.94 |
72.86 |
54.24 |
62.72 |
Kentucky |
33.41 |
56.62 |
86.71 |
26.35 |
Louisiana |
64.56 |
56.31 |
84.18 |
48.39 |
Maine |
91.27 |
54.85 |
74.65 |
100.14 |
Maryland |
49.95 |
136.41 |
76.86 |
30.31 |
Massachusetts |
84.53 |
91.57 |
60.54 |
92.49 |
Minnesota |
79.39 |
53.17 |
60.94 |
62.50 |
Mississippi |
63.15 |
56.35 |
61.87 |
63.92 |
Missouri |
50.45 |
75.77 |
60.65 |
52.93 |
Montana |
81.05 |
81.37 |
83.76 |
71.83 |
Nebraska |
53.47 |
78.15 |
41.86 |
50.89 |
Nevada |
63.51 |
67.74 |
74.67 |
56.51 |
New Hampshire |
50.47 |
90.04 |
73.96 |
39.11 |
New Jersey |
95.69 |
77.42 |
79.10 |
79.92 |
New Mexico |
49.65 |
85.69 |
56.83 |
55.71 |
North Carolina |
57.45 |
61.08 |
50.72 |
50.98 |
North Dakota |
71.23 |
76.07 |
80.15 |
31.69 |
Oklahoma |
49.83 |
47.20 |
65.90 |
56.02 |
Oregon |
72.06 |
46.21 |
92.02 |
63.72 |
Rhode Island |
48.22 |
64.55 |
41.35 |
54.68 |
South Carolina |
80.85 |
70.47 |
75.22 |
39.80 |
South Dakota |
62.32 |
103.72 |
63.68 |
73.93 |
Tennessee |
47.68 |
61.69 |
47.69 |
47.02 |
Utah |
48.84 |
75.67 |
62.31 |
48.75 |
Vermont |
101.01 |
111.32 |
100.67 |
77.08 |
Virginia |
53.21 |
113.91 |
55.53 |
40.14 |
Washington |
60.37 |
73.61 |
70.81 |
57.20 |
West Virginia |
62.38 |
84.72 |
113.73 |
21.56 |
Wisconsin |
114.06 |
97.72 |
62.87 |
102.75 |
Wyoming |
60.43 |
71.92 |
72.54 |
51.20 |
Average Over 43 Small States |
62.33 |
76.50 |
67.40 |
55.49 |
Relative Width Ratio=100*(Length of Small Area Estimate Prediction Interval/Small Area Estimate)/(Length of Design-Based Confidence Interval/Design-Based Estimate)
Table G.6 Ratio of Relative Widths of Small Area Estimate Prediction Intervals to the Design-Based Confidence Intervals for Past Year Illicit Drug Dependence
|
State |
Past Year Illicit Drug Dependence | |||
Total |
12-17 |
18-25 |
26 or Older | |
California |
90.43 |
86.67 |
80.60 |
87.04 |
Florida |
75.57 |
70.98 |
79.69 |
57.89 |
Illinois |
81.05 |
65.80 |
72.50 |
86.11 |
Michigan |
69.57 |
67.61 |
61.02 |
64.82 |
New York |
104.74 |
62.31 |
85.19 |
84.55 |
Ohio |
71.25 |
64.35 |
74.03 |
67.66 |
Pennsylvania |
57.36 |
76.09 |
74.57 |
54.05 |
Texas |
71.97 |
77.97 |
65.12 |
65.53 |
Average Over Eight Large States |
77.74 |
71.47 |
74.09 |
70.96 |
Alabama |
49.96 |
28.97 |
58.26 |
18.00 |
Alaska |
71.53 |
73.43 |
46.24 |
107.01 |
Arizona |
30.59 |
41.56 |
33.47 |
28.06 |
Arkansas |
44.00 |
50.36 |
27.94 |
23.28 |
Colorado |
68.44 |
41.02 |
70.09 |
19.06 |
Connecticut |
43.39 |
38.69 |
30.63 |
55.00 |
Delaware |
71.17 |
72.56 |
153.36 |
81.61 |
District of Columbia |
59.67 |
37.09 |
129.69 |
69.42 |
Georgia |
66.46 |
64.54 |
59.53 |
16.47 |
Hawaii |
120.95 |
67.77 |
51.61 |
20.07 |
Idaho |
69.09 |
54.72 |
39.45 |
. |
Indiana |
42.48 |
46.47 |
34.74 |
18.52 |
Iowa |
41.35 |
28.20 |
32.70 |
21.17 |
Kansas |
45.79 |
41.05 |
36.15 |
53.61 |
Kentucky |
67.26 |
41.40 |
80.46 |
16.35 |
Louisiana |
63.95 |
59.62 |
63.27 |
34.27 |
Maine |
52.01 |
47.94 |
47.20 |
37.47 |
Maryland |
62.27 |
53.44 |
56.26 |
31.16 |
Massachusetts |
41.78 |
86.36 |
54.77 |
44.51 |
Minnesota |
71.60 |
52.81 |
54.83 |
57.32 |
Mississippi |
84.53 |
67.04 |
73.10 |
48.89 |
Missouri |
52.37 |
33.48 |
51.23 |
28.88 |
Montana |
83.04 |
65.95 |
48.18 |
. |
Nebraska |
35.65 |
21.13 |
44.17 |
38.79 |
Nevada |
72.32 |
66.06 |
45.31 |
69.28 |
New Hampshire |
124.56 |
31.13 |
95.75 |
. |
New Jersey |
75.90 |
74.82 |
58.74 |
. |
New Mexico |
53.86 |
75.01 |
42.62 |
52.86 |
North Carolina |
38.22 |
90.20 |
51.72 |
29.11 |
North Dakota |
45.49 |
61.66 |
58.58 |
25.86 |
Oklahoma |
65.98 |
35.05 |
80.29 |
91.81 |
Oregon |
50.84 |
49.63 |
43.66 |
54.17 |
Rhode Island |
40.78 |
68.20 |
40.17 |
17.96 |
South Carolina |
71.95 |
45.51 |
58.13 |
. |
South Dakota |
47.76 |
43.68 |
63.46 |
18.96 |
Tennessee |
40.50 |
55.94 |
25.15 |
48.17 |
Utah |
35.02 |
57.04 |
43.78 |
17.99 |
Vermont |
57.39 |
76.81 |
48.74 |
31.22 |
Virginia |
51.49 |
37.88 |
47.18 |
30.22 |
Washington |
47.71 |
58.89 |
149.78 |
25.08 |
West Virginia |
48.81 |
63.30 |
43.58 |
18.05 |
Wisconsin |
75.59 |
55.69 |
34.86 |
34.51 |
Wyoming |
75.23 |
63.17 |
71.31 |
. |
Average Over 43 Small States |
59.51 |
54.08 |
57.68 |
33.35 |
Relative Width Ratio=100*(Length of Small Area Estimate Prediction Interval/Small Area Estimate)/(Length of Design-Based Confidence Interval/Design-Based Estimate)
Table G.7 Estimated Characteristics of Population Distribution, by State
|
Aged 18-251 |
Hispanic1 |
Non-Hispanic White1 |
Non-Hispanic Black1 |
Persons Below Poverty Level2 |
Urban3 |
Female Head of Household4 |
Unemployment Rate5 |
Mean Personal Income6 |
Median Household Income7 | |
Total |
12.87 |
10.41 |
73.76 |
11.42 |
13.20 |
74.94 |
6.38 |
4.20 |
24,442.53 |
35,492.00 |
Alabama |
13.10 |
0.77 |
74.00 |
24.12 |
14.70 |
60.08 |
7.06 |
4.80 |
20,062.43 |
29,518.00 |
Alaska |
14.43 |
4.05 |
73.50 |
3.19 |
8.80 |
67.40 |
7.02 |
6.40 |
24,597.00 |
44,280.00 |
Arizona |
13.19 |
20.17 |
70.83 |
2.82 |
18.10 |
87.75 |
6.22 |
4.40 |
21,338.84 |
32,842.00 |
Arkansas |
13.02 |
1.22 |
83.07 |
14.45 |
17.20 |
53.00 |
6.26 |
4.50 |
18,966.82 |
27,392.00 |
California |
13.47 |
29.77 |
51.18 |
6.31 |
16.30 |
92.38 |
6.36 |
5.20 |
25,375.41 |
38,664.00 |
Colorado |
13.06 |
13.30 |
80.02 |
3.79 |
9.30 |
81.65 |
6.18 |
2.90 |
25,743.41 |
38,772.00 |
Connecticut |
11.26 |
7.76 |
81.65 |
8.23 |
9.90 |
78.92 |
5.83 |
3.20 |
34,182.94 |
45,187.00 |
Delaware |
12.26 |
2.95 |
77.34 |
17.51 |
9.50 |
72.17 |
5.94 |
3.50 |
27,784.11 |
39,723.00 |
District of Columbia |
13.00 |
7.26 |
30.44 |
59.36 |
22.70 |
100.0 |
9.64 |
6.30 |
34,172.00 |
34,697.00 |
Florida |
10.68 |
15.27 |
70.31 |
12.63 |
13.90 |
84.38 |
5.69 |
3.90 |
24,203.27 |
31,064.00 |
Georgia |
13.51 |
2.14 |
68.89 |
27.06 |
14.30 |
62.57 |
7.79 |
4.00 |
23,034.64 |
33,919.00 |
Hawaii |
12.03 |
7.07 |
29.03 |
1.40 |
12.30 |
88.41 |
4.74 |
5.60 |
25,432.27 |
43,815.00 |
Idaho |
15.21 |
6.24 |
91.08 |
0.44 |
13.20 |
57.66 |
5.15 |
5.20 |
19,861.63 |
33,114.00 |
Illinois |
13.23 |
9.58 |
73.01 |
14.01 |
11.10 |
84.46 |
6.46 |
4.30 |
26,860.13 |
39,483.00 |
Indiana |
13.38 |
2.21 |
89.01 |
7.63 |
8.60 |
64.41 |
6.06 |
3.00 |
22,632.68 |
35,542.00 |
Iowa |
13.20 |
1.68 |
94.96 |
1.81 |
9.40 |
61.04 |
4.90 |
2.50 |
22,329.15 |
33,783.00 |
Kansas |
13.37 |
4.71 |
87.23 |
5.53 |
10.10 |
69.71 |
5.34 |
3.00 |
23,128.73 |
33,728.00 |
Kentucky |
13.31 |
0.64 |
92.06 |
6.51 |
15.50 |
51.24 |
6.26 |
4.50 |
19,786.03 |
30,418.00 |
Louisiana |
14.48 |
2.74 |
65.07 |
30.54 |
18.60 |
67.60 |
9.12 |
5.10 |
19,711.26 |
28,742.00 |
Maine |
11.61 |
0.66 |
97.95 |
0.26 |
10.60 |
44.49 |
5.68 |
4.10 |
21,086.97 |
32,809.00 |
Maryland |
11.75 |
3.75 |
65.59 |
26.45 |
8.60 |
80.43 |
6.86 |
3.50 |
27,679.92 |
44,206.00 |
Massachusetts |
11.68 |
6.02 |
85.50 |
4.82 |
10.30 |
84.08 |
6.12 |
3.20 |
29,810.70 |
40,831.00 |
Michigan |
12.98 |
2.46 |
81.79 |
13.62 |
10.80 |
69.68 |
7.68 |
3.80 |
24,604.04 |
38,127.00 |
Minnesota |
13.33 |
1.72 |
92.09 |
2.73 |
9.90 |
69.75 |
5.25 |
2.80 |
25,703.22 |
39,690.00 |
Mississippi |
14.34 |
0.78 |
64.16 |
34.10 |
18.30 |
47.17 |
9.12 |
5.10 |
17,558.16 |
26,925.00 |
Missouri |
12.98 |
1.54 |
86.77 |
10.25 |
10.40 |
67.68 |
6.11 |
3.40 |
22,991.68 |
32,791.00 |
Montana |
12.86 |
1.91 |
91.74 |
0.35 |
16.40 |
52.23 |
5.73 |
5.20 |
19,280.19 |
28,707.00 |
See notes at end of table. (continued)
Table G.7 (continued) Estimated Characteristics of Population Distribution, by State
|
Age 18-251 |
Hispanic1 |
Non-Hispanic White1 |
Non-Hispanic Black1 |
Persons Below Poverty Level2 |
Urban3 |
Female Head of Household4 |
Unemployment Rate5 |
Mean Personal income6 |
Median Household Income7 | |
Nebraska |
13.59 |
3.15 |
91.51 |
3.52 |
10.80 |
67.00 |
5.20 |
2.90 |
22,974.89 |
33,510.00 |
Nevada |
11.73 |
13.58 |
74.90 |
6.18 |
9.90 |
89.06 |
5.98 |
4.40 |
26,059.92 |
38,186.00 |
New Hampshire |
11.71 |
1.27 |
96.91 |
0.51 |
8.40 |
50.98 |
4.72 |
2.70 |
26,771.35 |
40,196.00 |
New Jersey |
11.63 |
11.88 |
69.90 |
12.65 |
9.00 |
89.09 |
5.50 |
4.60 |
31,285.18 |
46,803.00 |
New Mexico |
14.18 |
38.73 |
51.00 |
1.68 |
22.40 |
72.72 |
7.18 |
5.60 |
18,817.74 |
27,303.00 |
New York |
12.24 |
14.26 |
66.26 |
13.90 |
16.60 |
84.20 |
7.09 |
5.20 |
29,222.87 |
35,737.00 |
North Carolina |
12.20 |
1.30 |
75.47 |
20.97 |
12.50 |
50.62 |
6.52 |
3.20 |
22,244.34 |
34,326.00 |
North Dakota |
14.14 |
0.92 |
94.12 |
0.35 |
13.20 |
55.03 |
4.68 |
3.40 |
20,477.47 |
30,713.00 |
Ohio |
12.95 |
1.54 |
86.48 |
10.64 |
11.60 |
73.54 |
6.60 |
4.30 |
23,495.80 |
34,213.00 |
Oklahoma |
13.26 |
3.30 |
80.32 |
7.29 |
14.80 |
67.78 |
6.14 |
3.40 |
19,579.35 |
27,662.00 |
Oregon |
12.45 |
5.03 |
89.04 |
1.59 |
12.80 |
70.27 |
5.68 |
5.70 |
23,115.00 |
35,111.00 |
Pennsylvania |
11.63 |
2.36 |
87.03 |
8.89 |
11.30 |
68.14 |
5.31 |
4.40 |
24,850.88 |
35,140.00 |
Rhode Island |
11.33 |
6.41 |
87.38 |
3.58 |
11.80 |
85.61 |
5.97 |
4.10 |
24,612.70 |
36,326.00 |
South Carolina |
12.40 |
0.99 |
69.61 |
28.42 |
13.30 |
54.43 |
7.49 |
4.50 |
19,892.24 |
32,523.00 |
South Dakota |
14.00 |
1.03 |
91.79 |
0.41 |
13.00 |
50.85 |
5.13 |
2.90 |
20,741.06 |
29,846.00 |
Tennessee |
12.81 |
0.90 |
82.74 |
15.24 |
14.50 |
60.25 |
6.68 |
4.00 |
22,035.27 |
31,128.00 |
Texas |
14.62 |
27.47 |
58.44 |
11.33 |
16.10 |
80.20 |
6.70 |
4.60 |
22,328.80 |
32,719.00 |
Utah |
18.80 |
5.86 |
89.60 |
0.74 |
8.50 |
86.98 |
5.89 |
3.70 |
19,394.61 |
36,287.00 |
Vermont |
12.07 |
0.90 |
97.51 |
0.43 |
10.60 |
32.29 |
5.82 |
3.00 |
22,547.63 |
33,437.00 |
Virginia |
12.21 |
3.45 |
74.12 |
18.58 |
11.30 |
69.23 |
5.81 |
2.80 |
25,287.11 |
38,426.00 |
Washington |
12.54 |
5.36 |
84.79 |
2.74 |
10.00 |
75.85 |
5.93 |
4.70 |
25,282.19 |
37,975.00 |
West Virginia |
12.77 |
0.59 |
95.86 |
2.86 |
17.60 |
35.85 |
5.32 |
6.60 |
18,223.52 |
25,822.00 |
Wisconsin |
13.30 |
2.34 |
90.27 |
5.14 |
8.60 |
64.98 |
5.82 |
3.00 |
23,390.30 |
38,472.00 |
Wyoming |
14.42 |
6.11 |
90.51 |
0.66 |
12.00 |
64.63 |
5.75 |
4.90 |
21,586.26 |
31,180.00 |
1 Percentaged from the Census Bureau website about national population counts of civilian, noninstitutionalized persons aged 12 or older and State residential population for various demographic domains
(www.census.gov/population/www/projections/st_yr95to00.html).
2 Average of Current Population Survey (CPS) percentaged from 1996 to 1998, located on Census Bureau website (http://www.census.gov/hhes/poverty/poverty98/pv98state.html).
3 Percentaged from Area Resource File (ARF), which in turn were computed from 1990 Census data. Adjusted using 1996 population estimates.
4 1990 Census data. Female Head of Household defined as a household with children under 18 years old and female present where there is no husband present.
5 1999 percentaged from Bureau of Labor Statistics website (http://www.bls.gov/sahome.html under Local Area Unemployment Statistics).
6 Data in U.S. dollars from ARF file, which in turn were calculated from the Bureau of Economic Analysis's 1996 Regional Economic Information System.
7 Data in U.S. dollars from 1996 Modeled Small Area Income and Poverty statistics on the Census Bureau website (http://www.census.gov/hhes/www/saipe/stcty/estimate.html).
This page was last updated on May 16, 2008. |
|
SAMHSA, an agency in the Department of Health and Human Services, is the Federal Government's lead agency for improving the quality and availability of substance abuse prevention, addiction treatment, and mental health services in the United States.
* PDF formatted files require that Adobe Acrobat Reader® program is installed on your computer. Click here to download this FREE software now from Adobe. |