Skip To Content
Click for DHHS Home Page
Click for the SAMHSA Home Page
Click for the OAS Drug Abuse Statistics Home Page
Click for What's New
Click for Recent Reports and HighlightsClick for Information by Topic Click for OAS Data Systems and more Pubs Click for Data on Specific Drugs of Use Click for Short Reports and Facts Click for Frequently Asked Questions Click for Publications Click to send OAS Comments, Questions and Requests Click for OAS Home Page Click for Substance Abuse and Mental Health Services Administration Home Page Click to Search Our Site

State Estimates of Substance Use from the
2005-2006 National Surveys on Drug Use and Health

Go to the Table Of Contents

Appendix A: State Estimation Methodology

This report includes estimates of 23 substance use and mental health measures (see Section A.1) using the combined data from the 2005 and 2006 National Surveys on Drug Use and Health (NSDUHs). Also included in this report are estimates of change between 2004-2005 and 2005-2006 State estimates. As discussed in Chapter 1 (Section 1.1), several changes were introduced to the survey in 2002; thus, estimates for 2001 and prior years are not comparable with estimates from 2002 and later years.

The survey-weighted hierarchical Bayes (SWHB) methodology used in the production of State estimates from the 1999-2005 surveys also was used in the production of the 2005-2006 State estimates. The SWHB methodology is described in Appendix E of the 2001 State report (Wright, 2003b) and by Folsom, Shah, and Vaish (1999). The list of predictors used in the 2005-2006 small area estimation (SAE) modeling is given in Section A.2. No new variable selection was done for the 2005-2006 data (as discussed in Section A.3). The goals of SAE modeling, general model description, and the implementation of SAE modeling remain the same and are described in Appendix E of the 2001 State report (Wright, 2003b). At the end of this appendix, tables showing the 2004, 2005, 2006, pooled 2004-2005, and pooled 2005-2006 survey response rates are included (Tables A.1 to A.12).

Small area estimates obtained using the SWHB methodology are design consistent (i.e., for States with large sample sizes, the small area estimates are close to the robust design-based estimates). The State small area estimates when aggregated by using the appropriate population totals result in national small area estimates that are very close to the national design-based estimates. However, for numerous reasons (including internal consistency), it is desirable to have national small area estimates exactly match the national design-based estimates. Beginning in 2002, exact benchmarking was introduced as described in Section A.4. The definition and explanation of the formula used in estimating the marijuana incidence rate is given in Section A.5.

For all outcomes, the age groups for which estimates are provided in this report are 12 to 17, 18 to 25, and 26 or older. Estimates for those aged 12 or older also are provided in this report. Because it was determined that States may find estimates for 18 or older useful, estimates for that age group will be available on the web in the form of HTML tables (see http://oas.samhsa.gov/2k6State/toc.cfm).

Included in this report also are estimates of underage (aged 12 to 20) alcohol use and binge alcohol use. Alcohol consumption is expected to differ significantly across the 18 to 25 age group because of the legalization of alcohol at age 21. Therefore, it was decided that it would be useful to produce small area estimates for persons aged 12 to 20. A short description of the methodology used to produce underage drinking estimates is described in Section A.6.

Section A.7 discusses the criteria used to define dependence and abuse of illicit drugs and alcohol. Section A.8 discusses how the serious psychological distress (SPD) estimates were produced. Section A.9 discusses the production of major depressive episode (MDE) estimates. The methodology used to produce estimates of change between the 2004-2005 and the 2005-2006 State estimates is described in Section A.10.

A.1 Variables Modeled

The 2006 NSDUH data were pooled with the 2005 NSDUH data, and age group–specific State prevalence estimates for 23 binary (0, 1) outcome variables were produced and presented in this report. These estimates were produced for the following outcomes:

  1. past month use of illicit drugs,
  2. past year use of marijuana,
  3. past month use of marijuana,
  4. perception of great risk of smoking marijuana once a month,
  5. average annual rate of first use of marijuana,
  6. past month use of illicit drugs other than marijuana,
  7. past year use of cocaine,
  8. past year nonmedical use of pain relievers,
  9. past month use of alcohol,
  10. past month binge alcohol use,
  11. perception of great risk of having five or more drinks of an alcoholic beverage once or twice a week,
  12. past month use of tobacco products,
  13. past month use of cigarettes,
  14. perception of great risk of smoking one or more packs of cigarettes per day,
  15. past year alcohol dependence or abuse,
  16. past year alcohol dependence,
  17. past year illicit drug dependence or abuse,
  18. past year illicit drug dependence,
  19. past year dependence on or abuse of illicit drugs or alcohol,
  20. needing but not receiving treatment for illicit drug use in the past year,
  21. needing but not receiving treatment for alcohol use in the past year,
  22. past year serious psychological distress (SPD), and
  23. past year major depressive episode (MDE).

Estimates of change between the 2004-2005 and 2005-2006 State estimates were produced for all of these outcomes and are included in this report. Also included at the end of this appendix is a table listing all outcomes and the years for which small area estimates were produced going back to the 2002 and 2003 NSDUHs (Table A.13).

A.2 Predictors Used in Mixed Logistic Regression Models

Local area data used as potential predictor variables in the mixed logistic regression models were obtained from several sources, including Claritas Inc., the U.S. Census Bureau, the Federal Bureau of Investigation (FBI) (Uniform Crime Reports), Health Resources and Services Administration (Area Resource File), the Bureau of Labor Statistics, the Bureau of Economic Analysis, the Substance Abuse and Mental Health Services Administration (SAMHSA) (National Survey of Substance Abuse Treatment Services [N-SSATS]), and the National Center for Health Statistics (mortality data). The values of these predictor variables are updated every year (when possible). Major sources and potential data items used in the modeling are provided in the following text and lists.

The following lists provide the specific independent variables that were potential predictors in the models.

Claritas Data
Description Level
% Population aged 0-19 in block group Block group
% Population aged 20-24 in block group Block group
% Population aged 25-34 in block group Block group
% Population aged 35-44 in block group Block group
% Population aged 45-54 in block group Block group
% Population aged 55-64 in block group Block group
% Population aged 65+ in block group Block group
% Blacks in block group Block group
% Hispanics in block group Block group
% Other race in block group Block group
% Whites in block group Block group
% Males in block group Block group
% Females in block group Block group
% American Indian, Eskimo, Aleut in tract Tract
% Asian, Pacific Islander in tract Tract
% Population aged 0-19 in tract Tract
% Population aged 20-24 in tract Tract
% Population aged 25-34 in tract Tract
% Population aged 35-44 in tract Tract
% Population aged 45-54 in tract Tract
% Population aged 55-64 in tract Tract
% Population aged 65+ in tract Tract
% Blacks in tract Tract
% Hispanics in tract Tract
% Other race in tract Tract
% Whites in tract Tract
% Males in tract Tract
% Females in tract Tract
% Population aged 0-19 in county County
% Population aged 20-24 in county County
% Population aged 25-34 in county County
% Population aged 35-44 in county County
% Population aged 45-54 in county County
% Population aged 55-64 in county County
% Population aged 65+ in county County
% Blacks in county County
% Hispanics in county County
% Other race in county County
% Whites in county County
% Males in county County
% Females in county County

2000 Census Data
Description Level
% Population who dropped out of high school Tract
% Housing units built in 1940-1949 Tract
% Persons aged 16-64 with a work disability Tract
% Hispanics who are Cuban Tract
% Females 16 years or older in labor force Tract
% Females never married Tract
% Females separated/divorced/widowed/other Tract
% One-person households Tract
% Female head of household, no spouse, child #18 Tract
% Males 16 years or older in labor force Tract
% Males never married Tract
% Males separated/divorced/widowed/other Tract
% Housing units built in 1939 or earlier Tract
Average persons per room Tract
% Families below poverty level Tract
% Households with public assistance income Tract
% Housing units rented Tract
% Population with 9-12 years of school, no high school diploma Tract
% Population with 0-8 years of school Tract
% Population with associate's degree Tract
% Population with some college and no degree Tract
% Population with bachelor's, graduate, professional degree Tract
Median rents for rental units Tract
Median value of owner-occupied housing units Tract
Median household income Tract

Uniform Crime Report Data
Description Level
Drug possession arrest rate County
Drug sale/manufacture arrest rate County
Drug violations' arrest rate County
Marijuana possession arrest rate County
Marijuana sale/manufacture arrest rate County
Opium cocaine possession arrest rate County
Opium cocaine sale/manufacture arrest rate County
Other drug possession arrest rate County
Other dangerous non-narcotics arrest rate County
Serious crime arrest rate County
Violent crime arrest rate County
Driving under influence arrest rate County

Other Categorical Data
Description Source Level
=1 if Hispanic, =0 otherwise NSDUH sample Person
=1 if non-Hispanic black, =0 otherwise NSDUH sample Person
=1 if non-Hispanic other, =0 otherwise NSDUH sample Person
NSDUH sample NSDUH sample Person
=1 if male, =0 if female NSDUH sample Person
=1 if MSA with 1 million +, =0 otherwise 2000 Census County
=1 if MSA with <1 million, =0 otherwise 2000 Census County
=1 if non-MSA urban, =0 otherwise 2000 Census Tract
=1 if urban area, =0 if rural area 2000 Census Tract
=1 if no Cubans in tract, =0 otherwise 2000 Census Tract
=1 if no arrests for dangerous non-narcotics,
=0 otherwise
UCR County

Miscellaneous Data
Variable Description Source Level
Alcohol death rate, underlying cause NCHS-ICD-10 County
Cigarettes death rate, underlying cause NCHS-ICD-10 County
Drug death rate, underlying cause NCHS-ICD-10 County
Alcohol treatment rate N-SSATS (formerly called UFDS) County
Alcohol and drug treatment rate N-SSATS (formerly called UFDS) County
Drug treatment rate N-SSATS (formerly called UFDS) County
% Families below poverty level ARF County
Unemployment rate BLS County
Per capita income (in thousands) BEA County
Average suicide rate (per 10,000) NCHS-ICD-10 County
Food stamp participation rate Census Bureau County
Single State agency maintenance of effort National Association of State Alcohol and Drug Abuse Directors (NASADAD) State
Block grant awards SAMHSA State
Cost of Services Factor Index SAMHSA State
Total Taxable Resources Per Capita Index U.S. Department of Treasury State

A.3 Selection of Independent Variables for the Models

No new variable selection was done for any outcome variables in 2005-2006. The updated versions of fixed-effect predictors that were used in modeling the 2004-2005 data were used to model the 2005-2006 data. Because the interest was to estimate change between the 2004-2005 and 2005-2006 State estimates, the same set of fixed-effect predictors was used for producing both sets of estimates.

A.4 Benchmarking the Age Group–Specific Small Area Estimates

The self-calibration built into the SWHB solution ensures that the population-weighted average of the State small area estimates will closely match the national design-based estimates. Given the self-calibration ensured by the SWHB solution, for State reports prior to 2002, the standard Bayes prescription was followed; specifically, the posterior mean was used for the SAE point estimate, and the tail percentiles of the posterior distribution were used for the prediction interval (PI) limits.

Singh and Folsom (2001) extended Ghosh's (1992) results on constrained Bayes estimation to include exact benchmarking to design-based national estimates. In the simplest version of this constrained Bayes solution where only the design-based mean is imposed as a benchmarking constraint, each of the State-by-age group small area estimates (for 2005-2006) is adjusted by adding the common factor Δa = (Da - Pa), where Da is the design-based national prevalence estimate and Pa is the population-weighted mean of the State small area estimates (Psa) for age group-a. The exactly benchmarked State-s and age group-a small area estimates then are given by Θsa = Psa + Δa. Experience with such additive adjustments suggests that the resulting exactly benchmarked State small area estimates will always be between 0 and 100 percent because the SWHB self-calibration ensures that the adjustment factor is small relative to the size of the State-level small area estimates.

Relative to the Bayes posterior mean, these benchmark-constrained State small area estimates are biased by the common additive adjustment factor. Therefore, the posterior mean-squared error for each benchmarked State small area estimate has the square of this adjustment factor added to its posterior variance. To achieve the desirable feature of exact benchmarking, this constrained Bayes adjustment factor was implemented for the State-by-age group small area estimates. The associated credible intervals can be recentered at the benchmarked small area estimates on the logit scale with the symmetric interval end points based on the posterior root mean-squared errors. The adjusted 95 percent PIs (Lowersa, Uppersa) are defined below:

Lowersa = exp(Lsa)/[1 + exp(Lsa)] and Uppersa = exp(Usa)/[1 + exp(Usa)],

where

Lsa = log[Θsa/(1 - Θsa)] - 1.96 * image representing the square root of MSE sub s and a,

Usa = log[Θsa/(1 - Θsa)] + 1.96 * image representing the square root of MSE sub s and a, and

MSEsa = (log[Psa/(1 - Psa)]- log[Θsa/(1 - Θsa)])2 + posterior variance of log[Psa/(1 - Psa)].

The associated posterior coverage probabilities for these benchmarked intervals are very close to the prescribed 0.95 value because the State small area estimates have posterior distributions that can be approximated exceptionally well by a Gaussian distribution.

A.5 Calculation of Average Annual Incidence of Marijuana Use

Incidence rates typically are calculated as the number of new initiates of a substance during a period of time (such as in the past year) divided by an estimate of the number of person years of exposure (in thousands). The incidence definition used in this report employs a simpler form of the at-risk population based on the model-based methodology. This model-based average annual incidence rate is defined as follows:

Average annual rate = 100*{[X1 ÷ (0.5 * X1 + X2)] ÷ 2},

where X1 is the number of marijuana initiates in the past 24 months and X2 is the number of persons who never used marijuana.

In this report, the incidence rate is expressed as a percentage or rate per 100 person years of exposure. Note that this estimate uses a 2-year time period to accumulate incidence cases from each annual survey. By assuming further that the distribution of first use for the incidence cases is uniform across the 2-year interval, the total number of person years of exposure is 1 year on average for the incidence cases plus 2 years for all the "never users" at the end of the time period. This approximation to the person years of exposure permits one to recast the incidence rate as a function of two population prevalence rates, namely, the fraction of persons who first used marijuana in the past 2 years and the fraction who had never used marijuana. Both of these prevalence estimates were estimated using the SWHB estimation approach.

The count of persons who first used marijuana in the past 2 years is based on a "moving" 2-year period that ranges over 3 calendar years. Subjects were asked when they first used marijuana. If a person indicated first use of marijuana between the day of the interview and 2 years prior, the person was included in the count. Thus, it is possible for a person interviewed in the first part of 2006 to indicate first use as early as the first part of 2004 or as late as the first part of 2006. Similarly, a subject interviewed in the last part of 2006 could indicate first use as early as the last part of 2004 or as late as the last part of 2006. Therefore, in the 2006 survey, the reported period of first use ranged from early 2004 to late 2006 and was "centered" in 2005. About half of the 12 to 17 year olds who reported first use in the past 24 months reported first use in 2005, while a quarter each reported first use in 2004 and 2006. Persons who responded in 2006 that they had never used marijuana were included in the count of "never used." Similarly, reports of first use in past 24 months from the 2005 survey ranged from early 2003 to late 2005 and were centered in 2004. Half of the 12 to 17 year olds who reported first use in the past 24 months reported first use in 2004, while a quarter each reported first use in 2003 and 2005. Note that only incidence rates for marijuana use are provided in this report.

A.6 Underage Drinking

To obtain small area estimates for persons aged 12 to 20 for past month alcohol and binge alcohol use, a separate set of models was fit for these two outcomes for the 12 to 17 age group and the 18 to 20 age group. For the 2005-2006 models, no new variable selection was done. Updated versions of the predictors were used to produce the small area estimates.

Model-based estimates for persons aged 12 to 20 were produced by taking the population-weighted average of the individual age group (12 to 17 and 18 to 20) estimates. Estimates for underage drinking for past month alcohol and binge alcohol use were benchmarked to match national design-based estimates for that age group using the process described in Section A.4. Estimates of change between the 2004-2005 and 2005-2006 estimates for underage drinking in the States also are presented in this report.

A.7 Illicit Drug and Alcohol Dependence and Abuse

The NSDUH computer-assisted interviewing (CAI) instrumentation includes questions that are designed to measure dependence on and abuse of illicit drugs and alcohol. For these substances,8 dependence and abuse questions were based on the criteria in the DSM-IV (APA, 1994).

Specifically, for marijuana, hallucinogens, inhalants, and tranquilizers, a respondent was defined as having dependence if he or she met three or more of the following six dependence criteria:

  1. Spent a great deal of time over a period of a month getting, using, or getting over the effects of the substance.
  2. Used the substance more often than intended or was unable to keep set limits on the substance use.
  3. Needed to use the substance more than before to get desired effects or noticed that the same amount of substance use had less effect than before.
  4. Inability to cut down or stop using the substance every time tried or wanted to.
  5. Continued to use the substance even though it was causing problems with emotions, nerves, mental health, or physical problems.
  6. The substance use reduced or eliminated involvement or participation in important activities.

For alcohol, cocaine, heroin, pain relievers, sedatives, and stimulants, a seventh withdrawal criterion was added. A respondent was defined as having dependence if he or she met three or more of seven dependence criteria. The seventh withdrawal criterion is defined by a respondent reporting having experienced a certain number of withdrawal symptoms that vary by substance (e.g., having trouble sleeping, cramps, hands tremble).

For each illicit drug and alcohol, a respondent was defined as having abused that substance if he or she met one or more of the following four abuse criteria and was determined not to be dependent on the respective substance in the past year:

  1. Serious problems at home, work, or school caused by the substance, such as neglecting your children, missing work or school, doing a poor job at work or school, or losing a job or dropping out of school.
  2. Used the substance regularly and then did something that might have put you in physical danger.
  3. Use of the substance caused you to do things that repeatedly got you in trouble with the law.
  4. Had problems with family or friends that were probably caused by using the substance and continued to use the substance even though you thought the substance use caused these problems.

For additional details on how respondents were classified as being dependent on or having abused illicit drugs and alcohol, see Section B.4.3 in Appendix B of the 2006 NSDUH's national results report (OAS, 2007, pp. 125-127).

A.8 Serious Psychological Distress

In 2005 and 2006, serious psychological distress (SPD) was measured using the "short-form" module consisting only of the K6 screening instrument for nonspecific psychological distress (Kessler et al., 2003). In the 2004 NSDUH, however, the sample of respondents aged 18 or older was split evenly between the "long-form" module, which included all items in the mental health module used in the 2003 NSDUH (sample A) and a "short-form" module consisting only of the K6 items (sample B). For more details on how SPD was measured in the 2004 or earlier NSDUHs and how State estimates for SPD were produced using the pooled 2003-2004 NSDUH data, see Wright and Sathe (2006).

To produce the pooled 2004-2005 SPD estimates, the 2004 sample A "long-form" scores were transformed to match the distributional characteristics of the 2004 sample B "short-form" scores using the cumulative distribution function (CDF) adjustment method described by Wright and Sathe (2006). These adjusted 2004 sample A scores were used in conjunction with the 2004 sample B "short-form" scores and the 2005 "short-form" SPD scores to produce the 2004-2005 SPD estimates (Wright, Sathe, & Spagnola, 2007).

To produce the pooled 2005-2006 SPD estimates, data from both years (2005 and 2006) were combined. No transformations were required because both the survey years used the same K6 scale for getting information on SPD. Estimates of change between the 2004-2005 and the 2005-2006 small area estimates also are presented in this report.

A.9 Major Depressive Episode

Beginning in 2004, a module was included in the questionnaire that was related to having a major depressive episode (MDE); it was derived from the criteria specified for major depression in the DSM-IV (APA, 1994). These questions permit estimates to be calculated for lifetime and past year prevalence of MDE, treatment for MDE, and role impairment resulting from MDE. In this report, estimates of having at least one MDE in the past year are reported.

In 2004, a split-sample design was implemented where adults aged 18 or older in half of the sample (sample B) received the depression module, while adult respondents in the other half (sample A) did not. All youths aged 12 to 17 were administered the adolescent depression module. In 2005 and 2006, however, all adult and adolescent respondents were administered their respective depression modules. Due to minor wording differences in the questions in the adult and adolescent MDE modules, data from youths aged 12 to 17 were not combined with data from persons aged 18 or older to get an overall estimate for those aged 12 or older. Instead, an estimate for those aged 18 or older was produced. To produce the pooled 2004-2005 MDE estimates, the 2005 MDE data were pooled with the 2004 sample B MDE data. Because the 2004 sample A was not used in the estimation process, the 2004 sample B weights were properly adjusted to account for the missing 2004 sample A MDE data (Wright et al., 2007). To produce the pooled 2005-2006 MDE estimates, the 2005 MDE data were pooled with the 2006 MDE data. Estimates of change between 2004-2005 and 2005-2006 MDE small area estimates also are included for the first time in this report.

According to DSM-IV, a person is defined as having had MDE in his or her lifetime if he or she has had at least five or more of the following nine symptoms nearly every day in the same 2-week period, where at least one of the symptoms is a depressed mood or loss of interest or pleasure in daily activities (APA, 1994): (1) depressed mood most of the day; (2) markedly diminished interest or pleasure in all or almost all activities most of the day; (3) significant weight loss when not sick or dieting, or weight gain when not pregnant or growing, or decrease or increase in appetite; (4) insomnia or hypersomnia; (5) psychomotor agitation or retardation; (6) fatigue or loss of energy; (7) feelings of worthlessness; (8) diminished ability to think or concentrate or indecisiveness; and (9) recurrent thoughts of death or suicidal ideation.

For details on the adult and adolescent modules for MDE, see Section B.4.5 in Appendix B of the 2006 NSDUH's national results report (OAS, 2007, pp. 129-131).

A.10 Measuring Change in State Estimates between 2004-2005 and 2005-2006

The estimates of change between State small area estimates displayed in Appendix C are based on the 2004 through 2006 NSDUHs. The State estimates for 2004-2005 are the previously published model-based small area estimates (Wright et al., 2007). The State estimates for 2005-2006 are the small area estimates given in Appendix B. The moving average State prevalence estimates for the overlapping 2004-2005 and 2005-2006 time periods were obtained from independent applications of RTI's SWHB methodology; that is, the 2005-2006 models were fit independently of the previously fitted 2004-2005 models. This independent analysis approach was followed because there was no desire to revise the previously published 2004-2005 estimates. Moreover, the same fixed predictor variables were used in the 2004-2005 and 2005-2006 models, but annual updates were made when more current versions became available. The age group–specific fixed predictor variables were defined at five levels (namely, person-level, census block group-level, tract-level, county-level, and State-level). Also, each age group model had 51 State-level random effects and 300 "within-State" area-level random effects.

To estimate change in State estimates, let πsa(1) and πsa(2) denote 2004-2005 and 2005-2006 prevalence rates, respectively, for State-s and age group-a. The change between πsa(1) and πsa(2) is defined in terms of the log-odds ratio (lorsa) as opposed to the simple difference because the posterior distribution of the lorsa is closer to Gaussian than the posterior distribution of the simple difference (πsa(2) – πsa(1)). The lorsa is defined as

Equation A-2.     D

The p value given in the Appendix C tables is computed to test the null hypothesis of no change (i.e., πsa(2) = πsa(1) or equivalently lorsa = 0). An estimate of lorsa is given by

Equation A-3     D

where the psa(1) are previously published 2004-2005 State estimates and the psa(2) are the 2005-2006 State estimates presented in this report (see Appendix B). To compute the variance of The estimate of the log-odds ratio, lor hat, sub s and a, i.e., Variance v of the estimate of the log-odds ratio, lor hat, sub s and a let Theta 1 hat is defined as the ratio of p 1 sub s and a and 1 minus p 1 sub s and a and Theta 2 hat is defined as the ratio of p 2 sub s and a and 1 minus p 2 sub s and a, then

Equation A-8,     D

where covariance between the logarithm of Theta 1 hat and the logarithm of Theta 2 hat denotes the covariance between image represents the logarithm of Theta 1 hat and image represents the logarithm of Theta 2 hat. This covariance is defined in terms of the associated correlation as follows:

Equation A-10.     D

Note that the variance of the logarithm of Theta 1 hat and variance of the logarithm of Theta 2 hat used here to calculate Variance v of the estimate of the log-odds ratio, lor hat, sub s and a are the same variances used in calculating the previously published 2004-2005 prediction intervals (PIs) and the 2005-2006 PIs given in this report, respectively.

The correlation between image represents the logarithm of Theta 1 hat and image represents the logarithm of Theta 2 hat was obtained by simultaneously modeling the 2004, 2005, and 2006 NSDUH data. This simultaneous modeling approach was adopted based on the results of the validation study (see Appendix E, Section E.2, of Wright, 2003b) conducted for measuring change in 1999-2000 and 2000-2001 State estimates. For this simultaneous model, four age groups by 3 years (i.e., 12 subpopulation-specific models) were fitted, each with its own set of fixed and random effects. In this case, the general covariance matrices for the State and within-State random effects were 12 by 12 matrices corresponding to the 12 element (age group by year) vectors of random effects. Note that the survey-weighted Bernoulli-type log likelihood employed in SWHB methodology was appropriate for this simultaneous model because the 12 age group by year subpopulations were nonoverlapping. The correlation [image represents the logarithm of Theta 1 hat, image represents the logarithm of Theta 2 hat] was approximated by the correlation calculated using the posterior distributions of log[πsa(1) / (1 – πsa(1))] and log[πsa(2) / (1 – πsa(2))] from the simultaneous model.

To calculate the p value for testing the null hypothesis of no difference (lor = 0), it is assumed that the posterior distribution of lor is normal with Mean is equal to estimate of the log-odds ratio, lor hat, sub s and a and Variance is equal to variance v of the estimate of the log-odds ratio, lor hat, sub s and a. With the null value of lor = 0, the Bayes p value or posterior probability of no difference is p value = 2*P [Zabs(z)], where Z is a standard normal random variate, Quantity z is the estimate of the log-odds ratio, lor hat, sub s and a, divided by the square room of the variance v of the estimate of the log-odds ratio, lor hat, sub s and a, and abs(z) denotes the absolute value of z.

Table A.1 Sample Sizes, Weighted Screening and Interview Response Rates, and Population Estimates, by State, for Persons Aged 12 or Older: 2004
State Total Selected DUs Total Eligible DUs Total Completed Screeners Weighted DU Screening Response Rate Total Selected Total Responded Population Estimate Weighted Interview Response Rate Weighted Overall Response Rate
DU = dwelling unit.
Source: SAMHSA, Office of Applied Studies, National Survey on Drug Use and Health, 2004.
Total U.S. 169,514 142,612 130,130 90.92% 81,973 67,760 240,514,815 77.00% 70.01%
Northeast 37,347 31,454 27,750 86.79% 16,674 13,523 45,497,345 75.14% 65.21%
Midwest 46,219 39,482 36,137 91.61% 22,920 18,889 54,211,630 77.63% 71.12%
South 52,946 43,682 40,257 92.40% 24,820 20,807 86,141,190 78.65% 72.67%
West 33,002 27,994 25,986 91.42% 17,559 14,541 54,664,650 75.38% 68.91%
                   
Alabama 1,991 1,611 1,477 91.72% 1,055 880 3,740,924 74.76% 68.57%
Alaska 1,902 1,525 1,399 91.61% 1,078 894 511,059 79.21% 72.57%
Arizona 2,226 1,858 1,750 94.21% 1,119 903 4,616,821 77.92% 73.41%
Arkansas 2,369 1,933 1,833 94.83% 1,062 900 2,259,150 80.09% 75.95%
California 7,911 6,957 6,192 88.60% 4,631 3,725 29,016,735 72.88% 64.57%
Colorado 2,207 1,822 1,712 93.92% 1,135 934 3,735,710 77.90% 73.17%
Connecticut 2,493 2,209 2,013 90.99% 1,098 897 2,901,872 75.85% 69.02%
Delaware 2,253 1,954 1,794 91.90% 1,144 932 688,666 77.70% 71.41%
District of Columbia 3,155 2,606 2,242 86.24% 1,041 903 466,433 82.55% 71.19%
Florida 10,456 8,488 7,581 88.99% 4,526 3,662 14,478,448 73.89% 65.75%
Georgia 2,141 1,752 1,597 91.32% 1,054 890 7,063,198 80.38% 73.41%
Hawaii 1,959 1,715 1,575 91.94% 1,088 903 1,014,184 77.42% 71.18%
Idaho 2,015 1,704 1,607 94.31% 1,051 902 1,125,089 82.42% 77.74%
Illinois 8,457 7,458 6,342 85.01% 4,444 3,575 10,387,581 75.12% 63.86%
Indiana 2,176 1,833 1,742 95.05% 1,085 891 5,098,367 77.64% 73.79%
Iowa 1,990 1,745 1,641 94.14% 1,039 890 2,468,073 81.10% 76.35%
Kansas 2,294 1,953 1,841 94.22% 993 828 2,226,734 78.58% 74.04%
Kentucky 2,372 2,059 1,949 94.67% 1,144 933 3,421,489 73.82% 69.88%
Louisiana 2,106 1,713 1,614 94.17% 1,082 933 3,646,863 81.16% 76.43%
Maine 2,731 2,168 2,025 93.40% 1,064 896 1,127,062 81.46% 76.08%
Maryland 2,122 1,855 1,617 86.77% 1,039 901 4,557,984 81.39% 70.63%
Massachusetts 2,218 1,895 1,686 89.13% 1,087 877 5,380,703 76.92% 68.56%
Michigan 9,530 7,969 7,155 89.78% 4,490 3,670 8,364,197 75.61% 67.88%
Minnesota 2,001 1,714 1,578 91.98% 1,066 907 4,237,627 83.72% 77.00%
Mississippi 1,931 1,549 1,482 95.71% 1,053 914 2,341,802 80.45% 77.00%
Missouri 2,190 1,872 1,764 94.23% 1,104 897 4,751,346 77.96% 73.46%
Montana 2,511 1,990 1,874 94.18% 1,080 907 781,536 79.58% 74.95%
Nebraska 2,044 1,729 1,629 94.21% 1,072 897 1,430,465 80.70% 76.03%
Nevada 1,903 1,641 1,552 93.71% 1,053 888 1,898,843 78.32% 73.39%
New Hampshire 2,348 1,908 1,765 92.38% 1,114 904 1,095,589 76.40% 70.58%
New Jersey 2,764 2,359 2,033 85.50% 1,153 886 7,172,774 72.04% 61.60%
New Mexico 2,190 1,799 1,719 95.54% 1,072 922 1,552,672 80.98% 77.37%
New York 10,475 8,940 7,372 82.28% 4,585 3,638 15,978,304 73.79% 60.72%
North Carolina 2,185 1,733 1,635 94.33% 1,029 869 6,927,805 79.39% 74.89%
North Dakota 2,576 2,128 2,020 94.95% 1,071 911 530,030 81.21% 77.11%
Ohio 8,599 7,463 7,026 94.14% 4,404 3,613 9,489,788 76.91% 72.40%
Oklahoma 2,382 1,889 1,769 93.71% 1,054 867 2,867,524 76.21% 71.42%
Oregon 2,234 1,931 1,825 94.50% 1,108 910 3,001,872 76.30% 72.10%
Pennsylvania 9,599 8,236 7,448 90.44% 4,360 3,590 10,399,693 77.05% 69.68%
Rhode Island 2,030 1,785 1,588 89.11% 1,126 911 907,154 76.31% 68.00%
South Carolina 2,392 1,946 1,844 94.73% 1,042 885 3,437,860 81.78% 77.47%
South Dakota 2,024 1,674 1,594 95.24% 1,034 893 630,156 82.20% 78.30%
Tennessee 2,387 2,049 1,933 94.37% 1,023 896 4,888,070 85.51% 80.70%
Texas 7,923 6,599 6,254 94.72% 4,334 3,631 17,783,855 79.21% 75.03%
Utah 1,718 1,464 1,389 94.70% 1,040 910 1,851,896 83.73% 79.28%
Vermont 2,689 1,954 1,820 93.02% 1,087 924 534,195 81.75% 76.04%
Virginia 2,060 1,773 1,587 89.40% 1,080 902 6,027,395 79.88% 71.41%
Washington 1,998 1,769 1,677 94.81% 1,086 886 5,134,850 75.97% 72.03%
West Virginia 2,721 2,173 2,049 94.31% 1,058 909 1,543,726 79.17% 74.67%
Wisconsin 2,338 1,944 1,805 92.86% 1,118 917 4,597,266 77.89% 72.33%
Wyoming 2,228 1,819 1,715 94.28% 1,018 857 423,382 81.54% 76.88%
Table A.2 Sample Sizes, Weighted Interview Response Rates, and Population Estimates, by State and Three Age Groups: 2004
State 12-17 18-25 26+
Total
Selected
Total
Responded
Population
Estimate
Weighted
Interview
Response
Rate
Total
Selected
Total
Responded
Population
Estimate
Weighted
Interview
Response
Rate
Total
Selected
Total
Responded
Population
Estimate
Weighted
Interview
Response
Rate
Source: SAMHSA, Office of Applied Studies, National Survey on Drug Use and Health, 2004.
Total U.S. 25,141 22,309 25,214,390 88.56% 27,408 23,075 32,193,946 83.87% 29,424 22,376 183,106,479 74.22%
Northeast 4,999 4,363 4,536,148 86.58% 5,543 4,579 5,684,244 81.81% 6,132 4,581 35,276,953 72.67%
Midwest 7,147 6,311 5,681,355 87.55% 7,635 6,397 7,441,099 83.46% 8,138 6,181 41,089,176 75.22%
South 7,611 6,822 9,000,856 89.90% 8,328 7,134 11,537,980 85.58% 8,881 6,851 65,602,354 75.87%
West 5,384 4,813 5,996,031 89.01% 5,902 4,965 7,530,623 83.19% 6,273 4,763 41,137,996 72.01%
                         
Alabama 335 300 380,438 88.15% 317 277 506,024 87.42% 403 303 2,854,462 70.97%
Alaska 343 301 68,234 87.37% 376 308 71,635 80.38% 359 285 371,190 77.66%
Arizona 355 307 504,134 86.71% 356 280 632,441 79.72% 408 316 3,480,247 76.36%
Arkansas 336 301 232,624 89.62% 372 312 309,270 83.12% 354 287 1,717,256 78.00%
California 1,408 1,251 3,256,862 88.81% 1,523 1,259 3,971,071 82.89% 1,700 1,215 21,788,802 68.82%
Colorado 339 309 392,567 92.63% 435 358 502,509 81.78% 361 267 2,840,634 75.05%
Connecticut 351 310 297,475 88.74% 341 290 340,627 82.95% 406 297 2,263,770 72.90%
Delaware 344 296 67,017 86.70% 402 330 91,920 81.90% 398 306 529,729 75.84%
District of Columbia 324 291 33,936 90.50% 369 328 67,513 88.13% 348 284 364,984 80.63%
Florida 1,422 1,248 1,392,381 88.13% 1,426 1,197 1,690,586 83.29% 1,678 1,217 11,395,480 70.76%
Georgia 310 281 770,391 90.24% 384 325 974,428 85.42% 360 284 5,318,379 77.85%
Hawaii 314 290 100,117 92.32% 374 313 121,874 84.55% 400 300 792,193 74.27%
Idaho 310 279 127,641 90.53% 362 318 170,720 87.99% 379 305 826,729 80.03%
Illinois 1,316 1,166 1,096,436 89.10% 1,483 1,214 1,405,081 81.40% 1,645 1,195 7,886,063 72.15%
Indiana 339 284 547,820 80.66% 370 321 712,431 87.14% 376 286 3,838,117 75.51%
Iowa 354 319 241,677 90.80% 322 283 354,834 89.24% 363 288 1,871,562 78.38%
Kansas 309 279 235,602 90.08% 331 278 326,635 84.04% 353 271 1,664,497 75.58%
Kentucky 338 297 336,208 88.01% 379 324 454,337 85.35% 427 312 2,630,944 70.36%
Louisiana 315 288 401,563 91.61% 384 345 546,374 89.71% 383 300 2,698,926 77.88%
Maine 325 292 109,324 88.79% 378 310 136,314 82.23% 361 294 881,424 80.39%
Maryland 331 311 490,535 94.06% 350 299 564,517 86.07% 358 291 3,502,932 78.60%
Massachusetts 320 280 511,108 87.59% 372 304 678,194 81.46% 395 293 4,191,401 74.97%
Michigan 1,441 1,273 906,283 88.40% 1,503 1,266 1,113,043 83.80% 1,546 1,131 6,344,871 72.25%
Minnesota 346 305 440,475 87.61% 333 280 594,051 85.11% 387 322 3,203,101 82.96%
Mississippi 292 276 255,992 94.84% 415 367 350,329 88.32% 346 271 1,735,480 76.64%
Missouri 349 296 488,189 84.08% 355 293 650,694 81.04% 400 308 3,612,463 76.59%
Montana 320 277 78,581 87.14% 373 324 108,216 85.88% 387 306 594,739 77.30%
Nebraska 266 236 149,210 88.31% 413 342 210,327 82.97% 393 319 1,070,927 79.27%
Nevada 307 281 197,330 89.52% 356 307 234,194 87.69% 390 300 1,467,319 75.18%
New Hampshire 340 292 115,175 86.06% 335 285 136,081 83.41% 439 327 844,334 74.06%
New Jersey 308 265 741,001 83.21% 393 297 825,494 76.88% 452 324 5,606,279 70.03%
New Mexico 341 315 173,978 91.56% 333 296 222,316 88.48% 398 311 1,156,379 77.93%
New York 1,345 1,144 1,583,424 85.11% 1,564 1,275 2,048,409 81.31% 1,676 1,219 12,346,471 71.15%
North Carolina 336 307 710,225 91.75% 338 285 893,651 84.47% 355 277 5,323,929 76.67%
North Dakota 350 314 51,236 89.71% 368 315 83,256 84.18% 353 282 395,539 79.50%
Ohio 1,418 1,243 982,106 87.60% 1,428 1,186 1,258,053 83.17% 1,558 1,184 7,249,629 74.37%
Oklahoma 325 288 293,667 89.22% 386 324 417,990 85.22% 343 255 2,155,867 72.18%
Oregon 349 311 297,975 88.86% 365 309 394,016 85.49% 394 290 2,309,881 72.97%
Pennsylvania 1,314 1,177 1,037,595 89.81% 1,433 1,197 1,321,982 84.56% 1,613 1,216 8,040,116 74.30%
Rhode Island 342 285 87,882 85.57% 377 326 127,105 86.38% 407 300 692,166 73.19%
South Carolina 349 307 357,948 87.80% 292 258 463,134 89.41% 401 320 2,616,779 79.59%
South Dakota 277 257 67,385 91.02% 387 346 94,182 89.49% 370 290 468,590 79.79%
Tennessee 295 273 476,738 91.61% 341 298 640,352 88.72% 387 325 3,770,980 84.21%
Texas 1,350 1,205 2,044,166 89.33% 1,444 1,236 2,607,359 85.92% 1,540 1,190 13,132,330 76.31%
Utah 348 324 227,860 93.80% 343 301 354,811 86.85% 349 285 1,269,225 80.91%
Vermont 354 318 53,165 89.86% 350 295 70,039 87.11% 383 311 410,991 79.80%
Virginia 296 268 619,572 89.10% 374 310 765,684 80.74% 410 324 4,642,140 78.68%
Washington 345 301 527,781 86.17% 378 311 685,109 80.79% 363 274 3,921,960 73.76%
West Virginia 313 285 137,455 91.56% 355 319 194,513 90.54% 390 305 1,211,758 76.03%
Wisconsin 382 339 474,936 89.82% 342 273 638,512 80.49% 394 305 3,483,818 75.77%
Wyoming 305 267 42,970 89.29% 328 281 61,714 86.59% 385 309 318,699 79.61%
Table A.3 Sample Sizes, Weighted Screening and Interview Response Rates, and Population Estimates, by State, for Persons Aged 12 or Older: 2005
State Total Selected DUs Total Eligible DUs Total Completed Screeners Weighted DU Screening Response Rate Total Selected Total Responded Population Estimate Weighted Interview Response Rate Weighted Overall Response Rate
DU = dwelling unit.
Source: SAMHSA, Office of Applied Studies, National Survey on Drug Use and Health, 2005.
Total U.S. 175,958 146,912 134,055 91.33% 83,805 68,308 243220283 76.19% 69.58%
Northeast 38,755 32,817 28,810 86.72% 16,994 13,711 45630823 73.66% 63.88%
Midwest 47,200 40,222 36,706 91.59% 23,542 19,154 54524718 76.42% 69.99%
South 55,686 45,138 41,691 92.62% 25,411 20,818 87601607 77.16% 71.47%
West 34,317 28,735 26,848 92.91% 17,858 14,625 55463137 76.42% 71.01%
                   
Alabama 2,320 1,779 1,653 93.00% 1,118 914 3773741 77.10% 71.70%
Alaska 2,245 1,717 1,592 92.71% 1,137 921 519047 75.22% 69.74%
Arizona 1,945 1,609 1,518 94.18% 1,112 908 4791433 78.75% 74.16%
Arkansas 2,194 1,854 1,753 94.54% 1,040 851 2285001 77.70% 73.45%
California 7,672 6,875 6,297 91.57% 4,633 3,699 29214010 75.57% 69.20%
Colorado 2,333 1,951 1,839 94.26% 1,110 895 3793427 75.30% 70.97%
Connecticut 2,602 2,250 2,042 90.77% 1,201 978 2915935 77.45% 70.31%
Delaware 2,473 1,994 1,824 91.53% 1,160 942 700649 76.05% 69.61%
District of Columbia 3,628 3,072 2,655 86.34% 1,071 851 461073 74.67% 64.47%
Florida 10,631 8,280 7,581 91.61% 4,606 3,669 14828967 72.57% 66.47%
Georgia 2,328 1,849 1,721 92.99% 1,108 920 7294559 78.52% 73.01%
Hawaii 2,404 1,900 1,735 91.06% 1,134 895 1027252 71.95% 65.52%
Idaho 2,036 1,745 1,646 94.39% 1,087 915 1158701 81.04% 76.50%
Illinois 9,357 8,281 6,864 82.81% 4,731 3,661 10446542 71.84% 59.49%
Indiana 2,290 1,944 1,845 94.87% 1,117 900 5133632 73.79% 70.01%
Iowa 2,010 1,733 1,636 94.39% 1,088 923 2486265 79.03% 74.59%
Kansas 2,383 2,034 1,895 92.97% 1,133 938 2242553 79.53% 73.94%
Kentucky 2,403 2,070 1,940 93.74% 1,086 895 3447472 74.87% 70.18%
Louisiana 2,273 1,740 1,645 94.56% 1,017 840 3667177 76.58% 72.41%
Maine 2,834 2,113 1,940 91.83% 1,041 891 1133884 80.22% 73.66%
Maryland 2,315 2,027 1,739 85.78% 1,156 941 4595815 76.80% 65.88%
Massachusetts 2,538 2,246 2,009 89.32% 1,187 960 5368881 74.44% 66.49%
Michigan 9,190 7,629 6,898 90.37% 4,503 3,655 8384776 76.32% 68.97%
Minnesota 1,899 1,641 1,555 94.74% 1,063 904 4273652 81.74% 77.44%
Mississippi 2,369 1,780 1,697 95.39% 1,106 930 2361852 80.33% 76.63%
Missouri 2,119 1,762 1,666 94.57% 1,073 884 4802657 78.08% 73.84%
Montana 2,571 1,976 1,866 94.42% 1,083 914 791608 79.72% 75.28%
Nebraska 2,377 2,072 1,953 94.24% 1,127 935 1442367 77.51% 73.05%
Nevada 2,262 1,907 1,797 94.28% 1,111 917 1969076 76.12% 71.77%
New Hampshire 2,500 2,086 1,883 87.02% 1,098 881 1107223 77.35% 67.31%
New Jersey 2,466 2,114 1,866 88.21% 1,197 925 7195333 70.39% 62.09%
New Mexico 2,176 1,811 1,713 94.56% 1,036 902 1578514 83.61% 79.06%
New York 10,878 9,398 7,676 81.75% 4,683 3,622 16034185 71.14% 58.15%
North Carolina 2,308 1,789 1,684 94.11% 1,035 861 7058554 79.25% 74.59%
North Dakota 2,487 2,059 1,950 94.68% 1,097 933 533566 81.83% 77.48%
Ohio 8,990 7,750 7,310 94.37% 4,403 3,579 9513391 76.84% 72.51%
Oklahoma 2,497 1,989 1,872 94.15% 1,159 946 2897287 78.34% 73.76%
Oregon 2,423 2,093 1,962 93.89% 1,142 920 3049330 74.93% 70.35%
Pennsylvania 10,195 8,787 7,893 89.74% 4,463 3,684 10436338 76.71% 68.84%
Rhode Island 2,332 1,964 1,760 89.63% 1,074 890 902072 79.22% 71.01%
South Carolina 2,594 2,076 1,970 94.91% 1,086 910 3493487 80.56% 76.46%
South Dakota 1,955 1,593 1,522 95.51% 1,104 927 635910 78.13% 74.62%
Tennessee 2,273 1,934 1,762 91.06% 1,101 921 4950513 80.14% 72.97%
Texas 7,790 6,411 6,096 95.10% 4,276 3,562 18113028 78.62% 74.77%
Utah 1,622 1,402 1,342 95.61% 1,077 939 1926464 81.72% 78.14%
Vermont 2,410 1,859 1,741 93.90% 1,050 880 536973 78.31% 73.54%
Virginia 2,318 1,999 1,759 88.35% 1,156 941 6125856 75.60% 66.79%
Washington 2,061 1,737 1,641 94.54% 1,074 876 5216989 76.04% 71.88%
West Virginia 2,972 2,495 2,340 93.84% 1,130 924 1546578 76.22% 71.53%
Wisconsin 2,143 1,724 1,612 93.54% 1,103 915 4629408 78.18% 73.13%
Wyoming 2,567 2,012 1,900 94.43% 1,122 924 427287 77.40% 73.09%
Table A.4 Sample Sizes, Weighted Interview Response Rates, and Population Estimates, by State and Three Age Groups: 2005
State 12-17 18-25 26+
Total
Selected
Total
Responded
Population
Estimate
Weighted
Interview
Response
Rate
Total
Selected
Total
Responded
Population
Estimate
Weighted
Interview
Response
Rate
Total
Selected
Total
Responded
Population
Estimate
Weighted
Interview
Response
Rate
Source: SAMHSA, Office of Applied Studies, National Survey on Drug Use and Health, 2005.
Total U.S. 25,840 22,565 25,354,871 87.10% 27,337 22,764 32,485,929 83.06% 30,628 22,979 185,379,484 73.50%
Northeast 5,266 4,563 4,545,576 85.14% 5,618 4,612 5,709,650 80.84% 6,110 4,536 35,375,597 70.94%
Midwest 7,264 6,348 5,665,730 86.77% 7,654 6,355 7,448,881 82.93% 8,624 6,451 41,410,106 73.90%
South 7,899 6,899 9,065,330 87.79% 8,245 6,960 11,704,787 85.08% 9,267 6,959 66,831,490 74.34%
West 5,411 4,755 6,078,235 87.83% 5,820 4,837 7,622,611 81.78% 6,627 5,033 41,762,291 73.83%
                         
Alabama 344 293 379,863 84.92% 368 312 506,216 85.30% 406 309 2,887,663 74.27%
Alaska 312 272 68,090 88.06% 399 338 75,289 84.35% 426 311 375,668 71.25%
Arizona 314 282 517,262 90.59% 401 314 655,373 78.31% 397 312 3,618,797 77.24%
Arkansas 304 273 231,565 90.65% 370 309 311,085 85.29% 366 269 1,742,351 74.43%
California 1,408 1,211 3,324,479 86.65% 1,492 1,201 3,977,199 80.78% 1,733 1,287 21,912,332 72.98%
Colorado 333 293 393,077 88.81% 334 270 510,901 80.43% 443 332 2,889,449 72.26%
Connecticut 387 335 300,551 82.84% 449 363 347,469 81.66% 365 280 2,267,915 75.90%
Delaware 379 329 67,891 87.13% 375 312 92,587 84.10% 406 301 540,171 73.13%
District of Columbia 319 272 34,763 87.27% 298 252 62,881 84.05% 454 327 363,429 72.10%
Florida 1,410 1,235 1,415,728 88.36% 1,515 1,255 1,748,510 82.80%