Proceedings of the Joint Statistical Meetings 2024 Survey Research Methods Section
August 3 - August 8, 2024

Bayesian Dependent Data Models and Machine Learning for Official Statistics and Survey Methodology — Invited Paper Session
Organizer(s): Yajuan Si, University of Michigan
Chair(s): Scott Holan, University of Missouri/U.S. Census Bureau

Bayesian Hierarchical Models For Multi-type Survey Data Using Spatially Correlated Covariates Measured With Error (View Presentation) — Scott Holan, University of Missouri/U.S. Census Bureau; Saikat Nandy, St. Jude Children's Research Hospital; Jonathan Bradley, Florida State University; Christopher Wikle, University of Missouri-Columbia

Incorporating heterogeneous types of uncertainty in small area estimates from multiple demographic data sources(View Presentation) — Lance Waller, Emory University

Multi-Source Hierarchical Models for Geographically Granular Retail Sales Estimates(View Presentation) — Darcy Morris, U.S. Census Bureau

Statistical Deep Learning for Dependent Establishment Data(View Presentation) — Paul Parker, University of California Santa Cruz

Innovative Modeling Approaches for Small Area Estimation in the Presence of Complex Dependence Structures
Organizer(s): Scott Holan, University of Missouri/U.S. Census Bureau; Paul Parker, University of California Santa Cruz
Chair(s): Scott Holan, University of Missouri/U.S. Census Bureau

A Socio-Demographic Latent Space Approach to Spatial Data When Geography is Important but not All-Important(View Presentation) — Saikat Nandy, St. Jude Children's Research Hospital; Scott Holan, University of Missouri/U.S. Census Bureau; Michael Schweinberger, The Pennsylvania State University

Bayesian Unit-level Modeling of Categorical Survey Data with a Longitudinal Design(View Presentation) — Daniel Vedensky, University of Missouri; Scott Holan, University of Missouri/U.S. Census Bureau; Paul Parker, University of California Santa Cruz

Estimation for Multi-type Survey Data(View Presentation) — Zewei Kong; Paul Parker, University of California Santa Cruz; Scott Holan, University of Missouri/U.S. Census Bureau

Exploring patterns and determinants of the agro carbon footprint in the Po Valley (Italy) using spatio-temporal small area models(View Presentation) — Paolo Maranzano; Riccardo Borgoni, University of Milano-Bicocca; Felicetta Carillo; Riccardo Pajno

Spatially Selected and Dependent Random Effects for Small Area Estimation with Application to Rent Burden(View Presentation) — Sho Kawano; Paul Parker, University of California Santa Cruz; Zehang Li, UCSC

A deeper look into education bias in web surveys
Chair(s): Martha McRoy, NORC at the University of Chicago

A deeper look into education bias in web surveys(View Presentation) — Mark Trappmann, Institute for Employment Research (IAB); Mustafa Coban, IAB; Christine Distler, IAB

Adapting Geographical Sampling Unit Size and Structure for a Changing Survey Landscape(View Presentation) — Holly Cast, NORC at The University of Chicago; Nicholas Davis, NORC at The University of Chicago; Whitney Murphy, NORC at The University of Chicago; Chrystine Tadler, NORC at The University of Chicago; Nathaniel Poland, NORC at the University of Chicago

Assessing Data Quality and Inference in a Web Respondent Driven Sampling Study of Ethnic Minorities(View Presentation) — Kaidar Nurumov; Sunghee Lee, University of Michigan

Do Young People Still Prefer the Web? Revisiting Predictors of Survey Mode Preference(View Presentation) — Kristen Olson, University of Nebraska-Lincoln; Jolene Smyth, University of Nebraska, Lincoln

Exploring Mode Effect Adjustment Approaches for a Web and Face-to-face Survey(View Presentation) — Brian Wells, NORC at the University of Chicago; Sara Lafia, NORC at the University of Chicago; Martha McRoy, NORC at the University of Chicago

The application of sample overlap control to surveys of mothers and fathers in Ohio — Jamie Ridenhour, RTI International

Using Principal Stratification to Detect Mode Effects in a Longitudinal Setting(View Presentation) — Wenshan Yu; Trivellore Raghunathan, University of Michigan; Michael Elliott, University of Michigan

Statistical Methods for Survey Data Integration and its Related Topics — Topic Contributed Session
Organizer(s): Jae-Kwang Kim, Iowa State University
Chair(s): Zhengyuan Zhu, Iowa State University

Weight Smoothing via Design Modeling in Complex Surveys(View Presentation) — F. Jay Breidt, NORC at The University of Chicago

Integrating a non-probability sample and its complementary probability sample(View Presentation) — Andrius Čiginas, Vilnius University; Jae-Kwang Kim, Iowa State University; Ieva Burakauskaite, Vilnius University

A Generalized Least Squares Estimator for Combining Probability and Nonprobability Samples when the Variable of Interest is Measured in Both Surveys(View Presentation) — Emily Berg; Zhengyuan Zhu, Iowa State University; Chengpeng Zeng

Presentation(View Presentation) — Yumou Qiu

Singular Propensity Scores: Reducing Variance in Weighted Estimators(View Presentation) — Kosuke Morikawa, Osaka University

Innovative Methods for Survey Sampling — Contributed Paper Session
Chair(s): Jamie Ridenhour, RTI International

A Novel Estimate for the Respondent-Driven Sampling Methods: A Resampling Approach(View Presentation) — Hui Yi, University of Georgia; Kyle Vincent; David Okech, University of Georgia Jody Clay-Warner, University of Georgia; Nnenne Onyioha-Clayton, University of Georgia; Anne Waswa, University of Georgia
Keywords: Respondent-Driven Sampling (RDS), Network Sampling, New Estimates for Network Sampling (NE4NS), Volz-Heckathorn (VH) Weighting Scheme, Hidden or Hard-to-Reach Population, Resampling Approach

Sunshine or Rainbows? Deconstructing referendum results in Florida, 2012-2022(View Presentation) — Jonathan Fischer
Keywords: Election data, public opinion, decomposition, deconvolution, public policy, regression

Cycling of Non-Self-Representing Primary Sample Units in the National Health Interview Survey(View Presentation) — Padraic Murphy, US Census Bureau; John Chesnut, Census Bureau
Keywords: Survey Sample Design, Multi-stage Sampling, Primary Sample Units, Cycling, National Health Interview Survey

Monthly Sample Size Prediction for the Current Population Survey(View Presentation) — John Jones, US Census Bureau, DSSD; Brian Shaffer, US Census Bureau; Timothy Trudell
Keywords: Sample Frame, Modeling

Prevalence and Effect of Misclassification on Measuring Census Coverage(View Presentation) — Timothy Kennel, Federal Government
Keywords: Measurement Error, Census, Coverage, Dual-System Estimation, Capture-Recapture

Navigating Complexity: Recent Advances in Analysis of Data from Complex Surveys — Topic Contributed Papers
Organizer(s): Paul Parker, University of California Santa Cruz; Scott Holan, University of Missouri/U.S. Census Bureau
Chair(s): Paul Parker, University of California Santa Cruz

A Pseudo-likelihood Approach to Under-5 Mortality Estimation — Taylor Okonek; Jon Wakefield, University of Washington; Katherine Wilson

Bayesian Classification Trees for Binary Survey Data under Informative Samples — Diya Bhaduri, University of Missouri-Columbia; Scott Holan, University of Missouri/U.S. Census Bureau; Daniell Toth, US Bureau of Labor Statistics

Echo state network for spatial temporal areal data — Zhenhua Wang

Latent Dirichlet Allocation for Complex Surveys Under Informative Sampling — Namitha Pais, University of Connecticut; Paul Parker, University of California Santa Cruz; Scott Holan, University of Missouri/U.S. Census Bureau

Preliminary Blended Index Variance Estimation with Census Trade Data — Daniel Yang, Bureau of Labor Statistics

Combining Probability and Non-Probability Data: Considerations, Methods, and Applications — Invited Paper Session
Organizer(s): Morgan Earp, National Center for Health Statistics; Katherine Irimata, National Center for Health Statistics
Chair(s): Brady West, Institute for Social Research

A Look at Propensity-based Methods for Combining Probability and Non-probability Sample Data(View Presentation) — Matt Williams, RTI

A New Evaluation of the Impact of Combining Probability and Non-probability Sample Data(View Presentation) — Jon Krosnick, Stanford University; Sierra Davis, Stanford University

Comparing Alternative Estimation Methods Using Combined Probability and Nonprobability Samples(View Presentation) — Michael Yang, NORC at The University of Chicago; Soubhik Barari, NORC at the University of Chicago; David Dutwin, NORC at the University of Chicago; Chien-Min Huang, NORC at the University of Chicago; Stanislav Kolenikov, NORC at The University of Chicago

Leveraging Non-Probability Data at the National Center for Health Statistics(View Presentation) — Katherine Irimata, National Center for Health Statistics; Paul Scanlon, National Center for Health Statistics; Lauren Rossen, National Center for Health Statistics; Guangyu Zhang, National Center for Health Statistics

Utilizing Data from an Incomplete Sample to Supplement the Probability-Based U.S. PIAAC Cycle II(View Presentation) — Wendy Van de Kerckhove, Westat; Tom Krenzke, Westat; Benjamin Schneider, Westat; Mike Kwanisai, Westat

Recent Advances in Estimation Methods for Survey Data — Contributed Papers
Chair(s): Robyn Ferg, Westat

Evaluation of a Modified Gross Flows Estimator for The Current Population Survey(View Presentation) — Stephen Miller, Bureau of Labor Statistics; Connor Doherty
Keywords: Monthly Labor Force Transitions, Survey Weights

Exploratory Analysis that redefined the parameter of a variable in Consumer Price Index Housing Age(View Presentation) — Alice Yu; Ayme Tomson, BLS; Benjamin Houck, BLS; Chun Wing Tse, BLS
Keywords: CPI, building's age bias, exploratory analysis, python

Fitting multilevel models using STEPS data from multiple countries for estimating health outcomes(View Presentation) — Timothy Raxworthy; Yajuan Si, University of Michigan
Keywords: multilevel logit model; diabetes; chronic disease; public health; disease reporting

Future of Tuned Ratio Unbiased Mean Predictor (TRUMP) with the Unified Scrambling Approach (USA)(View Presentation) — Sarjinder Singh, Texas A&M University-Kingsville; Stephen Sedory, Texas A & M University - Kingsville
Keywords: Population Mean; Scrambled Responses; Jackknifing; TRUMP cuts; Linear model

Implementing weighted interval censoring survival analysis on tobacco regulatory science(View Presentation) — Adriana Perez, University of Texas At Houston, Health Science Center; Sarah Valencia, Michael & Susan Dell Center for Healthy Living. University of Texas Health Science Center Houston; Pushan P Jani, The University of Texas Health Science Center at Houston, School of Medicine; Melissa B Harrell, The University of Texas Health Science Center at Houston, School of Public Health
Keywords: Interval Censoring Hazard Function; Balanced Repeated Replicate Weights; Sampling weights; Fay's variance estimation

Non-commercial catch estimation of pelagic species in the main Hawaiian Islands(View Presentation) — Hongguang Ma, PIFSC, NOAA Fisheries; Toby Matthews, PIFSC, NOAA Fisheries
Keywords: Hawaii Marine Recreational Fishing Survey (HMRFS); pelagic species; non-commercial catch; fishing effort; catch rate; stock assessment and fishery evaluation (SAFE)

Some Results from the Continuous Count Study(View Presentation) — Mary Mulry, US Government; Vincent T. Mule, U.S. Census Bureau
Keywords: administrative records; population estimates; small area estimation

Big Data Initiatives in Survey Statistics — Contributed Papers
Chair(s): Jamie Ridenhour, RTI Internationa

Adaptive Sampling Design for Estimating Spatiotemporal Pathogen Prevalence in Cities(View Presentation) — Katherine McLaughlin, Oregon State University; Jeffrey Bethel, Oregon State University; Nicole Breuner, Oregon State University; Benjamin Dalziel, Oregon State University; Kathryn Higley, Oregon State University; Allison Myers, Oregon State University; Justin Preece, Oregon State University; Tyler Radniecki, Oregon State University
Keywords: adaptive sampling, COVID-19, prevalence estimation, spatial sampling, wastewater-based epidemiology

Estimating Control Total Acres for Desired Geographies Using Cropland Data Layer(View Presentation) — Mingyue Hu
Keywords: Sample surveys, Spatial data analysis, Machine learning

Exploring the big data paradox for various estimands using vaccination data from a global survey(View Presentation) — Youqi Yang, Walter Dempsey; Peisong Han, Gilead Sciences; Yashwant Deshmukh, CVoter Foundation; Sylvia Richardson, Cambridge College London; Brian Tom, University of Cambridge; Bhramar Mukherjee, University of Michigan
Keywords: Big Data Paradox, Non-probability sample, Selection bias, Online survey, Vaccine uptake

False Discovery Rate in Large-Scale Data Error Localization(View Presentation) — Chin-Fang Weng, US Census Bureau; Paul Smith, University of Maryland (retired); Eric Slud, U. S. Census Bureau
Keywords: data editing, response errors, over-editing, multiple hypothesis tests, periodic surveys

Selection bias in big data in official statistics from a practitioner’s point of view(View Presentation) — Martin Hyllienmark, Stockholm University; Dan Hedlin, Stockholm University; Edgar Bueno, Stockholm University
Keywords: Selection bias, Simulation study, Bias-variance tradeoff, Non-response bias

The Use of Big Data-Based Model Prediction for Stratification of Household Addresses(View Presentation) — Noah Bassel, NORC
Keywords: Big Data, Machine Learning, Stratification, Sample Design

The utility of big data for evaluating public opinion(View Presentation) — Michael Robbins, RAND Corporation
Keywords: Weighting, Big data, Sentiment analysis

Survey Data Analysis and Small Area Estimation: Some Innovative Contributions of Dr. Ralph Folsom — Topic Contributed Paper Session
Organizer(s): Akhil Vaish, RTI International; Phillip Kott, RTI International
Chair(s): Kathryn Spagnola
Discussant(s): Phillip Kott, RTI International(View Presentation)

On the Definition of Response Propensity for Survey Nonresponse(View Presentation) — Roderick Little, University of Michigan

Credible Distributions of Overall Ranking of Entities(View Presentation) — Gauri Datta, University of Georgia

Bias Evaluation for Web Health Surveys, A Sensitivity Analysis Approach(View Presentation) — Yulei He, National Center for Health Statistics; Katherine Irimata, National Center for Health Statistics; Yan Li, University of Maryland, College Park; Guangyu Zhang, National Center for Health Statistics

Unit-level Survey Weighted Hierarchical Bayes Small Area Estimation for Binary Outcomes(View Presentation) — Akhil Vaish, RTI International

Challenges in Error Estimation for Survey Data — Contributed Papers
Chair(s): Adriana Perez, University of Texas At Houston, Health Science Center

Comparison of Variance Estimators for Self-Representing Primary Sample Units — Stephen Ash, Bureau of Labor Statistics
Keywords: Variance estimation, Self-representing strata, Balanced-repeated replication, Delete-a-group jackknife, Successive difference replication

Improved estimators of variance of the regression estimator in two-phase sampling — Lane Christiansen; Sarjinder Singh, Texas A&M University-Kingsville
Keywords: Two-phase sampling; Jackknife; Regression estimator; Variance estimation

Incorporating Inclconlsuve Outcomes in Error Rate Estimation with Applications in Forensic Science — Sydney Campbell, University of Virginia; Karen Kafadar, University of Virginia; Jordan Rodu, University of Virginia
Keywords: error rates, inconclusive decisions, standardization, small sample size, quality, forensic science

Jackknife Variance Estimation for Web Panel Health Survey Estimates Based on a Propensity-Score Meth — Hee-Choon Shin, National Center for Health Statistics
Keywords: Variance, Complex Sample, Jackknife

Survey data integration with applications to hypertension among US children and adolescents — Chengpeng Zeng; Emily Berg, Iowa State University; Zhengyuan Zhu, Iowa State University
Keywords: Nonprobability sample, Probability sample, Informative sampling, Missing at random, Variance estimation, NHANES

The Effects of Measurement Error on Health Estimates in Web vs Face-to-Face National Health Surveys — Leanna Moron, Westat
Keywords: web survey; face-to-face survey; total survey error; secondary data analysis; significance testing

Generating Select Synthetic Data — Topic Contributed Panel Session
Chair(s): Minsun Riddles, Westat
Organizer(s): Thomas Krenzke, Westat
Panelist(s): Fang Liu, University of Notre Dame(View Presentation); Lin Li, Westat(View Presentation); Hang Kim, University of Cincinnati(View Presentation); Aaron Williams(View Presentation); Trivellore Raghunathan, University of Michigan(View Presentation); Saki Kinney, RTI International(View Presentation)

Contributed Poster Presentations: Survey Research Methods Section — Contributed Posters
Chair(s): Ryan Peterson, University of Colorado - Anschutz Medical Campus

60 A Survival Analysis for Respondent Burden in the American Community Survey Across Household Language — Heather Smalley, Willamette University; Kristen Gore
Keywords: American Community Survey, Survival, Break-off, Non-response, Mode Effect

61 Generalized Least Squares in Non-Monotone Missing Data — Caleb Leedy; Jae-Kwang Kim, Iowa State University
Keywords: Non-monotone missingness, Survey sampling, Generalized least squares, Data integration

62 Improving generalizability in the Penalized Spline of Propensity Methods for Treatment Comparison — Katherine Li, University of Michigan; Michael Elliott, University of Michigan; Yajuan Si, University of Michigan
Keywords: Penalized Spline of Propensity Methods for Treatment Comparison (PENCOMP), External Validity, Generalizability, Complex Sample Design, Observational Study, Causal Inference

64 The Use of QR Codes in a National, Multimode Survey — Kayla Varela, US Census Bureau
Keywords: Data Collection, QR Codes

65 Unique Challenges of Weighting Calibration and Impact on COVID-19 Vaccine Hesitancy Outcomes — Adrian Diaz; Elizabeth Allen, NORC at The University of Chicago; Vicki Pineau, NORC at The University of Chicago; Jason Boim; Michael Chen, Centers for Disease Control and Prevention; James Singleton, CDC; David Yankey, CDC; Carla Black, Centers for Disease Control and Prevention; Jennifer Kriss, Centers for Disease Control and Prevention; Yi Mu
Keywords: COVID-19 Vaccination Coverage, Calibration, Weighting Methods

Leveraging External Data Sources to Improve Federal Government Surveys — Topic Contributed Paper Session
Organizer(s): Minsun Riddles, Westat
Chair(s): Amy Lin, Westat
Discussant(s): Jean Opsomer, Westat; (View Presentation)Gizem Korkmaz, Westat(View Presentation)

Age-Eligibility Oversampling to Reduce Screening Costs in a Multimode Survey(View Presentation) — Stephanie Zimmer, RTI International; Stephanie Zimmer, RTI International; Joe McMichael, RTI International; Taylor Lewis

Enhancing Weighting in the National Health and Nutrition Examination Survey (NHANES) with External Data(View Presentation) — Jay Clark, Westat; Minsun Riddles, Westat; Matt Jans, National Center for Health Statistics; Te-Ching Chen, CDC/NCHS

Improving Survey Efficiency with Linked Data: The Survey of Doctorate Recipients Story(View Presentation) — Wan-Ying Chang, National Science Foundation; Lynn Milan, National Center for Science and Engineering Statistics, NSF; Flora Lan, National Center for Science and Engineering Statistics, NSF; Kelly Phou, National Center for Science and Engineering Statistics, NSF

Nonresponse Adjustment Methods for Survey Data — Contributed Papers
Chair(s): Don Jang, NORC at The University of Chicago

Efficient Multiple-Robust Estimation for Nonresponse Data Under Informative Sampling(View Presentation) — Kosuke Morikawa, Osaka University, Kenji Beppu, Osaka University, Wataru Aida, Osaka University
Keywords: Survey sampling, Data integration, Semiparametric efficiency, Empirical likelihood, Missing data

Fighting or Accepting Nonresponse: Considerations and Plans at Statistics Canada(View Presentation) — Eric Rancourt, Statistics Canada
Keywords: Administrative Data, Coefficient of Variation, Error, Resources, Sample Size

Local Sensitivity to Nonignorable Missingness in Overdispersed Count Using Negative Binomial Model(View Presentation) — Bocheng Jing; Hui Xie, Faculty of Health Sciences, Simon Fraser University; Grisell Diaz-Ramirez, University of California, San Francisco; Qian Yi, Sauder School of Business, University of British Columbia; Joan Hu, Simon Fraser University; W. Boscardin, UCSF Medicine & Biostatistics
Keywords: Missing Data; ISNI Index; Negative Binomial Model; Count outcome; Simulation Study

Participation rates in educational surveys: An identification and statistical inference perspective(View Presentation) — Diego Cortes
Keywords: Identification and statistical inference of population parameters, Estimating survey participation rates in complex sampling designs, Survey policy: reporting participation rates as a measure of data quality, International large-scale assessments in education, A contribution to the survey methodology and policy of the Trends in Mathematics and Science Study.

SPEED 7: Statistical Methods in Surveys & Policy Applications, Part 1 — Contributed Speed
Chair(s): Barbara Bailey, San Diego State University

A Partially Observed Merton’s Jump Model for Ultra-High Frequency Financial Data with Bayesian Learn — Jamila Kridan; Yong Zeng, National Science Foundation
Keywords: Ultrahigh-frequency data, Partially observed Merton’s jump model, Normalized filtering equation, Bayes factors

A Practical Approach for Case Prioritization in A Panel Survey — Rui Jiao, Westat; Xiaoshu Zhu, Westat; Nicholas Askew, WESTAT; Ting Yan, Westat; Sylvia Dohrmann, Westat
Keywords: case prioritization, response propensity, dynamic adaptive design, nonresponse bias

Accounting for reporting delays in real-time phylodynamic analyses with preferential sampling — Catalina Medina, University of California, Irvine; Julia Palacios, Stanford University; Lorenzo Cappello, Pompeu Fabra University; Volodymyr Minin, University of California-Irvine
Keywords: infectious disease dynamics, disease surveillance, Bayesian phylogenetics, genomic epidemiology, Bayesian nonparametrics

Analysis of Total Survey Error in the 2022 National Immunization Survey-Child — YUHEI KOSHINO, Zachary Seeskin, NORC at The University of Chicago; Benjamin Skalland, NORC at the University of Chicago; Kirk Wolter, NORC at The University of Chicago & University of Chicago; Holly Hill, Centers for Disease Control and Prevention; David Yankey, CDC; Laurie D Elam-Evans, CDC; Yi Mu; Kushagra Vashist, CDC
Keywords: Total survey error, Sampling-frame coverage error, Nonresponse error, Nonresponse error, Random digit dialing

Analyzing Survey Data with Tree Models: rpms R Package — Daniell Toth, US Bureau of Labor Statistics
Keywords: Sample Design, Regression Tree, Machine Learning, Government Survey Data, Statistical Inference, Statistical Model

By ignoring statistics, the government sometimes spread pandemic misinformation. — Alan Salzberg, Salt Hill Statistical Consulting
Keywords: masks, vaccines, myocarditis, CDC, Covid

Contrastive dimension estimation — Sam Hawke, Didong Li
Keywords: Dimension reduction, Contrastive dimension

Cross-fitting model evaluation for small area estimation using complex survey data. — Qianyu Dong, Zehang Richard Li, University of California, Santa Cruz
Keywords: Cross validation, Small Area Estimation, Complex survey data

Evaluation of Data Quality and Imputation Methods for EIA’s Liquefied Natural Gas Storage Report — Makayla Cowles, Energy Information Administration, Preston McDowney, DOE/EIA/SMG, Pushpal Mukhopadhyay, U.S. Energy Information Administration, Hongbin Weng, Energy Information Administration
Keywords: Energy Statistics, Clustering

Inference of effective reproduction number dynamics from wastewater data in small populations — Isaac Goldstein, University of California, Irvine, Volodymyr Minin, University of California-Irvine
Keywords: Bayesian Statistics, Infectious Disease Statistics, Stochastic Processes, Nowcasting, Epidemic Modeling, Infectious Disease Surveillance

Multifaceted Gender Identity Measurement As An Alternative to Forced-Choice Assessments — Thomas Belin, University of California-Los Angeles; Hilary Aralis, University of California Los Angeles; Zichen Liu, Univ; Andrew Chuang, University of California, Los Angeles; Sung-Jae Lee; Donatello Telesca, UCLA School of Public Health
Keywords: gender identity, sexual orientation, ordinal data, cluster analysis, nonbinary, gender fluidity

Multilevel Regression and Poststratification with Population Margins: Application to HIV Inference — Amy Pitts, Columbia University; Maiko Yomogida, Columbia University; Angela Aidala, Columbia University; Andrew Gelman, Columbia University; Qixuan Chen, Columbia University
Keywords: Multilevel Regression and Poststratification (MRP), Bayesian, Survey Methods, COVID-19, HIV

Restricted Adaptive Probability-Based Latin Hypercube Design — HUIJUAN LI
Keywords: Adaptive sampling, Environmental sampling, Latin Hypercube Design, Rao-Blackwell

Simulating Low-cost Rotating Panel Designs for the Commercial Buildings Energy Consumption Survey — Adebowale Sijuwade, U.S. Energy Information Administration; Janice Lent, U.S. Energy Information Administration; Michael Winkler, Energy Information Administration
Keywords: rotating panel surveys, complex surveys, simulation

Spatial Smoothing and FDR Control in Climate — Kyle McEvoy; Karen McKinnon, University of California, Los Angeles
Keywords: FDR, Spatial, Climate, Smoothing, Multiple Hypotheses, Regression

Statistical Behaviour of Mixed Crowds of Humans and Automata — Guillermo Frank; Claudio Dorso, Instituto de Fı́sica de Buenos Aires, CONICET
Keywords: crowd dynamics, social force model, emergency, automata, safety

Supplementing a Non-probability Sample with a Probability Sample to Predict the Population Mean — Zihang Xu; Balgobin Nandram, Worcester Polytechnic Institute
Keywords: adjusted survey weight, Gibbs sampling, logistic regression, missing data, propensity score, robust model

Cleaning Products for Your Data: Four Studies in Editing and Imputation — Topic-Contributed Paper Session
Organizer(s): Darcy Miller, USDA/NASS; Luca Sartore, National Institute of Statistical Sciences
Chair(s): Luca Sartore, National Institute of Statistical Sciences
Discussant(s): Megan Lipke, USDA/NASS

Applying Non-Survey Data and Machine Learning Techniques to Address Nonresponse in an Agricultural Area Frame Survey(View Presentation) — Darcy Miller, USDA/NASS; Tara Murphy, USDA National Agricultural Statistics; Service; Luca Sartore, National Institute of Statistical Sciences; Jonathon Abernethy, USDA/NASS; Robert Emmet; Linda Young, USDA NASS; Arthur Rosales

Developing a Hot Deck Imputation Procedure for the Annual Economic Integrated Survey(View Presentation) — Katherine Thompson, US Census Bureau

Ensuring Data Quality in Student Lists Submitted for Sampling for the 2022 National Assessment of Educational Progress(View Presentation) — Leslie Wallace, Westat

Rule-Based Data Validation and Reconciliation of Survey Responses(View Presentation) — Gunnar Ingle, Summit Consulting LLC; Albert Lee, Summit Consulting, LLC

Challenging Aspects of Small Area and Survey Research — Invited Paper Session
Organizer(s): Snigdhansu Chatterjee, University of Minnesota
Chair(s): Snigdhansu Chatterjee, University of Minnesota

Challenges of Estimating Inflation in Small Areas in Official Statistics(View Presentation) — Vladislav Beresovsky, U.S. Bureau of Labor Statistics

On an Empirical Likelihood-based Method for Complex Survey Data with Applications to Non-probability Sampling(View Presentation) — Sanjay Chaudhuri, National University of Singapore

Towards Developing Best Practices for Using Small Area Estimation (SAE) for Diversity, Racial Equity, and Inequality (DREI) Research(View Presentation) — Carolina Franco, NORC at The University of Chicago

Vector-weighted Mechanisms for Utility Maximization under Differential Privacy(View Presentation) — Terrance Savitsky, US Bureau of Labor Statistics

Innovations in Survey Methodology — Contributed Papers
Chair(s): Jeramiah Yeksavich, US Department of Energy

Automating Quality Control in Recorded Interviews with Machine Learning(View Presentation) — Peter Baumgartner, RTI International; Kirsty Weitzel; Jerry Timbrook
Keywords: Machine Learning, CATI, CARI, Automated Transcription, Survey Administration, Automated Quality Control

Evaluating the Impact of Three Incentive Schemes on Survey Responses in Online Longitudinal Panels(View Presentation) — Htay-Wah Saw
Keywords: Longitudinal data collections; Probability-based online panels; Panel attrition; Survey nonresponse; Survey incentives; Student mental well-being

Evaluating the Measurement of Household Expectations with Audio Recordings and Machine Learning(View Presentation) — Nicolás Forteza, Bank of Spain; Javier J. Alonso, Bank of Spain; Laura Crespo
Keywords: Machine Learning; Audio Transcription; Survey Methodology; Household Expectations

Improving Sexual Identity Measures in Health Disparity Studies with Machine Learning and Resampling(View Presentation) — Rona Hu; Brady West, Institute for Social Research
Keywords: Sexual Identity Measurement, Machine Learning, Health Disparity Estimates, Survey Research, National Survey of Family Growth (NSFG), Bootstrap Resampling

Interviewer Morale, Field Effort and Field Efficiency in the National Health Interview Survey(View Presentation) — Adena Galinsky; Galila Haile, National Center for Health Statistics; Beth Taylor, NCHS(CDC); Grace Medley, NCHS/ CDC; Maria Villarroel, NCHS(CDC); Antonia Warren, NCHS(CDC); Jonaki Bose, NCHS; Lindsay Howden, U.S. Census Bureau; Aaron Maitland, National Center for Health Statistics; Lillian Hoffmann, Census; James Dahlhamer, National Center for Health Statistics
Keywords: Interviewer support, Interviewer morale, CAPI survey, Field effort, Field efficiency

Leveraging Wearables Data to Improve Self-Reports in Survey Research: An Imputation-Based Approach(View Presentation) — Deji Suolang, University of Michigan - Ann Arbor; Brady West, Institute for Social Research
Keywords: missing data imputation, wearable sensor data, self-report survey, data integration, NHANES, NHIS

Testing Whether Text and Email Contacts Improve Response in a Large ABS Mixed-Mode Study(View Presentation) — Martha McRoy, NORC at the University of Chicago; Leah Christian, NORC; Zoe Slowinski, NORC; Christopher Hansen, NORC
Keywords: mixed-mode, contact strategies, text messaging, text reminders, response rates, text invitations

h3 align="center"> Bolstering Health Survey Data with Health-Related Administrative Data: Organizational Infrastructure and Analytic Examples — Topic-Contributed Paper Session
Organizer(s): Stephanie Coffey, US Census Bureau
Chair(s): Haley Hunter-Zinck

Leveraging Administrative Records and Record Linkages to Enhance Health Data at the U.S. Census Bureau(View Presentation) — Victoria Udalova, U.S. Census Bureau

Integration of Immunization Information Systems for Efficiencies in the National Immunization Survey(View Presentation) — Elizabeth Allen, NORC at The University of Chicago; Megha S. Ravanam, NORC at The University of Chicago; Kaitlin Peterson, NORC at The University of Chicago; Madeleine Valier, Center for Disease Control; Sean Hu, CDC; Lauren Shaw, Centers for Disease Control and Prevention; James Singleton, CDC; Laurie D Elam-Evans, CDC

Revisiting Response Error and the Medicaid Undercount in the Current Population Survey(View Presentation) — James Noon, US Census Bureau; Maria Perez-Patron

Can Real World Data (Including Electronic Health Records) Replace Probability Surveys for Estimating Health Conditions at the State Level?(View Presentation) — David Marker, Retired

Utilizing Additional Data Sources to Improve National Representative Estimates on Patient Care in Hospitals(View Presentation) — Geoffrey Jackson, National Center for Health Statistics

Model-Based Estimation Methods for Survey Data — Contributed Papers
Chair(s): Samson Adeshiyan, U.S. Energy Information Administration

A Novel Application of Small Area Estimation Methodology to Study Child-Trafficked Population in Sierra Leone(View Presentation) — Hui Yi, University of Georgia; David Okech, University of Georgia; Gauri Datta, University of Georgia; Pedro Goulart; Jiacheng Li, Wells Fargo; Anna Cody, University of Georgia; Jody Clay-Warner, University of Georgia
Keywords: Small Area Estimation (SAE), Empirical Best Linear Unbiased Prediction (EBLUP) Method, Hierarchical Bayes Model, Household Survey, Child Trafficking, Sierra Leone

A regularized hidden Markov model for analyzing multiple tobacco product use in the US(View Presentation) — Xinyu Yan; Ji-Hyun Lee, University of Florida; XiangYang Lou
Keywords: Hidden Markov model, tobacco control, regularization

Assessing small area estimates using artificial populations via the Approximate Bayesian Bootstrap(View Presentation) — Jerzy Wieczorek, Colby College; Kelly McConville, Harvard University; Grayson White; Tracey Frescino, US Forest Service
Keywords: k nearest neighbors, forest inventory data, model evaluation, design-based simulation

Bayesian Resolution of Discrepant Self-Reported Network Ties — Dongah Kim; Krista Gile, University of Massachusetts Amherst; Maryclare Griffin James Kitts, University of Massachusetts, Amherst; David Nolin, University of Massachusetts, Amherst
Keywords: Bayesian modeling, network analysis, measurement error, multiplex networks

Estimation of job vacancies in small population domains using web-scraped data(View Presentation) — Ieva Burakauskaite, Vilnius University; Andrius Čiginas, Vilnius University
Keywords: small area estimation, non-probability sample, big data, data integration

Recent Bayesian and Non-Bayesian Evaluating Methods for Analyzing Sparse Classifications(View Presentation) — Yves Thibaudeau, U.S. Census Bureau
Keywords: Regularization, Bayesian Regularization, Orthogonal Subspaces, Log-Linear Models

Test for Detecting Sampling Bias in a Semi-parametric Model(View Presentation) — Zixiang Xu; Jiayang Sun, George Mason University; Mary Meyer, Colorado State University; Michael Woodroofe, Univ of Michigan
Keywords: selection-bias, semi-parametric, empirical process, spline, shape constraint

Innovative Statistical and Machine Learning Methods for Survey Data — Invited Paper Session
Organizer(s): Samiran Sinha, Texas A&M University
Chair(s): Malay Ghosh, University of Florida

Revisiting Neural Networks for large scale survey data analysis(View Presentation) — Tapabrata Maiti, Michigan State University

A Tree-based Dual-frame Estimation Approach for Combining Probability and Non-probability Samples(View Presentation) — Chien-Min Huang, NORC at the University of Chicago

Improve Survey Inference Using Bayesian Machine Learning(View Presentation) — Qixuan Chen, Columbia University

Bayesian Neural Network in the Partial Linear Regression with an Application to the NHANES Survey Data(View Presentation) — Samiran Sinha, Texas A&M University

Novel Imputation and Raking Methods in Sample Surveys — Contributed Papers
Chair(s): Katherine McLaughlin, Oregon State University

A New Joint Confidence Region for A Ranking(View Presentation) — Tommy Wright, US Census Bureau
Keywords: Joint Confidence Region, Overall Ranking, Tightness, Uncertainty

Domain-Specific Raking, With Application to Mortality Modeling(View Presentation) — Ariane Ducellier, University of Washington; Alexander Hsu, University of Washington; Aleksandr Aravkin, University of Washington; Peng Zheng, University of Washington - Institute for Health Metrics and Evaluation
Keywords: Raking, Optimization, Uncertainty, Mortality modeling

Improving population estimates through raking by joint distribution of multiple grouping variables(View Presentation) — Yilan Huang, Department of Biostatistics, Fielding School of Public Health, UCLA; Honghu Liu
Keywords: Survey Design, Sampling Weight, Raking, Post-stratification, Joint Distribution

Using State-Level Medicaid Claims Data to Enhance Survey Estimation(View Presentation) — Nicholas Davis, NORC at The University of Chicago; Michael Trierweiler, NORC at the University of Chicago; Holly Cast, NORC at The University of Chicago
Keywords: Medicare, Medicaid, Claims, Survey, Estimation, Imputation

MULTIPLE IMPUTATION FOR RANK-ORDERED SURVEY ITEMS(View Presentation) — Robert Petrin, Ipsos Public Affairs; Brittany Alexander; Sadie Larsen, National Center for PTSD, Medical College of Wisconsin; Jessica Hamblen, National Center for PTSD, Geisel School of Medicine at Dartmouth
Keywords: Imputation, Ranked Survey Items, PTSD, Health Statistics, Survey Data, Nonlinear Models

Multiple Imputation of Hierarchical Nonlinear Time Series Data: an Application to School Enrollment(View Presentation) — Daphne Liu, University of Washington; Adrian Raftery, University of Washington
Keywords: Multiple imputation, Missing data, Hierarchical time series data, Bayesian hierarchical model, School enrollment rates

Survey Data Collection, Estimation, and Disclosure Limitation Methods — Contributed Papers
Chair(s): David Kinyon, Energy Information Administration

Effective data visualizations for large meta-analyses: Evidence from a randomized survey experiment(View Presentation) — David Khella; Kaitlyn Fitzgerald, Azusa Pacific University; Avery Charles, Azusa Pacific University
Keywords: meta-analysis, data visualization, survey experiment, statistical cognition

Design-Based Methods for State-Level Survey Estimation under Three-year Data Pooling(View Presentation) — William Waldron, National Center for Health Statistics
Keywords: Survey Methods, Cross-sectional data, Design effect, Variance Estimation, Taylor-Series, Sample Design, Clustering, Intraclass Correlation, Small Area Estimation, Design-based methods, Markov Chain methods

Unbiased Survey Estimation with Population Auxiliary Variables(View Presentation) — Robyn Ferg, Westat; Johann Gagnon-Bartsch; Jaylin Lowe, University of Michigan; James Green, Westat
Keywords: model-assisted inference, survey estimation, auxiliary data, finite population inference, machine learning, regression

How to Collect Data on Agricultural Nutrient Management Practices: Survey Results from Iowa(View Presentation) — Zhengyuan Zhu, Iowa State University; Kunal Das, Iowa State Univ; Rob Davis, Iowa State University; Matthew Helmers, Iowa State University; Ben Gleason, Iowa Nutrient Research and Education Council; Isenhert Thomas, Iowa State University
Keywords: Nutrient reduction plans, Balanced sampling, Local Pivotal Method, Stratified sampling

Primary Sampling Unit Stratification for the Current Population Survey 2020 Sample Redesign(View Presentation) — Brian Shaffer; Yarissa Gonzalez, US Census Bureau; Timothy Trudell
Keywords: Sample Design, Stratification

Evaluating the Disclosure Risk and Analytic Utility of Synthetic Data in a Municipal Health Survey(View Presentation) — Stephen Immerwahr, NYC Department of Health and Mental Hygiene; Wen Qin Deng, NYC Department of Health and Mental Hygiene; Jingchen Hu, Vassar College; Tashema Bholanath, NYC Department of Health and Mental Hygiene; Fangtao He, NYC Department of Health and Mental Hygiene; Nneka Lundy De La Cruz, NYC Department of Health and Mental Hygiene
Keywords: Health Surveys, Data Privacy Risk, Synthetic Data, Survey Research Methods, Government Statistics

Small Area Modeling for Differentially Private Counts(View Presentation) — Kyle Irimata
Keywords: Small Area Estimation, Differential Privacy, Generalized variance function

Using Advanced AI Methods to Improve Statistical Estimation and Official Statistics at the Internal Revenue Service — Topic Contributed Paper Session
Organizer(s): Barry Johnson, Statistics of Income, Internal Revenue Service
Chair(s): Barry Johnson, Statistics of Income, Internal Revenue Service
Discussant(s): Jae-Kwang Kim, Iowa State University(View Presentation)

A semi-supervised approach to anomaly detection for tax compliance(View Presentation) — Evan Schulz

Estimating Undetected Income: A Descriptive Analysis and an Application of Empirical Bayes Methods(View Presentation) — Jonathan Hennessy; Patrick Vossler, Stanford University; John Guyton, Internal Revenue Service; Andrew Johns, IRS; Valentina Kachanovskaya, IRS; Dan Rosenbaum, IRS; Jacob Goldin; Daniel Ho, Stanford Law School

Improving Operational Agility at the IRS by Leveraging Intermediate Examination Results in a Sequential Decision-Making ML Pipeline(View Presentation) — Brandon Anderson

Integrating Probability and Nonprobability Data to Estimate Tax Compliance: Selection Bias and Measurement Error (View Presentation) — Ishani Roy, Internal Revenue Service; Brett Collins, IRS; Sean Roh, IRS; Alex Turk, IRS