Measures of Departure from Probability Sampling
—
Roderick Joseph Little, University of Michigan
Keywords: Probability Sampling; Selection Bias; Pattern-Mixture Models
Bayesian Analysis of Tests with Unknown Specificity and Sensitivity
—
Andrew Gelman, Columbia University; Bob Carpenter, Flatiron Institute
Keywords: Bayesian inference; multilevel regression and poststratification (MRP); specificity; sensitivity; coronavirus
A Design for Hospital-Based Coronavirus Tracking
—
Yajuan Si, University of Michigan; Len Covello, Community Hospital; Andrew Gelman, Columbia University; Siquan Wang, Columbia University
Keywords: Coronavirus; selection bias; multilevel regression and poststratification
Using the Facebook Platform to Create a Global Symptom and Pandemic Effects Monitoring Resource
—
Frauke Kreuter, University of Maryland and University of Mannheim; Neta Barkay, Facebook Research; Curtiss Cobb, Facebook Research; Roee Eilat, Facebook Research; Tal Galili, Facebook Research; Daniel Haimovich, Facebook Research; Sarah LaRocca, Facebook Research; Katherine Morris, Facebook Research; Tal Sarig, Facebook Research
Keywords: covid-19; survey methods; health; survey weighting; survey sampling; surveys
An International Man of Action
—
Paul Biemer, RTI International
Keywords:
Lars Lyberg in the Context of the Evolution of Survey Methodology
—
Bob Groves, Georgetown University
Keywords:
Lars, the Journal of Official Statistics, and Statistics Sweden
—
Ingegerd Jansson, Statistics Sweden
Keywords:
Lars Lyberg: Mentor and Editor
—
Edith de Leeuw, Utrecht University
Keywords:
A Fond Memory of Lars
—
Phillip Kott, RTI International
Keywords:
Lars Lyberg: A Leader in Survey Research
—
Dan Kaspryzyk, NORC
Keywords:
Lars Lyberg's Contributions
—
Frauke Kreuter, University of Maryland and University of Mannheim
Keywords:
Bayesian Dasymetric Modeling
—
Matthew Simpson, SAS Institute; Scott Holan, University of Missouri; Christopher Wikle, University of Missouri
Keywords: Bayesian statistics; Survey statistics; Dasymetric modeling; Downscaling
Hierarchical Bayesian Mixed Effect Models for Spatially Correlated Areal Count-Valued Data When Covariates Are Measured with Error
—
Saikat Nandy, University of Missouri; Scott Holan, University of Missouri
Keywords: American Community Survey; Bayesian hierarchical model; Generalized Transformation Model; Markov chain Monte Carlo; Measurement error; non-Gaussian data
An Accurate Corset Methodology for Efficient Reduction of Spatial Data
—
Ranadeep Daw, University of Missouri; Christopher Wikle, University of Missouri
Keywords: Coreset; Accurate Coreset; Predictive Process; Gaussian Process; Data Reduction; Spatial Thinning
A Spatial Change of Support Model for Differentially Private Measurements, with Application to Estimation of Counts of Persons in AIAN Areas by Detailed Race Groups
—
Ryan Janicki, US Census Bureau; Andrew Raim, U. S. Census Bureau; Kyle Irimata, U.S. Census Bureau; James A Livsey, U. S. Census Bureau; Scott Holan, University of Missouri
Keywords:
A Bayesian Functional Data Model for Surveys Collected Under Informative Sampling with Application to Mortality Estimation Using NHANES
—
Paul Parker, University of Missouri; Scott Holan, University of Missouri
Keywords: Functional principal components; Horseshoe prior; National Health and Nutri- tion Examination Survey (NHANES); Po ´lya-Gamma; Pseudo-likelihood
Inspecting Regression Trees of Subjective Burden Perception with Objective Burden Measures in a Household Survey
(View Presentation)
—
Daniel Yang, Bureau of Labor Statistics
Keywords: Respondent burden; Survey design; Nonparametric; Regression Tree model
A Unified Bayesian Framework for Statistical Matching
—
Gauri Kamat, Brown University; Roee Gutman, Brown University
Keywords: matching; likelihood; Bayesian; checkerboard Metropolis; record linkage ; missing data
Administrative Record Use in the 2020 Census for Modeling and Processing
—
Andrew D. Keller, US Census Bureau
Keywords: administrative data
Relative Standard Error: A Misleading Indicator for Cell Suppression and Publishing Guidelines for Estimates of Proportions
—
Sadeq R Chowdhury, Agency for Healthcare Research and Quality; David K Kashihara, Agency for Healthcare Research and Quality
Keywords: RSE; Cell suppression; Publishing guidelines; MEPS
Nonparametric Model-Assisted Estimation from Quantitative Randomized Response Models
—
Sayed Mostafa, North Carolina A&T State University
Keywords: randomized response model; complex surveys; model-assisted estimation; local linear regression; Horvitz-Thompson estimation
Bayesian Inference for Nonprobability Samples
—
Lyric Liu, Worcester Polytechnic Institute ; Nandram Balgobin, Worcester Polytechnic Insttute
Keywords: Non-probability Samples; Regression ; Bayesian Statistics ; Data Integration ; Grid Method
A Model-Assisted Approach for Distinguishing Two Nonresponses in Achievement Test or Survey Data
—
Yu-Wei Chang, National Chengchi University
Keywords: Item Response Theory tree model; non-response; Laplace-approximated maximum likelihood estimation
Web Survey Response Times: What to Do and What Not to Do
—
Ioannis Andreadis, Aristotle University of Thessaloniki
Keywords: web surveys; response times; data quality
Machine Learning-Based Data Integration Procedures for Handling Nonprobability Sample
—
James Cutler, University of Oklahoma Health Sciences Center; Sixia Chen, University of Oklahoma Health Sciences Center; Chao Xu, University of Oklahoma Health Sciences Center
Keywords: machine learning; mass imputation; survey methodology; non-probability sample
Assessing the Quality of Blended Statistics in a Federal Health Survey
(View Presentation)
—
Liz Kantor, NORC at the University of Chicago; Ann Bisognano, NORC at the University of Chicago; Michael Trierweiler, NORC at the University of Chicago
Keywords: administrative data; survey data; blended statistics; data quality; record linkage; item nonresponse
How Can We Use Mixture, Multi-Process, and Other Multi-Dimensional Item Response Theory Models to Account for Midpoint and Extreme Response Style Use in Personality Assessment?
—
Michael Lucci, University of Pittsburgh at Greensburg
Keywords: Personality Subscales/Assessment; Midpoint/Extreme Response Styles; Multi-process (IRT tree) Model; Mixture Item Response Model; Multi-dimensional Item Response model; K-means clustering
Improving the Representativeness of Tribal BRFSS Through Data Integration
—
Sixia Chen, University of Oklahoma Health Sciences Center; Janis Campbell, University of Oklahoma Health Science Center; Erin Spain, Southern Tribal Health Board; Alexandra Milligan, University of Oklahoma Health Science Center; Cuyler Snider, Southern Tribal Health Board
Keywords: Calibration; Data integration; Mass imputation; Non-probability samples
A Comparison of Indirect Questioning Techniques for Binary Responses
—
Jessica K. Fox, LSU Health Sciences Center School of Public Health Biostatistics Program; Evrim Oral, LSU Health Sciences Center School of Public Health Biostatistics Program
Keywords: Randomized response techniques; Non-randomized response techniques; Social desirability bias; Monte-Carlo simulation; Intimate partner violence
A Public Health Application of a Randomized Response Model Based on Applied Neuroscience
—
Augustus Jayaraj, Cornell University; Sarjinder Singh, Texas A&M University -Kingsville; Oluseun Odumade, Deloitte & Touche
Keywords: neuroscience; randomized response; survey sampling; sensitive data
An Approach to Estimate the Reidentification Risk in Longitudinal Survey Microdata
—
Jianzhu Li, Westat; Lin Li, Westat; Tom Krenzke, Westat; Wan-Ying Chang, National Science Foundation
Keywords: log-linear models; disclosure risk; risk assessment
A Model-Assisted Approach for Finding Coding Errors in Manual Coding of Open-Ended Questions
—
Matthias Schonlau, University of Waterloo; Zhoushanyue He, University of Waterloo
Keywords: machine learning ; statistical learning; intercoder disagreement; coding error
Boat-Based Fishing Surveys in the US Pacific Island Territories: Survey Design, Catch and Effort Estimation, and Sensitivity Analyses
(View Presentation)
—
Hongguang Ma, NOAA Pacific Islands Fisheries Science Center; Toby Matthews, University of Hawaii ; Felipe Carvalho, NOAA Pacific Islands Fisheries Science Center; Marc Nadon, NOAA Pacific Islands Fisheries Science Center / University of Hawaii
Keywords: Boat-based fishing survey; Catch estimate; Fishing effort estimate; Survey design; Expansion algorithm; Sensitivity analysis
Overview of Administrative Records Modeling in the 2020 Census
—
Mary Mulry, U.S. Census Bureau; Tom Mule, U.S. Census Bureau; Andrew D. Keller, US Census Bureau; Scott Konicki, U.S. Census Bureau
Keywords: 2020 U.S. Census; Nonresponse Followup
OptimalFisheryDesign: An R Package for Fishery Sampling Designs
—
Zhaoce Liu, Mathematica Inc.; Lynne Stokes, Southern Methodist University
Keywords: Non-probability sample; Data Integration; Recreational Fishing; Optimal Survey Design
Measurement and Selection Errors in Recall and Diary Responses
—
Gradon Nicholls, Bank of Canada; Heng Chen, Bank of Canada
Keywords: total survey error; non-response; measurement error
Another Invisible Threat: Unequal Privacy Cost of Tracing COVID-19
—
Claire McKay Bowen, Urban Institute; Joshua Snoke, RAND Corp.
Keywords: data privacy; COVID-19; structural racism
Contextual Integrity as a Framework for Assessing Privacy Violations from Digital Trace Data
—
Jessica Vitak, University of Maryland
Keywords: privacy; contextual integrity; privacy paradox; trace data; factorial vignettes
Learning from the People: Responsibly Encouraging Adoption of Contact Tracing Apps
—
Elissa M. Redmiles, Max Planck Institute for Software Systems
Keywords:
Data Collection Using Smartphones: Concerns About Data Sharing and Framing Effects
—
Florian Keusch, University of Mannheim
Keywords: smartphones; sensors; log files; privacy
Inference from Blended Probability and Nonprobability Samples
—
Marcus Berzofsky, Research Triangle Institute
Keywords: Blended data; Nonprobability Samples; Variance Estimation
The Role of Statistics in Open Science and Publicly Accessible Research Data
—
Sarah Nusser, Iowa State University
Keywords: FAIR principles; data reuse; data sharing; privacy; restricted data; open science
Combining Data from Probability and Non-Probability Samples Using Penalized Spline of Propensity Prediction
(View Presentation)
—
Michael R. Elliott, University of Michigan; Ali Rafei, University of Michigan; Carol Flannagan, University of Michigan
Keywords: Double robustness; Non-parametric Bayesian modeling; Augmented inverse propensity weighting
Estimators with Combined Probability and Nonprobability Samples Using Small Area Models
—
Michael Yang, NORC at the University of Chicago; Nada Ganesh, NORC at the University of Chicago; Evan Herring-Nathan, NORC at the University of Chicago; Vicki Pineau, NORC at the University of Chicago
Keywords: nonprobability sample estimation; statistical matching; propensity weighting; small area estimation; doubly robust
Creating Statistically-Defensible Calibrated Weights for a Blended Sample and Measuring the Accuracy of the Resulting Estimates
(View Presentation)
—
Phillip Kott, RTI International; Jamie Ridenhour, RTI International
Keywords: Selection model; Output model; calibration equation; double protection; logit function; WTADJX
Exploring Nonprobability Methods with Simulations from a Common Data Source: Culture and Community in a Time of Crisis
—
Jennifer Benoit-Bryan, Slover Linett Audience Research
Keywords: nonprobability sample; Monte Carlo simulation; Arts and culture
BLS Business Surveys in the Wake of COVID-19: Changes to Data Collection, Imputation, and Estimation
—
Kenneth W Robertson, U.S. Bureau of Labor Statistics
Keywords: COVID-19; BLS; estimation; imputation; data collection; business
Methodological Innovations Behind the COVID-19 Household Pulse Survey
—
Jason Fields, U.S. Census Bureau; Jennifer Hunter Childs, US Census Bureau; Anthony Tersine, U.S. Census Bureau
Keywords: Household Pulse Survey; COVID; Census; Rapid response; Online; Self response
Producing National Health Estimates During a Pandemic
—
Morgan Earp, National Center for Health Statistics; Stephen Blumberg, National Center for Health Statistics; Carol DeFrances, National Center for Health Statistics; Ryne Paulose, National Center for Health Statistics
Keywords: COVID-19; NCHS; NHIS; NHANES; RANDS
General-Purpose Multiply Robust Estimation Procedure for Handling Missing Data
—
Sixia Chen, University of Oklahoma Health Sciences Center; David Haziza, University of Ottawa
Keywords: Estimating equation; Imputation; Nonresponse adjustment; Variance estimation
Random Forests Imputation in Surveys
—
Mehdi Dagdoug, Université de Bourgogne Franche Comté; Camelia Goga, Université de Bourgogne Franche Comté; David Haziza, University of Ottawa
Keywords: survey; missing data; random forest; imputation
Multiply robust bootstrap procedure in the presence of imputed survey data
—
Zeinab Mashreghi, University of Winnipeg; Sixia Chen, University of Oklahoma Health Sciences Center; David Haziza, University of Ottawa
Keywords: Item Nonresponse; Multiply Robustness Property; Variance Estimation ; Complex Survey
Use of Administrative Data for Surveys
—
Michael D Larsen, St. Michael's College
Keywords: Administrative Records; Survey Methodology; Sample Design; Record Linkage
The Conditions of America’s Learners During COVID-19: What Do Federal Statistics Tell Us?
—
Marilyn Seastrom, National Center for Education Statistics
Keywords: COVID-19; Household Pulse Survey; Education Statistics
Using Machine Learning and Statistical Models to Predict Survey Costs
—
James Wagner, University of Michigan; Brady T. West, University of Michigan; Michael R. Elliott, University of Michigan; Stephanie Coffey, U.S. Census Bureau
Keywords: Responsive survey design; Nonresponse error; Survey costs
Is the Price Right? Predicting Future Survey Costs Using Previous Cycles
(View Presentation)
—
Kayla Marie Varela, US Census Bureau
Keywords: Cost Modeling; Survey Costs
Examining Variation in Survey Costs Across Surveys
—
Kristen Olson, University of Nebraska-Lincoln; Jennifer Dykema, University of Wisconsin Survey Center; John Stevenson, University of Wisconsin Survey Center; Lindsey Witt-Swanson, Bureau of Sociological Research
Keywords: survey costs; mixed mode surveys; self-administered surveys
Biased Data, Biased Models? Bridging Advances in Survey Research and Computer Science for Improving Fairness in Algorithmic Decision-Making
—
Keywords: Algorithmic decision-making; Fair ML; Data Bias
Visual and Interactive Tools for Assessing Data Quality
(View Presentation)
—
Zachary H. Seeskin, NORC at the University of Chicago; Kiegan Rice, NORC at the University of Chicago
Keywords: Exploratory data analysis; Integrating data from multiple sources; Administrative data; Interactive data visualization; R
Transportation Data for Decision-Making During the COVID-19 Pandemic
(View Presentation)
—
Rolf Schmitt, Bureau of Transportation Statistics
Keywords: trasnportation; covid
Expanding Data Linkage Opportunities and Measuring Quality
—
Lisa B. Mirel, CDC/NCHS/DAE; Dean Resnick, NORC; Christine Cox, NORC; Jonathan Aram, CDC/NCHS/DAE
Keywords: National Center for Health Statstics; Data Linkage Program; privacy preserving record linkage
'Ask US' About Coverage Error: An Investigation from a New Inter-Agency Online Panel
—
Emilia Peytcheva, RTI International; Jennifer Hunter Childs, US Census Bureau
Keywords: online panel; pilot; coverage error
Evaluation and Performance of Privacy Preserving Record Linkage (PPRL)
—
Dean Resnick, NORC
Keywords: Administrative Records; Record Linkage; Privacy-Preserving Record Linkage
Machine Learning with Complex Survey Data
—
Jerzy Wieczorek, Colby College
Keywords: Complex Sample Design; Machine Learning; Survey Methodology; Cross-Validation
Inference from Non-Random Samples Using Bayesian Machine Learning
—
Yutao Liu, Vertex Pharmaceuticals, Inc.; Andrew Gelman, Columbia University; Qixuan Chen, Columbia University
Keywords: Bayesian machine learning; High-dimensional auxiliary variables; Non-random samples; Probability and non-probability surveys; Propensity score; Soft Bayesian additive regression trees
Individual and Community-Level Risks for COVID-19 Mortality in the US
—
Neha Agarwala, University of Maryland, Baltimore County; Prosenjit Kundu, Bloomberg School of Public Health, Johns Hopkins University; Jin Jin, Bloomberg School of Public Health, Johns Hopkins University; Nilanjan Chatterjee, Johns Hopkins University; Yuqi Zhang, Johns Hopkins University; Benjamin Harvey, Bloomberg School of Public Health, Johns Hopkins University; Eliza Wallace, 4PolicyMap, Inc.
Keywords:
A Two-Stage Cox Process Model with Spatial and Nonspatial Covariates
—
Claire Kelling, Penn State; Murali Haran, Pennsylvania State University
Keywords: spatial statistics; point process; policing; crime
Varying Impacts of Letters of Recommendation on College Admissions: Approximate Balancing Weights for Subgroup Effects in Observational Studies
—
Eli Ben-Michael, Harvard University; Avi Feller, UC Berkeley; Jesse Rothstein, UC Berkeley
Keywords: Causal Inference; Observational Studies
Refinement: Measuring Informativeness of Ratings in the Absence of a Gold Standard
—
Sheridan Lloyd Grant, University of Washington; Marina Meila, University of Washington; Elena Erosheva, University of Washington; Carole Lee, University of Washington
Keywords: Ratings; Entropy; Peer Review; Decision-Making
Applying Cluster Analysis to Improve the American Housing Survey Hot Deck
—
Brian Shaffer, U.S. Census Bureau; Stephen Ash, Bureau of Labor Statistics; Kathy Zha, U.S. Census Bureau
Keywords: imputation; cluster analysis
Coverage and Nonresponse Bias in the 2016 American Community Survey Content Follow-Up Reinterview
(View Presentation)
—
Samantha Spiers, U.S. Census Bureau
Keywords: Nonresponse Bias; Coverage Error; Reinterview
Variational Bayesian Multiple Imputation for Clustered High-Dimensional Data
—
Qiushuang Li, University at Albany SUNY; Recai Yucel, Temple University
Keywords: Clustered data; variational inference; multiple imputation; sequential hierarchical regression imputation; calibration-based imputation; spike-and-slab variable selection
A Comparison of Two CHAID Packages for Modeling Survey Nonresponse
—
Tien-Huan Lin, Westat; Carlos Arieira, Westat; Ismael Flores Cervantes, Westat; Mike Kwanisai, Westat; Jennifer Kali, Westat
Keywords: Survey weighting adjustments; Weighting class; Nonresponse; CHAID; SAS; HPSPLIT
Imputation of Missing Values by Low Rank Matrix Approximation
—
MoonJung Cho, U.S. Bureau of Labor Statistics
Keywords: Auxiliary variables; Correlation; Rank estimation; Singular value decomposition
Modeling Survey Nonresponse Under a Cluster Sample Design: Classification and Regression Tree Methodologies Compared
—
Michael Jones, Westat; William Everett Cecere, Westat; Tien-Huan Lin, Westat; Jennifer Kali, Westat; Ismael Flores Cervantes, Westat
Keywords: classification trees; clustering; nonresponse bias; response propensities; survey weights; weighting class adjustments
The spsurvey R Package: New Options for Selecting and Analyzing Spatially Balanced Probability Samples
—
Michael Dumelle, Environmental Protection Agency ; Anthony R. Olsen, Environmental Protection Agency; Tom Kincaid, Environmental Protection Agency; Marc Weber, Environmental Protection Agency
Keywords: Design-Based; Estimation; GRTS; Geometry; Selection Probability; Stratification
Multi-Objective Sample Allocation in a US Veteran Health Survey
—
Michael D Larsen, St. Michael's College
Keywords: Oversampling; Poststratification; Response rates; Sample Design; Stratification; Two-phase sampling
Creating Base Weights and Replicate Weights for a PPS Sample with a Supplemental Sample When the Eligibility Frame Information Is Available After Sampling
—
Ismael Flores Cervantes, Westat; Mike Kwanisai, Westat; Jianru Angela Chen, WESTAT
Keywords: systematic PPS sample design; eligibility; weighting; supplemental sample for PPS designs; simulation
Sample Design with Operational Constraints for Zero-Inflated Response Data
—
Chauncey M Dayton, BDS Data Analytics, LLC; Mary Batcher, BDS Data Analytics, LLC; NJ Scheers, BDS Data Analytics, LLC
Keywords: Sample design; Zero-inflated data
Association of National Health and Nutrition Examination Survey (NHANES) Screener Health Questions Responses to Survey Participation Propensity
—
Te-Ching Chen, CDC/NCHS; Jay Clark, Westat; Chia-Yih Wang, CDC/NCHS; Steven Fink, CDC\NCHS; Allan Uribe, CDC/NCHS; Orlando Davy, CDC/NCHS
Keywords: NHANES; response propensity; survey screening
WITHDRANW: Rejective Sampling, Rerandomization, and Regression Adjustment in Survey Experiments
—
Zihao Yang, University of Illinois at Urbana-Champaign; Tianyi Qu, University of Illinois at Urbana-Champaign; Xinran Li, University of Illinois
Keywords: causal inference; potential outcome; randomization-based inference; covariate balance; Mahalanobis distance; randomization test
Designing and Implementing Multi-Wave Sampling Surveys in R
(View Presentation)
—
Jasper Yang, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania; Bryan Shepherd, Department of Biostatistics, Vanderbilt University School of Medicine; Thomas Lumley, University of Auckland; Pamela Shaw, Kaiser Permanente Washington Health Research Institute
Keywords: multi-wave sampling; Neyman allocation; optimal design; R
Evaluation of Methods to Form Segments from Census Blocks in Area Sample Designs
(View Presentation)
—
Jennifer Kali, Westat; Tom Krenzke, Westat; Ying Chen, Westat; Jianru Angela Chen, WESTAT; Jim Green, Westat
Keywords: area sampling; multi-stage sampling; segments; in-person surveys
Basic Imputing Design Explored Now (BIDEN) in 2020 with TRUMP Cuts
—
Sarjinder Singh, Texas A&M University -Kingsville; Stephen A. Sedory, Texas A&M University-Kingsville
Keywords: Imputation; Jackknifing; Auxiliary information; Ratio type estimators; TRUMP Cuts; Non response
Methodological Development of Person-Day Level Nonresponse Adjusted Weights for the FoodAPS-1
—
Jeffrey Gonzalez, Economic Research Service
Keywords:
Evaluation and Utility of Address-Level Predictive Models for Address-Based Sampling (ABS) Sample Designs
—
Joseph McMichael, RTI International; Stephanie Zimmer, RTI International
Keywords: ABS; stratification; sample design; auxiliary data
Part 2: Construction of Strata Boundaries in Tax Auditing
—
Zac Rhyne, Ryan, LLC.
Keywords: Sampling; Audit; Estimation; Tax Sampling
A Bayesian Approach to Differential Recruitment with Respondent-Driven Sampling Data
—
Isabelle Beaudry, Pontificia Universidad Católica de Chile
Keywords: Respondent-Driven sampling; Survey Sampling methodology; Hard-to-reach populations; Model-Based inference; Non-response bias; Network inference
Imputation Vis-a-Vis Research Standards in Clearinghouses
—
Keywords: WWC; Randomized trials; Evaluation research; Sensitivity analysis
Extending the Mann-Whitney-Wilcoxon Rank Sum Test to Survey Data for Comparing Mean Ranks
—
Tuo Lin, University of California, San Diego
Keywords: inverse probability weighting; median; NHANES; rank; sampling weights; U-statistics
Utilizing Stability Criteria in Choosing Feature Selection Methods Yields Reproducible Results in Microbiome Data
—
Lingjing Jiang, Johnson & Johnson
Keywords: classification; prediction; reproducible; feature selection; microbiome; stability
Short-Term Transition Impact on Change Point Detection and Monitoring Existing State
—
Turkan Kumbaraci Gardenier, Teka Trends, Inc.
Keywords: Moniotoring; Change point identificationnt; Cumulative Transitional State Score (CTSS)TSS; Trnasitional Score (TS) ; Cumulative Sums (CUSUM); Control charts
A Sparse Negative Binomial Mixture Model for Clustering RNA-Seq Count Data
—
YUJIA LI, Department of Biostatistics, University of Pittsburgh; Tanbin Rahman, Department of Biostatistics, MD Anderson Cancer Center; Tianzhou Ma, Department of Epidemiology and Biostatistics, University of Maryland; Lu Tang, University of Pittsburgh; George Tseng, Department of Biostatistics, University of Pittsburgh
Keywords: cluster analysis; Gaussian mixture model; sparse K-means; feature selection
Prediction of Dispositional Dialectical Thinking from Resting-State Electroencephalography
—
Kun Huang, Tsinghua University; Dian Chen, Tsinghua University; Fei Wang, Tsinghua University; Lijian Yang, Tsinghua University
Keywords: FDA; resting-EEG; machine learning; alpha wave; dialectical thinking
Evaluation of the Impact of Correlation Between Competing/Semi-Competing Event and Event of Interest in Trials with Heterogeneity Population
—
Ran Liao, Eli Lilly; Junjing Lin, Takeda Pharmaceuticals; Margaret Gamalo-Siebers, Pfizer Inc.
Keywords: competing risk; semi-competing risk; correlation ; heterogeneity population ; Subgroup
Data Analysis for International Investments into Territories: Assessment of International Entrepreneurs' Decision-Making as the Generalized Erlang(N)Risk Stand as a Brownian Particle
—
VALERY MFONDOUM, Toulouse 1 Capitol University / CNAM of Pays de la Loire
Keywords: Decision-making; International entrepreneurs; Territories; Brownian particle; information’s atom; Erlang(n) risk
A General Adaptive Framework for Testing a Multivariate Point Null Hypothesis
—
Adam Elder, University of Washington; Alex Luedtke, University of Washington; Marco Carone, University of Washington
Keywords: Hypothesis testing; nonparametric; multivariate point null; asymptotic linearity
Nonconstant Error Variance in Generalized Propensity Score Model
—
Doyoung Kim, Sungkyunkwan University; Chanmin Kim, Sungkyunkwan University
Keywords: Causal Inference; Continuous Treatment; Covariate Balance; Heteroskedasticity; Observational Study
Testing Microbiome Association Using Integrated Quantile Regression Models
—
Tianying Wang, Center for Statistical Science, Tsinghua University; Wodan Ling, Fred Hutchinson Cancer Research Center; Michael C Wu, Fred Hutchinson Cancer Research Center; Anna Plantinga, Williams College; Xiang Zhan , Penn State University
Keywords: quantile regression; microbiome association; kernel machine; integrated quantile
Pathfinder: A parallel quasi-Newton algorithm for reaching regions of high probability mass
—
Lu Zhang, Columbia University; Bob Carpenter, Flatiron Institute; Aki Vehtari, Aalto University; Andrew Gelman, Columbia University
Keywords: Quasi-Newton Optimization; Laplace Approximation; Variational Inference; Markov chain Monte Carlo; Importance Resampling; Wasserstein Distance
ASCEND for Veteran Suicide Prevention: Evaluating Sample Design and Response Characteristics of a National Veteran Survey Surveillance System
—
Evan Herring-Nathan, NORC at the University of Chicago; Christopher Johnson, NORC; Claire Hoffmire, Department of Veterans Affairs; Lindsey Monteith, Department of Veterans Affairs; Alexandra Schneider, Department of Veterans Affairs
Keywords: Sample Design; Administrative Data; Complex Surveys; Stratified Sample
Network-Based Differential Co-Expression Analysis Using Breast Cancer Data Set
—
Yonghui Ni, University of Kansas-Medical Center,Department of Biostatistics & Data Science; Prabhakar Chalise, University of Kansas Medical Center; Jianghua He, University of Kansas Medical Center
Keywords: differential co-expression; differential network
A Reinforcement Learning Algorithm for Online Personalized Tutor Recommendation
—
Mohamad Kazem Shirani Faradonbeh, University of Georgia
Keywords: reinforcement learning; computerized education; intelligent tutoring; data-driven recommendation; statistical machine learning; decision-making algorithms
Data Fusion of Distance Sampling and Spatial Capture-Recapture Data
—
Narmadha Meenabhashini Mohankumar, Kansas State University; Trevor Hefley, Kansas State University, Department of Statistics; Katy Silber, Kansas State University; W. Alice Boyle, Kansas State University
Keywords: Data fusion; Integrated population models; Ecological statistics; Spatial statistics; Distance sampling; Spatial capture-recapture
Storytelling with Receiver Operating Characteristic (ROC) Curves for Environmental Remediation of Unexploded Ordnance (UXO)
—
Shelley Cazares, Institute for Defense Analyses; Jacob Bartel, Institute for Defense Analyses
Keywords: Environmental Remediation; Unexploded Ordnance (UXO); Receiver Operating Characteristic Curve (ROC Curve); Probability of Detection; Probability of False Alarm; False Alarm Rate
A Comparative Study of Time Series Forecasting Using Deep Learning Methods
—
Simachew Endale Ashebir, North Carolina A&T State University; Seong-Tae Kim, North Carolina A&T State University
Keywords: Time Series; Forecasting; ARIMA- GARCH; Deep learning ; Artificial neural network
Robust Online Linear Discriminant Analysis for Data with Outliers
—
Soshi Kawarai, Doshisha University; Ippei Takasawa, Doshisha University; Hiroshi Yadohisa, Doshisha University
Keywords: Online learning; Linear discriminant analysis; Averaged stochastic gradient descent; Median covariance matrix; Incremental learning
Gene Clustering Method for Multi-Omics Data: A Canonical Correlation and Weighted Correlation Network Analysis Approach
—
Ulrich Kemmo Tsafack, Medical College of Wisconsin; Kwang Woo Ahn, Medical College of Wisconsin; Chien-Wei Lin, Medical College of Wisconsin
Keywords: Canonical correlation; Gene Clustering; Multi-omics data; Multi-view clustering; Weighted correlation network analysis
Online Registration and Inference for Functional Data
—
Yoonji Kim, The Ohio State University; Oksana Chkrebtii, The Ohio State University; Sebastian Kurtek, The Ohio State University
Keywords: Bayesian Registration; Functional Data; Online Inference; Sequential Monte Carlo; Square-Root Velocity Function
Measures of Selection Bias in Regression Coefficients Estimated from Nonprobability Samples
—
Brady T. West, University of Michigan; Roderick Joseph Little, University of Michigan; Rebecca R Andridge, The Ohio State University College of Public Health; Philip Boonstra, University of Michigan; Erin Ware, Institute for Social Research, University of Michigan-Ann Arbor; Anita Pandit, University of Michigan; Fernanda Alvarado-Leiton, Institute for Social Research, University of Michigan-Ann Arbor
Keywords: Selection Bias; Non-Probability Samples; Linear Regression; Probit Regression; Non-Ignorable Selection; Polygenic Scores
Robust Bayesian Inference for Non-Probability Samples Using Gaussian Process of Propensity Prediction
—
Ali Rafei, University of Michigan; Michael R. Elliott, University of Michigan; Carol Flannagan, University of Michigan
Keywords: Non-probability sample; double robustness; Gaussian processes; prediction model; Bayesian inference
Efficient and Robust Propensity-Score-Based Methods for Finite Population Inference with Nonprobability Epidemiologic Cohorts
—
Lingxiao Wang, National Cancer Institute, DCEG, Biostatistics Branch; Yan Li, University of Maryland, College Park; Barry Graubard, National Cancer Institute, DCEG, Biostatistics Branch; Hormuzd Katki, National Cancer Institute
Keywords: Epidemiologic cohort; Non-Probability sample; Propensity score weighting; Survey Sampling; Taylor series linearization variance
Variable Selection Strategies for Effective Quota Sampling and Propensity Weighting: An Application to SARS-Cov-2 Infection Prevalence Estimation
—
Yan Li, University of Maryland, College Park; Barry Graubard, National Cancer Institute, DCEG, Biostatistics Branch; Michael Fay, Biostatistics Research Branch, Division of Clinical Research, NIAID; Sally Hunsberger, NIAID
Keywords: balanced distribution; propensity score; quota variable
High-Dimensional, Robust, Unsupervised Record Linkage
—
Ansu Chatterjee, University of Minnesota; Sabyasachi Bera, University of Minnesota
Keywords: Record Linkage; High Dimensional; Unsupervised; Robust
Multiple Systems Estimation for the Quantification of Modern Slavery: A Comparative Study Regarding the Accuracy of Current Methods
—
Olivier Binette, Duke University ; Rebecca C. Steorts, Duke University
Keywords: multiple systems estimation; modern slavery ; human trafficking
Bayesian one-inflated models for population size estimation
—
Tiziana Tuoto, Italian National Institute for Statistics; Davide Di Cecco, Sapienza University of Rome; Andrea Tancredi, Sapienza University of Rome
Keywords: record linkage; capture-recapture; zero-truncated one-inflated count data; population size estimation
A Theoretical Framework for Probabilistic Record Linkage in Multiple-Frame Surveys
—
Takumi Saegusa, University of Maryland, College Park; Partha Lahiri, University of Maryland, College Park
Keywords: domain membership; multiple-frame survey; record linkage
Constructing UpSet Plot for Survey Data with Weights Using SAS Survey Package and R UpSetR Package
—
Julia Soulakova, College of Medicine at Universtity of Central Florida; Camilo Gomez, Graduate Student, UCF; Alexander Goponenko, Graduate Student, UCF
Keywords: multi-stage sampling; exploratory analysis; data visualization; software development
Statistical Improvements in Weekly-Updated Cumulative Estimates of Flu Vaccination Coverage for Children in the United States
—
Kennon Copeland, NORC at the University of Chicago; Nada Ganesh, NORC at the University of Chicago; Lin Liu, NORC at the University of Chicago; Tammy Santibanez, CDC; James Singleton, CDC
Keywords: Cumulative vaccination coverage; Composite estimation; Variance reduction; Estimates revisions; Surveys and questionnaires
A General Stopping Rule for Survey Data Collection
—
Xinyu Zhang, University of Michigan - Ann Anbor; James Wagner, University of Michigan; Michael R. Elliott, University of Michigan; Brady T. West, University of Michigan; Stephanie Coffey, U.S. Census Bureau
Keywords: Stopping rule; Responsive survey design; Nonresponse bias
A Bayesian Model for Inference on Multiple Panel Public Opinion Surveys
—
Brittany Alexander, Texas A&M University
Keywords: political polling; bayesian modelling; hierarchical models; mrp; panel survey
Machine Learning Methods: A Case Study Using Online Web-Based Panel Surveys
(View Presentation)
—
Yulei He, US Centers for Disease Control and Prevention; Guangyu Zhang, CDC; Van Parsons, CDC
Keywords: probability panel; online survey; machine learning; prediction; performance; statistical estimates
Linear Quantile Mixed Models Under Informative Cluster Sampling
—
Sixia Chen, University of Oklahoma Health Sciences Center; Daniel E Zhao, OUHSC Hudson College of Public Health; Chao Xu, University of Oklahoma Health Sciences Center
Keywords: Cluster sample; Informative sampling; Mixed model; Quantile regression
Response Rates in the Consumer Assessment of Health Care Providers and Systems Outpatient and Ambulatory Surgery Survey (OAS-CAHPS) Mode Experiment
—
Kathryn Spagnola, RTI; Elizabeth Goldstein, CMS; Memuna Ifedirah, CMS; Christine Payne, CMS; Patrick Chen, RTI; Shampa Saha, RTI; Marjorie Hinsdale-Shouse, RTI
Keywords: Mode Experiment; Mode Effect; Response Rates; CAHPS; Mixed Mode
Assessing differences of mean estimates from longitudinal surveys, with an application to Research and Development Survey
—
Rong Wei, National Center for Health Statistics, CDC; Yulei He, US Centers for Disease Control and Prevention; Van Parsons, CDC; Paul Scanlon, National Center for Health Statistics, CDC
Keywords: US population health; RANDS; COVID-19 Pandemic; on-line surveys; longitudinal design; partial overlapping samples
An Overview of the 2019 Research and Development Survey (RANDS)
—
Li-Yen Rebecca Hu, NCHS; Yulei He, US Centers for Disease Control and Prevention; Paul Scanlon, National Center for Health Statistics, CDC; Kristen Miller, NCHS; Katherine Irimata, NCHS; Guangyu Zhang, CDC; Kristen Cibelli Hibben, NCHS
Keywords: RANDS; questionnaire design; web survey; probability panel
Calculating Confidence Limits and the Application of Reliability Standards for Proportions Estimated from Complex Survey Data
—
Crescent B Martin, National Center for Health Statistics; Margaret D Carroll, National Center for Health Statistics
Keywords: National Health and Nutrition Evaluation Survey; NHANES; National Health Interview Survey; NHIS; confidence intervals
Calibration of RANDS During COVID-19: A Web-Based Panel Survey
—
Katherine Irimata, NCHS; Yulei He, US Centers for Disease Control and Prevention; Van Parsons, CDC; Hee-Choon Shin, National Center for Health Statistics; Guangyu Zhang, CDC
Keywords: calibration; National Health Interview Survey; Research and Development Survey; web surveys
Unadjusted and Adjusted Receiver Operating Characteristic Curve and Precision Recall Curve Analyses for Complex Survey Data
—
Alok Kumar Dwivedi, Texas Tech University Health Sciences Center El Paso
Keywords: Receiver operating characteristic curve ; Precision-recall curve ; Survey data analysis; Adjusted area under the curve; Predictive study ; Diagnostic study
Joinpoint Regression Methods of Aggregated Outcomes for Complex Survey Data
—
Benmei Liu, National Cancer Institute; Hyune-Ju Kim, Syracuse University ; Eric J. Feuer, National Cancer Institute; Barry Graubard, National Cancer Institute, DCEG, Biostatistics Branch
Keywords: Joinpoint regression; variance-covariance matrix; complex survey data
Modeling and Computation of Multi-Step Batch Testing for Infectious Diseases
—
Haoran Jiang, Stony Brook University; Hongshik Ahn, Stony Brook University; Xiaolin Li, Stony Brook University
Keywords: optimal batch size sample pooling ; coronavirus ; specificity; sensitivity; hierarchical batch testing
A New Approach to Composite Estimation for Repeated Surveys with Rotating Panels
—
Takis Merkouris, Athens University of Economics and Business
Keywords: composite calibration; composite regression estimator; AK-composite estimator; MR-composite estimator; sample overlap
COVID-19 Vaccine Effects on Seroprevalence
—
Richard Lee Harding, ICF; Ronaldo Iachan, ICF; Adam Lee, ICF; Yangyang Deng, ICF; Tonja Kyle, ICF; Myrna Charles, CDC; Ryan Wiegand, CDC
Keywords: COVID-19; Seroprevalence Rate; Vaccination Data
WITHDRAWN Changes at the Margins: Evaluating MRP Performance for Data Sets with Clustered Sparsity
—
Robert Petrin, Ipsos Public Affairs; Kevin Fenton, Ipsos Public Affairs; Meng Li, Ipsos Public Affairs
Keywords: MRP; Sparsity; Design; Health; Polling
Utilizing Occupational Employment Statistics (OES) Survey to Improve Small Domain Estimation (SDE) in the Occupational Requirements Survey (ORS)
—
Xingyou Zhang, Bureau Of Labor Statistics; Erin McNulty, Bureau of Labor Statistics; Ellen Galantucci, Bureau of Labor Statistics; Patrick Kim, Bureau of Labor Statistics; Joan Coleman, Bureau of Labor Statistics; Tom Kelly, Bureau of Labor Statistics
Keywords: small domain estimation; multilevel statistical models; Standard Occupational Classification (SOC); Occupational Requirements Survey (ORS); Occupational Employment Statistics (OES) survey; establishment survey
Privacy and Confidentiality Views Toward Surveys Collected in the Era of COVID-19
—
Jennifer Hunter Childs, US Census Bureau; Aleia Clark Fobia, US Census Bureau; Casey Eggleston, US Census Bureau; Yazmin Garcia Trejo, US Census Bureau; Shaun S. Genter, U.S. Census Bureau
Keywords: privacy; confidentiality; trust; covid-19; public opinion
Using Anonymized Data to Help Public Health Officials Combat COVID-19
—
Damien Desfontaines, Google
Keywords: differential privacy
Privacy Attitudes in Times of Crisis: Acceptance of Data Sharing for Public Health?
—
Frederic Gerdon, University of Mannheim; Helen Nissenbaum, Cornell Tech; Ruben Bach, University of Mannheim; Frauke Kreuter, University of Maryland and University of Mannheim; Stefan Zins, Institute for Employment Research (IAB) Nuremberg
Keywords: privacy; Covid-19; health data; survey experiment; data sharing; public good
Privacy by Design When Launching a Global COVID Symptom Survey
—
Frauke Kreuter, University of Maryland and University of Mannheim; Sarah LaRocca, Facebook Research; Katherine Morris, Facebook Research; Adrianne Bradford, University of Maryland; Samantha Lee-Ming Chiu, University of Maryland
Keywords: covid-19; survey methods; privacy
Protecting Privacy in Facebook Mobility Data During the COVID-19 Response
—
Amaç Herda?delen, Facebook; P. Alex Dow, Facebook; Payman Mohassel, Facebook; Alex Pompe, Facebook; Bogdan State, Victoria University of Wellington
Keywords:
Bayesian Hierarchical Spatial Models for Small Area Estimation
—
Gauri Sankar Datta, US Census Bureau/University of Georgia; Hee Cheol Chung, Texas A&M University
Keywords: Conditional autoregression; Intrinsic autoregression; Simultaneous autoregression; Fay-Herriot model; Unsampled small areas; Current Population Survey
Transformation-Based Variable Selection for Three-Fold Sub-Sub-Area-Level Models in Small Area Estimation
—
Song Cai, Carleton University; J.N.K. Rao, Carleton University
Keywords:
A Bayesian Approach for Integrating a Small Probability Sample with a Non-Probability Sample
—
Nandram Balgobin, Worcester Polytechnic Insttute
Keywords: Big data; Covariates; Inverse probability weighting; Power prior; Small area; Surrogate sampling
Construction of a Database for Small Area Estimation
—
Emily Berg, Iowa State University
Keywords: Imputation; Bootstrap; Informative Sampling
Model-Based Estimates for Farm Labor Quantities
—
Lu Chen, National Institute of Statistical Sciences/USDA, NASS; Nathan B. Cruze, NASS, USDA; Linda J Young, USDA National Agricultural Statistics Service
Keywords: Auxiliary Information; Agricultural Statistics; Hierarchical Bayes; Small Area Estimation
Design Consistent Random Forest Models for Survey Data
—
daniell toth, US Bureau of Labor Statistics; Kelly McConville, Reed College
Keywords: desgin consistent; sample design; tree models; machine learning; small area; official statistics
Comparing the Performance of Alternative Imputation Methods for the Advance Monthly Retail Trade Survey
—
Katherine Jenny Thompson, U.S. Census Bureau; Stephen Kaputa, U.S. Census Bureau; Nicole Czaplicki, U.S. Census Bureau; Brian Dumbacher, U.S. Census Bureau
Keywords: regression trees; RegARIMA forecasts; Bayesian hierarchical models; link relative estimator
Computationally Efficient Bayesian Unit-Level Models for Non-Gaussian Data Under Informative Sampling
—
Scott Holan, University of Missouri; Paul Parker, University of Missouri; Ryan Janicki, US Census Bureau
Keywords: American Community Survey; Bayesian analysis; Informative sampling; Polya-Gamma; Pseudo-likelihood; Variational Bayes
Imputation Procedures in Surveys Using Nonparametric and Machine Learning Methods: An Empirical Comparison
—
David Haziza, University of Ottawa; Mehdi Dagdoug, Université de Bourgogne Franche Comté; Camelia Goga, Université de Bourgogne Franche Comté
Keywords: Additive models; Boosting; Random forests; Neareast-neighbour imputation; Cubist; BART
Changing Modes on the Fly: Transitioning a Complex Longitudinal Survey from In-Person to Phone Due to COVID-19
—
Becky Reimer; NORC at the University of Chicago; Kylie Carpenter, NORC at the University of Chicago; Ann Bisognano, NORC at the University of Chicago; Liz Kantor, NORC at the University of Chicago
Keywords: survey mode, mode analysis, in-person interviewing, telephone interviewing, data quality, COVID-19
Assessing Reproducibility of Analytic Findings Derived Through National Survey Data Integration Efforts: A Case Study Linking Patient-Level Clinical Trial Data with Medical Expenditure Panel Survey Data
—
Steven B. Cohen, RTI International; Jennifer Unangst, RTI International; Feng Yu, RTI International
Keywords: Project Data Sphere, Data Integration, Health Disparities, MEPS, Reproducibility
Hello ... Good-Bye. Hang-Ups and Breakoffs by Mode, Topic, and Geography in Oregon
—
Debbie Krug Mangipudi, ICF; Samantha McCoy, Oregon Criminal Justice Commission; Kelly Officer, Oregon Criminal Justice Commission; Ken Sanshagrin, Oregon Criminal Justice Commission; Michael Weinerman, Oregon Criminal Justice Commission; Zoe Padgett, ICF; Melinda Scott, ICF; Matt Jans, ICF; Robynne Locke, ICF; Stephen Haas, ICF; John Boyle, ICF; Randy ZuWallack, ICF; Lizzie Remrey, ICF; Heather Driscoll. ICF; Siobhan McAlister, Oregon Criminal Justice Commission
Keywords: nonresponse, paradata, hang-ups, breakoffs, web survey, phone survey
LGBTQ Disparities in Health and Social Determinants of Health in a National Monitoring Survey of Americans During the COVID-19 Pandemic
—
Rachel Kinder, ICF; Matt Jans, ICF; Dierdre Middleton, ICF; John Boyle, ICF; Thomas Brassell, ICF; James Dayton, ICF
Keywords: LGBTQ, COVID-19, survey, health, employment, income
Pilot Testing the Shift from In-Person to Phone Data Collection on the Medicare Current Beneficiary Survey (MCBS)
—
Shena Patel, NORC at the University of Chicago; Samantha Rosner, NORC at the University of Chicago; Andrea Mayfield, NORC at the University of Chicago; Jennifer Vanicek, Survey Director, NORC at the University of Chicago, Chicago, IL
Keywords: Medicare Current Beneficiary Survey, Phone Pilot Test, COVID-19, Data Collection, Interviewer
The Impact of COVID-19 on Large-Scale Phone Survey (Sample) Productivity and Response Rates
—
Matt Jans, ICF; Zoe Padgett, ICF; James Dayton, ICF; Randy ZuWallack, ICF; Don Allen, ICF; Josh Duell, ICF; Andy Dyer, ICF; Thomas Brassell, ICF; Sam Collins, ICF; Traci Creller, ICF
Keywords: response rates, COVID-19, RDD survey, survey productivity
The Impact of the COVID-19 Pandemic on RDD Cell Phone Response in the National Immunization Surveys
—
Jason Boim, NORC at the University of Chicago; Benjamin Skalland, NORC at the University of Chicago, Chicago, IL; Holly A. Hill, US Centers for Disease Control and Prevention; Michael Chen, US Centers for Disease Control and Prevention; Natalie Sterrett, US Centers for Disease Control and Prevention; David Yankey, US Centers for Disease Control and Prevention; Laurie Elam-Evans, National Center for Immunization and Respiratory Diseases, CDC
Keywords: National Immunization Surveys, COVID-19, Response Rates
Using R-Indicators to Make Case-Level Decisions for GSS 2020
—
Steven Pedlow, NORC at the University of Chicago
Keywords: General Social Survey, Representativity Indicators
An Innovative Approach to Cognitive Testing: Using Cognitive Probes in Production CATI Interviews
—
Samantha Collins, ICF; Georgette Lavetsky, MD Department of Health; Matt Jans, ICF
Keywords: cognitive testing, questionnaire pretesting, BRFSS