Mathematics, risk, and messy survey data

Kristi Anne Thompson; Carolyn Sullivan

doi:10.29173/iq979

Mathematics, risk, and messy survey data

Authors

Kristi Anne Thompson Western University
Carolyn Sullivan

DOI:

https://doi.org/10.29173/iq979

Keywords:

data, data deidentification, anonymization, anonymity, survey data

Abstract

Research funder mandates, such as those from the U.S. National Science Foundation (2011), the Canadian Tri-Agency (draft, 2018), and the UK Economic and Social Research Council (2018) now often include requirements for data curation, including where possible data sharing in an approved archive. Data curators need to be prepared for the potential that researchers who have not previously shared data will need assistance with cleaning and depositing datasets so that they can meet these requirements and maintain funding. Data de-identification or anonymization is a major ethical concern in cases where survey data is to be shared, and one which data professionals may find themselves ill-equipped to deal with. This article is intended to provide an accessible and practical introduction to the theory and concepts behind data anonymization and risk assessment, will describe a couple of case studies that demonstrate how these methods were carried out on actual datasets requiring anonymization, and discuss some of the difficulties encountered. Much of the literature dealing with statistical risk assessment of anonymized data is abstract and aimed at computer scientists and mathematicians, while material aimed at practitioners often does not consider more recent developments in the theory of data anonymization. We hope that this article will help bridge this gap.

Downloads

Published

2020-12-18

How to Cite

Thompson, K. A., & Sullivan, C. (2020). Mathematics, risk, and messy survey data. IASSIST Quarterly, 44(4). https://doi.org/10.29173/iq979

Download Citation

Issue

Vol. 44 No. 4 (2020): IASSIST Quarterly

Section

Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

This license lets others remix, tweak, and build upon your work non-commercially, and although their new works must also acknowledge you and be non-commercial, they don’t have to license their derivative works on the same terms.

The Creative Commons-Attribution-Noncommercial License 4.0 International applies to all works published by IASSIST Quarterly. Authors will retain copyright of the work. Your contribution will be available at the IASSIST Quarterly website when announced on the IASSIST list server.

Mathematics, risk, and messy survey data

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

doajseal

about

cclicense

Information

Current Issue

Make a Submission