What we're about

People generally interested in R (professionals, students, academics ...) from any discipline.

Fork stuff from: https://github.com/michaelschulte/BernR

Upcoming events (1)

Anonymizing data using Statistical Disclosure Control

Online event

A presentation by Dr. Matthias Templ from the ZHAW School of Engineering.

The event takes place online. The link to the meeting will be visible for registered participants shortly before the beginning of the event (at the latest).

----

New technologies and research in the field of machine learning and deep learning methods and new ways of accessing, integrating and analyzing sensitive personal data increase the demand for solutions to be able to respect laws on data privacy and confidentiality. Fields of applications include official statistics and social sciences, financial transactions, social network activities, location trajectories, CRM, insurance data and medical records.
New data protection regulations, that especially include high penalties for violating privacy, put the topic of statistical disclosure control in focus.

There are a lot of different concepts to consider data privacy. To name a few: privacy preserving computation, remote execution, remote access, synthetic data, and statistical disclosure control. We focus on the latter one. Statistical disclosure control includes the measurement of the re-identification risk of persons in a data set, the anonymization of data and the measurement of the information loss after anonymization. After anonymization the data include no link to persons, and thus all the rules on privacy do no longer apply.

Based on an application using the European Survey of Income and Living Condition, we show the S4 class R package sdcMicro in action. Note, that each anonymization should be done very data- and use-case specific, and an anonymization of different kinds of data (e.g. mobility data, event history data, time series, longitudinal data) needs different solutions.

The audience is expected to have knowledge in base R.

Further readings, resources and documents:

Publications (selection):
- Journal of Statistical Software: sdcMicro
https://www.jstatsoft.org/article/view/v067i04
- Publication about sdc app
https://www.mdpi.com/1999-4893/12/9/191/htm
- Online test version
https://sdctools.shinyapps.io/sdcapp/
- Book on SDC in Springer
https://www.springer.com/de/book/9783319502700

Resources:
- sdcMicro development on github
https://github.com/sdcTools/sdcMicro
- sdcMicro stable CRAN version
https://cran.r-project.org/web/packages/sdcMicro/index.html

See also:
- International Household Survey Network
http://www.ihsn.org/software/disclosure-control-toolbox
- World Bank Group
http://surveys.worldbank.org/sdcmicro

Past events (13)

BernR Meetup: Why and how to build your own R packages

Schweizerischer Nationalfonds SNF

Photos (7)