Identifying the Privacy Risks of Semi-Anonymous Online Medical Forums: Content Analysis (Preprint)

Author:

Sangari Ayush,Sood Aditya,Horinek Maddison,Sangari Anish,Sood NitishORCID,Siddiqui Sulaimaan,Soni AakritiORCID

Abstract

BACKGROUND

Online medical forums, such as Breastcancer.org, allow individuals to come together in a safe space to share personal details about themselves, diagnoses, and treatment plans. Such a space is meant to be anonymous in order to protect the identities of all patients involved. However, the information shared is anonymized only by the chosen username, and usernames on these forums can be used again by patients on other public social media websites. In addition, content posted by patients often contains sensitive information that can be used to uncover their identities. This threatens the privacy of patients by offering the opportunity to link their public-facing social media profiles with their private-facing profiles on online medical support groups.

OBJECTIVE

The aim of this study was to design a methodology that could analyze the dual privacy threat derived from username linkage and from disclosure of personal identifiable information (PII) or protected health information (PHI) for semi-anonymous medical forums.

METHODS

12,000 usernames on Breastcancer.org were randomly selected and then cross-referenced with Facebook, Twitter, Instagram, and Reddit using an online tool called socialscan. The entropy of each username was then calculated as a proxy for username uniqueness using Dropbox’s zxcvbn tool. Using the username uniqueness probabilities, the expected number of profiles that could be linked to a public-facing social media profile was subsequently determined. Analysis was further conducted on the nature of PII and PHI being shared on these forums on a randomly chosen message board using Microsoft Presidio, a tool designed for scrubbing sensitive text.

RESULTS

Substantial reuse of usernames was detected between Breastcancer.org and any of the four social media sites. Out of these 12,000 sampled users, 169 patients on Breastcancer.org were expected to be linked to at least one other social media website. 66 patients were expected to be linked to Facebook, 96 to Twitter, 102 to Instagram, and 66 to Reddit. This analysis suggests that of Breastcancer.org’s 227,862 users, approximately 3,190 users in total can be linked to at least one of the four social media websites examined in this study solely based on username. In addition, when content from a randomly chosen thread was analyzed, each of the 2,781 posts from 64 patients contained an average of 9.5 pieces of PII.

CONCLUSIONS

These results demonstrate a substantial risk to patients on semi-anonymous medical forums with regards to medical privacy and the need to add warnings for patients when they are creating usernames. This tool, or a similar one, could be used by online medical support groups to warn patients when signing up for the platform to use different usernames for their public-facing and private-facing profiles. Such a warning would strengthen patient privacy.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3