Building High Performance Explainable Machine Learning Models for Social Media-based Substance Use Prediction-Reference-Cited by-同舟云学术

Building High Performance Explainable Machine Learning Models for Social Media-based Substance Use Prediction

Published:2020-06 Issue:03n04 Volume:29 Page:2060009
ISSN:0218-2130
Container-title:International Journal on Artificial Intelligence Tools
language:en
Short-container-title:Int. J. Artif. Intell. Tools

Author:

Ding Tao¹,Hasan Fatema¹^ORCID,Bickel Warren K.²,Pan Shimei¹

Affiliation:

1. Department of Information Systems, University of Maryland, Baltimore County, Baltimore, 21250, USA

2. Addiction Recovery Research Center, Virginia Tech Carilion School of Medicine and Research Institute, Roanoke, VA 24016, USA

Abstract

Social media contain rich information that can be used to help understand human mind and behavior. Social media data, however, are mostly unstructured (e.g., text and image) and a large number of features may be needed to represent them (e.g., we may need millions of unigrams to represent social media texts). Moreover, accurately assessing human behavior is often difficult (e.g., assessing addiction may require medical diagnosis). As a result, the ground truth data needed to train a supervised human behavior model are often difficult to obtain at a large scale. To avoid overfitting, many state-of-the-art behavior models employ sophisticated unsupervised or self-supervised machine learning methods to leverage a large amount of unsupervised data for both feature learning and dimension reduction. Unfortunately, despite their high performance, these advanced machine learning models often rely on latent features that are hard to explain. Since understanding the knowledge captured in these models is important to behavior scientists and public health providers, we explore new methods to build machine learning models that are not only accurate but also interpretable. We evaluate the effectiveness of the proposed methods in predicting Substance Use Disorders (SUD). We believe the methods we proposed are general and applicable to a wide range of data-driven human trait and behavior analysis applications.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Artificial Intelligence

Link

https://www.worldscientific.com/doi/pdf/10.1142/S021821302060009X

Reference25 articles.

1. Private traits and attributes are predictable from digital records of human behavior

2. Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach

3. A Multilinear Singular Value Decomposition

4. Causal inference in statistics: An overview

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Examining the Postdictive Validity of Self-Report Big Five Personality Traits with Objective Recordings of Online Behaviors: A Ten-Year Retrospective Study Using Facebook Page Likes;Heliyon;2024-06

2. A Median-based Resilient Distributed Optimization Algorithm Against Byzantine Attack;International Journal on Artificial Intelligence Tools;2022-09

3. Social media mining in drug development—Fundamentals and use cases;Drug Discovery Today;2021-12