BACKGROUND
e-Cigarette use among youth is high, which may be due in part to pro–e-cigarette content on social media such as YouTube. YouTube is also a valuable resource for learning about e-cigarette use, trends, marketing, and e-cigarette user perceptions. However, there is a lack of understanding on how similar e-cigarette–related search items result in similar or relatively mutually exclusive search results. This study uses novel methods to evaluate the relationship between e-cigarette–related search items and results.
OBJECTIVE
The aim of this study is to apply network modeling and rule-based classification to characterize the relationships between e-cigarette–related search items on YouTube and gauge the level of importance of each search item as part of an e-cigarette information network on YouTube.
METHODS
We used 16 fictitious YouTube profiles to retrieve 4201 distinct videos from 18 keywords related to e-cigarettes. We used network modeling to represent the relationships between the search items. Moreover, we developed a rule-based classification approach to classify videos. We used betweenness centrality (BC) and correlations between nodes (ie, search items) to help us gain knowledge of the underlying structure of the information network.
RESULTS
By modeling search items and videos as a network, we observed that broad search items such as <i>e-cig</i> had the most connections to other search items, and specific search items such as <i>cigalike</i> had the least connections. Search items with similar words (eg, <i>vape</i> and <i>vaping</i>) and search items with similar meaning (eg, <i>e-liquid</i> and <i>e-juice</i>) yielded a high degree of connectedness. We also found that each node had 18 (SD 34.8) connections (common videos) on average. BC indicated that general search items such as <i>electronic cigarette</i> and <i>vaping</i> had high importance in the network (BC=0.00836). Our rule-based classification sorted videos into four categories: e-cigarette devices (34%-57%), cannabis vaping (16%-28%), e-liquid (14%-37%), and <i>other</i> (8%-22%).
CONCLUSIONS
Our findings indicate that search items on YouTube have unique relationships that vary in strength and importance. Our methods can not only be used to successfully identify the important, overlapping, and unique e-cigarette–related search items but also help determine which search items are more likely to act as a gateway to e-cigarette–related content.
CLINICALTRIAL