From voice to ink (VINK): Development and assessment of an automated, free-of-charge transcription tool

Author:

Tolle HannahORCID,Castro Maria del MarORCID,Wachinger JonasORCID,Putri Agrin ZauyaniORCID,Kempf DominicORCID,Denkinger Claudia M.ORCID,McMahon Shannon A.ORCID

Abstract

AbstractVerbatim transcription of qualitative data is a cornerstone of analytic quality and rigor, yet the time and energy required for such transcription can drain resources, delay analysis and hinder the timely dissemination of qualitative insights. In recent years, software programs have presented a promising mechanism to accelerate transcription, but the broad application of such programs has been constrained due to expensive licensing or “per-minute” fees, data protection concerns, and limited availability of such programs in many languages. In this article, we outline our process of developing and adapting a free, open-source, speech-to-text algorithm (Whisper by OpenAI) into a usable and accessible tool for qualitative transcription. Our program, which we have dubbed “Vink” for voice to ink, is available under a permissive open-source license (and thus free of cost). We assessed Vink’s reliability in transcribing authentic interview audio data in 14 languages, and identified high accuracy and limited correction times in most languages. A majority (9 out of 12) of reviewers evaluated the software performance positively, and all reviewers whose transcript had a word-error-rate below 20% (n=9) indicated that they were likely or very likely to use the tool in their future research. Our usability assessment indicates that Vink is easy-to-use, and we are continuing further refinements based on reviewer feedback to increase user-friendliness. With Vink, we hope to contribute to facilitating rigorous qualitative research processes globally by reducing time and costs associated with transcription, and expanding the availability of this transcription software into several global languages. With Vink running on the researcher’s computers, data privacy issues arising within many other solutions do not apply.Summary boxWhat is already known on this topic:Transcription is a key element to ensure quality and rigor of qualitative data for analysis. Current practices, however, often entail high costs, variable quality, data privacy concerns, stress for human transcribers, or long delays of analysis.What this study adds:We present the development and assessment of a transcription tool (Vink) for qualitative research drawing upon an open-source automatic speech recognition system developed by OpenAI and trained on multilingual audio data (Whisper). Initial validation in real-life data from 14 languages shows high accuracy in several languages, and an easy-to-use interface.How this study might affect research, practice or policy:Vink overcomes limitations of transcription by providing a ready to use, open source and free-of-cost tool, with minimal data privacy concerns, as no data is uploaded to the web during transcription.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3