Design and evaluation of a speech interface for remote database searching

Author:

Philip G.1,Peters B.F.1,Smith F.J.2,Crookes D.2,Rafferty T.2

Affiliation:

1. Department of Information Management, School of Finance and Information, Queen's University of Belfast

2. Department of Computer Science, School of Electronic Engineering and Computer Science. Queen's University, of Belfast, Belfast BT7 INN, Northern Ireland, UK

Abstract

Advances in speech technology have made it possible to use speech as an effective man-machine interface in infor mation retneval applications. We believe that since it is among the most natural means of communication, limited use of speech should increase the speed and ease of access to large document databases. The purpose of this paper is to descnbe a two-year research programme which involved the design, im plementation and evaluation of a speech interface for the British Library Blaise online service The project began by examining an existing pnmitive voice interface which was developed during the early 1980s for an in-house text retneval system known as MicroBIRD. Evaluation of this interface has provided invaluable insights into the problems of voice interface design for online searching. The main lessons learned from this initial study were: (a) take full advantage of the well defined syntax of the query language to limit the difficulty of the speech recognition process; and (b) avoid antagonising the user by providing full control of the configuration of the interface, enabling varying degrees of audio reinforcement of visually presented data. Based on the experience gained from the MicroBiRD inter face we embarked on the more challenging task of designing a speech interface for the Blaise system. We will outline the hardware configuration, software development, analysis of the Blaise query language syntax and design features of the new interface. Having successfully developed the system the next logical step was to study the reactions of users to the interface. particularly in relation to the preferred mix of keyboard/voice input and screen/speech output. We have carried out a series of expenments using subjects from a wide range of back grounds. The evaluation expenments have shown that the use of voice for the input of commands and associated parameters is the area of greatest advantage of a voice interface. Indeed voice input is almost as fast as keyboard input, and sometimes slightly faster. Speech output, on the other hand, should be used pnmanly for providing short prompts (e.g. help messages) to the user. The reading out of an entire bibliographic record would be irksome and, because of the senal nature of speech. far too slow. If speech output is desired it should be limited to selected parts of a record such as author and title fields. Finally. our results have also shown that currently available speech recognition and synthesis hardware, along with intelli gent software, can provide an interface well suited to the needs of online information retrieval systems.

Publisher

SAGE Publications

Subject

Library and Information Sciences,Information Systems

Reference16 articles.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3