Users’ voice-switching behaviors, embodiments and perceptions of voice assistants

Date24 January 2025
Pages29-55
DOIhttps://doi.org/10.1108/ILS-07-2024-0081
Published date24 January 2025
AuthorDania Bilal,Li-Min Cassandra Huang
Usersvoice-switching behaviors,
embodiments and perceptions of
voice assistants
Dania Bilal
School of Information Sciences, The University of Tennessee Knoxville, Knoxville,
Tennessee, USA, and
Li-Min Cassandra Huang
Department of Library of Information Science, National Taiwan University,
Taipei, Taiwan
Abstract
Purpose This paper aims to investigate user voice-switching behavior in voice assistants (VAs),
embodiments and perceivedtrust in information accuracy, usefulnessand intelligence. The authors addressed
four research questions: RQ1. What is the nature of usersvoice-switching behavior in VAs?RQ2: What are
user preferences for embodiedvoice interfaces (EVIs), and do their preferred EVIs inuence their decision to
switch the voice on their VAs? RQ3: What are the usersperceptions of their VAs concerning: a. information
accuracy, b. usefulness,c. intelligence and d. the most important characteristics they must possess? RQ4:Do
users prefer theirvoice interface to match their characteristics (age, gender,accent and race/ethnicity)?
Design/methodology/approach The authors used a 52-question survey questionnaire to collect
quantitative and qualitativedata. The population was undergraduate students(freshmen and sophomores) at a
research university in the USA. The students were enrolled in two required courses with a research
participation assignment offered for credits. Students must register for research participation credits in the
SONA Research Participation System www.sona-systems.com/platform/research-management/ Registered
students cannot be invitedor sampled to participate in a research study. Therewere 1,700 students enrolled in
both courses. Afterthe surveys URL was posted in SONA, the authors received (n= 632) responses.Of these,
(n= 150) completedthe survey and provided valid responses.
Findings Participants(43%) switched the voice interface in their VAs.They preferred American and British
accents but trustedthe latter. The British accent with a male voice was moretrusted than the American accent
with a female voice.Voice-switchingdecisions varied in the case of most and least preferredEVIs. Participants
preferred EVIs that matched theircharacteristics. Most trusted their VAsinformation accuracy because they
used the internet to nd information, reecting inadequate mental models. Lack of trust is attributed to
misunderstandingrequests and inability to respond accurately.A signicant correlation was found between the
participantsperceivedintelligence of their VAsand trust in informationaccuracy.
Research limitations/implications Due to the wide variability in the data (e.g. 84% White, 6% Asian and
6% Black), the authors did not perform a statistical test to identify the signicance betweenthe selected EVIs and
participantsraces or ethnicities. The self-reported survey questionnaire may be prone to inaccuracy. The
participantsinterest in earning research credit for participation in this study and using SONA is a potential bias.
The EVIs the authors used as embodiments are limited in their representation of peo ple from diverse backgrounds,
races, ethnicities, ages and genders. However,they could be examples for building prototypes to test in VAs.
Practical implications Educators and informationprofessionals should lead the way in offering articial
intelligence (AI) literacy programsto enable young adults to form more adequate mental models of VAsand
support their learning and interactions. VA designers should address the failures and other issues the
participants experiencedin VAs to minimize frustrations.They should also train machine learning models on
large data sets of complex queries to augmentsuccess. Furthermore, they should consider augmenting VAs
This research was partially funded by the University of Tennessee School of Information Sciences
Faculty Research Fund. The authors thank the School for supporting this research.
Information and
Learning Sciences
29
Received15 July2024
Revised4December 2024
Accepted10 De cember 2024
Informationand Learning
Sciences
Vol.126 No. 1/2, 2025
pp. 29-55
© Emerald Publishing Limited
2398-5348
DOI 10.1108/ILS-07-2024-0081
The current issue and full text archive of this journal is available on Emerald Insight at:
https://www.emerald.com/insight/2398-5348.htm
personication with EVIs to enrich voice interactionsand enhance personalization. Researchers should use a
mixed researchmethod with data triangulation instead of only a survey.
Social implications There is a dire need to teach young adults AI literacy skills to enable them to build
adequate mental modelsofVAs. Failures in VAs could affect userswillingness to usethem in the future. VAs
can be effective teaching and learning tools, supporting studentsautonomous and personalized learning.
Integrating EVIs with diverse characteristics could advance inclusivity in designing VAs and support
personalizationbeyond language, accent and gender.
Originality/value This study advances research on user voice-switching behavior in VAs, which has
hardly been investigated in VA research. It brings attention to usersexperiential learning and the need for
exposure to AI literacy to enable them to form adequate mental models of VAs. This studycontributes to
research on personifying VAs through EVIs with diverse characteristics to visualize voice interactions.
Reasons for not switching the voiceinterface due to satisfaction with the current voice or a lack of knowledge
of this feature did not supportthe status quo theory. Incorporatingsatisfaction and lack of knowledge as new
factors could advancethis theory. Switching the voice interface to avoid visualizingthe least preferred EVIs in
VAsis a new theme emerging from this study. Userstrust in VAsinformation accuracy is intertwinedwith
perceivedintelligence and usefulness, but perceivedintelligence is the strongest factor inuencingtrust.
Keywords Voiceassistants, Embodied voice interfaces (EVIs), Interface mirroring,
Perception of VAintelligence, Perception of VA usefulness, Trust in VA information accuracy,
Voice-switchingbehavior
Paper type Research paper
Introduction
Voice assistants (VAs) are software programs that use articial intelligence (AI), natural
language processing, speech recognition and large language models to provide information,
perform various tasks, engage in conversations with users and generate responses based on
text or speech inputs (Terzopoulos and Satratzemi, 2020). They offer exible features,
allowing users to switch the defaultvoice interface by language, accent and gender or choose
the preferred voice and gender while settingup the device (Bilal and Bareld, 2021a). These
features have intrigued researchersinterest in exploring voice-switching behavior as a new
form of interaction behavior with VAs. Nonetheless, we still know very little about this
behavior,especially from young adult users(i.e. college students) perspectives.
VAs have been implemented in many disciplines, including education, to enhance teaching
and learning and provide students access to coursework, augment autonomous and personalized
learning, and promote computational thinking (Al Shamsi et al., 2022). Students learn to use
VAsthrough trial and error or other experiential experiences. Eliciting young adults perceptions
of VAsinformation accuracy, intelligence, trust and usefulness. Eliciting the students
perceptions will unveil their VAsliteracy knowledge and skills and whether interventions are
needed to enable them to use these devices effectively and efciently.
Studies revealed that users assign VAs anthropomorphic attributes (Calahorra-Candao and
Martín-de Hoyos, 2024), treat them as humans (Ki et al., 2020;Seymour and Van Kleek, 2021;
Pitardi and Marriott, 2021), personify them by ascribing friendship (Lopatovska et al., 2019;
Pradhan et al., 2019;Schweitzer et al., 2019;Wienrich et al., 2021), trust (Girouard-Hallam and
Danovitch, 2022;Seymour and Van Kleek, 2021;Wienrich et al., 2021) and personality traits
(Bilal and Bareld, 2021a;Lopatovska and Williams, 2018;Snyder et al., 2023). Users also
prefer their VAs voice to match their gender, language and accent (e.g. Bilal and Bareld,
2021b;Brahnam and De Angeli, 2021). In the case of chatbots and embodied voice interfaces
(EVIs), users prefer the characteristics of the agents EVIs to match their race and ethnicity (e.g.
Bilal and Bareld, 2021b;Bonfert et al., 2021;Liao and He, 2020).
With the growth of Generative AI (GenAI), such as ChatGPT and similar tools,VAs are
increasing in sophisticationby performing more conversational than transactional tasks (e.g.
ILS
126,1/2
30

Get this document and AI-powered insights with a free trial of vLex and Vincent AI

Get Started for Free

Unlock full access with a free 7-day trial

Transform your legal research with vLex

  • Complete access to the largest collection of common law case law on one platform

  • Generate AI case summaries that instantly highlight key legal issues

  • Advanced search capabilities with precise filtering and sorting options

  • Comprehensive legal content with documents across 100+ jurisdictions

  • Trusted by 2 million professionals including top global firms

  • Access AI-Powered Research with Vincent AI: Natural language queries with verified citations

vLex

Unlock full access with a free 7-day trial

Transform your legal research with vLex

  • Complete access to the largest collection of common law case law on one platform

  • Generate AI case summaries that instantly highlight key legal issues

  • Advanced search capabilities with precise filtering and sorting options

  • Comprehensive legal content with documents across 100+ jurisdictions

  • Trusted by 2 million professionals including top global firms

  • Access AI-Powered Research with Vincent AI: Natural language queries with verified citations

vLex

Unlock full access with a free 7-day trial

Transform your legal research with vLex

  • Complete access to the largest collection of common law case law on one platform

  • Generate AI case summaries that instantly highlight key legal issues

  • Advanced search capabilities with precise filtering and sorting options

  • Comprehensive legal content with documents across 100+ jurisdictions

  • Trusted by 2 million professionals including top global firms

  • Access AI-Powered Research with Vincent AI: Natural language queries with verified citations

vLex

Unlock full access with a free 7-day trial

Transform your legal research with vLex

  • Complete access to the largest collection of common law case law on one platform

  • Generate AI case summaries that instantly highlight key legal issues

  • Advanced search capabilities with precise filtering and sorting options

  • Comprehensive legal content with documents across 100+ jurisdictions

  • Trusted by 2 million professionals including top global firms

  • Access AI-Powered Research with Vincent AI: Natural language queries with verified citations

vLex

Unlock full access with a free 7-day trial

Transform your legal research with vLex

  • Complete access to the largest collection of common law case law on one platform

  • Generate AI case summaries that instantly highlight key legal issues

  • Advanced search capabilities with precise filtering and sorting options

  • Comprehensive legal content with documents across 100+ jurisdictions

  • Trusted by 2 million professionals including top global firms

  • Access AI-Powered Research with Vincent AI: Natural language queries with verified citations

vLex

Unlock full access with a free 7-day trial

Transform your legal research with vLex

  • Complete access to the largest collection of common law case law on one platform

  • Generate AI case summaries that instantly highlight key legal issues

  • Advanced search capabilities with precise filtering and sorting options

  • Comprehensive legal content with documents across 100+ jurisdictions

  • Trusted by 2 million professionals including top global firms

  • Access AI-Powered Research with Vincent AI: Natural language queries with verified citations

vLex

VLEX uses login cookies to provide you with a better browsing experience. If you click on 'Accept' or continue browsing this site we consider that you accept our cookie policy. ACCEPT