Smart speaker

A smart speaker is a type of loudspeaker and voice command device with an integrated virtual assistant that offers interactive actions and hands-free activation with the help of one "wake word" (or several "wake words"). Some smart speakers also act as smart home hubs by using Wi-Fi, Bluetooth, Thread, and other protocol standards to extend usage beyond audio playback and control home automation devices connected through a local area network.

History

Early voice-activated devices began in 2013 with MIT's Jasper project,^[1] which used multiple microphones and cloud software to power hands-free interactions from across a room.

The first commercial smart speaker was the Amazon Echo, which was released in 2014 powered by Alexa and a ring of far-field microphones. Google followed in 2016 with Home, powered by Google Assistant. By 2017, devices like the Echo Show and Home Hub (later called Nest Hub) added touchscreens and video, creating the "smart display" subcategory. In 2018, Apple joined the smart speaker trend by launching the HomePod, which focused on high-quality audio alongside their built-in assistant Siri.

In the early 2020s, smart speakers gained on-device voice processing for faster responses and improved privacy. New standards such as Matter and Thread allowed multitudes of smart-home devices (even from completely different brands) to work together.^[2]

Features

Audio and Voice

Smart speakers use multiple microphones along with noise-cancelling software to pick up your voice from across the room, even when music is playing or the assistant is already talking. Noise suppression and echo cancellation is also used by the speaker so it can focus in on who is talking and ignore any background noises. Most smart speaker models can recognize who is speaking by voiceprint, which allows the speaker to grab information from that person's calendar, preferences, or music playlists.^{[citation needed]}

Listening to music on a speaker is when importance for good audio quality becomes apparent. Entry-level (cheaper) speakers such as the Home Mini or the Echo Dot have a single full-range driver. These lower-end speakers typically aren't great for listening to music as the audio quality is pretty poor. More advanced units such as the Home Max or Echo Studio have separate tweeters and woofers meant for listening to music in high quality.^{[citation needed]}

Connectivity and smart-home control

Most connect over Wi-Fi or Bluetooth and support hub protocols like Thread and Matter. That lets them not only stream and play music but also allows you to control various brands of smart lights, thermostats, door locks, cameras, and much more-all from one point of control. Each can have its own designated interface and features in-house, usually launched or controlled via application or home automation software.^[3] These devices are able to communicate with each other via peer-to-peer connection through mesh networking. These speakers and related smart devices are typically controlled with one smartphone application.^[4]

Assistant services and skills

The built-in assistants handle timers, alarms, reminders, news briefings, weather updates, send messages to other smart devices, send texts, make calls, and simple questions. You can combine actions together in what are typically known as routines (for example saying "good morning" turns on lights, starts the coffee, says the weather, and reads the news) and add extra functions known as skills or actions (for things like ordering food or playing trivia games). This hands-free use of smart speakers can help assist those with disabilities. Most other technologies need the user to be able to physically interact with the device. Smart speakers are not bound by these limitations and can serve as an excellent tool for those who are unable to use their arms or legs or have vision issues.^[5]

Although these tasks can be completed by a phone or computer, consumers tend to lean towards smart speakers due to factors such as their range being much greater than that of a phone and the need to not have to physically interact with the speaker to get the voice assistant as with most smartphones, certain parts of a phone may need to be interacted with to activate the speaking assistant.^[6]

Smart displays

Some smart speakers also include a screen to show the user a visual response. A smart speaker with a touchscreen is known as a smart display;^[7]^[8] these integrate a conversational user interface with display screens to augment voice interaction with images and video. They are powered by one of the common voice assistants and offer additional controls for smart home devices, feature streaming apps, and web browsers with touch controls for selecting content. The first smart displays were introduced in 2017 by Amazon (Amazon Echo Show) and Google (Google/Nest Home Hub).

Artificial intelligence

The newest speakers can use on-device AI or cloud-based generative models to allow the smart speaker to carry on much more natural conversations, draft emails or recipes, suggest ideas based on context, or even create short pieces of music or art. This AI evolution allows these speakers to do far more than what they could do before.^[9]

Accuracy

According to a study by Proceedings of the National Academy of Sciences of the United States of America released In March 2020, the six biggest tech development companies, Amazon, Apple, Google, Yandex, IBM and Microsoft, have misidentified more words spoken by "black people" than "white people". The systems tested errors and unreadability, with a 19 and 35 percent discrepancy for the former and a 2 and 20 percent discrepancy for the latter.^[10]

The North American Chapter of the Association for Computational Linguistics (NAACL) also identified a discrepancy between male and female voices. According to their research, Google's speech recognition software is 13 percent more accurate for men than women. It performs better than the systems used by Bing, AT&T, and IBM.^[11]

Privacy concerns

The built-in microphone in smart speakers is continuously listening for wake words followed by a command. However, these continuously listening microphones also raise privacy concerns among users.^[12] According to a survey taken by 1,007 people in Western Europe, it is clear that privacy is the biggest concern holding consumers back from buying "smart" products.^[13] these concerns include what is being recorded, how the data will be used, how it will be protected, and whether it will be used for invasive advertising.^[14]^[15] Furthermore, an analysis of Amazon Echo Dots showed that 30–38% of "spurious audio recordings were human conversations", suggesting that these devices capture audio other than strictly detection of the wake word.^[16]

As a wiretap

There are strong concerns that the ever-listening microphone of smart speakers presents a perfect candidate for wiretapping. In 2017, British security researcher Mark Barnes showed that pre-2017 Echos have exposed pins which allow for a compromised OS to be booted.^[17]

According to Umar Iqbal, an assistant professor at Washington University in St. Louis, research indicates that data from consumer interactions with Alexa was used to targeted advertisements and products to consumer with over 40% of transmitted data lacking proper encryption raising privacy concerns.^[18] Further data indicates that due to the Smart Speakers ability to always capture audio, it begins to pick up on external conversations from consumers not related to commands given to the smart speaker. Things such as other members in the household, consumers on the phone and even TV audio can be picked up by these speakers and stored for future use by companies.^[19]

Voice assistance vs privacy

While voice assistants provide a valuable service, there can be some hesitation towards using them in various social contexts, such as in public or around other users.^[20] However, only more recently have users begun interacting with voice assistants through an interaction with smart speakers rather than an interaction with the phone. On the phone, most voice assistants have the option to be engaged by a physical button (e.g., Siri with a long press of the home button) rather than solely by wake word-based engagement in a smart speaker. While this distinction increases the privacy by limiting when the microphone is on, users felt that having to press a button first removed the convenience of voice interaction.^[21] This trade-off is not unique to voice assistants; as more and more devices come online, there is an increasing trade-off between convenience and privacy.^[22]

Security concerns

When configured without authentication, smart speakers can be activated by people other than the intended user or owner. For example, visitors to a home or office, or people in a publicly accessible area outside an open window, partial wall, or security fence, may be able to be heard by a speaker. This may allow others to access personal information of the owner without the owner's permission. One team demonstrated the ability to stimulate the microphones of smart speakers and smartphones through a closed window, from another building across the street, using a laser.^[23]

Smart speakers are typically forgotten about when it comes to home network security concerns because of their convenience and simplicity. However one in three breaches now involves an IoT device.^[24] The risk of getting hacked through a smart speaker can be greatly mitigated by following some simple steps. For one, it is necessary that a trustworthy brand is chosen when purchasing a smart speaker. Doing research to find a brand that aligns with your needs/wants while being secure can relieve stresses in the long run. It is also extremely important to keep your smart devices up-to-date to the latest firmware version as these updates will have patches for any known exploits. Deliberately going through the settings once the device is up-to-date will allow you to turn off settings you don't need or want to use further minimizing security risks.^[25]

Usage statistics

As of summer 2022, it is estimated by NPR and Edison Research that 91 million Americans (35% of the population over 18) own a smart speaker.^[26]

Gallery

Smart Speakers

The first-generation Amazon Echo smart speaker in black
An Apple HomePod speaker
The Google Home smart speaker idle on a table
The Amazon Echo Show smart speaker in white
The Harman Kardon INVOKE smart speaker in graphite

References

^ Saha, Shubhro (7 April 2014). "Jasper -- control anything with your voice". YouTube. Retrieved 15 July 2025.
^ Hill, Simon (26 May 2025). "Here's What the 'Matter' Smart Home Standard Is All About". Wired. Retrieved 15 July 2025.{{cite magazine}}: CS1 maint: url-status (link)
^ smart speaker Archived 2019-04-10 at the Wayback Machine, techtarget.com, May 2017
^ "What is a smart speaker and how does it work?". Bose. Retrieved 15 July 2025.
^ Mauldin, Laura (2022-09-10). "Cripping in America: Hacking an ableist world". The Baffler (64): 22–29. JSTOR 27159392.
^ Kowalczuk, Pascal (2018-10-10). "Consumer acceptance of smart speakers: a mixed methods approach". Journal of Research in Interactive Marketing. 12 (4): 418–431. doi:10.1108/jrim-01-2018-0022.
^ Brown, Rich. "Echo Show, Nest Hub, Facebook Portal and more: How to pick the best smart display in 2019". CNET. Archived from the original on 2019-07-08. Retrieved 2019-06-19.
^ Faulkner, Cameron (9 October 2018). "How Google's new Home Hub compares to the Echo Show and Facebook Portal". The Verge. Archived from the original on 2019-12-06. Retrieved 2019-06-19.
^ Elnaj, Saeed (8 July 2019). "The Rise Of AI-Enabled Smart Speakers And Their Future In Our Lives". Forbes. Retrieved 15 July 2025.
^ Metz, Cade (2020-03-23). "There Is a Racial Divide in Speech-Recognition Systems, Researchers Say". The New York Times. Archived from the original on 2022-10-13. Retrieved 2020-04-22.
^ Bajorek, Joan Palmiter (2019-05-10). "Voice Recognition Still Has Significant Race and Gender Biases". Harvard Business Review. Archived from the original on 2020-04-25. Retrieved 2020-04-24.
^ Lau, Josephine; Zimmerman, Benjamin; Schaub, Florian (1 November 2018). "Alexa, Are You Listening?: Privacy Perceptions, Concerns and Privacy-seeking Behaviors with Smart Speakers". Proceedings of the ACM on Human-Computer Interaction. 2 (CSCW): 102:1–102:31. doi:10.1145/3274371. S2CID 53223356.
^ Zimmermann, Jenny (16 February 2023). "Smart Product Breakthroughs Depend on Customer Control". MIT Sloan Management Review. Retrieved 16 July 2025.
^ "Amazon hands over Echo 'murder' data". BBC News. 7 March 2017. Archived from the original on 6 January 2020. Retrieved 2 March 2019.
^ "Amazon patents 'voice-sniffing' algorithms". BBC News. 11 April 2018. Archived from the original on 14 December 2019. Retrieved 2 March 2019.
^ Ford, Marcia, and William Palmer. "Alexa, are you listening to me? An analysis of Alexa voice service network traffic." Personal and Ubiquitous Computing (2018): 1-13.
^ Greenberg, Andy (1 August 2017). "A Hacker Turned an Amazon Echo Into a 'Wiretap'". Wired. Archived from the original on 3 June 2019. Retrieved 2 March 2019 – via www.wired.com.
^ Iqbal, Umar (2024-12-03). "Tracking, Profiling, and Ad Targeting in the Alexa Echo Smart Speaker Ecosystem" (PDF). umariqbal.com. Archived (PDF) from the original on 2024-09-24. Retrieved 2024-12-03.
^ Malkin, Nathan (2019). "Privacy Attitudes of Smart Speaker Users" (PDF). Sciendo. Proceedings on Privacy Enhancing Technologies.
^ "Sarah Mennicken and Elaine M. Huang. 2012. Hacking the Natural Habitat: An In-the-Wild Study of Smart Homes, Their Development, and the People Who Live in Them. In Pervasive Computing. Springer, Berlin, Heidelberg, 143–160". doi:10.1007/978-3-642-31205-2_10. S2CID 3480089. Archived from the original on 2022-10-13. Retrieved 2019-02-26. {{cite journal}}: Cite journal requires |journal= (help)
^ Christoffer Lambertsson. 2017. Expectations of Privacy in Voice Interaction–A Look at Voice Controlled Bank Transactions. Ph.D. Dissertation. KTH Royal Institute of Technology
^ "Rao, Sonia (12 September 2018) "In today's homes, consumers are willing to sacrifice privacy for convenience". Retrieved 25 February 2019". The Washington Post. Archived from the original on 2 March 2019. Retrieved 26 February 2019.
^ "Lasers can silently issue 'voice commands' to your smart speakers". 5 November 2019. Archived from the original on 2019-11-05. Retrieved 2019-11-06.
^ Blanton, Sean (10 January 2025). "IoT Security Risks: Stats and Trends to Know in 2025". jumpcloud. Retrieved 16 July 2025.
^ "Securing Smart Speakers and Digital Assistants". National Cybersecurity Alliance. 14 June 2024. Retrieved 16 July 2025.
^ Landis, Devon (2022-06-12). "NPR & Edison Research: Smart Speaker Ownership Reaches 35% of Americans". NPR. Archived from the original on 2024-12-02. Retrieved 2024-12-02.

[1] Saha, Shubhro (7 April 2014). "Jasper -- control anything with your voice". YouTube. Retrieved 15 July 2025.

[2] Hill, Simon (26 May 2025). "Here's What the 'Matter' Smart Home Standard Is All About". Wired. Retrieved 15 July 2025.{{cite magazine}}: CS1 maint: url-status (link)

[3] smart speaker Archived 2019-04-10 at the Wayback Machine, techtarget.com, May 2017

[4] "What is a smart speaker and how does it work?". Bose. Retrieved 15 July 2025.

[5] Mauldin, Laura (2022-09-10). "Cripping in America: Hacking an ableist world". The Baffler (64): 22–29. JSTOR 27159392.

[6] Kowalczuk, Pascal (2018-10-10). "Consumer acceptance of smart speakers: a mixed methods approach". Journal of Research in Interactive Marketing. 12 (4): 418–431. doi:10.1108/jrim-01-2018-0022.

[7] Brown, Rich. "Echo Show, Nest Hub, Facebook Portal and more: How to pick the best smart display in 2019". CNET. Archived from the original on 2019-07-08. Retrieved 2019-06-19.

[8] Faulkner, Cameron (9 October 2018). "How Google's new Home Hub compares to the Echo Show and Facebook Portal". The Verge. Archived from the original on 2019-12-06. Retrieved 2019-06-19.

[9] Elnaj, Saeed (8 July 2019). "The Rise Of AI-Enabled Smart Speakers And Their Future In Our Lives". Forbes. Retrieved 15 July 2025.

[10] Metz, Cade (2020-03-23). "There Is a Racial Divide in Speech-Recognition Systems, Researchers Say". The New York Times. Archived from the original on 2022-10-13. Retrieved 2020-04-22.

[11] Bajorek, Joan Palmiter (2019-05-10). "Voice Recognition Still Has Significant Race and Gender Biases". Harvard Business Review. Archived from the original on 2020-04-25. Retrieved 2020-04-24.

[lau-12] Lau, Josephine; Zimmerman, Benjamin; Schaub, Florian (1 November 2018). "Alexa, Are You Listening?: Privacy Perceptions, Concerns and Privacy-seeking Behaviors with Smart Speakers". Proceedings of the ACM on Human-Computer Interaction. 2 (CSCW): 102:1–102:31. doi:10.1145/3274371. S2CID 53223356.

[13] Zimmermann, Jenny (16 February 2023). "Smart Product Breakthroughs Depend on Customer Control". MIT Sloan Management Review. Retrieved 16 July 2025.

[14] "Amazon hands over Echo 'murder' data". BBC News. 7 March 2017. Archived from the original on 6 January 2020. Retrieved 2 March 2019.

[15] "Amazon patents 'voice-sniffing' algorithms". BBC News. 11 April 2018. Archived from the original on 14 December 2019. Retrieved 2 March 2019.

[16] Ford, Marcia, and William Palmer. "Alexa, are you listening to me? An analysis of Alexa voice service network traffic." Personal and Ubiquitous Computing (2018): 1-13.

[17] Greenberg, Andy (1 August 2017). "A Hacker Turned an Amazon Echo Into a 'Wiretap'". Wired. Archived from the original on 3 June 2019. Retrieved 2 March 2019 – via www.wired.com.

[18] Iqbal, Umar (2024-12-03). "Tracking, Profiling, and Ad Targeting in the Alexa Echo Smart Speaker Ecosystem" (PDF). umariqbal.com. Archived (PDF) from the original on 2024-09-24. Retrieved 2024-12-03.

[19] Malkin, Nathan (2019). "Privacy Attitudes of Smart Speaker Users" (PDF). Sciendo. Proceedings on Privacy Enhancing Technologies.

[20] "Sarah Mennicken and Elaine M. Huang. 2012. Hacking the Natural Habitat: An In-the-Wild Study of Smart Homes, Their Development, and the People Who Live in Them. In Pervasive Computing. Springer, Berlin, Heidelberg, 143–160". doi:10.1007/978-3-642-31205-2_10. S2CID 3480089. Archived from the original on 2022-10-13. Retrieved 2019-02-26. {{cite journal}}: Cite journal requires |journal= (help)

[21] Christoffer Lambertsson. 2017. Expectations of Privacy in Voice Interaction–A Look at Voice Controlled Bank Transactions. Ph.D. Dissertation. KTH Royal Institute of Technology

[22] "Rao, Sonia (12 September 2018) "In today's homes, consumers are willing to sacrifice privacy for convenience". Retrieved 25 February 2019". The Washington Post. Archived from the original on 2 March 2019. Retrieved 26 February 2019.

[23] "Lasers can silently issue 'voice commands' to your smart speakers". 5 November 2019. Archived from the original on 2019-11-05. Retrieved 2019-11-06.

[24] Blanton, Sean (10 January 2025). "IoT Security Risks: Stats and Trends to Know in 2025". jumpcloud. Retrieved 16 July 2025.

[25] "Securing Smart Speakers and Digital Assistants". National Cybersecurity Alliance. 14 June 2024. Retrieved 16 July 2025.

[26] Landis, Devon (2022-06-12). "NPR & Edison Research: Smart Speaker Ownership Reaches 35% of Americans". NPR. Archived from the original on 2024-12-02. Retrieved 2024-12-02.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

Voice Assistant	Unanswered Query Rate (%)
Amazon Alexa	23
Google Assistant	8.4
Apple Siri	2
Average	6.3

Region/Country	Household Adoption Rate (2024)	Primary Drivers
United States	35%	Amazon Echo and Google Nest ecosystems^[197]^[198]
United Kingdom	18.3%	Integration with existing smart home tech^[197]
India	20.9%	Affordable models and mobile-first users^[197]

Manufacturer	Approximate Global Market Share (2024)	Key Products
Amazon	23%	Echo series ^[27]
Apple	15%	HomePod series ^[27]
Google	Significant (exact % varies by source)	Nest Audio, etc. ^[30]

History

Media collections

Smart speaker

Recent from talks

Recent from talks

Contribute something

Contribute something

Media Pages

Timelines

Articles

Notes collections

Notes

Notes

Days in Chronicle

Smart speaker

History

Features

Audio and Voice

Connectivity and smart-home control

Assistant services and skills

Smart displays

Artificial intelligence

Accuracy

Privacy concerns

As a wiretap

Voice assistance vs privacy

Security concerns

Usage statistics

Gallery

See also

References

Smart speaker

History

Early Precursors and Foundational Technologies

Commercial Launch and Early Adoption (2014–2019)

Maturation and Recent Developments (2020–2025)

Hardware Design

Audio Output and Acoustic Engineering

Microphone Arrays and Sensor Integration

Processors, Connectivity, and Form Factors

Core Features and Capabilities

Voice Processing and Natural Language Understanding

Smart Home Control and IoT Integration

Extensible Services, Skills, and Third-Party Ecosystems

Embedded AI and Machine Learning Functions

Variants and Extensions

Smart Displays and Visual Interfaces

Portable, Automotive, and Niche Applications

Performance Metrics

Accuracy in Voice Recognition and Response

Reliability, Latency, and Error Rates

Security Concerns

Known Vulnerabilities and Hacking Incidents

Defense Mechanisms and Best Practices

Privacy Considerations

Data Acquisition and Transmission Protocols

User Controls, Consent Models, and Data Retention

Regulatory Responses and Legal Precedents

Market Dynamics

Adoption Rates and Usage Statistics

Key Manufacturers, Market Shares, and Competition

Economic Growth, Pricing Trends, and Forecasts

Societal Impacts

Benefits in Convenience, Accessibility, and Productivity

Criticisms Including Dependency, Surveillance, and Cultural Shifts

References

Add your contribution

Related Hubs

Contribute something