Recent from talks
Contribute something
Nothing was collected or created yet.
Voice over IP
View on Wikipedia
Voice over Internet Protocol (VoIP),[a] also known as IP telephony, is a set of technologies used primarily for voice communication sessions over Internet Protocol (IP) networks, such as the Internet.[2] VoIP enables voice calls to be transmitted as data packets, facilitating various methods of voice communication, including traditional applications like Skype, Microsoft Teams, Google Voice, and VoIP phones. Regular telephones can also be used for VoIP by connecting them to the Internet via analog telephone adapters (ATAs), which convert traditional telephone signals into digital data packets that can be transmitted over IP networks.
The broader terms Internet telephony, broadband telephony, and broadband phone service specifically refer to the delivery of voice and other communication services, such as fax, SMS, and voice messaging, over the Internet, in contrast to the traditional public switched telephone network (PSTN), commonly known as plain old telephone service (POTS).
VoIP technology has evolved to integrate with mobile telephony, including Voice over LTE (VoLTE) and Voice over NR (Vo5G), enabling seamless voice communication over mobile data networks. These advancements have extended VoIP's role beyond its traditional use in Internet-based applications. It has become a key component of modern mobile infrastructure, as 4G and 5G networks rely entirely on this technology for voice transmission.
Overview
[edit]The steps and principles involved in originating VoIP telephone calls are similar to traditional digital telephony and involve signaling, channel setup, digitization of the analog voice signals, and encoding. Instead of being transmitted over a circuit-switched network, the digital information is packetized and transmission occurs as IP packets over a packet-switched network. They transport media streams using special media delivery protocols that encode audio and video with audio codecs and video codecs. Various codecs exist that optimize the media stream based on application requirements and network bandwidth; some implementations rely on narrowband and compressed speech, while others support high-fidelity stereo codecs.
The most widely used speech coding standards in VoIP are based on the linear predictive coding (LPC) and modified discrete cosine transform (MDCT) compression methods. Popular codecs include the MDCT-based AAC-LD (used in FaceTime), the LPC/MDCT-based Opus (used in WhatsApp), the LPC-based SILK (used in Skype), μ-law, A-law versions of G.711, G.722, an open source voice codec known as iLBC, and a codec that uses only 8 kbit/s each way called G.729.
Early providers of voice-over-IP services used business models and offered technical solutions that mirrored the architecture of the legacy telephone network. Second-generation providers, such as Skype, built closed networks for private user bases, offering the benefit of free calls and convenience while potentially charging for access to other communication networks, such as the PSTN. This limited the freedom of users to mix-and-match third-party hardware and software. Third-generation providers, such as Google Talk, adopted the concept of federated VoIP.[3] These solutions typically allow dynamic interconnection between users in any two domains of the Internet, when a user wishes to place a call.
In addition to VoIP phones, VoIP is also available on many personal computers and other Internet access devices. Calls and SMS text messages may be sent via Wi-Fi or the carrier's mobile data network.[4] VoIP provides a framework for consolidation of all modern communications technologies using a single unified communications system.
Integration of VoIP in mobile networks
[edit]VoIP technology has been adapted for use in mobile networks, leading to the development of advanced systems designed to support voice communication over modern data infrastructures. Among these are Voice over LTE (VoLTE) and Voice over 5G (Vo5G), which enable voice communication over IP-based mobile infrastructures. In contrast to traditional VoIP services, which often function independently of global telephone numbering systems, VoLTE and Vo5G are directly connected to mobile operators' infrastructures, providing seamless connectivity to the international telephone network.[5][6]
VoLTE, introduced as part of 4G LTE networks, enables voice communication over an IP-based infrastructure initially developed for data transmission. It offers features such as high-definition voice (HD Voice) and faster call setup times compared to circuit-switched networks.[7]
Vo5G, the 5G equivalent of VoLTE, utilizes the increased speed, reduced latency, and greater capacity of 5G networks to further enhance these capabilities.[8] Both VoLTE and Vo5G maintain compatibility with traditional public switched telephone networks (PSTNs), allowing users to make and receive calls to and from any telephone number worldwide.
These technologies differ from standalone VoIP services by being fully integrated with mobile network operators. This integration ensures additional features such as emergency call support and quality-of-service guarantees, making them a central part of modern mobile telecommunication systems.
Protocols
[edit]Voice over IP has been implemented with proprietary protocols and protocols based on open standards in applications such as VoIP phones, mobile applications, and web-based communications.
A variety of functions are needed to implement VoIP communication. Some protocols perform multiple functions, while others perform only a few and must be used in concert. These functions include:
- Network and transport – Creating reliable transmission over unreliable protocols, which may involve acknowledging receipt of data and retransmitting data that wasn't received.
- Session management – Creating and managing a session (sometimes glossed as simply a "call"), which is a connection between two or more peers that provides a context for further communication.
- Signaling – Performing registration (advertising one's presence and contact information) and discovery (locating someone and obtaining their contact information), dialing (including reporting call progress), negotiating capabilities, and call control (such as hold, mute, transfer/forwarding, dialing DTMF keys during a call [e.g. to interact with an automated attendant or IVR], etc.).
- Media description – Determining what type of media to send (audio, video, etc.), how to encode/decode it, and how to send/receive it (IP addresses, ports, etc.).
- Media – Transferring the actual media in the call, such as audio, video, text messages, files, etc.
- Quality of service – Providing out-of-band content or feedback about the media such as synchronization, statistics, etc.
- Security – Implementing access control, verifying the identity of other participants (computers or people), and encrypting data to protect the privacy and integrity of the media contents and/or the control messages.
VoIP protocols include:
- Matrix, open standard for online chat, voice over IP, and videotelephony
- Session Initiation Protocol (SIP),[9] connection management protocol developed by the IETF
- H.323, one of the first VoIP call signaling and control protocols that found widespread implementation.[10] Since the development of newer, less complex protocols such as MGCP and SIP, H.323 deployments are increasingly limited to carrying existing long-haul network traffic.[11]
- Media Gateway Control Protocol (MGCP), connection management for media gateways
- H.248, control protocol for media gateways across a converged internetwork consisting of the traditional PSTN and modern packet networks
- Real-time Transport Protocol (RTP), transport protocol for real-time audio and video data
- Real-time Transport Control Protocol (RTCP), sister protocol for RTP providing stream statistics and status information
- Secure Real-time Transport Protocol (SRTP), encrypted version of RTP
- Session Description Protocol (SDP), a syntax for session initiation and announcement for multi-media communications and WebSocket transports.
- Inter-Asterisk eXchange (IAX), protocol used between Asterisk PBX instances
- Extensible Messaging and Presence Protocol (XMPP), instant messaging, presence information, and contact list maintenance
- Jingle, for peer-to-peer session control in XMPP
- Skype protocol, proprietary Internet telephony protocol suite based on peer-to-peer architecture
Adoption
[edit]Consumer market
[edit]
Mass-market VoIP services use existing broadband Internet access, by which subscribers place and receive telephone calls in much the same manner as they would via the PSTN. Full-service VoIP phone companies provide inbound and outbound service with direct inbound dialing. Many offer unlimited domestic calling and sometimes international calls for a flat monthly subscription fee. Phone calls between subscribers of the same provider are usually free when flat-fee service is not available.[12]
A VoIP phone is necessary to connect to a VoIP service provider. This can be implemented in several ways:
- Dedicated VoIP phones connect directly to the IP network using technologies such as wired Ethernet or Wi-Fi. These are typically designed in the style of traditional digital business telephones.
- An analog telephone adapter connects to the network and implements the electronics and firmware to operate a conventional analog telephone attached through a modular phone jack. Some residential Internet gateways and cable modems have this function built in.
- Softphone application software installed on a networked computer that is equipped with a microphone and speaker, or headset. The application typically presents a dial pad and display field to the user to operate the application by mouse clicks or keyboard input.[13]
PSTN and mobile network providers
[edit]It is increasingly common for telecommunications providers to use VoIP telephony over dedicated and public IP networks as a backhaul to connect switching centers and to interconnect with other telephony network providers; this is often referred to as IP backhaul.[14][15]
Smartphones may have SIP clients built into the firmware or available as an application download.[16][17]
Corporate use
[edit]Because of the bandwidth efficiency and low costs that VoIP technology can provide, businesses are migrating from traditional copper-wire telephone systems to VoIP systems to reduce their monthly phone costs. In 2008, 80% of all new Private branch exchange (PBX) lines installed internationally were VoIP.[18] For example, in the United States, the Social Security Administration is converting its field offices of 63,000 workers from traditional phone installations to a VoIP infrastructure carried over its existing data network.[19][20]
VoIP allows both voice and data communications to be run over a single network, which can significantly reduce infrastructure costs. The prices of extensions on VoIP are lower than for PBX and key systems. VoIP switches may run on commodity hardware, such as personal computers. Rather than closed architectures, these devices rely on standard interfaces.[21] VoIP devices have simple, intuitive user interfaces, so users can often make simple system configuration changes. Dual-mode phones enable users to continue their conversations as they move between an outside cellular service and an internal Wi-Fi network, so that it is no longer necessary to carry both a desktop phone and a cell phone. Maintenance becomes simpler as there are fewer devices to oversee.[21]
VoIP solutions aimed at businesses have evolved into unified communications services that treat all communications—phone calls, faxes, voice mail, e-mail, web conferences, and more—as discrete units that can all be delivered via any means and to any handset, including cellphones. Two kinds of service providers are operating in this space: one set is focused on VoIP for medium to large enterprises, while another is targeting the small-to-medium business (SMB) market.[22]
Skype, which originally marketed itself as a service among friends, began to cater to businesses in 2009, providing free-of-charge connections between any users on the Skype network and connecting to and from ordinary PSTN telephones for a charge.[23]
Delivery mechanisms
[edit]In general, the provision of VoIP telephony systems to organizational or individual users can be divided into two primary delivery methods: private or on-premises solutions, or externally hosted solutions delivered by third-party providers. On-premises delivery methods are more akin to the classic PBX deployment model for connecting an office to local PSTN networks.
While many use cases still remain for private or on-premises VoIP systems, the wider market has been gradually shifting toward Cloud or Hosted VoIP solutions. Hosted systems are also generally better suited to smaller or personal use VoIP deployments, where a private system may not be viable for these scenarios.
Hosted VoIP systems
[edit]Hosted or Cloud VoIP solutions involve a service provider or telecommunications carrier hosting the telephone system as a software solution within their own infrastructure.
Typically this will be one or more data centers with geographic relevance to the end-user(s) of the system. This infrastructure is external to the user of the system and is deployed and maintained by the service provider.
Endpoints, such as VoIP telephones or softphone applications (apps running on a computer or mobile device), will connect to the VoIP service remotely. These connections typically take place over public internet links, such as local fixed WAN breakout or mobile carrier service.
Private VoIP systems
[edit]
In the case of a private VoIP system, the primary telephony system itself is located within the private infrastructure of the end-user organization. Usually, the system will be deployed on-premises at a site within the direct control of the organization. This can provide numerous benefits in terms of QoS control (see below), cost scalability, and ensuring privacy and security of communications traffic. However, the responsibility for ensuring that the VoIP system remains performant and resilient is predominantly vested in the end-user organization. This is not the case with a Hosted VoIP solution.
Private VoIP systems can be physical hardware PBX appliances, converged with other infrastructure, or they can be deployed as software applications. Generally, the latter two options will be in the form of a separate virtualized appliance. However, in some scenarios, these systems are deployed on bare metal infrastructure or IoT devices. With some solutions, such as 3CX, companies can attempt to blend the benefits of hosted and private on-premises systems by implementing their own private solution but within an external environment. Examples can include data center collocation services, public cloud, or private cloud locations.
For on-premises systems, local endpoints within the same location typically connect directly over the LAN. For remote and external endpoints, available connectivity options mirror those of Hosted or Cloud VoIP solutions.
However, VoIP traffic to and from the on-premises systems can often also be sent over secure private links. Examples include personal VPN, site-to-site VPN, private networks such as MPLS and SD-WAN, or via private SBCs (Session Border Controllers). While exceptions and private peering options do exist, it is generally uncommon for those private connectivity methods to be provided by Hosted or Cloud VoIP providers.
Quality of service
[edit]Communication on the IP network is perceived as less reliable in contrast to the circuit-switched public telephone network because it does not provide a network-based mechanism to ensure that data packets are not lost, and are delivered in sequential order. It is a best-effort network without fundamental quality of service (QoS) guarantees. Voice, and all other data, travels in packets over IP networks with fixed maximum capacity. This system may be more prone to data loss in the presence of congestion[b] than traditional circuit switched systems; a circuit switched system of insufficient capacity will refuse new connections while carrying the remainder without impairment, while the quality of real-time data such as telephone conversations on packet-switched networks degrades dramatically.[25] Therefore, VoIP implementations may face problems with latency, packet loss, and jitter.[25][26]
By default, network routers handle traffic on a first-come, first-served basis. Fixed delays cannot be controlled as they are caused by the physical distance the packets travel. They are especially problematic when satellite circuits are involved because of the long distance to a geostationary satellite and back; delays of 400–600 ms are typical. Latency can be minimized by marking voice packets as being delay-sensitive with QoS methods such as DiffServ.[25]
Network routers on high volume traffic links may introduce latency that exceeds permissible thresholds for VoIP. Excessive load on a link can cause congestion and associated queueing delays and packet loss. This signals a transport protocol like TCP to reduce its transmission rate to alleviate the congestion. But VoIP usually uses UDP not TCP because recovering from congestion through retransmission usually entails too much latency.[25] So QoS mechanisms can avoid the undesirable loss of VoIP packets by immediately transmitting them ahead of any queued bulk traffic on the same link, even when the link is congested by bulk traffic.
VoIP endpoints usually have to wait for the completion of transmission of previous packets before new data may be sent. Although it is possible to preempt (abort) a less important packet in mid-transmission, this is not commonly done, especially on high-speed links where transmission times are short even for maximum-sized packets.[27] An alternative to preemption on slower links, such as dialup and digital subscriber line (DSL), is to reduce the maximum transmission time by reducing the maximum transmission unit. But since every packet must contain protocol headers, this increases relative header overhead on every link traversed.[27]
The receiver must resequence IP packets that arrive out of order and recover gracefully when packets arrive too late or not at all. Packet delay variation results from changes in queuing delay along a given network path due to competition from other users for the same transmission links. VoIP receivers accommodate this variation by storing incoming packets briefly in a playout buffer, deliberately increasing latency to improve the chance that each packet will be on hand when it is time for the voice engine to play it. The added delay is thus a compromise between excessive latency and excessive dropout, i.e. momentary audio interruptions.
Although jitter is a random variable, it is the sum of several other random variables that are at least somewhat independent: the individual queuing delays of the routers along the Internet path in question. Motivated by the central limit theorem, jitter can be modeled as a Gaussian random variable. This suggests continually estimating the mean delay and its standard deviation and setting the playout delay so that only packets delayed more than several standard deviations above the mean will arrive too late to be useful. In practice, the variance in latency of many Internet paths is dominated by a small number (often one) of relatively slow and congested bottleneck links. Most Internet backbone links are now so fast (e.g. 10 Gbit/s) that their delays are dominated by the transmission medium (e.g. optical fiber) and the routers driving them do not have enough buffering for queuing delays to be significant.[28]
A number of protocols have been defined to support the reporting of quality of service (QoS) and quality of experience (QoE) for VoIP calls. These include RTP Control Protocol (RTCP) extended reports,[29] SIP RTCP summary reports, H.460.9 Annex B (for H.323), H.248.30 and MGCP extensions.
The RTCP extended report VoIP metrics block specified by RFC 3611 is generated by an VoIP phone or gateway during a live call and contains information on packet loss rate, packet discard rate (because of jitter), packet loss/discard burst metrics (burst length/density, gap length/density), network delay, end system delay, signal/noise/echo level, mean opinion scores (MOS) and R factors and configuration information related to the jitter buffer. VoIP metrics reports are exchanged between IP endpoints on an occasional basis during a call, and an end of call message sent via SIP RTCP summary report or one of the other signaling protocol extensions. VoIP metrics reports are intended to support real-time feedback related to QoS problems, the exchange of information between the endpoints for improved call quality calculation and a variety of other applications.
DSL and ATM
[edit]DSL modems typically provide Ethernet connections to local equipment, but inside they may actually be Asynchronous Transfer Mode (ATM) modems.[c] They use ATM Adaptation Layer 5 (AAL5) to segment each Ethernet packet into a series of 53-byte ATM cells for transmission, reassembling them back into Ethernet frames at the receiving end.
Using a separate virtual circuit identifier (VCI) for voice over IP has the potential to reduce latency on shared connections. ATM's potential for latency reduction is greatest on slow links because worst-case latency decreases with increasing link speed. A full-size (1500 byte) Ethernet frame takes 94 ms to transmit at 128 kbit/s but only 8 ms at 1.5 Mbit/s. If this is the bottleneck link, this latency is probably small enough to ensure good VoIP performance without MTU reductions or multiple ATM VCs. The latest generations of DSL, VDSL and VDSL2, carry Ethernet without intermediate ATM/AAL5 layers, and they generally support IEEE 802.1p priority tagging so that VoIP can be queued ahead of less time-critical traffic.[25]
ATM has substantial header overhead: 5/53 = 9.4%, roughly twice the total header overhead of a 1500 byte Ethernet frame. This "ATM tax" is incurred by every DSL user whether or not they take advantage of multiple virtual circuits – and few can.[25]
Layer 2
[edit]Several protocols are used in the data link layer and physical layer for quality-of-service mechanisms that help VoIP applications work well even in the presence of network congestion. Some examples include:
- IEEE 802.11e is an approved amendment to the IEEE 802.11 standard that defines a set of quality-of-service enhancements for wireless LAN applications through modifications to the media access control (MAC) layer. The standard is considered of critical importance for delay-sensitive applications, such as voice over wireless IP.
- IEEE 802.1p defines 8 different classes of service (including one dedicated to voice) for traffic on layer-2 wired Ethernet.
- The ITU-T G.hn standard, which provides a way to create a high-speed (up to 1 gigabit per second) Local area network (LAN) using existing home wiring (power lines, phone lines and coaxial cables). G.hn provides QoS by means of Contention-Free Transmission Opportunities (CFTXOPs) which are allocated to flows (such as a VoIP call) that require QoS and which have negotiated a contract with the network controllers
Performance metrics
[edit]The quality of voice transmission is characterized by several metrics that may be monitored by network elements and by the user agent hardware or software. Such metrics include network packet loss, packet jitter, packet latency (delay), post-dial delay, and echo. The metrics are determined by VoIP performance testing and monitoring.[30][31][32][33][34][35]
PSTN integration
[edit]This section needs additional citations for verification. (November 2019) |
A VoIP media gateway controller (aka Class 5 Softswitch) works in cooperation with a media gateway (aka IP Business Gateway) and connects the digital media stream, so as to complete the path for voice and data. Gateways include interfaces for connecting to standard PSTN networks. Ethernet interfaces are also included in the modern systems which are specially designed to link calls that are passed via VoIP.[36]
E.164 is a global numbering standard for both the PSTN and public land mobile network (PLMN). Most VoIP implementations support E.164 to allow calls to be routed to and from VoIP subscribers and the PSTN/PLMN.[37] VoIP implementations can also allow other identification techniques to be used. For example, Skype allows subscribers to choose Skype names (usernames)[38] whereas SIP implementations can use Uniform Resource Identifier (URIs) similar to email addresses.[39] Often VoIP implementations employ methods of translating non-E.164 identifiers to E.164 numbers and vice versa, such as the Skype-In service provided by Skype[40] and the E.164 number to URI mapping (ENUM) service in IMS and SIP.[41]
Echo can also be an issue for PSTN integration.[42] Common causes of echo include impedance mismatches in analog circuitry and an acoustic path from the receive to transmit signal at the receiving end.
Number portability
[edit]Local number portability (LNP) and mobile number portability (MNP) also impact VoIP business. Number portability is a service that allows a subscriber to select a new telephone carrier without requiring a new number to be issued. Typically, it is the responsibility of the former carrier to "map" the old number to the undisclosed number assigned by the new carrier. This is achieved by maintaining a database of numbers. A dialed number is initially received by the original carrier and quickly rerouted to the new carrier. Multiple porting references must be maintained even if the subscriber returns to the original carrier. The Federal Communications Commission (FCC) mandates carrier compliance with these consumer-protection stipulations. In November 2007, the FCC in the United States released an order extending number portability obligations to interconnected VoIP providers and carriers that support VoIP providers.[43]
A voice call originating in the VoIP environment also faces least-cost routing (LCR) challenges to reach its destination if the number is routed to a mobile phone number on a traditional mobile carrier. LCR is based on checking the destination of each telephone call as it is made, and then sending the call via the network that will cost the customer the least. This rating is subject to some debate given the complexity of call routing created by number portability. With MNP in place, LCR providers can no longer rely on using the network root prefix to determine how to route a call. Instead, they must now determine the actual network of every number before routing the call.[44]
Therefore, VoIP solutions also need to handle MNP when routing a voice call. In countries without a central database, like the UK, it may be necessary to query the mobile network about which home network a mobile phone number belongs to. As the popularity of VoIP increases in the enterprise markets because of LCR options, VoIP needs to provide a certain level of reliability when handling calls.
Emergency calls
[edit]A telephone connected to a land line has a direct relationship between a telephone number and a physical location, which is maintained by the telephone company and available to emergency responders via the national emergency response service centers in form of emergency subscriber lists. When an emergency call is received by a center the location is automatically determined from its databases and displayed on the operator console.
In IP telephony, no such direct link between location and communications end point exists. Even a provider having wired infrastructure, such as a DSL provider, may know only the approximate location of the device, based on the IP address allocated to the network router and the known service address. Some ISPs do not track the automatic assignment of IP addresses to customer equipment.[45]
IP communication provides for device mobility. For example, a residential broadband connection may be used as a link to a virtual private network of a corporate entity, in which case the IP address being used for customer communications may belong to the enterprise, not the residential ISP. Such off-premises extensions may appear as part of an upstream IP PBX. On mobile devices, e.g., a 3G handset or USB wireless broadband adapter, the IP address has no relationship with any physical location known to the telephony service provider, since a mobile user could be anywhere in a region with network coverage, even roaming via another cellular company.
At the VoIP level, a phone or gateway may identify itself by its account credentials with a Session Initiation Protocol (SIP) registrar. In such cases, the Internet telephony service provider (ITSP) knows only that a particular user's equipment is active. Service providers often provide emergency response services by agreement with the user who registers a physical location and agrees that, if an emergency number is called from the IP device, emergency services are provided to that address only.
Such emergency services are provided by VoIP vendors in the United States by a system called Enhanced 911 (E911), based on the Wireless Communications and Public Safety Act. The VoIP E911 emergency-calling system associates a physical address with the calling party's telephone number. All VoIP providers that provide access to the public switched telephone network are required to implement E911, a service for which the subscriber may be charged. "VoIP providers may not allow customers to opt-out of 911 service."[45] The VoIP E911 system is based on a static table lookup. Unlike in cellular phones, where the location of an E911 call can be traced using assisted GPS or other methods, the VoIP E911 information is accurate only if subscribers keep their emergency address information current.[46]
Fax support
[edit]Sending faxes over VoIP networks is sometimes referred to as Fax over IP (FoIP). Transmission of fax documents was problematic in early VoIP implementations, as most voice digitization and compression codecs are optimized for the representation of the human voice and the proper timing of the modem signals cannot be guaranteed in a packet-based, connectionless network.
A standards-based solution for reliably delivering fax-over-IP is the T.38 protocol. The T.38 protocol is designed to compensate for the differences between traditional packet-less communications over analog lines and packet-based transmissions which are the basis for IP communications. The fax machine may be a standard device connected to an analog telephone adapter (ATA), or it may be a software application or dedicated network device operating via an Ethernet interface.[47] Originally, T.38 was designed to use UDP or TCP transmission methods across an IP network.
Some newer high-end fax machines have built-in T.38 capabilities which are connected directly to a network switch or router. In T.38 each packet contains a portion of the data stream sent in the previous packet. Two successive packets have to be lost to actually lose data integrity.
Power requirements
[edit]Telephones for traditional residential analog service are usually connected directly to telephone company phone lines which provide direct current to power most basic analog handsets independently of locally available electrical power. The susceptibility of phone service to power failures is a common problem even with traditional analog service where customers purchase telephone units that operate with wireless handsets to a base station, or that have other modern phone features, such as built-in voicemail or phone book features.
VoIP phones and VoIP telephone adapters connect to routers or cable modems which typically depend on the availability of mains electricity or locally generated power.[48] Some VoIP service providers use customer premises equipment (e.g., cable modems) with battery-backed power supplies to assure uninterrupted service for up to several hours in case of local power failures. Such battery-backed devices typically are designed for use with analog handsets. Some VoIP service providers implement services to route calls to other telephone services of the subscriber, such a cellular phone, in the event that the customer's network device is inaccessible to terminate the call.
Security
[edit]Secure calls are possible using standardized protocols such as Secure Real-time Transport Protocol. Most of the facilities of creating a secure telephone connection over traditional phone lines, such as digitizing and digital transmission, are already in place with VoIP. It is necessary only to encrypt and authenticate the existing data stream. Automated software, such as a virtual PBX, may eliminate the need for personnel to greet and switch incoming calls.
The security concerns for VoIP telephone systems are similar to those of other Internet-connected devices. This means that hackers with knowledge of VoIP vulnerabilities can perform denial-of-service attacks, harvest customer data, record conversations, and compromise voicemail messages. Compromised VoIP user account or session credentials may enable an attacker to incur substantial charges from third-party services, such as long-distance or international calling.
The technical details of many VoIP protocols create challenges in routing VoIP traffic through firewalls and network address translators, used to interconnect to transit networks or the Internet. Private session border controllers are often employed to enable VoIP calls to and from protected networks. Other methods to traverse NAT devices involve assistive protocols such as STUN and Interactive Connectivity Establishment (ICE).
Standards for securing VoIP are available in the Secure Real-time Transport Protocol (SRTP) and the ZRTP protocol for analog telephony adapters, as well as for some softphones. IPsec is available to secure point-to-point VoIP at the transport level by using opportunistic encryption. Though many consumer VoIP solutions do not support encryption of the signaling path or the media, securing a VoIP phone is conceptually easier to implement using VoIP than on traditional telephone circuits. A result of the lack of widespread support for encryption is that it is relatively easy to eavesdrop on VoIP calls when access to the data network is possible.[49] Free open-source solutions, such as Wireshark, facilitate capturing VoIP conversations.
Government and military organizations use various security measures to protect VoIP traffic, such as voice over secure IP (VoSIP), secure voice over IP (SVoIP), and secure voice over secure IP (SVoSIP).[50] The distinction lies in whether encryption is applied in the telephone endpoint or in the network.[51] Secure voice over secure IP may be implemented by encrypting the media with protocols such as SRTP and ZRTP. Secure voice over IP uses Type 1 encryption on a classified network, such as SIPRNet.[52][53][54][55] Public Secure VoIP is also available with free GNU software and in many popular commercial VoIP programs via libraries, such as ZRTP.[56]
In June 2021, the National Security Agency (NSA) released comprehensive documents describing the four attack planes of a communications system – the network, perimeter, session controllers and endpoints – and explaining security risks and mitigation techniques for each of them.[57][58]
Caller ID
[edit]Voice over IP protocols and equipment provide caller ID support that is compatible with the PSTN. Many VoIP service providers also allow callers to configure custom caller ID information.[59]
Hearing aid compatibility
[edit]Wireline telephones which are manufactured in, imported to, or intended to be used in the US with Voice over IP service, on or after February 28, 2020, are required to meet the hearing aid compatibility requirements set forth by the Federal Communications Commission.[60]
Operational cost
[edit]VoIP has drastically reduced the cost of communication by sharing network infrastructure between data and voice.[61][62] A single broadband connection has the ability to transmit multiple telephone calls.
Regulatory and legal issues
[edit]This section needs to be updated. (April 2022) |
As the popularity of VoIP grows, governments are becoming more interested in regulating VoIP in a manner similar to PSTN services.[63]
Throughout the developing world, particularly in countries where regulation is weak or captured by the dominant operator, restrictions on the use of VoIP are often imposed, including in Panama where VoIP is taxed, Guyana where VoIP is prohibited.[64] In Ethiopia, where the government is nationalizing telecommunication service, it is a criminal offense to offer services using VoIP. The country has installed firewalls to prevent international calls from being made using VoIP. These measures were taken after the popularity of VoIP reduced the income generated by the state-owned telecommunications company.[citation needed][65]
Canada
[edit]In Canada, the Canadian Radio-television and Telecommunications Commission regulates telephone service, including VoIP telephony service. VoIP services operating in Canada are required to provide 9-1-1 emergency service.[66]
European Union
[edit]In the European Union, the treatment of VoIP service providers is a decision for each national telecommunications regulator, which must use competition law to define relevant national markets and then determine whether any service provider on those national markets has "significant market power" (and so should be subject to certain obligations). A general distinction is usually made between VoIP services that function over managed networks (via broadband connections) and VoIP services that function over unmanaged networks (essentially, the Internet).[citation needed]
The relevant EU Directive is not clearly drafted concerning obligations that can exist independently of market power (e.g., the obligation to offer access to emergency calls), and it is impossible to say definitively whether VoIP service providers of either type are bound by them.[citation needed][67]
Oman
[edit]In Oman, it is illegal to provide or use unauthorized VoIP services, to the extent that web sites of unlicensed VoIP providers have been blocked.[citation needed] Violations may be punished with fines of 50,000 Omani Rial (about 130,317 US dollars), a two-year prison sentence or both. In 2009, police raided 121 Internet cafes throughout the country and arrested 212 people for using or providing VoIP services.[68]
Saudi Arabia
[edit]In September 2017, Saudi Arabia lifted the ban on VoIPs, in an attempt to reduce operational costs and spur digital entrepreneurship.[69][70]
United Arab Emirates
[edit]In the United Arab Emirates (UAE), it is illegal to provide or use unauthorized VoIP services. Web sites of unlicensed VoIP providers have been blocked. Some VoIP services such as Skype were allowed.[71] In January 2018, internet service providers in UAE blocked all VoIP apps, including Skype, but permitting only 2 government-approved VoIP apps (C’ME and BOTIM).[72][73] In opposition, a petition on Change.org garnered over 5000 signatures, in response to which the website was blocked in UAE.[74]
On March 24, 2020, the United Arab Emirates loosened restriction on VoIP services earlier prohibited in the country, to ease communication during the COVID-19 pandemic. However, popular instant messaging applications such as WhatsApp, Skype, and FaceTime remained blocked from being used for voice and video calls, constricting residents to use paid services from the country's state-owned telecom providers.[75]
India
[edit]In India, it is legal to use VoIP, but it is illegal to have VoIP gateways inside India.[76] This effectively means that people who have PCs can use them to make a VoIP call to other computers but not to a normal phone number. Foreign-based VoIP server services are illegal to use in India.[76]
Internet telephony is permitted to the ISP with restrictions. The following services are permitted:[77]
- PC to PC; within or outside India
- PC / a device / Adapter conforming to the standard of any international agencies like- ITU or IETF etc. in India to PSTN/PLMN abroad.
- Any device / Adapter conforming to standards of International agencies like ITU, IETF etc. connected to ISP node with static IP address to similar device / Adapter; within or outside India.
- Except whatever is described in condition (ii) above[clarification needed], no other form of Internet Telephony is permitted.
- In India no Separate Numbering Scheme is provided to the Internet Telephony. Presently the 10 digit Numbering allocation based on E.164 is permitted to the Fixed Telephony, GSM, CDMA wireless service. For Internet Telephony, the numbering scheme shall only conform to IP addressing Scheme of Internet Assigned Numbers Authority (IANA). Translation of E.164 number / private number to IP address allotted to any device and vice versa, by ISP to show compliance with IANA numbering scheme is not permitted.
- The Internet Service Licensee is not permitted to have PSTN/PLMN connectivity. Voice communication to and from a telephone connected to PSTN/PLMN and following E.164 numbering is prohibited in India.
South Korea
[edit]In South Korea, only providers registered with the government are authorized to offer VoIP services. Unlike many VoIP providers, most of whom offer flat rates, Korean VoIP services are generally metered and charged at rates similar to terrestrial calling. Foreign VoIP providers encounter high barriers to government registration. This issue came to a head in 2006 when Internet service providers providing personal Internet services by contract to United States Forces Korea (USFK) members residing on USFK bases threatened to block off access to VoIP services used by USFK members as an economical way to keep in contact with their families in the United States, on the grounds that the service members' VoIP providers were not registered. A compromise was reached between USFK and Korean telecommunications officials in January 2007, wherein USFK service members arriving in Korea before June 1, 2007, and subscribing to the ISP services provided on base could continue to use their US-based VoIP subscription, but later arrivals are required to use a Korean-based VoIP provider, which by contract will offer pricing similar to the flat rates offered by US VoIP providers.[78]
United States
[edit]In the United States, the FCC requires all interconnected VoIP service providers to comply with requirements comparable to those for traditional telecommunications service providers.[79] VoIP operators in the US are required to support local number portability; make service accessible to people with disabilities; pay regulatory fees, universal service contributions, and other mandated payments; and enable law enforcement authorities to conduct surveillance pursuant to the Communications Assistance for Law Enforcement Act (CALEA).
Operators of Interconnected VoIP (fully connected to the PSTN) are mandated to provide Enhanced 911 service without special request, provide for customer location updates, clearly disclose any limitations on their E-911 functionality to their consumers, obtain affirmative acknowledgements of these disclosures from all consumers,[80] and may not allow their customers to opt-out of 911 service.[81] VoIP operators also receive the benefit of certain US telecommunications regulations, including an entitlement to interconnection and exchange of traffic with incumbent local exchange carriers via wholesale carriers. Providers of nomadic VoIP service—those who are unable to determine the location of their users—are exempt from state telecommunications regulation.[82]
Another legal issue that the US Congress is debating concerns changes to the Foreign Intelligence Surveillance Act. The issue in question is calls between Americans and foreigners. The NSA is not authorized to tap Americans' conversations without a warrant—but the Internet, and specifically VoIP does not draw as clear a line to the location of a caller or a call's recipient as the traditional phone system does. As VoIP's low cost and flexibility convinces more and more organizations to adopt the technology, surveillance for law enforcement agencies becomes more difficult. VoIP technology has also increased federal security concerns because VoIP and similar technologies have made it more difficult for the government to determine where a target is physically located when communications are being intercepted, and that creates a whole set of new legal challenges.[83]
History
[edit]The early developments of packet network designs by Paul Baran and other researchers were motivated by a desire for a higher degree of circuit redundancy and network availability in the face of infrastructure failures than was possible in the circuit-switched networks in telecommunications of the mid-twentieth century. Danny Cohen first demonstrated a form of packet voice in 1973 which was developed into Network Voice Protocol which operated across the early ARPANET.[84][85]
On the early ARPANET, real-time voice communication was not possible with uncompressed pulse-code modulation (PCM) digital speech packets, which had a bit rate of 64 kbps, much greater than the 2.4 kbps bandwidth of early modems. The solution to this problem was linear predictive coding (LPC), a speech coding data compression algorithm that was first proposed by Fumitada Itakura of Nagoya University and Shuzo Saito of Nippon Telegraph and Telephone (NTT) in 1966. LPC was capable of speech compression down to 2.4 kbps, leading to the first successful real-time conversation over ARPANET in 1974, between Culler-Harrison Incorporated in Goleta, California, and MIT Lincoln Laboratory in Lexington, Massachusetts.[86] LPC has since been the most widely used speech coding method.[87] Code-excited linear prediction (CELP), a type of LPC algorithm, was developed by Manfred R. Schroeder and Bishnu S. Atal in 1985.[88] LPC algorithms remain an audio coding standard in modern VoIP technology.[86]
In the two decades following the 1974 demo, various forms of packet telephony were developed and industry interest groups formed to support the new technologies. Following the termination of the ARPANET project, and expansion of the Internet for commercial traffic, IP telephony was tested and deemed infeasible for commercial use until the introduction of VocalChat in the early 1990s and then in Feb 1995 the official release of Internet Phone (or iPhone for short) commercial software by VocalTec, based on a patent by Lior Haramaty and Alon Cohen,[89] and followed by other VoIP infrastructure components such as telephony gateways and switching servers. Soon after it became an established area of interest in commercial labs of the major IT concerns, notably at AT&T, where Marian Croak and her team filed many patents related to the technology.[citation needed] By the late 1990s, the first softswitches became available, and new protocols, such as H.323, MGCP and Session Initiation Protocol (SIP) gained widespread attention. In the early 2000s, the proliferation of high-bandwidth always-on Internet connections to residential dwellings and businesses, spawned an industry of Internet telephony service providers (ITSPs). The development of open-source telephony software, such as Asterisk PBX, fueled widespread interest and entrepreneurship in voice-over-IP services, applying new Internet technology paradigms, such as cloud services to telephony.
Milestones
[edit]- 1966: Linear predictive coding (LPC) proposed by Fumitada Itakura of Nagoya University and Shuzo Saito of Nippon Telegraph and Telephone (NTT).[86]
- 1973: Packet voice application by Danny Cohen.
- 1974: The Institute of Electrical and Electronics Engineers (IEEE) publishes a paper entitled "A Protocol for Packet Network Interconnection".[90]
- 1974: Network Voice Protocol (NVP) tested over ARPANET in August 1974, carrying barely intelligible 16 kpbs CVSD encoded voice.[86]
- 1974: The first successful real-time conversation over ARPANET achieved using 2.4 kpbs LPC, between Culler-Harrison Incorporated in Goleta, California, and MIT Lincoln Laboratory in Lexington, Massachusetts.[86]
- 1977: Danny Cohen and Jon Postel of the USC Information Sciences Institute, and Vint Cerf of the Defense Advanced Research Projects Agency (DARPA), agree to separate IP from TCP, and create UDP for carrying real-time traffic.
- 1981: IPv4 is described in RFC 791.
- 1985: The National Science Foundation commissions the creation of NSFNET.[91]
- 1985: Code-excited linear prediction (CELP), a type of LPC algorithm, developed by Manfred R. Schroeder and Bishnu S. Atal.[88]
- 1986: Proposals from various standards organizations[specify] for Voice over ATM, in addition to commercial packet voice products from companies such as StrataCom
- 1991: Speak Freely, a voice-over-IP application, was released to the public domain.[92][93]
- 1992: The Frame Relay Forum conducts development of standards for voice over Frame Relay.
- 1992: InSoft Inc. announces and launches its desktop conferencing product Communique, which includes VoIP and video.[92][94] The company is credited with developing the first generation of commercial, US-based VoIP, Internet media streaming and real-time Internet telephony/collaborative software and standards that would provide the basis for the Real Time Streaming Protocol (RTSP) standard.[citation needed]
- 1993 Release of VocalChat, a commercial packet network PC voice communication software from VocalTec.[citation needed]
- 1994: MTALK, a freeware LAN VoIP application for Linux[95]
- 1995:
- 1996:
- ITU-T begins development of standards for the transmission and signaling of voice communications over Internet Protocol networks with the H.323 standard.[99]
- US telecommunications companies petition the US Congress to ban Internet phone technology.[100]
- G.729 speech codec introduced, using CELP (LPC) algorithm.[101]
- 1997: Level 3 began development of its first softswitch, a term they coined in 1998.[102]
- 1999:
- The Session Initiation Protocol (SIP) specification RFC 2543 is released.[103]
- Mark Spencer of Digium develops Asterisk, the first open source private branch exchange (PBX) software.[104]
- A discrete cosine transform (DCT) variant called the modified discrete cosine transform (MDCT) is adopted for the Siren codec, used in the G.722.1 wideband audio coding standard.[105][106]
- The MDCT is adapted into the LD-MDCT algorithm, used in the AAC-LD standard.[107]
- 2001: INOC-DBA, the first inter-provider SIP network is deployed; this is also the first voice network to reach all seven continents.[108]
- 2003: Skype released in August 2003. This was the creation of Niklas Zennström and Janus Friis, in cooperation with four Estonian developers. It quickly became a popular program that helped democratize VoIP.
- 2004: Early commercial VoIP service providers proliferate.[citation needed]
- 2005: PhoneGnome VoIP service is launched by TelEvolution, Inc. of California.[109]
- 2006: G.729.1 wideband codec introduced, using MDCT and CELP (LPC) algorithms.[110]
- 2007: VoIP device manufacturers and sellers boom in Asia, specifically in the Philippines where many families of overseas workers reside.[111]
- 2009: SILK codec introduced, using LPC algorithm,[112] and used for voice calling in Skype.[113]
- 2010: Apple introduces FaceTime, which uses the LD-MDCT-based AAC-LD codec.[114]
- 2011:
- 2012: Opus codec introduced, using MDCT and LPC algorithms.[116]
See also
[edit]- Audio over IP
- Call-through telecom
- Comparison of audio network protocols
- Comparison of VoIP software
- Differentiated services
- High Bit Rate Media Transport
- Integrated services
- Internet fax
- IP Multimedia Subsystem
- List of VoIP companies
- Mobile VoIP
- RTP payload formats
- SIP trunking
- UNIStim
- Voice over LTE
- VoiceXML
- VoIP VPN
- VoIP recording
Notes
[edit]References
[edit]- ^ "VoIP". Cambridge Dictionaries Online.
- ^ Arora, Rakesh (November 23, 1999). "Voice Over IP : Protocols and Standards". cse.wustl.edu. Archived from the original on January 19, 2024. Retrieved March 19, 2024.
Voice over IP (VOIP) uses the Internet Protocol (IP) to transmit voice as packets over an IP network. ...in the Internet, anyone can capture the packets meant for someone else. Some security can be provided by using encryption and tunneling. The common tunneling protocol used is Layer 2 Tunneling Protocol and the common encryption mechanism used is Secure Sockets Layer (SSL).
- ^ "XMPP Federation". Google Talkabout. 2006. Retrieved May 11, 2012.
- ^ Booth, C (2010). "Chapter 2: IP Phones, Software VoIP, and Integrated and Mobile VoIP". Library Technology Reports. 46 (5): 11–19.
- ^ Wright, Arol (October 12, 2023). "What Is Vo5G (VoNR) and How Does It Compare to VoLTE?". MUO. Retrieved November 30, 2024.
- ^ "Communication services (VoLTE/VoNR)". www.3gpp.org. Retrieved November 30, 2024.
- ^ "What is Voice Over LTE (VoLTE)? Definition from WhatIs.com". Search Networking. Retrieved November 30, 2024.
- ^ "What Is 5G VoNR | Unveiling the Future of Wireless Communication". www.voicenter.com. Retrieved November 30, 2024.
- ^ Montazerolghaem, Ahmadreza; Moghaddam, Mohammad Hossein Yaghmaee; Leon-Garcia, Alberto (March 2018). "OpenSIP: Toward Software-Defined SIP Networking". IEEE Transactions on Network and Service Management. 15 (1): 184–199. arXiv:1709.01320. Bibcode:2018ITNSM..15..184M. doi:10.1109/TNSM.2017.2741258. ISSN 1932-4537. S2CID 3873601.
- ^ "H.323 and SIP Integration". Retrieved January 24, 2020.
- ^ Omar, Ahmed. "Voice OVER IP (VOIP)".
- ^ "Voice Over Internet Protocol (VoIP)". Federal Communications Commission. November 18, 2010. Archived from the original on October 20, 2023. Retrieved July 19, 2022.
- ^ "VoIP (v77)" (PDF).
- ^ "WIRELESS: Carriers look to IP for backhaul". www.eetimes.com. EE Times. Archived from the original on August 9, 2011. Retrieved April 8, 2015.
- ^ "Mobile's IP challenge". www.totaltele.com. Total Telecom Online. Archived from the original on February 17, 2006. Retrieved April 8, 2015.
- ^ "Android SIP Client". Retrieved January 30, 2018.
- ^ "Learn to Make Free or Inexpensive Calls Using SIP on Android". Retrieved January 30, 2018.
- ^ Michael Dosch and Steve Church. "VoIP in the Broadcast Studio". Axia Audio. Archived from the original on October 7, 2011. Retrieved June 21, 2011.
- ^ Jackson, William (May 27, 2009). "SSA goes big on VOIP". Government Computer News. Archived from the original on July 28, 2011. Retrieved May 28, 2009.
- ^ "Social Security to Build "World's Largest VOIP"". Government Technology. Archived from the original on June 2, 2009. Retrieved May 29, 2009.
- ^ a b Korzeniowski, Peter (January 8, 2009). "Three Technologies You Need In 2009". Forbes. Retrieved March 2, 2009.
- ^ Callahan, Renee (December 9, 2008). "Businesses Move To Voice-Over-IP". Forbes. Retrieved March 3, 2009.
- ^ "Skype For Business". skype.com. Retrieved March 16, 2009.
- ^ "VoIP - Vulnerability over Internet Protocol?". www.continuitycentral.com.
- ^ a b c d e f "Quality of Service for Voice over IP". Retrieved May 3, 2011.
- ^ Prabhakar, G.; Rastogi, R.; Thotton, M (2005). "OSS Architecture & Requirements for VoIP Networks". Bell Labs Technical Journal. 10 (1): 31–45. doi:10.1002/bltj.20077. ISSN 1089-7089. S2CID 12336090.
- ^ a b "Quality of Service for Voice over IP". Retrieved May 3, 2011.
- ^ "Optical Packet Buffers for Backbone Internet Routers | Request PDF".
- ^ Caceres, Ramon. RTP Control Protocol Extended Reports (RTCP XR). doi:10.17487/RFC3611. RFC 3611.
- ^ CableLabs, PacketCable Residential SIP Telephony Feature Definition, Technical Report, PKT-TR-RST-V03-071106 (2007)
- ^ "VoIP performance measurement using QoS parameters" (PDF). A.H.Muhamad Amin. August 14, 2016.
- ^ "Methodology for SIP Infrastructure Performance Testing" (PDF). Miroslav Voznak, Jan Rozhon. August 14, 2016.
- ^ "Voice over IP (VoIP) Performance Evaluation on VMware vSphere® 5" (PDF). VMware. August 14, 2016. Archived from the original (PDF) on April 23, 2023. Retrieved August 13, 2016.
- ^ "Performance and Stress Testing of SIP Servers, Clients and IP Networks". StarTrinity. August 13, 2016.
- ^ "Testing Voice over IP (VolP) Networks" (PDF). IXIA. August 14, 2016.
- ^ "Importance of Softswitch VoIP Technology". ixc.ua. May 20, 2011. Archived from the original on November 11, 2012. Retrieved October 4, 2012.
{{cite web}}: CS1 maint: bot: original URL status unknown (link) - ^ Peterson, Jon; Liu, Hong; Campbell, Ben; Yu, James (June 1, 2004). "RFC 3824– Using E.164 numbers with the Session Initiation Protocol (SIP)". The Internet Society. Retrieved January 21, 2009.
- ^ "Create a Skype Name". Skype. Retrieved January 21, 2009.
- ^ Camarillo, Gonzalo (December 1, 2004). "RFC 3969– The Internet Assigned Number Authority (IANA) Uniform Resource Identifier (URI) Parameter Registry for the Session Initiation Protocol (SIP)". The Internet Society. Retrieved January 21, 2009.
- ^ "Your personal online number". Skype. Retrieved January 21, 2009.
- ^ "Application-level Network Interoperability and the Evolution of IMS". TMCnet.com. May 24, 2006. Retrieved January 21, 2009.
- ^ Jeff Riddel (2007). Packetcable Implementation. Cisco Press. p. 557. ISBN 978-1-58705-181-4.
- ^ "Keeping your telephone number when you change your service provider". FCC. Archived from the original on December 12, 2009. Retrieved January 20, 2009.
- ^ "TelePassport takes the sting out of MNP". ITWeb. November 13, 2006. Archived from the original on July 19, 2022. Retrieved July 19, 2022.
- ^ a b "FCC Consumer Advisory VoIP and 911 Service" (PDF). FCC. Archived from the original (PDF) on August 10, 2010. Retrieved May 2, 2011.
- ^ Noworatzky, Daniel (February 6, 2019). "Keep your VoIP E911 service in compliance with this checklist". TeleDynamics. Retrieved July 19, 2022.
- ^ "Faxing over IP networks". Soft-Switch.org. Archived from the original on December 18, 2023.
- ^ "4.4 VOIP – Regulatory Issues – Universal Service". ICT Regulation Tool Kit. Archived from the original on June 4, 2009. Retrieved September 21, 2017.
- ^ Thermos, Peter (April 5, 2006). "Examining Two Well-Known Attacks on VoIP". CircleID. Retrieved April 5, 2006.
- ^ "Internet Protocol Telephony & Voice over Internet Protocol Security Technical Implementation Guide Version 2, Release 2" (PDF). DISA. April 21, 2006. Archived from the original (PDF) on August 25, 2009.
- ^ "Secure Voice over IP (SVoIP) vs. Voice over Secure IP (VoSIP) Installations" (PDF). General Dynamics C4 Systems. Archived from the original (PDF) on September 24, 2015.
- ^ Dunte, Markus; Ruland, Christoph (June 2007). "Secure Voice-over-IP" (PDF). International Journal of Computer Science and Network Security. 7 (6): 63–68. Archived (PDF) from the original on April 19, 2023.
- ^ Stringfellow, Brian (August 15, 2001). "Secure Voice Over IP". SANS Institute. Archived from the original on June 1, 2023.
- ^ White, C.M.; Teague, K.A.; Daniel, E.J. (November 7–10, 2004). "Packet loss concealment in a secure voice over IP environment" (PDF). Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004. Vol. 1. pp. 415–419. CiteSeerX 10.1.1.219.633. doi:10.1109/ACSSC.2004.1399165. ISBN 978-0-7803-8622-8. S2CID 402760. Archived from the original (PDF) on May 17, 2006. Retrieved June 12, 2009.
- ^ "Cellcrypt secure VOIP heading to BlackBerry". Networkworld.com. Archived from the original on April 24, 2009. Retrieved June 12, 2009.
- ^ "Secure VOIP calling, free software, and the right to privacy". Free Software Magazine.
- ^ "NSA Releases Guidance on Securing Unified Communications and Voice and Video over IP Systems". National Security Agency/Central Security Service. Retrieved September 26, 2022.
- ^ "Deploying Secure Unified Communications/Voice and Video over IP Systems" (PDF). media.defense.gov. Retrieved September 27, 2023.
- ^ VOIPSA.org, Blog: "Hello Mom, I'm a Fake!" (Telespoof and Fakecaller).
- ^ "Hearing Aid Compatibility for Wireline and Wireless Telephones". Federal Communications Commission. October 30, 2014. Retrieved July 9, 2019.
- ^ FCC.gov, What are some advantages of VoIP?
- ^ "network infrastructure: getting started with VoIP" (PDF). Archived from the original (PDF) on September 17, 2011.
- ^ "Global VOIP Policy Status Matrix". Global IP Alliance. 2005. Retrieved November 23, 2006.
- ^ Proenza, Francisco J. "The Road to Broadband Development in Developing Countries is through Competition Driven by Wireless and VOIP" (PDF). Archived from the original (PDF) on June 2, 2012. Retrieved April 7, 2008.
- ^ "VOICE OVER INTERNET PROTOCOL".
- ^ "Telecom Decision CRTC 2005-21". Canadian Radio-television and Telecommunications Commission. Government of Canada. April 4, 2005. Retrieved April 29, 2017.
- ^ "Voice over IP".
- ^ Metz, Cade. "Oman cuffs 212 for selling VoIP calls". The Register. Retrieved September 20, 2016.
- ^ "Saudi Arabia to lift ban on internet calls". BBC News. September 20, 2017. Retrieved January 10, 2018.
- ^ "Saudi Arabia to lift ban on internet calls". Reuters. September 20, 2017. Retrieved January 10, 2018.
- ^ "Don't worry, Skype is working in UAE". Khaleejtimes. June 26, 2017. Retrieved January 11, 2018.
- ^ Debusmann, Bernd Jr. (January 9, 2018). "Etisalat launches new unlimited calling plan with VoIP apps". Arabian Business. Retrieved January 9, 2018.
- ^ Maceda, Cleofe (January 8, 2018). "No Skype? Pay Dh50 monthly for video calls". Gulf News. Retrieved January 9, 2018.
- ^ Zacharias, Anna \ (January 8, 2018). "Etisalat launches new calling app plan days after Skype disruptions". The National. Retrieved January 9, 2018.
- ^ "UAE loosens some VoIP restrictions as residents in lockdown call for end to WhatsApp and Skype ban". CNBC. March 26, 2020. Retrieved March 26, 2020.
- ^ a b Mahanagar Doorsanchar Bhawan and Jawahar Lal Nehru Marg (May 2008). "Telecom Regulatory Authority of India (TRAI) Consultation paper on Issues related to Internet Telephony. Consultation Paper No. 11/2008" (PDF). New Delhi India: Telecom Regulatory Authority of India (TRAI). p. 16 (Section 2.2.1.2 PC–to–Phone Internet telephony). Archived from the original (PDF) on October 6, 2014. Retrieved September 19, 2012.
An end-user is allowed to make PC–to-Phone Internet Telephony calls only on PSTN/PLMN abroad.
- ^ Harish Kumar Gangwar Technical Note on Illegal International Long Distance telephone Exchange in India
- ^ Stars and Stripes: USFK deal keeps VoIP access for troops, archived from the original on January 13, 2010
- ^ Pershing, Genny. "Cybertelecom :: VoIP :: FCC". www.cybertelecom.org. Retrieved September 21, 2017.
- ^ GPO.gov Archived June 8, 2010, at the Wayback Machine, 47 C.F.R. pt. 9 (2007)
- ^ "VoIP and 911 Service". FCC. May 26, 2011. Retrieved August 16, 2014.
- ^ "Voice Over Internet Protocol (VoIP)". November 18, 2010. Retrieved September 21, 2017.
- ^ Greenberg, Andy (May 15, 2008). "The State Of Cybersecurity Wiretapping's Fuzzy Future". Forbes. Retrieved March 2, 2009.
- ^ "Danny Cohen". INTERNET HALL of FAME. Retrieved December 6, 2014.
- ^ Advanced Content Delivery, Streaming, and Cloud Services (Pg 34). Willey. September 19, 2014. ISBN 9781118909706. Retrieved December 6, 2014.
- ^ a b c d e Gray, Robert M. (2010). "A History of Realtime Digital Speech on Packet Networks: Part II of Linear Predictive Coding and the Internet Protocol" (PDF). Found. Trends Signal Process. 3 (4): 203–303. doi:10.1561/2000000036. ISSN 1932-8346.
- ^ Gupta, Shipra (May 2016). "Application of MFCC in Text Independent Speaker Recognition" (PDF). International Journal of Advanced Research in Computer Science and Software Engineering. 6 (5): 805–810 (806). ISSN 2277-128X. S2CID 212485331. Archived from the original (PDF) on October 18, 2019. Retrieved October 18, 2019.
- ^ a b M. R. Schroeder and B. S. Atal, "Code-excited linear prediction (CELP): high-quality speech at very low bit rates," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 10, pp. 937–940, 1985.
- ^ Audio Transceiver
- ^ Cerf, V.; Kahn, R. (May 1974). "A Protocol for Packet Network Intercommunication" (PDF). IEEE Transactions on Communications. 22 (5): 637–648. doi:10.1109/TCOM.1974.1092259.
- ^ "The Launch of NSFNET". The National Science Foundation. Archived from the original on May 7, 2006. Retrieved January 21, 2009.
- ^ a b Dua, Amit (July 29, 2021). "VoIP Basics: Everything Beginners Should Know!". business2community.com. Business 2 Community. Retrieved September 14, 2021.
- ^ McCraw, Corey (October 12, 2022). "The History of VoIP Over the Last 55 Years (1966 to 2021)". fitsmallbusiness.com. Fits Small Business.
- ^ IDG Network World Inc; Eckerson, Wayne (September 21, 1992). "Network World - Startup targets desktop Videoconferencing arena". Network World. IDG Network World Inc: 39–. ISSN 0887-7661. Retrieved February 10, 2012.
- ^ "MTALK-Readme" (TXT). Sunsite.edu. Retrieved April 29, 2012.
- ^ Keating, Tom. "Internet Phone Release 4" (PDF). Computer Telephony Interaction Magazine. Retrieved November 7, 2007.
- ^ "The 10 that Established VOIP (Part 1: VocalTec)". iLocus. Retrieved January 21, 2009.
- ^ The free Library RADVision and Intel Target Compatibility Between RADVision's H.323/320 Videoconferencing Gateway And Intel's Business Video Conferencing And TeamStation Products. Archived October 30, 2013, at the Wayback Machine June 2, 1997 VoiP Developer Solutions Archived June 16, 2011, at the Wayback Machine
- ^ "H.323 Visual telephone systems and equipment for local area networks which provide a non-guaranteed quality of service". ITU-T. Retrieved January 21, 2009.
- ^ "RFC 2235". R. Zakon. Retrieved January 21, 2009.
- ^ International Telecommunication Union, Standardization Sector (ITU-T), Study Group 15 (1993-1996), Recommendation G.729, March 1996.
- ^ "The 10 that Established VOIP (Part 2: Level 3)". iLocus. July 13, 2007. Retrieved November 7, 2007.
- ^ "RFC 2543, SIP: Session Initiation Protocol". Handley, Schulzrinne, Schooler, Rosenberg. Retrieved January 21, 2009.
- ^ "What is Asterisk". Asterisk.org. Archived from the original on January 23, 2009. Retrieved January 21, 2009.
- ^ Hersent, Olivier; Petit, Jean-Pierre; Gurle, David (2005). Beyond VoIP Protocols: Understanding Voice Technology and Networking Techniques for IP Telephony. John Wiley & Sons. p. 55. ISBN 9780470023631.
- ^ Lutzky, Manfred; Schuller, Gerald; Gayer, Marc; Krämer, Ulrich; Wabnik, Stefan (May 2004). A guideline to audio codec delay (PDF). 116th AES Convention. Fraunhofer IIS. Audio Engineering Society. Retrieved October 24, 2019.
- ^ Schnell, Markus; Schmidt, Markus; Jander, Manuel; Albert, Tobias; Geiger, Ralf; Ruoppila, Vesa; Ekstrand, Per; Bernhard, Grill (October 2008). MPEG-4 Enhanced Low Delay AAC - A New Standard for High Quality Communication (PDF). 125th AES Convention. Fraunhofer IIS. Audio Engineering Society. Retrieved October 20, 2019.
- ^ Stapleton-Gray, Ross (2009). Inter-Network Operations Center Dial-by-ASN (INOC-DBA), a Resource for the Network Operator Community. Los Alamitos: IEEE Computer Society Press. ISBN 978-0-7695-3568-5.
- ^ Pogue, David (August 2, 2007). "State of the Art: Get Your Free Net Phone Calls Here". The New York Times. Retrieved January 20, 2009.
- ^ Nagireddi, Sivannarayana (2008). VoIP Voice and Fax Signal Processing. John Wiley & Sons. p. 69. ISBN 9780470377864.
- ^ Remo, Michelle V. (August 27, 2007). "Prospects bright for voice calls over internet". Philippine Daily Inquirer. Retrieved January 1, 2015.
- ^ Audio-Mitschnitt Archived February 10, 2013, at the Wayback Machine vom Treffen der IETF-Codec-Arbeitsgruppe auf der Konferenz IETF79 in Peking, China mit einer Darstellung der grundlegenden Funktionsprinzipien durch Koen Vos (MP3, ~70 MiB)
- ^ "Skype's new super wideband codec". Wirevolution.com. January 13, 2009. Retrieved March 31, 2009.
- ^ Daniel Eran Dilger (June 8, 2010). "Inside iPhone 4: FaceTime video calling". AppleInsider. Retrieved June 9, 2010.
- ^ Presentation of the CELT codec Archived August 7, 2011, at the Wayback Machine by Timothy B. Terriberry (65 minutes of video, see also presentation slides Archived August 10, 2011, at the Wayback Machine in PDF)
- ^ Valin, Jean-Marc; Maxwell, Gregory; Terriberry, Timothy B.; Vos, Koen (October 2013). High-Quality, Low-Delay Music Coding in the Opus Codec. 135th AES Convention. Audio Engineering Society. arXiv:1602.04845.
External links
[edit]
The dictionary definition of VoIP at Wiktionary
Internet telephony travel guide from Wikivoyage
Voice over IP
View on GrokipediaFundamentals
Definition and Core Principles
Voice over Internet Protocol (VoIP) is a technology that enables the transmission of voice communications as digital data packets over packet-switched IP networks, such as the Internet, rather than dedicated analog or circuit-switched telephone lines.[9] [1] This approach leverages broadband connections to convert analog voice signals into digital format, allowing for efficient multiplexing of multiple calls on shared network resources.[10] At its core, VoIP operates by sampling analog audio from a microphone at rates typically between 8 kHz and 48 kHz, quantizing the samples, and encoding them using codecs such as G.711 or G.729 to compress the data for transmission.[4] These encoded payloads are then packetized into Real-time Transport Protocol (RTP) packets, encapsulated in UDP/IP datagrams, and routed independently across the network to the destination.[10] Upon arrival, the packets are reordered, decoded, and converted back to analog signals for playback, with jitter buffers mitigating variations in packet arrival times to ensure smooth audio reproduction.[11] Unlike traditional telephony, which employs circuit switching to establish a fixed, end-to-end path reserving bandwidth for the call's duration—resulting in underutilized resources during silence periods—VoIP utilizes packet switching, where voice data is fragmented into variable-length packets that share bandwidth dynamically and may traverse different routes.[12] [13] This principle enables higher network efficiency and scalability but introduces challenges like latency, packet loss, and jitter, necessitating quality-of-service mechanisms for real-time performance.[14] Standards from bodies such as ITU-T, including H.323 for multimedia signaling over packet networks, underpin interoperable VoIP implementations.[6]Comparison to Traditional Telephony
Traditional telephony, primarily the Public Switched Telephone Network (PSTN), relies on circuit switching, establishing a dedicated end-to-end path for the duration of a call, ensuring consistent bandwidth allocation regardless of network load.[15] In contrast, Voice over IP (VoIP) employs packet switching, digitizing voice into data packets transmitted over shared IP networks, which optimizes bandwidth usage but introduces variability in transmission paths.[16] This fundamental difference means PSTN provides predictable latency and minimal jitter inherent to its fixed-circuit design, while VoIP call quality can degrade due to network congestion, with acceptable thresholds typically below 150 ms for one-way latency and 30 ms for jitter to maintain intelligible audio.[17] VoIP systems generally incur lower operational costs than PSTN, with per-user monthly fees ranging from $15 to $40, encompassing features like unlimited long-distance calling that traditional setups charge separately for, alongside reduced need for dedicated copper wiring and hardware.[18][19] Deployment of VoIP leverages existing internet infrastructure, minimizing physical cabling expenses, whereas PSTN requires extensive analog or digital line installations that escalate with scale.[20] However, VoIP's dependency on stable broadband introduces reliability risks absent in PSTN; traditional lines often function during power outages via line-powered handsets, but VoIP fails without electricity for endpoints or internet access, potentially disrupting service entirely.[21][22] In terms of features and scalability, VoIP enables advanced integrations such as video conferencing, call routing based on presence, and mobility across devices without location constraints, capabilities limited in PSTN's analog framework.[9] PSTN offers superior inherent security through physical isolation, with fewer vulnerabilities to interception or denial-of-service attacks compared to VoIP's exposure to IP-based threats like eavesdropping or spoofing.[23][24] Emergency services present another divergence: PSTN reliably routes 911 calls with automatic location via fixed lines, while interconnected VoIP may require manual address registration and can fail to transmit precise location data during outages.[25]| Aspect | PSTN (Traditional Telephony) | VoIP |
|---|---|---|
| Switching Method | Circuit-switched: Dedicated path | Packet-switched: Shared IP packets |
| Cost Structure | Higher per-line fees, wiring expenses | Lower monthly rates ($15-40/user), scalable |
| Reliability | Operates in power outages, consistent QoS | Internet/power dependent, prone to jitter |
| Features | Basic voice, limited scalability | Advanced (video, mobility), integrable |
| Security | Physically secure, low cyber risk | Vulnerable to network attacks |
Technical Protocols and Standards
Signaling and Transport Protocols
Signaling protocols in VoIP systems handle the establishment, modification, maintenance, and termination of sessions, including endpoint registration, location discovery, and capability negotiation. These protocols operate independently of the media streams they control, enabling separation of call control from data transport to support scalability and interoperability across IP networks. The two dominant standards are the Session Initiation Protocol (SIP), developed by the Internet Engineering Task Force (IETF), and H.323, standardized by the International Telecommunication Union (ITU).[26] SIP functions as an application-layer signaling protocol using text-based messages modeled after HTTP, facilitating peer-to-peer communication for multimedia sessions involving voice, video, or other real-time data. Defined initially in RFC 2543 and refined in subsequent updates, SIP employs methods such as INVITE for session initiation, ACK for confirmation, and BYE for termination, often complemented by the Session Description Protocol (SDP) to negotiate media parameters like codecs and ports.[27] Its lightweight, extensible design has made SIP the de facto standard for modern VoIP deployments, particularly in enterprise and carrier environments, due to its compatibility with web technologies and ease of integration with firewalls via UDP or TCP on port 5060.[28] In contrast, H.323 comprises an umbrella suite of ITU-T recommendations originating from 1996, encompassing H.225.0 for call signaling and RAS (Registration, Admission, and Status) for gatekeeper interactions, alongside H.245 for media channel negotiation. This binary-encoded protocol stack was designed for circuit-like multimedia conferencing over packet networks, supporting features like address translation and bandwidth management through a centralized gatekeeper architecture.[29][30] While H.323 enabled early VoIP adoption in legacy systems, its complexity and proprietary elements have led to declining use compared to SIP, though interworking functions exist to bridge the two via gateways compliant with RFC 4123.[31] Other signaling protocols include the Media Gateway Control Protocol (MGCP), outlined in RFC 2705, which centralizes control in a call agent for simpler gateways by decomposing traditional telephony commands into package-based instructions over UDP. MGCP suits decomposed architectures but is less flexible for endpoint-initiated features than SIP.[32] Transport protocols in VoIP primarily manage the delivery of encoded media streams, prioritizing low-latency packetization over reliability, as UDP underpins real-time flows to avoid TCP's retransmission delays. The Real-time Transport Protocol (RTP), standardized in RFC 3550 by the IETF, encapsulates audio or video payloads with headers including sequence numbers for reordering, timestamps for synchronization, and payload type indicators for codec identification, typically running over UDP on even-numbered ports starting from 16384 in many implementations.[33][34] RTP's profile extensions support diverse applications, from narrowband voice to high-definition video, but it lacks built-in congestion control or encryption, necessitating complementary mechanisms.[35] Complementing RTP, the RTP Control Protocol (RTCP) provides out-of-band feedback on transmission quality, including packet loss rates, jitter, and round-trip delay, sent periodically in the same UDP session but on odd-numbered ports adjacent to RTP. RTCP enables adaptive adjustments, such as rate limiting, and extended reports (RTCP XR) per RFC 3611 offer detailed metrics like signal-to-noise ratios for VoIP diagnostics.[36][37] This signaling-transport separation—where protocols like SIP negotiate parameters but RTP/RTCP handle actual media—optimizes VoIP for IP networks by decoupling control from data paths, though it requires quality-of-service provisions to mitigate packet loss in best-effort environments.[38]Audio Codecs and Compression Techniques
In VoIP systems, audio codecs digitize and compress voice signals to enable efficient packet transmission over IP networks, balancing bandwidth efficiency against perceptual quality and latency. Compression exploits speech redundancies, including short-term correlations via linear predictive coding (LPC), which models the vocal tract as an all-pole filter, and long-term pitch periodicity.[39] Techniques range from waveform coding, which directly quantizes time-domain samples, to source modeling of speech production parameters, and hybrid approaches that integrate both for optimal rate-distortion performance in real-time constraints. The ITU-T G.711 codec employs uncompressed pulse code modulation (PCM), sampling speech at 8 kHz with 8-bit logarithmic quantization to yield a fixed 64 kbps bit rate, supporting narrowband frequencies (300-3400 Hz) for toll-quality reproduction.[40] It features two variants—μ-law for North American systems and A-law for international use—incurring negligible algorithmic delay beyond sampling (125 μs per frame), which minimizes end-to-end latency in circuit-like VoIP deployments.[41] Compressed codecs address bandwidth limitations in packet-switched networks by reducing data rates through perceptual coding, discarding inaudible components and quantizing perceptually relevant features. G.729, standardized by ITU-T in 1996, achieves 8 kbps using conjugate-structure algebraic code-excited linear prediction (CS-ACELP), a hybrid method where LPC coefficients represent the spectral envelope, and an algebraic codebook searches for optimal excitation vectors to synthesize speech frames every 10 ms with 5 ms lookahead.[42] This CELP-based technique halves bandwidth versus G.711 but introduces 15 ms total delay and vulnerability to packet loss, yielding mean opinion scores (MOS) around 3.9 for clean channels, below toll quality (MOS >4.0).[43] Advanced compression in VoIP favors adaptive, low-complexity algorithms resilient to jitter and loss. Opus, defined in IETF RFC 6716 (2012), supports variable bit rates from 6 to 510 kbps across narrowband to fullband (up to 20 kHz), switching between SILK (LPC-based for speech) and CELT (MDCT-based for music-like audio) modes with 2.5-60 ms frames and under 30 ms delay.[44] It incorporates error concealment via packet loss hiding and dynamic switching, achieving MOS scores exceeding 4.3 in wideband modes at 24-32 kbps, surpassing G.729 in efficiency for modern applications like WebRTC.[45] Other techniques include adaptive differential PCM (ADPCM) in G.726/G.722 for wideband extension (50-7000 Hz) at 32-64 kbps with MOS >4.2, and internet low-bitrate codec (iLBC) at 13.3 or 15.2 kbps using frame-based LPC with built-in redundancy for 20-30 ms loss tolerance.[46] Codec selection hinges on causal trade-offs: higher compression lowers bandwidth (e.g., from 64 kbps to 8 kbps) but elevates CPU demands and risks quality degradation from quantization noise or modeling errors under variable network conditions.[47]| Codec | Bitrate (kbps) | Bandwidth | Core Technique | Approx. MOS (clean channel) |
|---|---|---|---|---|
| G.711 | 64 | Narrow | PCM | 4.1-4.2 [45] |
| G.729 | 8 | Narrow | CS-ACELP (CELP hybrid) | 3.9 [43] |
| Opus | 6-510 (typ. 12-40 for voice) | Narrow to Full | SILK/CELT hybrid | 4.0-4.5+ [45] |
| G.722 | 48-64 | Wide | SB-ADPCM | 4.2+ [48] |
System Architectures and Delivery
Hosted and Cloud-Based VoIP Systems
Hosted VoIP systems, also referred to as hosted PBX or virtual PBX, enable businesses to conduct voice communications over the internet without maintaining on-site telephony hardware, with the provider managing call routing, switching, and features from remote data centers.[49][50] These systems leverage broadband connections to transmit digitized voice packets, integrating with endpoints such as IP desk phones, softphone applications on computers or mobiles, and unified communications platforms for voice, video, and messaging.[51] Adoption accelerated in the mid-2000s alongside widespread broadband availability and software-as-a-service models, shifting from traditional circuit-switched networks to packet-switched IP infrastructure for cost efficiency and flexibility.[52] Cloud-based VoIP represents an evolution or synonymous implementation of hosted systems, emphasizing elastic scalability through public or hybrid cloud environments like those from AWS or Azure, where resources dynamically adjust to demand without fixed hardware investments.[53][54] Key features include auto-scaling for adding extensions, pay-per-use pricing, API integrations for CRM and collaboration tools, and advanced analytics for call monitoring, often bundled with security protocols like SRTP for encryption and failover redundancy.[55][56] Providers such as RingCentral, 8x8, and Vonage dominate segments of the market, with North America holding approximately 36.8% global share in 2025 due to high internet penetration and enterprise demand.[57][58] Advantages encompass reduced capital expenditures—eliminating PBX hardware costs estimated at $20,000–$100,000 for mid-sized firms—and operational savings of up to 50% on long-distance calls via internet routing, alongside rapid deployment in days rather than weeks.[59][18] Enhanced mobility supports remote work, with users accessing extensions from any location with internet, contributing to a projected global VoIP services market growth from $132.2 billion in 2024 to $349.1 billion by 2034 at a 10.2% CAGR.[60] However, dependency on internet quality introduces risks: latency above 150 ms or jitter exceeding 30 ms can degrade call clarity, and outages render systems inoperable without provider SLAs guaranteeing 99.99% uptime.[55][61] Security vulnerabilities, such as DDoS attacks on provider infrastructure, necessitate robust measures, though empirical data shows cloud VoIP breach rates comparable to on-premise when properly configured.[56]Private and On-Premise VoIP Deployments
Private and on-premise VoIP deployments involve installing private branch exchange (PBX) systems on local hardware within an organization's internal network, enabling voice communications without reliance on external cloud providers.[62] These systems typically use Session Initiation Protocol (SIP) for signaling and support internal calls over local area networks (LANs), with SIP trunks connecting to public switched telephone networks (PSTN) for external communications.[63] Common implementations include open-source solutions like Asterisk, which powers customizable PBX setups on commodity hardware, and proprietary systems from vendors such as Cisco and Avaya.[64] [65] Asterisk-based systems, often paired with graphical interfaces like FreePBX, allow enterprises to deploy features including call routing, voicemail, and conferencing on dedicated servers or appliances like the Grandstream UCM series.[65] Cisco systems emphasize integration with unified communications platforms, supporting IP phones and gateways for hybrid environments.[66] Advantages of on-premise deployments include greater control over hardware and software configurations, enabling tailored customization and reduced dependency on internet bandwidth for intra-site calls.[63] They offer enhanced data sovereignty and compliance for regulated industries, as voice traffic remains isolated on private networks.[67] Security benefits arise from physical access controls and network segmentation, mitigating risks like eavesdropping compared to internet-exposed cloud services; recommended practices include firewalls, VPNs for remote access, and regular firmware updates.[68] [24] Challenges encompass high initial capital expenditures for servers, phones, and setup, alongside ongoing maintenance requiring in-house IT expertise.[69] Scalability demands hardware upgrades, unlike cloud models, and power outages can disrupt service without redundant infrastructure.[69] Despite these, enterprises in sectors like finance and manufacturing favor on-premise VoIP for stable, high-volume internal communications, such as call centers handling proprietary data.[67]Integration with Mobile Networks and 5G
The integration of Voice over IP (VoIP) with mobile networks relies on the IP Multimedia Subsystem (IMS), a 3GPP-defined architectural framework that enables multimedia services, including voice, over packet-switched domains rather than traditional circuit-switched voice channels.[70] IMS handles signaling via Session Initiation Protocol (SIP) and supports interoperability between fixed and mobile VoIP, facilitating handover and quality assurance across access networks.[71] In 4G LTE networks, VoIP manifests as Voice over LTE (VoLTE), which supplants circuit-switched fallback by routing voice traffic entirely over the evolved packet core (EPC) using IMS for call control and media transport.[70] VoLTE deployments began commercially around 2012, with global subscriptions reaching approximately 6.3 billion by the end of 2024, representing a shift from legacy 2G/3G voice as operators decommission circuit-switched infrastructure.[72] This integration improves spectral efficiency and enables advanced codecs like Adaptive Multi-Rate Wideband (AMR-WB) for higher audio quality, though it requires device certification and network provisioning for IMS registration.[70] With 5G New Radio (NR), VoIP evolves to Voice over NR (VoNR), standardized in 3GPP Release 15 and enhanced in subsequent releases, delivering voice services natively over the 5G core (5GC) and radio access network (RAN) while leveraging IMS for end-to-end control.[70] In standalone (SA) 5G deployments, VoNR supports ultra-low latency below 20 ms end-to-end and enhanced voice services (EVS) codec for super-wideband audio up to 20 kHz, surpassing VoLTE capabilities.[73] Non-standalone (NSA) configurations often fallback to VoLTE via EPS interworking until full SA coverage matures, with global VoLTE/VoNR adoption projected to exceed 70% of mobile connections by 2030.[74] Key enablers include 5G's enhanced QoS frameworks, such as 5QI (5G QoS Identifier) profiles tailored for conversational voice (e.g., 5QI=1 for guaranteed bit rate), ensuring prioritized packet handling and minimal jitter.[75] Integration challenges persist in hybrid environments, including seamless mobility between 5G, LTE, and Wi-Fi via IP flow mobility, and regulatory mandates for emergency calling support.[71] Operators like Verizon and AT&T initiated VoNR trials in 2020, with commercial rollout accelerating post-2023 as 5G SA networks expand.[70]Quality of Service and Performance
Measurement Metrics
The quality of Voice over IP (VoIP) communications is quantified through a combination of objective network performance indicators and subjective perceptual assessments, enabling systematic evaluation of audio fidelity, reliability, and user experience. Objective metrics focus on transport-layer impairments such as packet delay, variability, and loss, while subjective metrics aggregate human listener judgments to correlate network conditions with perceived quality. These metrics are standardized primarily by the International Telecommunication Union (ITU) and inform service level agreements (SLAs) in commercial deployments.[76] Latency, or end-to-end delay, measures the time required for voice packets to traverse the network, including encoding, transmission, and decoding phases; excessive latency (>150 ms one-way) introduces noticeable talker overlap or echo, degrading conversational flow. The ITU-T G.114 recommendation specifies that delays below 150 ms support satisfactory real-time voice interactions, with thresholds tightening to under 100 ms for optimal toll-quality equivalence.[77] Jitter, the variation in packet arrival intervals, disrupts smooth playback and requires buffering to compensate, typically targeting values below 30 ms after jitter buffer application to minimize audio artifacts like choppiness. Packet loss, expressed as a percentage of transmitted packets not received, directly causes audible gaps or distortions; VoIP systems tolerate less than 1% loss for acceptable quality, as higher rates exceed human auditory thresholds for discontinuity.[76] Subjective quality is often captured via the Mean Opinion Score (MOS), a scale from 1 (poor) to 5 (excellent) derived from listener ratings of speech naturalness and intelligibility under ITU-T P.800 methodologies. MOS scores above 4.0 indicate toll-quality equivalence to public switched telephone network (PSTN) calls, while objective predictors like the ITU-T P.862 Perceptual Evaluation of Speech Quality (PESQ) algorithm map network impairments to estimated MOS values for automated testing. The R-factor, computed via the ITU-T G.107 E-model, integrates multiple factors (delay, loss, codec performance) into a transmission rating score from 0 to 100, where values exceeding 90 correlate with MOS >4.0.[78]| Metric | Acceptable Threshold | Impact if Exceeded |
|---|---|---|
| Latency | <150 ms (one-way) | Echo, talker overlap, reduced interactivity |
| Jitter | <30 ms (post-buffering) | Choppiness, buffering delays |
| Packet Loss | <1% | Audible gaps, distortion |
| MOS | >4.0 | Perceived degradation from toll quality |
| R-Factor | >90 | Overall transmission impairment |
