G.729 is a royalty-free[1] narrow-band vocoder-based audio data compression algorithm using a frame length of 10 milliseconds. It is officially described as Coding of speech at 8 kbit/s using code-excited linear prediction speech coding (CS-ACELP).[2] The wide-band extension of G.729 is called G.729.1, which equals G.729 Annex J.

Because of its low bandwidth requirements, G.729 is mostly used in voice over Internet Protocol (VoIP) applications when bandwidth must be conserved, such as for conference calls. Standard G.729 operates at a bit rate of 8 kbit/s, but extensions provide rates of 6.4 kbit/s (Annex D, F, H, I, C+) and 11.8 kbit/s (Annex E, G, H, I, C+) for worse and better speech quality, respectively.

G.729 has been extended with various features, commonly designated as G.729a and G.729b:

  • G.729: This is the original codec using a high-complexity algorithm.
  • G.729A or A annex: This version has a medium complexity, and is compatible with G.729. It provides a slightly lower voice quality.
  • G.729B or B annex: This version extends G.729 with silence suppression, and is not compatible with the previous versions.
  • G.729AB: This version extends G.729A with silence suppression, and is only compatible with G.729B.
  • G.729.1 or J annex: This version extends G.729A and B with scalable variable encoding using hierarchical enhancement layers.

Dual-tone multi-frequency signaling (DTMF), fax transmissions, and high-quality audio cannot be transported reliably with this RFC 4733.

G.729 annexes

Functionality G.729 Annexes [3]
- A B C D E F G H I C+ J
Low complexity X X
Fixed-point X X X X X X X X X X
Floating-point X X
8 kbit/s X X X X X X X X X X X X
6.4 kbit/s X X X X X
11.8 kbit/s X X X X X
variable bit rate,

G.729 Annex A

G.729a is a compatible extension of G.729, but requires less computational power. This lower complexity, however, bears the cost of marginally reduced speech quality.

G.729a was developed by a consortium of organizations: France Telecom, Mitsubishi Electric Corporation, Nippon Telegraph and Telephone Corporation (NTT)

The features of G.729a are:

  • Sampling frequency 8 kHz/16-bit (80 samples for 10 ms frames)
  • Fixed bit rate (8 kbit/s 10 ms frames)
  • Fixed frame size (10 bytes (80 bits) for 10 ms frame)
  • Algorithmic delay is 15 ms per frame, with 5 ms look-ahead delay
  • G.729a is a hybrid speech coder which uses Algebraic Code Excited Linear Prediction (ACELP)
  • The complexity of the algorithm is rated at 15, using a relative scale where G.711 is 1 and G.723.1 is 25.
  • PSQM testing under ideal conditions yields Mean Opinion Scores of 4.04 for G.729a, compared to 4.45 for G.711 (μ-law)
  • PSQM testing under network stress yields Mean Opinion Scores of 3.51 for G.729a, compared to 4.13 for G.711 (μ-law)

Some VoIP phones incorrectly use the description "G729a/8000" in SDP (e.g. this affects some Cisco and Linksys phones). This is incorrect as G729a is an alternative method of encoding the audio, but still generates data decodable by either G729 or G729a - i.e. there is no difference in terms of codec negotiation. Since the SDP RFC allows static payload types to be overridden by the textual rtpmap description this can cause problems calling from these phones to endpoints adherring to the RFC unless the codec is renamed in their settings since they will not recognise 'G729a' as 'G729' without a specific workaround in place for the bug.

G.729 Annex B

G.729 has been extended in Annex B (G.729b) which provides a silence compression method that enables a voice activity detection (VAD) module. It is used to detect voice activity in the signal. It also includes a discontinuous transmission (DTX) module which decides on updating the background noise parameters for non speech (noisy frames). It uses 2-byte Silence Insertion Descriptor (SID) frames transmitted to initiate comfort noise generation (CNG). If transmission is stopped, and the link goes quiet because of no speech, the receiving side might assume that the link has been cut. By inserting comfort noise, analog hiss is simulated digitally during silence to assure the receiver that the link is active and operational.

G.729 Annex J (G.729.1)

G.729 Annex J, maintained by G.729.1, provides support for wideband speech and audio. It defines variable bit-rate wideband enhancement using up to 12 hierarchical layers. The core layer is an 8 kbit/s G.729 bitstream, the second layer is a 4 kbit/s narrowband enhancement layer, and the third 2 kibt/s layer is a bandwidth enhancement layer. Further layers provide wideband enhancement in 2 kbit/s steps. The G.729.1 uses three-stage coding: embedded Code-excited linear prediction (CELP) coding of the lower band, parametric coding of the higher band by Time-Domain Bandwidth Extension (TDBWE), and enhancement of the full band by a predictive transform coding by Time-Domain Aliasing Cancellation (TDAC). Bit rate and the obtained quality are adjustable by simple bitstream truncation.

Other Languages
català: G.729
čeština: G.729
Deutsch: G.729
español: G.729
français: G.729
한국어: G.729
italiano: G.729
日本語: G.729
polski: G.729
português: G.729
русский: G.729
українська: G.729
Tiếng Việt: G.729
Yorùbá: G.729
中文: G.729