Amr Codec in Umts

UMTS
Javier Sanchez & Mamadou Tnioune Copyright 02007, ISTE Ltd.
Appendix 1
AMR Codec in UMTS
Originally developed to be used in GSM by the ETSI, the Adaptive Multi-Rate (AMR) speech codec [TS 26.071] was approved within the 3GPP forum in 1999 to be mandatory for circuit- and packet-switched speech in UMTS networks. An AMR speech codec adapts the error protection level to the local radio channel and traffic conditions so that it always selects the optimum channel and codec mode to deliver the best combination of speech quality and system capacity. AMR uses Multi-Rate Algebraic Code Excited Linear Prediction (MR-ACELP) scheme based on two different synthesis filters. It converts a narrowband speech signal (from 300 to 3,400 Hz) to 13-bit uniform Pulse Coded Modulated (PCM) samples with 8 kHz sample rate. This leads to 20 ms AMR frames consisting of 160 encoded speech samples. This means that the codec can switch mode, i.e. source bite rate, every 20 ms. AMR has 8 coded modes in UMTS systems, whereas in GSM AMR uses either 6 or 8 modes. The eight source rates vary from 4.75 to 12.2 kbps. It also contains a low rate encoding mode, called SIlence Descriptor (SID), which operates at 1.8 kbps to produce background noise and a non-transmission mode. The AMR codec dynamically adapts its error protection level to the channel error conditions. For instance, lower speech coding bit rate and more error protection schemes are used in bad channel conditions. This principle is illustrated in Figure A1.1 where AMR strives to change to the best curve associated to a given AMR mode. It has been shown that the degradation on the audio quality caused by a lower speech coding rate is compensated by increased robustness with the channel coding. Note, however, that this channel robustness is more beneficial in GSM than in UMTS due to the embedded fast power control used in WCDMA systems. Using a variable-rate transmission scheme also makes it possible to control the transmission power of the UE, a fact that is particularly useful when the UE
376
UMTS
suddenly attains its maximum transmit power: in CDMA: lower bit rates generally need lower transmit power and vice versa.
The optimum AMR codec mode is dynamically selected as function of the channel quality
Speech quality Good
Poor Good
Channel quality Bad
Figure A1.1. AMR principle
A1.1. AMR frame structure and operating modes Figure A1.2 depicts the generic structure of the AMR frame. As observed in the figure, the frame is divided into a header, auxiliary information and core frame. The header contains the Frame Types and Frame Quality Indicator fields. The Frame Type can indicate the use of one of the eight AMR codec modes for that frame, a noise frame, or an empty frame. The Frame Quality Indicator indicates if the frame is good or bad. The auxiliary information part includes the Mode Indication, Mode Request and Codec CRC fields. The CRC field is used for the purpose of errordetection calculated over all the Class A bits in the AMR Core frame. The Core frame part is used to carry the encoded bits divided into A, B and C classes. In case of a comfort noise frame, comfort noise parameters, i.e. a SID frame, replace class A bits of the core frame while class B and class C bits are omitted.
AMR frame Frame Type (4 bits) Frame Quality Indicator (1 bits) Mode Indication (3 bits) Mode Request (3 bits) Codec CRC bits (8 bits) Class A bits Class B bits Class C bits Core frame (speech or comfort noise) Header
Auxiliary Information (for Mode Adaptation, and Error Detection)
Figure A1.2. Generic structure of the AMR frame
AMR Codec in UMTS
377
Classification of the encoded bits according to their sensitivity to errors AMR encoded bits are divided into three indicative classes according to their importance: A, B and C. The reason for dividing the speech bits into classes is that they can be subjected to different error protection in the network. Class A contains the bits that are most sensitive to errors and any kind of errors in these bits typically result in a corrupted speech frame which should not be decoded without applying appropriate error concealment. This class is protected by the CRC in auxiliary information field. Classes B and C contain bits where increasing error rates gradually reduce the speech quality, but the decoding of an erroneous speech frame is usually possible without a strongly perceptible quality degradation. AMR operating modes Table A1.1 depicts the 8 different modes (source bit rates) AMR can operate. It should be noted that some of these modes are equivalent to the speech codecs currently used in other mobile communication systems. For instance, the AMR 12.20 kbps mode is equal to the ETSI GSM called codec EFR (Enhanced Full Rate Speech [TS 06.60]). Similarly, the AMR 7.40 kbps mode is equivalent to the IS-641 codec used in the USA standard IS-136 (US TDMA). Finally, AMR 6.70 kbps mode is equivalent to the codec used in the PDC Japanese standard.
Frame content ClassA Frame type (AMR mode, comfort noise, or other) bits index 0 1 2 3 4 5 6 7 8 9 10 11 12-14 15 AMR 4.75 kbps AMR 5.15 kbps AMR 5.90 kbps AMR 6.70 kbps (PDC EFR) AMR 7.40 kbps (IS-136 EFR). AMR 7.95 kbps AMR 10.2 kbps . AMR 12.2 kbps (GSM EFR) AMR SID GSM EFR SID TDMA EFR SID PDC EFR SID Future usage No data to transmit/receive 42 49 55 58 61 75 65 81 Class B Class C bits bits 53 54 63 76 87 84 99 103 0 0 0 0 0 0 40 60
Table A1.1. AMR modes and relationship with AMR frame structure
378
UMTS
Based on the fact that voice activity in a normal conversation is about 40%, all AMR modes implement a Voice Activity Detection (VAD) algorithm that detects if each 20 ms-frame contains speech or not on the transmitting side. VAD works together with the Discontinuous Transmission (DTX) or Source Controlled Rate (SCR) [TS 26.093] techniques where RF transmission is cut during speech pauses. When the transmission is cut, comfort noise parameters are sent at a regular rate in AMR frames during discontinuous activity. These frames are known as SID (SIlence Descriptor) frames. The receiver decodes these parameters and generates locally a comfort noise. Without this background noise the participants in a conversation, might think that their connection is broken during silence periods. The SCR technique for AMR in UMTS is mandatory and aims at prolonging the battery life (UE side) and reducing the interference. A1.2. Dynamic AMR mode adaptation The AMR mode adaptation in UMTS networks means using different AMR coding for the data stream. Mode adaptation can independently be applied in the uplink and the downlink. At any point in time, a different AMR mode can be used in each direction and this can be dynamically changed during a voice conversation. Location of the AMR speech codec in UMTS networks The AMR speech codec is located in the Transcoder (TC) function defined to be in the UMTS core network and as such, logically controlled by Non-Access Stratum protocols. From the transfer point of view, this means that all AMR coded data is going to be transmitted not only via Iub and air interface but also via Iu-interfaces. Note, however, that the AMR mode control that generates the AMR mode command cannot be located in the TC, since this control entity needs information from the air interface to make a decision about valid AMR modes the AMR mode command is used to change the current AMR mode to the new one. The only element in the network which can provide this type of information is the UTRAN. Note that in GSM networks the control of the codec mode is provided by the BTS. This solution is not applicable in UTRA due to the soft-handover procedure defined for dedicated traffic channels. Therefore, the AMR mode control function is part of the RNC, and more precisely a part of layer 3 functionality. Within the radio interface, the rate on the speech connection is either decreased or increased depending on the new valid AMR mode by changing the valid Transport Format (TF) in the corresponding MAC-d entity (see Chapter 7). AMR mode adaptation in the downlink As shown in Figure A1.3, the RNC generates the AMR mode adaptation command based on existing radio conditions in the downlink as reported by the UE
AMR Codec in UMTS
379
from radio quality measurements and from traffic volume measurements. The command is sent to the encoder inside the TC via the Iu interface.
Control of AMR modes Iu-CS
UE AMR speech codec Uu
Node B Iub
RNC
TC AMR speech codec
AMR encoded speech data in UL (ongoing call) Request to modify the AMR mode in DL AMR encoded speech data in DL with new AMR mode (ongoing call) Change AMR mode Request to modify the AMR mode in UL Change AMR mode
AMR encoded speech data in UL with new AMR mode (ongoing call)
Figure A1.3. Overview of AMR codec mode control during an ongoing voice call
Uplink AMR mode adaptation Two different alternatives for the AMR mode control in the uplink have been proposed: Based on the air-interface load, the RNC decides when to request the encoder in the UE to change the valid AMR mode and a new valid AMR mode is sent to the UE inside the AMR mode command message. When received by the UE, mode adaptation is made accordingly (see Figure A1.3). Within this approach, the UE does not have any rights to request the mode adaptation from the network nor to change the used AMR mode autonomously. In the second proposed alternative, the AMR codec control is not only included into the RNC but also into the UE. This enables the UE to change the valid AMR mode of the speech connection on uplink more quickly without requesting the mode change from the RNC first. For instance, when the level of the maximum transmission power is reached, the UE may change the valid AMR mode independently. The new mode can, however, be selected only from the valid Transport Format Set (TFS), which has been communicated to the UE by RRC from the RNC side. The changed AMR mode is discovered by the RNC from the TFCI bits in the uplink dedicated physical data channel.
380
UMTS
A1.3. Resource allocation for an AMR speech connection An AMR speech connection can be initiated either by the UE or the network. When the UE requests resources from the network, a first negotiation is made based on NAS procedures in order to configure the call connection. The CN will determine the QoS, needed which will be then indicated to the UTRAN inside the RANAP RAB ASSIGNMENT REQUEST message. Based on this request, RNC can define the requested RAB and associated Radio Bearer(s) (RB). Depending on whether the requested AMR base speech connection supports the concept of Unequal Error Protection (UEP) or Equal Error Protection (EEP), the RNC assigns either one or three RBs (including one or three DCHs), respectively, for the user plane (see Figure A1.4). In the control plane, RRC may allocate one or none signaling radio bearer according to the alternative method used to change the AMR mode.
NAS
radio bearer 1 81 bits
radio bearer 2 103 bits RLC (TM) 103 bits DTCH
radio bearer 3 60 bits RLC (TM) 60 bits DTCH
RLC RLC (TM) 81 bits DTCH 81 bits DCH (Class A bits) TTI = 20 ms Conv. coding 1/3 CRC = 12 bits 303
MAC
103 bits DCH (Class B bits) TTI = 20 ms Conv. coding 1/3 CRC = 0 bits 333
60 bits DCH (Class C bits) TTI = 20 ms Conv. coding 1/2 CRC = 0 bits 136 128 bits (RM)
Physical layer
294 bits (RM)
324 bits (RM)
DPCH
Data 1 6
TPC TFCI 0 2
Data 2 28
Pilots 4
DPCH slot structure (SF = 128)
Figure A1.4. Implementation of AMR 12.2 kbps mode in the radio interface
A1.4. AMR wideband The AMR wideband (AMR-WB) codec has been standardized in 3GPP and is part of the specification in Release 5. The codec is based on the same adaptive
AMR Codec in UMTS
381
principles as the AMR narrowband. The AMR-WB comprises nine codec modes: 6.6 kbps, 8.85 kbps, 12.65 kbps, 14.25 kbps, 15.85 kbps, 18.25 kbps, 19.85 kbps, 23.05 kbps and 23.85 kbps. The encoder of the AMR-WB is able to code an audio signal with bandwidth between 50 and 7,000 Hz. A higher sampling rate is thus needed compared with the narrowband approach (16 kHz instead of 8 kHz) leading to a 14 bit samples with 16,000 samples/s. Wideband coding provides improved voice quality especially in terms of increased voice naturalness since it covers twice the audio bandwidth compared to the classical telephone voice bandwidth of 4 kHz [TS 26.190, R5].

Amr Codec in Umts

Cargado por

Información del documento

Descripción original:

Derechos de autor

Formatos disponibles

Compartir este documento

Compartir o incrustar documentos

Opciones para compartir

¿Le pareció útil este documento?

¿Este contenido es inapropiado?

Copyright:

Formatos disponibles

Amr Codec in Umts

Cargado por

Copyright:

Formatos disponibles

UMTS

Javier Sanchez & Mamadou Tnioune Copyright 02007, ISTE Ltd.

AMR Codec in UMTS

Speech quality Good

Channel quality Bad

Figure A1.1. AMR principle

Auxiliary Information (for Mode Adaptation, and Error Detection)

Figure A1.2. Generic structure of the AMR frame

AMR Codec in UMTS

AMR Codec in UMTS

UE AMR speech codec Uu

TC AMR speech codec

radio bearer 1 81 bits

radio bearer 2 103 bits RLC (TM) 103 bits DTCH

radio bearer 3 60 bits RLC (TM) 60 bits DTCH

294 bits (RM)

324 bits (RM)

DPCH slot structure (SF = 128)

AMR Codec in UMTS

También podría gustarte