Digital cellular telecommunications system (Phase 2) (GSM); Enhanced Full Rate (EFR) speech transcoding (GSM 06.60 version 4.1.1)

REN/SMG-110660PR1

Digitalni celični telekomunikacijski sistem (faza 2) – Izboljšano prekodiranje govora s polno hitrostjo (EFR) (GSM 06.60, različica 4.1.1)

General Information

Status
Published
Publication Date
23-Aug-2000
Technical Committee
Current Stage
12 - Completion
Due Date
13-Jun-1999
Completion Date
24-Aug-2000
Mandate

Buy Standard

Standard
EN 301 245 V4.1.1:2003
English language
45 pages
sale 10% off
Preview
sale 10% off
Preview
e-Library read for
1 day

Standards Content (Sample)

2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.Digital cellular telecommunications system (Phase 2) (GSM); Enhanced Full Rate (EFR) speech transcoding (GSM 06.60 version 4.1.1)33.070.50Globalni sistem za mobilno telekomunikacijo (GSM)Global System for Mobile Communication (GSM)ICS:Ta slovenski standard je istoveten z:EN 301 245 Version 4.1.1SIST EN 301 245 V4.1.1:2003en01-december-2003SIST EN 301 245 V4.1.1:2003SLOVENSKI
STANDARD



SIST EN 301 245 V4.1.1:2003



ETSIEN301245V4.1.1(2000-08)EuropeanStandard(Telecommunicationsseries)Digitalcellulartelecommunicationssystem(Phase2);EnhancedFullRate(EFR)speechtranscoding(GSM06.60version4.1.1)GLOBALSYSTEMFORMOBILECOMMUNICATIONSRSIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)2(GSM06.60version4.1.1)ReferenceREN/SMG-110660PR1KeywordsEFR,digitalcellulartelecommunicationssystem,GlobalSystemforMobilecommunications(GSM),speechETSI650RoutedesLuciolesF-06921SophiaAntipolisCedex-FRANCETel.:+33492944200Fax:+33493654716SiretN°34862356200017-NAF742CAssociationàbutnonlucratifenregistréeàlaSous-PréfecturedeGrasse(06)N°7803/88ImportantnoticeIndividualcopiesofthepresentdocumentcanbedownloadedfrom:http://www.etsi.orgThepresentdocumentmaybemadeavailableinmorethanoneelectronicversionorinprint.Inanycaseofexistingorperceiveddifferenceincontentsbetweensuchversions,thereferenceversionisthePortableDocumentFormat(PDF).Incaseofdispute,thereferenceshallbetheprintingonETSIprintersofthePDFversionkeptonaspecificnetworkdrivewithinETSISecretariat.Usersofthepresentdocumentshouldbeawarethatthedocumentmaybesubjecttorevisionorchangeofstatus.InformationonthecurrentstatusofthisandotherETSIdocumentsisavailableathttp://www.etsi.org/tb/status/Ifyoufinderrorsinthepresentdocument,sendyourcommentto:editor@etsi.frCopyrightNotificationNopartmaybereproducedexceptasauthorizedbywrittenpermission.Thecopyrightandtheforegoingrestrictionextendtoreproductioninallmedia.©EuropeanTelecommunicationsStandardsInstitute2000.Allrightsreserved.SIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)3(GSM06.60version4.1.1)ContentsIntellectualPropertyRights.4Foreword.41Scope.52References.53Definitions,symbolsandabbreviations.63.1Definitions.63.2Symbols.73.3Abbreviations.124Outlinedescription.134.1Functionaldescriptionofaudioparts.134.2Preparationofspeechsamples.134.2.1PCMformatconversion.144.3PrinciplesoftheGSMenhancedfullratespeechencoder.144.4PrinciplesoftheGSMenhancedfullratespeechdecoder.154.5Sequenceandsubjectiveimportanceofencodedparameters.165Functionaldescriptionoftheencoder.165.1Pre-processing.165.2Linearpredictionanalysisandquantization.165.2.1Windowingandauto-correlationcomputation.165.2.2Levinson-Durbinalgorithm.175.2.3LPtoLSPconversion.185.2.4LSPtoLPconversion.195.2.5QuantizationoftheLSPcoefficients.205.2.6InterpolationoftheLSPs.215.3Open-looppitchanalysis.215.4Impulseresponsecomputation.225.5Targetsignalcomputation.225.6Adaptivecodebooksearch.235.7Algebraiccodebookstructureandsearch.245.8Quantizationofthefixedcodebookgain.275.9Memoryupdate.276Functionaldescriptionofthedecoder.286.1Decodingandspeechsynthesis.286.2Post-processing.306.2.1Adaptivepost-filtering.306.2.2Up-scaling.317Variables,constantsandtablesintheC-codeoftheGSMEFRcodec.317.1DescriptionoftheconstantsandvariablesusedintheCcode.318Homingsequences.348.1Functionaldescription.348.2Definitions.348.3Encoderhoming.368.4Decoderhoming.368.5Encoderhomestate.378.6Decoderhomestate.389Bibliography.43AnnexA(informative):Documentchangehistory.44History.45SIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)4(GSM06.60version4.1.1)IntellectualPropertyRightsIPRsessentialorpotentiallyessentialtothepresentdocumentmayhavebeendeclaredtoETSI.TheinformationpertainingtotheseessentialIPRs,ifany,ispubliclyavailableforETSImembersandnon-members,andcanbefoundinETSISR000314:"IntellectualPropertyRights(IPRs);Essential,orpotentiallyEssential,IPRsnotifiedtoETSIinrespectofETSIstandards",whichisavailablefromtheETSISecretariat.LatestupdatesareavailableontheETSIWebserver(http://www.etsi.org/ipr).PursuanttotheETSIIPRPolicy,noinvestigation,includingIPRsearches,hasbeencarriedoutbyETSI.NoguaranteecanbegivenastotheexistenceofotherIPRsnotreferencedinETSISR000314(ortheupdatesontheETSIWebserver)whichare,ormaybe,ormaybecome,essentialtothepresentdocument.ForewordThisEuropeanStandard(Telecommunicationsseries)hasbeenproducedbyETSITechnicalCommitteeSpecialMobileGroup(SMG).Thepresentdocumentdescribesthedetailedmappingbetweeninputblocksof160speechsamplesin13-bituniformPCMformattoencodedblocksof244bitsandfromencodedblocksof244bitstooutputblocksof160reconstructedspeechsampleswithinthedigitalcellulartelecommunicationssystem.NationaltranspositiondatesDateofadoptionofthisEN:30October1998DateoflatestannouncementofthisEN(doa):31January1999DateoflatestpublicationofnewNationalStandardorendorsementofthisEN(dop/e):31July1999DateofwithdrawalofanyconflictingNationalStandard(dow):31July1999SIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)5(GSM06.60version4.1.1)1ScopeThepresentdocumentdescribesthedetailedmappingbetweeninputblocksof160speechsamplesin13-bituniformPCMformattoencodedblocksof244bitsandfromencodedblocksof244bitstooutputblocksof160reconstructedspeechsamples.Thesamplingrateis8000sample/sleadingtoabitratefortheencodedbitstreamof12,2kbit/s.Thecodingschemeistheso-calledAlgebraicCodeExcitedLinearPredictionCoder,hereafterreferredtoasACELP.ThisENalsospecifiestheconversionbetweenA-lawPCMand13-bituniformPCM.Performancerequirementsfortheaudioinputandoutputpartsareincludedonlytotheextentthattheyaffectthetranscoderperformance.Thispartalsodescribesthecodecdowntothebitlevel,thusenablingtheverificationofcompliancetotheparttoahighdegreeofconfidencebyuseofasetofdigitaltestsequences.ThesetestsequencesaredescribedinGSM06.54[7]andareavailableondisks.IncaseofdiscrepancybetweentherequirementsdescribedinthisENandthefixedpointcomputationaldescription(ANSI-Ccode)oftheserequirementscontainedinGSM06.53[6],thedescriptioninGSM06.53[6]willprevail.ThetranscodingprocedurespecifiedinthisENisapplicablefortheenhancedfullratespeechtrafficchannel(TCH)intheGSMsystem.InGSM06.51[5],areferenceconfigurationforthespeechtransmissionchainoftheGSMenhancedfullrate(EFR)systemisshown.Accordingtothisreferenceconfiguration,thespeechencodertakesitsinputasa13-bituniformPCMsignaleitherfromtheaudiopartoftheMobileStationoronthenetworkside,fromthePSTNviaan8-bit/A-lawto13-bituniformPCMconversion.TheencodedspeechattheoutputofthespeechencoderisdeliveredtoachannelencoderunitwhichisspecifiedinGSM05.03[3].Inthereceivedirection,theinverseoperationstakeplace.2ReferencesThefollowingdocumentscontainprovisionswhich,throughreferenceinthistext,constituteprovisionsofthepresentdocument.• Referencesareeitherspecific(identifiedbydateofpublication,editionnumber,versionnumber,etc.)ornon-specific.• Foraspecificreference,subsequentrevisionsdonotapply.• Foranon-specificreference,thelatestversionapplies.• Anon-specificreferencetoanETSshallalsobetakentorefertolaterversionspublishedasanENwiththesamenumber.[1]GSM01.04(ETR100):"Digitalcellulartelecommunicationssystem(Phase2);Abbreviationsandacronyms".[2]GSM03.50(ETS300540):"Digitalcellulartelecommunicationssystem(Phase2);TransmissionplanningaspectsofthespeechserviceintheGSMPublicLandMobileNetwork(PLMN)system".[3]GSM05.03(ETS300575):"Digitalcellulartelecommunicationssystem(Phase2);Channelcoding".[4]GSM06.32(ETS300580-6):"Digitalcellulartelecommunicationssystem(Phase2);VoiceActivityDetection(VAD)".[5]GSM06.51(EN301243):"Digitalcellulartelecommunicationssystem(Phase2);EnhancedFullRate(EFR)speechprocessingfunctionsGeneraldescription".[6]GSM06.53(EN301244):"Digitalcellulartelecommunicationssystem(Phase2);ANSI-CcodefortheGSMEnhancedFullRate(EFR)speechcodec".[7]GSM06.54(EN301250):"Digitalcellulartelecommunicationssystem(Phase2);TestvectorsfortheGSMEnhancedFullRate(EFR)speechcodec".SIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)6(GSM06.60version4.1.1)[8]ITU-TRecommendationG.711(1988):"CodingofanaloguesignalsbypulsecodemodulationPulsecodemodulation(PCM)ofvoicefrequencies".[9]ITU-TRecommendationG.726:"40,32,24,16kbit/sadaptivedifferentialpulsecodemodulation(ADPCM)".3Definitions,symbolsandabbreviations3.1DefinitionsForthepurposesofthepresentdocument,thefollowingtermsanddefinitionsapply:adaptivecodebook:Theadaptivecodebookcontainsexcitationvectorsthatareadaptedforeverysubframe.Theadaptivecodebookisderivedfromthelongtermfilterstate.Thelagvaluecanbeviewedasanindexintotheadaptivecodebook.adaptivepostfilter:Thisfilterisappliedtotheoutputoftheshorttermsynthesisfiltertoenhancetheperceptualqualityofthereconstructedspeech.IntheGSMenhancedfullratecodec,theadaptivepostfilterisacascadeoftwofilters:aformantpostfilterandatiltcompensationfilter.algebraiccodebook:Afixedcodebookwherealgebraiccodeisusedtopopulatetheexcitationvectors(innovationvectors).Theexcitationcontainsasmallnumberofnonzeropulseswithpredefinedinterlacedsetsofpositions.closed-looppitchanalysis:Thisistheadaptivecodebooksearch,i.e.,aprocessofestimatingthepitch(lag)valuefromtheweightedinputspeechandthelongtermfilterstate.Intheclosed-loopsearch,thelagissearchedusingerrorminimizationloop(analysis-by-synthesis).IntheGSMenhancedfullratecodec,closed-looppitchsearchisperformedforeverysubframe.directformcoefficients:Oneoftheformatsforstoringtheshorttermfilterparameters.IntheGSMenhancedfullratecodec,allfilterswhichareusedtomodifyspeechsamplesusedirectformcoefficients.fixedcodebook:Thefixedcodebookcontainsexcitationvectorsforspeechsynthesisfilters.Thecontentsofthecodebookarenon-adaptive(i.e.,fixed).IntheGSMenhancedfullratecodec,thefixedcodebookisimplementedusinganalgebraiccodebook.fractionallags:Asetoflagvalueshavingsub-sampleresolution.IntheGSMenhancedfullratecodecasub-sampleresolutionof1/6thofasampleisused.frame:Atimeintervalequalto20ms(160samplesatan8kHzsamplingrate).integerlags:Asetoflagvalueshavingwholesampleresolution.interpolatingfilter:AnFIRfilterusedtoproduceanestimateofsub-sampleresolutionsamples,givenaninputsampledwithintegersampleresolution.inversefilter:Thisfilterremovestheshorttermcorrelationfromthespeechsignal.Thefiltermodelsaninversefrequencyresponseofthevocaltract.lag:Thelongtermfilterdelay.Thisistypicallythetruepitchperiod,oramultipleorsub-multipleofit.LineSpectralFrequencies:(seeLineSpectralPair).LineSpectralPair:TransformationofLPCparameters.LineSpectralPairsareobtainedbydecomposingtheinversefiltertransferfunctionA(z)toasetoftwotransferfunctions,onehavingevensymmetryandtheotherhavingoddsymmetry.TheLineSpectralPairs(alsocalledasLineSpectralFrequencies)aretherootsofthesepolynomialsonthez-unitcircle).LPanalysiswindow:Foreachframe,theshorttermfiltercoefficientsarecomputedusingthehighpassfilteredspeechsampleswithintheanalysiswindow.IntheGSMenhancedfullratecodec,thelengthoftheanalysiswindowis240samples.Foreachframe,twoasymmetricwindowsareusedtogeneratetwosetsofLPcoefficients.Nosamplesofthefutureframesareused(nolookahead).SIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)7(GSM06.60version4.1.1)LPcoefficients:LinearPrediction(LP)coefficients(alsoreferredasLinearPredictiveCoding(LPC)coefficients)isagenericdescriptivetermfordescribingtheshorttermfiltercoefficients.open-looppitchsearch:Aprocessofestimatingthenearoptimallagdirectlyfromtheweightedspeechinput.Thisisdonetosimplifythepitchanalysisandconfinetheclosed-looppitchsearchtoasmallnumberoflagsaroundtheopen-loopestimatedlags.IntheGSMenhancedfullratecodec,open-looppitchsearchisperformedevery10ms.residual:Theoutputsignalresultingfromaninversefilteringoperation.shorttermsynthesisfilter:Thisfilterintroduces,intotheexcitationsignal,shorttermcorrelationwhichmodelstheimpulseresponseofthevocaltract.perceptualweightingfilter:Thisfilterisemployedintheanalysis-by-synthesissearchofthecodebooks.Thefilterexploitsthenoisemaskingpropertiesoftheformants(vocaltractresonances)byweightingtheerrorlessinregionsneartheformantfrequenciesandmoreinregionsawayfromthem.subframe:Atimeintervalequalto5ms(40samplesatan8kHzsamplingrate).vectorquantization:Amethodofgroupingseveralparametersintoavectorandquantizingthemsimultaneously.zeroinputresponse:Theoutputofafilterduetopastinputs,i.e.duetothepresentstateofthefilter,giventhataninputofzerosisapplied.zerostateresponse:Theoutputofafilterduetothepresentinput,giventhatnopastinputshavebeenapplied,i.e.,giventhestateinformationinthefilterisallzeroes.3.2SymbolsForthepurposesofthepresentdocument,thefollowingsymbolsapply:()AzTheinversefilterwithunquantizedcoefficients()$AzTheinversefilterwithquantifiedcoefficients()()HzAz=1$ThespeechsynthesisfilterwithquantifiedcoefficientsaiTheunquantizedlinearpredictionparameters(directformcoefficients)$aiThequantifiedlinearpredictionparametersmTheorderoftheLPmodel1Bz()Thelong-termsynthesisfilter()WzTheperceptualweightingfilter(unquantizedcoefficients)γγ12,TheperceptualweightingfactorsFzE()Adaptivepre-filterTThenearestintegerpitchlagtotheclosed-loopfractionalpitchlagofthesubframeβTheadaptivepre-filtercoefficient(thequantifiedpitchgain)HzAzAzfnd()$(/)$(/)=γγTheformantpostfilterSIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)8(GSM06.60version4.1.1)γnControlcoefficientfortheamountoftheformantpost-filteringγdControlcoefficientfortheamountoftheformantpost-filteringHzt()TiltcompensationfilterγtControlcoefficientfortheamountofthetiltcompensationfilteringµγ=tk1'Atiltfactor,withk1'beingthefirstreflectioncoefficienthnf()ThetruncatedimpulseresponseoftheformantpostfilterLhThelengthofhnf()rih()Theauto-correlationsofhnf()$(/)AznγTheinversefilter(numerator)partoftheformantpostfilter1/$(/)AzdγThesynthesisfilter(denominator)partoftheformantpostfilter$()rnTheresidualsignaloftheinversefilter$(/)Aznγhzt()Impulseresponseofthetiltcompensationfilterβscn()TheAGC-controlledgainscalingfactoroftheadaptivepostfilterαTheAGCfactoroftheadaptivepostfilterHzh1()Pre-processinghigh-passfilterwnI(),wnII()LPanalysiswindowsLI1()LengthofthefirstpartoftheLPanalysiswindowwnI()LI2()LengthofthesecondpartoftheLPanalysiswindowwnI()LII1()LengthofthefirstpartoftheLPanalysiswindowwnII()LII2()LengthofthesecondpartoftheLPanalysiswindowwnII()rkac()Theauto-correlationsofthewindowedspeechsn'()wilag()Lagwindowfortheauto-correlations(60Hzbandwidthexpansion)f0ThebandwidthexpansioninHzfsThesamplingfrequencyinHzrkac'()Themodified(bandwidthexpanded)auto-correlations()EiLDThepredictionerrorintheithiterationoftheLevinsonalgorithmkiTheithreflectioncoefficientSIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)9(GSM06.60version4.1.1)aji()ThejthdirectformcoefficientintheithiterationoftheLevinsonalgorithmFz1'()SymmetricLSFpolynomialFz2'()AntisymmetricLSFpolynomialFz1()Polynomial()Fz1′withrootz=−1eliminatedFz2()Polynomial()Fz2′withrootz=1eliminatedqiThelinespectralpairs(LSPs)inthecosinedomainqAnLSPvectorinthecosinedomain$()qinThequantifiedLSPvectorattheithsubframeoftheframenωiThelinespectralfrequencies(LSFs)Txm()AmthorderChebyshevpolynomialfifi12(),()ThecoefficientsofthepolynomialsFz1()andFz2()fifi12''(),()Thecoefficientsofthepolynomials()Fz1′and()Fz2′fi()ThecoefficientsofeitherFz1()orFz2()Cx()SumpolynomialoftheChebyshevpolynomialsxCosineofangularfrequencyωλkRecursioncoefficientsfortheChebyshevpolynomialevaluationfiThelinespectralfrequencies(LSFs)inHz[]ftfff=1210KThevectorrepresentationoftheLSFsinHzz()()1n,z()()2nThemean-removedLSFvectorsatframenr()()1n,r()()2nTheLSFpredictionresidualvectorsatframenp()nThepredictedLSFvectoratframen$()()r21n−Thequantifiedsecondresidualvectoratthepastframe$fkThequantifiedLSFvectoratquantizationindexkELSPTheLSPquantizationerrorwii,,,,=110KLSP-quantizationweightingfactorsdiThedistancebetweenthelinespectralfrequenciesfi+1andfi−1SIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)10(GSM06.60version4.1.1)hn()TheimpulseresponseoftheweightedsynthesisfilterOkThecorrelationmaximumofopen-looppitchanalysisatdelaykOiti,,,=13KThecorrelationmaximaatdelaystii,,,=13K()Mtiii,,,,=13KThenormalizedcorrelationmaximaMiandthecorrespondingdelaystii,,,=13KHzWzAzAzAz()()(/)$()(/)=γγ12TheweightedsynthesisfilterAz(/)γ1Thenumeratoroftheperceptualweightingfilter12/(/)AzγThedenominatoroftheperceptualweightingfilterT1Thenearestintegertothefractionalpitchlagoftheprevious(1stor3rd)subframesn'()Thewindowedspeechsignalsnw()Theweightedspeechsignal$()snReconstructedspeechsignal$()′snThegain-scaledpost-filteredsignal$()snfPost-filteredspeechsignal(beforescaling)xn()Thetargetsignalforadaptivecodebooksearchxn2(),x2tThetargetsignalforalgebraiccodebooksearchresnLP()TheLPresidualsignalcn()Thefixedcodebookvectorvn()Theadaptivecodebookvectorynvnhn()=()()∗Thefilteredadaptivecodebookvectorynk()Thepastfilteredexcitationun()Theexcitationsignal()$unTheemphasizedadaptivecodebookvector$'()unThegain-scaledemphasizedexcitationsignalTopThebestopen-looplagtminMinimumlagsearchvaluetmaxMaximumlagsearchvalue()RkCorrelationtermtobemaximizedintheadaptivecodebooksearchSIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)11(GSM06.60version4.1.1)b24TheFIRfilterforinterpolatingthenormalizedcorrelationterm()Rk()RktTheinterpolatedvalueof()Rkfortheintegerdelaykandfractiontb60TheFIRfilterforinterpolatingthepastexcitationsignalun()toyieldtheadaptivecodebookvectorvn()AkCorrelationtermtobemaximizedinthealgebraiccodebooksearchatindexkCkThecorrelationinthenumeratorofAkatindexkEDkTheenergyinthedenominatorofAkatindexkdHx=t2Thecorrelationbetweenthetargetsignal()xn2andtheimpulseresponse()hn,i.e.,backwardfilteredtargetHThelowertriangularToeplizconvolutionmatrixwithdiagonal()h0andlowerdiagonals()()hh139,,KΦ=HHtThematrixofcorrelationsof()hndn()Theelementsofthevectordφ(,)ijTheelementsofthesymmetricmatrixΦckTheinnovationvectorCThecorrelationinthenumeratorofAkmiThepositionoftheithpulseϑiTheamplitudeoftheithpulseNpThenumberofpulsesinthefixedcodebookexcitationEDTheenergyinthedenominatorofAk()resnLTPThenormalizedlong-termpredictionresidualbn()Thesumofthenormalized()dnvectorandnormalizedlong-termpredictionresidual()resnLTPsnb()Thesignsignalforthealgebraiccodebooksearchdn'()Signextendedbackwardfilteredtargetφ'(,)ijThemodifiedelementsofthematrixΦ,includingsigninformationzt,()znThefixedcodebookvectorconvolvedwithhn()En()Themean-removedinnovationenergy(indB)SIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)12(GSM06.60version4.1.1)EThemeanoftheinnovationenergy~()EnThepredictedenergy[]bbbb1234TheMApredictioncoefficients$()RkThequantifiedpredictionerroratsubframekEIThemeaninnovationenergyRn()Thepredictionerrorofthefixed-codebookgainquantizationEQThequantizationerrorofthefixed-codebookgainquantizationen()Thestatesofthesynthesisfilter1/$()Azenw()Theperceptuallyweightederroroftheanalysis-by-synthesissearchηThegainscalingfactorfortheemphasizedexcitationgcThefixed-codebookgaingc'Thepredictedfixed-codebookgain$gcThequantifiedfixedcodebookgaingpTheadaptivecodebookgain$gpThequantifiedadaptivecodebookgainγgcccgg=/'Acorrectionfactorbetweenthegaingcandtheestimatedonegc'$γgcTheoptimumvalueforγgcγscGainscalingfactor3.3AbbreviationsForthepurposesofthepresentdocument,thefollowingabbreviationsapply.FurtherGSMrelatedabbreviationsmaybefoundinGSM01.04[1].ACELPAlgebraicCodeExcitedLinearPredictionAGCAdaptiveGainControlCELPCodeExcitedLinearPredictionFIRFiniteImpulseResponseISPPInterleavedSingle-PulsePermutationLPLinearPredictionLPCLinearPredictiveCodingLSFLineSpectralFrequencyLSPLineSpectralPairLTPLongTermPredictor(orLongTermPrediction)MAMovingAverageSIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)13(GSM06.60version4.1.1)4OutlinedescriptionThepresentdocumentisstructuredasfollows:Section4.1containsafunctionaldescriptionoftheaudiopartsincludingtheA/DandD/Afunctions.Section4.2describestheconversionbetween13-bituniformand8-bitA-lawsamples.Sections4.3and4.4presentasimplifieddescriptionoftheprinciplesoftheGSMEFRencodinganddecodingprocessrespectively.Insubclause4.5,thesequenceandsubjectiveimportanceofencodedparametersaregiven.Section5presentsthefunctionaldescriptionoftheGSMEFRencoding,whereasclause6describesthedecodingprocedures.Section7describesvariables,constantsandtablesoftheC-codeoftheGSMEFRcodec.4.1FunctionaldescriptionofaudiopartsTheanalogue-to-digitalanddigital-to-analogueconversionwillinprinciplecomprisethefollowingelements:1)AnaloguetouniformdigitalPCM−microphone;−inputleveladjustmentdevice;−inputanti-aliasingfilter;−sample-holddevicesamplingat8kHz;−analogue−to−uniformdigitalconversionto13−bitrepresentation.Theuniformformatshallberepresentedintwo'scomplement.2)UniformdigitalPCMtoanalogue−conversionfrom13−bit/8kHzuniformPCMtoanalogue;−aholddevice;−reconstructionfilterincludingx/sin(x)correction;−outputleveladjustmentdevice;−earphoneorloudspeaker.Intheterminalequipment,theA/Dfunctionmaybeachievedeither−bydirectconversionto13-bituniformPCMformat;−orbyconversionto8-bit/A-lawcompoundedformat,basedonastandardA-lawcodec/filteraccordingtoITU-TRecommendationsG.711[8]andG.714,followedbythe8-bitto13-bitconversionasspecifiedinsubclause4.2.1.FortheD/Aoperation,theinverseoperationstakeplace.InthelattercaseitshouldbenotedthatthespecificationsinITU-TG.714(supersededbyG.712)areconcernedwithPCMequipmentlocatedinthecentralpartsofthenetwork.Whenusedintheterminalequipment,thisENdoesnotonitsownensuresufficientout-of-bandattenuation.Thespecificationofout-of-bandsignalsisdefinedinGSM03.50[2]inclause2.4.2PreparationofspeechsamplesTheencoderisfedwithdatacomprisingofsampleswitharesolutionof13bitsleftjustifiedina16-bitword.Thethreeleastsignificantbitsaresetto'0'.Thedecoderoutputsdatainthesameformat.Outsidethespeechcodecfurtherprocessingmustbeappliedifthetrafficdataoccursinadifferentrepresentation.SIST EN 301 245 V4.1.1:2003



ETSIETSIEN301245V4.1.1(2000-08)14(GSM06.60version4.1.1)4.2.1PCMformatconversionTheconversionbetween8-bitA-Lawcompresseddataandlineardatawith13-bitresolutionatthespeechencoderinputshallbeasdefinedinITU-TRec.G.711[8].ITU-TRec.G.711[8]specifiestheA-Lawtolinearconversionandviceversabyprovidingtableentries.Examplesonhowtoperformtheconversionbyfixed-pointarithmeticcanbefoundinITU-TRec.G.726[9].Section4.2.1ofG.726[9]describesA-Lawtolinearexpansionandsubclause4.2.7ofG.726[9]providesasolutionforlineartoA-Lawcompression.4.3PrinciplesoftheGSMenhancedfullratespeechencoderThecodecisbasedonthecode-excitedlinearpredictive(CELP)codingmodel.A10thorderlinearprediction(LP),orshort-term,synthesisfilterisusedwhichisgivenby:HzAzazimii()$()$,==+=−∑1111(1)where$,,,,aimi=1Karethe(quantified)linearprediction(LP)parameters,andm=10isthepredictororder.Thelong-term,orpitch,synthesisfilterisgivenby:111BzgzpT(),=−−(2)whereTisthepitchdelayandgpisthepitchgain.Thepitchsynthesisfilterisimplementedusingtheso-calledadaptivecodebookapproach.TheCELPspeechsynthesismodelisshowninfigure2.Inthismodel,theexcitationsignalattheinputoftheshort-termLPsynthesisfilterisconstructedbyaddingtwoexcitationvectorsfromadaptiveandfixed(innovative)codebooks.Thespeechissynthesizedbyfeedingthetwoproperlychosenvectorsfromthesecodebooksthroughtheshort-termsynthesisfilter.Theoptimumexcitationsequenceinacodebookischosenusingananalysis-by-synthesissearchprocedureinwhichtheerrorbetweentheoriginalandsynthesizedspeechisminimizedaccordingtoaperceptuallyweighteddistortionmeasure.Theperceptualweightingfilterusedintheanalysis-by-synthesissearchtechniqueisgivenby:WzAzAz()(/)(/),=γγ12(3)where()AzistheunquantizedLPfilterand0121<<≤γγaretheperceptualweightingfactors.Thevaluesγ109=.andγ206=.areused.TheweightingfilterusestheunquantizedLPparameterswhiletheformantsynthesisfilterusesthequantifiedones.Thecoderoperatesonspeechframesof20mscorrespondingto160samplesatthesamplingfrequencyof8000sample/s.Ateach160speechsamples,thespeechsignalisanalysedtoextracttheparametersoftheCELPmodel(LPfiltercoefficients,adaptiveandfixedcodebooks'indicesandgains).Theseparametersareencodedandtransmitted.Atthedecoder,theseparametersaredecodedandspeechissynthesizedbyfilteringthereconstructedexcitationsignalthroughtheLPsynthesisfilter.Thesignalflowattheencoderisshowninfigure3.LPanalysisisperformedtwiceperframe.ThetwosetsofLPparametersareconvertedtolinespectrumpairs(LSP)andjointlyquantifiedusingsplitmatrixquantization(SMQ)with38bits.Thespeechframeisdividedinto4subframesof5mseach(40samples).Theadaptiveandfixedcodebookparametersaretransmittedeverysubframe.Thetwos
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.