对å¤å£°éé³é¢ä¿¡å·ï¼é³é¢ç¼ç 被åºç¨äºä¸åé¢çèå´çé³é¢è¾å ¥å£°éçä¸ååéãä¾å¦ï¼å¯¹å ·æäºä¸ªå¸¸è§å£°éåä¸ä¸ªä½é¢(LFE)声éç5.1ç¯ç»é³åï¼å声éæç¤ºç (BCC)å¯è¢«åºç¨å°ææå 个声éçå¤äºæä½äºæå®çæªæ¢é¢ççåé¢å¸¦ï¼èä» ä» è¢«åºç¨å°äºä¸ªé³é¢å£°é(æé¤äºLFE声é)çé«äºæªæ¢é¢ççåé¢å¸¦ãä¸åºç¨å°æ´ä¸ªé¢çèå´ä¸çææè¾å ¥å£°éçåæ°ç¼ç ææ¯ç¸æ¯ï¼è¿ç§åºäºé¢çç声éç¼ç å¯åå°ç¼ç åè§£ç å¤çæä½çè´æ å/æè¢«ç¼ç çé³é¢æ¯ç¹æµç大å°ã
For multi-channel audio signals, audio coding is applied to different subsets of the audio input channels in different frequency ranges. For example, for 5.1 surround sound with five regular channels and one low frequency (LFE) channel, Binaural Cue Codes (BCC) can be applied to all six channel subs at or below a specified cutoff frequency. frequency band, while only being applied to sub-bands above the cutoff frequency of the five audio channels (excluding the LFE channel). Such frequency-based channel coding may reduce the burden of encoding and decoding processing operations and/or the size of the encoded audio bitstream compared to parametric encoding techniques applied to all input channels over the entire frequency range.
Description Translated from Chinese 对å¤å£°éé³é¢ä¿¡å·è¿è¡ç¼ç ååæçæ¹æ³åè£ ç½®Method and device for encoding and synthesizing multi-channel audio signalsææ¯é¢å technical field
æ¬åææ¶åé³é¢ä¿¡å·çç¼ç 以åéåç±ç¼ç åçé³é¢æ°æ®å¯¹å¬è§åºæ¯çåæã The invention relates to the encoding of audio signals and the subsequent synthesis of auditory scenes from the encoded audio data. the
ç¸å ³ç³è¯·ç交åå¼ç¨Â Cross References to Related Applications
æ¬ç³è¯·è¦æ±å¨04å¹´3æ4æ¥ä»¥ä»£ç人å·å·Faller 14-2æäº¤çç¾å½ä¸´æ¶ç³è¯·å·60/549972çç³è¯·æ¥çä¼å æãæ¬ç³è¯·ç䏻颿¶å2001å¹´5æ4æ¥ä»¥ä»£ç人å·å·Faller 5(ââ877ç³è¯·â)æäº¤çç¾å½ä¸å©ç³è¯·åºå·09/848877ç主é¢ï¼2001å¹´11æ7æ¥ä»¥ä»£ç人å·å·Baumgarte 1-6-8(ââ458ç³è¯·â)æäº¤çç¾å½ä¸å©ç³è¯·åºå·10/045458ç主é¢ï¼2002å¹´5æ24æ¥ä»¥ä»£ç人å·å·Baumgarte 2-10(ââ437ç³è¯·â)æäº¤çç¾å½ä¸å©ç³è¯·åºå·10/155437ç主é¢ï¼ä»¥å2004å¹´4æ1æ¥ä»¥ä»£ç人å·å·Baumgarte 7-12(ââ591ç³è¯·â)æäº¤çç¾å½ä¸å©ç³è¯·åºå·10/815591ç主é¢ï¼è¿å个ä¸å©ç³è¯·çææå 容é½å¨æ¤å¹¶å ¥ä½ä¸ºåèã This application claims priority on the filing date of U.S. Provisional Application No. 60/549972, filed March 4, 2004, with Attorney Docket No. Faller 14-2. The subject matter of this application is related to that of U.S. Patent Application Serial No. 09/848,877, filed May 4, 2001, in Attorney Docket No. Faller 5 (the "'877 application"), and filed Nov. 7, 2001, in Attorney Docket No. Baumgarte 1 -Subject of U.S. Patent Application Serial No. 10/045,458 filed May 24, 2002 under Attorney Docket Baumgarte 2-10 ("the '437 Application") filed on 6-8 (the "'458 Application") The subject of Serial No. 10/155,437, and the subject of U.S. Patent Application Serial No. 10/815,591, filed April 1, 2004, with Attorney Docket No. Baumgarte 7-12 (the "'591 application"), all of these four patent applications All are incorporated herein by reference. the
èæ¯ææ¯ Background technique
å¤å¹´æ¥çµå½±é¢ä¸çå¤å£°éç¯ç»é³åç³»ç»å·²ç»å¾å°æ ååãéçææ¯çè¿æ¥ï¼å·²è½å¤å¶é ç¨äºå®¶åºä½¿ç¨çå¤å£°éç¯ç»ç³»ç»ãç°ä»ï¼è¿ç§ç³»ç»é常ä½ä¸ºâå®¶åºå½±é¢ç³»ç»âéå®ãæç §ITU-R建议ï¼å¤§å¤æ°è¿ç±»ç³»ç»æä¾äºäºä¸ªå¸¸è§é³é¢å£°éåä¸ä¸ªä½é¢è¶ ä½é³åå声é(表示ä½é¢æææLFE声é)ãè¿ç§å¤å£°éç³»ç»è¢«è¡¨ç¤ºä¸º5.1ç¯ç»ç³»ç»ãè¿æå ¶å®çç¯ç»ç³»ç»ï¼å¦7.1(ä¸ä¸ªå¸¸è§å£°éåä¸ä¸ªLFE声é)å10.2(å个常è§å£°éå两个LFE声é)ç¯ç»ç³»ç»ã Multichannel surround sound systems in movie theaters have been standardized for many years. As technology has advanced, it has become possible to manufacture multi-channel surround systems for home use. These days, such systems are often marketed as "home theater systems." Following ITU-R recommendations, most such systems provide five regular audio channels and a low-frequency subwoofer channel (meaning the low-frequency effects or LFE channel). Such a multi-channel system is denoted as a 5.1 surround system. There are other surround systems such as 7.1 (seven regular channels and one LFE channel) and 10.2 (ten regular channels and two LFE channels) surround systems. the
C.FalleråF.Baumgarteç论æâEfficient representation of  spatial audio coding using perceptual parametrizationâï¼IEEE workshop on Appl.of Sig.Proc.to Audio and Acoust.ï¼2001å¹´10æï¼ä»¥åC.FalleråF.Baumgartç论æâBinaural Cue Coding Applied to Stereo and Multi-Channel Audio Compressionï¼âPreprint 112th Conv.Aud.Eng.Soc.ï¼2002å¹´5æ(æ»ç§°ä¸ºâBCC论æâ)çå 容å卿¤å¹¶å ¥ä½ä¸ºåèï¼æ¥æè¿°åæ°å¤å£°éé³é¢ç¼ç ææ¯(称为BCCç¼ç )ã C.Faller and F.Baumgarte's paper "Efficient representation of spatial audio coding using perceptual parametrization", IEEE workshop on Appl. of Sig.Proc.to Audio and Acoust., October 2001, and C.Faller and F.Baumgart The contents of the paper "Binaural Cue Coding Applied to Stereo and Multi-Channel Audio Compression," Preprint 112th Conv.Aud.Eng.Soc., May 2002 (collectively "BCC Papers") are hereby incorporated by reference, to describe a parametric multi-channel audio coding technique (called BCC coding). the
å¾1示åºäºæ ¹æ®BCCè®ºææ§è¡å声éæç¤ºç¼ç (BCC)çé³é¢å¤çç³»ç»100çæ¹æ¡å¾ãBCCç³»ç»100å ·æä¸ä¸ªBCCç¼ç å¨102ï¼å®æ¥æ¶C个é³é¢è¾å ¥å£°é108ï¼æ¯ä¸ªå£°éä¾å¦æ¥èªC个ä¸å麦å é£106ä¸çæ¯ä¸ªéº¦å é£ãBCCç¼ç å¨102å ·æä¸ä¸ªä¸æ··é¢å¨110ï¼å®å°C个é³é¢è¾å ¥å£°éè½¬æ¢æåé³é¢ç¸å ä¿¡å·112ã Figure 1 shows a block diagram of an audio processing system 100 that performs binaural cue coding (BCC) according to the BCC paper. The BCC system 100 has a BCC encoder 102 that receives C audio input channels 108 , each channel eg from each of C different microphones 106 . The BCC encoder 102 has a down-mixer 110 which converts the C audio input channels into a single audio sum signal 112 . the
å¦å¤ï¼BCCç¼ç å¨102å ·æä¸ä¸ªBCCåæå¨114ï¼å®ä¸ºC个è¾å ¥å£°é产çBCCæç¤ºç æ°æ®æµ116ã该BCCæç¤ºç (ä¹è¢«ç§°ä¸ºå¬è§åºæ¯åæ°)å æ¬æ¯ä¸ªè¾å ¥å£°éç声éé´çµå¹³å·®(ICLD)å声éé´æ¶é´å·®(ICTD)æ°æ®ãBCCåæå¨114æ§è¡åºäºé¢å¸¦çå¤çï¼ä»è为é³é¢è¾å ¥å£°éçä¸ä¸ªæå¤ä¸ªä¸åçåé¢å¸¦(ä¾å¦ä¸åç临çé¢å¸¦)ä¸çæ¯ä¸ªåé¢å¸¦äº§çICLDåICTDæ°æ®ã Additionally, the BCC encoder 102 has a BCC analyzer 114 which generates a BCC hint code data stream 116 for the C input channels. The BCC hint code (also called auditory scene parameter) includes inter-channel level difference (ICLD) and inter-channel time difference (ICTD) data for each input channel. The BCC analyzer 114 performs band-based processing to generate ICLD and ICTD data for each of one or more different sub-bands (eg, different critical bands) of the audio input channel. the
BCCç¼ç å¨102æç¸å ä¿¡å·112åBCCæç¤ºç æ°æ®æµ116(ä¾å¦å¯ä»¥æ¯å ³äºè¯¥ç¸å ä¿¡å·çå¸¦å æå¸¦å¤è¾¹ä¿¡æ¯)åéå°BCCç³»ç»100çBCCè§£ç å¨104ãBCCè§£ç å¨104å ·æä¸ä¸ªè¾¹ä¿¡æ¯å¤çå¨118ï¼å®å¯¹æ°æ®æµ116è¿è¡å¤ç以æ¢å¤BCCæç¤ºç 120(ä¾å¦ICLDåICTDæ°æ®)ãBCCè§£ç å¨104ä¹å ·æä¸ä¸ªBCCåæå¨122ï¼å®ä½¿ç¨æ¢å¤çBCCæç¤ºç 120ç±ç¸å ä¿¡å·112åæC个é³é¢è¾åºå£°é124ï¼ä»èåå«éè¿C个æ¬å£°å¨126æ¥ææ¾ã BCC encoder 102 sends summed signal 112 and BCC hint code data stream 116 (which may be, for example, in-band or out-of-band side information about the summed signal) to BCC decoder 104 of BCC system 100 . BCC decoder 104 has a side information processor 118 that processes data stream 116 to recover BCC hint codes 120 (eg, ICLD and ICTD data). The BCC decoder 104 also has a BCC synthesizer 122 which uses the recovered BCC hint code 120 to synthesize C audio output channels 124 from the summed signal 112 to be played through C speakers 126 respectively. the
é³é¢å¤çç³»ç»100å¯ä»¥å¨å¦5.1ç¯ç»é³åçå¤å£°éé³é¢ä¿¡å·çç¯å¢ä¸å®ç°ãç¹å«å°ï¼BCCç¼ç å¨102ç䏿··é¢å¨110å°å¸¸è§ç5.1ç¯ç»é³åçå 个è¾å ¥å£°é(å³äºä¸ªå¸¸è§å£°éå ä¸ä¸ªLFE声é)è½¬æ¢æç¸å ä¿¡å·112ãå¦å¤ï¼ç¼ç å¨102çBCCåæå¨114å°è¿å 个è¾å ¥å£°é忢å°é¢åï¼ä»¥äº§çç¸åºçBCCæç¤ºç 116ã类似å°ï¼BCCè§£ç å¨Â 104ç边信æ¯å¤çå¨118仿¥æ¶å°çè¾¹ä¿¡æ¯æµ116䏿¢å¤BCCæç¤ºç 120ï¼ç¶åè§£ç å¨104çBCCåæå¨122(1)å°æ¥æ¶å°çç¸å ä¿¡å·112忢å°é¢åï¼(2)ææ¢å¤çBCCæç¤ºç 120åºç¨å°é¢åçç¸å ä¿¡å·ä»¥äº§çå 个é¢åä¿¡å·ï¼ç¶å(3)æè¿äºé¢åä¿¡å·åæ¢å°åæç5.1ç¯ç»é³åçå 个æ¶å声é(å³äºä¸ªåæç常è§å£°éå ä¸ä¸ªåæçLFE声é)ï¼ä»¥éè¿æ¬å£°å¨126ææ¾ã The audio processing system 100 may be implemented in the environment of multi-channel audio signals such as 5.1 surround sound. In particular, the down-mixer 110 of the BCC encoder 102 converts the six input channels of conventional 5.1 surround sound (ie, five conventional channels plus one LFE channel) into a summed signal 112 . Additionally, the BCC analyzer 114 of the encoder 102 transforms the six input channels into the frequency domain to generate corresponding BCC hint codes 116 . Similarly, the side information processor 118 of the BCC decoder 104 recovers the BCC hint code 120 from the received side information stream 116, and the BCC synthesizer 122(1) of the decoder 104 then transforms the received summed signal 112 into In the frequency domain, (2) apply the recovered BCC cue code 120 to the summed signal in the frequency domain to produce six frequency domain signals, and then (3) transform these frequency domain signals into the six time domains of the synthesized 5.1 surround sound channels (ie, five synthesized regular channels plus one synthesized LFE channel) for playback through speaker 126. the
åæå 容 Contents of the invention
æ ¹æ®æ¬åæï¼æä¾äºï¼Â According to the present invention, there is provided:
å¯¹å ·æå¤ä¸ªé³é¢è¾å ¥å£°éçå¤å£°éé³é¢ä¿¡å·è¿è¡ç¼ç çæ¹æ³ï¼æè¿°å¤å£°éé³é¢ä¿¡å·å ·æå¤ä¸ªå¸¸è§å£°éåè³å°ä¸ä¸ªä½é¢ææå£°éï¼è¯¥æ¹æ³å æ¬ï¼åºç¨åæ°é³é¢ç¼ç ææ¯ä¸ºç¨äºç¬¬ä¸ä¸ªé¢çèå´çææé³é¢è¾å ¥å£°é产çåæ°é³é¢ç¼ç ï¼æè¿°ç¬¬ä¸ä¸ªé¢çèå´å¯¹åºäºä½äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼ä»¥ååºç¨åæ°é³é¢ç¼ç ææ¯ä» ä» ä¸ºç¨äºç¬¬äºä¸ªé¢çèå´ç常è§å£°é产çåæ°é³é¢ç¼ç ï¼æè¿°ç¬¬äºä¸ªé¢çèå´å¯¹åºäºé«äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼å ¶ä¸ï¼å¯¹äºç¬¬ä¸ä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç ææ¯çæå¯¹åºäºææé³é¢è¾å ¥å£°éçåæ°é³é¢ç¼ç ï¼ä»¥å对äºç¬¬äºä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç ææ¯çæä» ä» å¯¹åºäºå¸¸è§å£°éï¼èä¸é对è³å°ä¸ä¸ªä½é¢ææå£°éçåæ°é³é¢ç¼ç ã A method of encoding a multi-channel audio signal having a plurality of audio input channels, the multi-channel audio signal having a plurality of conventional channels and at least one low-frequency effects channel, the method comprising: applying a parametric audio coding technique to producing parametric audio encoding for all audio input channels of a first frequency range corresponding to one or more sub-bands below a specified cutoff frequency; and applying the parametric audio encoding technique only for the first frequency range Parametric audio coding is produced by conventional channels of two frequency ranges, the second frequency range corresponding to one or more sub-bands above a specified cutoff frequency, where: for the first frequency range, parametric audio coding techniques generate parametric audio coding corresponding to Parametric audio coding of all audio input channels; and for the second frequency range, the parametric audio coding technique generates parametric audio coding corresponding only to regular channels, not to at least one low-frequency effects channel. the
å¯¹å ·æå¤ä¸ªé³é¢è¾å ¥å£°éçå¤å£°éé³é¢ä¿¡å·è¿è¡ç¼ç çè£ ç½®ï¼æè¿°å¤å£°éé³é¢ä¿¡å·å ·æå¤ä¸ªå¸¸è§å£°éåè³å°ä¸ä¸ªä½é¢ææå£°éï¼è¯¥è£ ç½®å æ¬ï¼åºç¨åæ°é³é¢ç¼ç ææ¯ä¸ºç¨äºç¬¬ä¸ä¸ªé¢çèå´çææé³é¢è¾å ¥å£°é产çåæ°é³é¢ç¼ç çè£ ç½®ï¼æè¿°ç¬¬ä¸ä¸ªé¢çèå´å¯¹åºäºä½äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼ä»¥ååºç¨åæ°é³é¢ç¼ç ææ¯ä» ä» ä¸ºç¨äºç¬¬äºä¸ªé¢çèå´ç常è§å£°é产çåæ°é³é¢ç¼ç çè£ ç½®ï¼æè¿°ç¬¬äºä¸ªé¢çèå´å¯¹åºäºé«äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼å ¶ä¸ï¼å¯¹äºç¬¬ä¸ä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç ææ¯çæå¯¹åºäºææé³é¢è¾å ¥å£°éçåæ°é³é¢ç¼ç ï¼ä»¥å对äºç¬¬äºä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç ææ¯çæä» ä» å¯¹åºäºå¸¸è§å£°éï¼èä¸é对è³å°ä¸ä¸ªä½é¢ææå£°éçåæ°é³é¢ç¼Â ç ã Apparatus for encoding a multi-channel audio signal having a plurality of audio input channels, said multi-channel audio signal having a plurality of conventional channels and at least one low-frequency effects channel, comprising: applying a parametric audio coding technique to means for producing parametric audio coding for all audio input channels of a first frequency range corresponding to one or more sub-bands below a specified cut-off frequency; and applying parametric audio coding techniques only for Means for producing parametric audio coding on a regular channel of a second frequency range, said second frequency range corresponding to one or more sub-bands above a specified cut-off frequency, wherein: for a first frequency range, parametric audio coding The technique generates parametric audio coding corresponding to all audio input channels; and for the second frequency range, the parametric audio coding technique generates parametric audio coding corresponding only to conventional channels, not to at least one low-frequency effects channel. the
ä¸ç§åæ°é³é¢ç¼ç å¨ï¼å æ¬ï¼éç¨äºç±å¤å£°éé³é¢ä¿¡å·çå¤ä¸ªé³é¢è¾å ¥å£°é产çä¸ä¸ªæå¤ä¸ªç»å声éç䏿··é¢å¨ï¼æè¿°å¤å£°éé³é¢ä¿¡å·å ·æå¤ä¸ªå¸¸è§å£°éåè³å°ä¸ä¸ªä½é¢ææå£°éï¼ä»¥ååæå¨ï¼ç¨äºäº§çï¼(1)ç¨äºç¬¬ä¸ä¸ªé¢çèå´å çææé³é¢è¾å ¥å£°éçåæ°é³é¢ç¼ç ï¼æè¿°ç¬¬ä¸ä¸ªé¢çèå´å¯¹åºäºä½äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼ä»¥å(2)ä» ä» ç¨äºç¬¬äºä¸ªé¢çèå´å ç常è§å£°éçåæ°é³é¢ç¼ç ï¼æè¿°ç¬¬äºä¸ªé¢çèå´å¯¹åºäºé«äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼å ¶ä¸ï¼å¯¹äºç¬¬ä¸ä¸ªé¢çèå´ï¼åæå¨çæå¯¹åºäºææé³é¢è¾å ¥å£°éçåæ°é³é¢ç¼ç ï¼ä»¥å对äºç¬¬äºä¸ªé¢çèå´ï¼åæå¨çæä» ä» å¯¹åºäºå¸¸è§å£°éï¼èä¸é对è³å°ä¸ä¸ªä½é¢ææå£°éçåæ°é³é¢ç¼ç ã A parametric audio encoder comprising: a down-mixer adapted to generate one or more combined channels from a plurality of audio input channels of a multi-channel audio signal having a plurality of conventional channel and at least one low-frequency effects channel; and an analyzer for generating: (1) parametric audio encoding for all audio input channels in a first frequency range corresponding to one or more subbands of a specified cutoff frequency; and (2) parametric audio coding for regular channels only in a second frequency range corresponding to one or more subbands above the specified cutoff frequency subbands, where: for a first frequency range, the analyzer generates parametric audio encodings corresponding to all audio input channels; and for a second frequency range, the analyzer generates parametric audio encodings only for regular channels, not for at least one Parametric audio encoding of the low-frequency effects channel. the
å¯¹å ·æå¤ä¸ªé³é¢è¾åºå£°éçå¤å£°éé³é¢ä¿¡å·è¿è¡åæçæ¹æ³ï¼æè¿°å¤å£°éé³é¢ä¿¡å·å ·æå¤ä¸ªå¸¸è§å£°éåè³å°ä¸ä¸ªä½é¢ææå£°éï¼è¯¥æ¹æ³å æ¬ï¼åºç¨åæ°é³é¢è§£ç ææ¯äº§çç¨äºç¬¬ä¸ä¸ªé¢çèå´çææé³é¢è¾åºå£°éï¼æè¿°ç¬¬ä¸ä¸ªé¢çèå´å¯¹åºäºä½äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼ä»¥ååºç¨åæ°é³é¢è§£ç ææ¯ä» ä» äº§çç¨äºç¬¬äºä¸ªé¢çèå´ç常è§å£°éï¼æè¿°ç¬¬äºä¸ªé¢çèå´å¯¹åºäºé«äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼å ¶ä¸ï¼åæ°é³é¢è§£ç ææ¯ä½¿ç¨åæ°é³é¢ç¼ç çæé³é¢è¾åºå£°éï¼å¯¹äºç¬¬ä¸ä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç 对åºäºææé³é¢è¾åºå£°éï¼ä»¥å对äºç¬¬äºä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç ä» ä» å¯¹åºäºå¸¸è§å£°éï¼èä¸é对è³å°ä¸ä¸ªä½é¢ææå£°éã A method of synthesizing a multi-channel audio signal having a plurality of audio output channels, the multi-channel audio signal having a plurality of conventional channels and at least one low-frequency effects channel, the method comprising: applying parametric audio decoding techniques to generate all audio output channels for a first frequency range corresponding to one or more sub-bands below a specified cutoff frequency; and applying parametric audio decoding techniques to generate only for a second frequency range , the second frequency range corresponds to one or more sub-bands above the specified cutoff frequency; where: parametric audio decoding techniques use parametric audio coding to generate audio output channels; for the first frequency range, the parametric The audio encoding corresponds to all audio output channels; and for the second frequency range, the parametric audio encoding corresponds only to the normal channels, not to the at least one low frequency effects channel. the
Â å¯¹å ·æå¤ä¸ªé³é¢è¾åºå£°éçå¤å£°éé³é¢ä¿¡å·è¿è¡åæçè£ ç½®ï¼æè¿°å¤å£°éé³é¢ä¿¡å·å ·æå¤ä¸ªå¸¸è§å£°éåè³å°ä¸ä¸ªä½é¢ææå£°éï¼è¯¥è£ ç½®å æ¬ï¼åºç¨åæ°é³é¢è§£ç ææ¯äº§çç¨äºç¬¬ä¸ä¸ªé¢çèå´çææé³é¢è¾åºå£°éçè£ ç½®ï¼æè¿°ç¬¬ä¸ä¸ªé¢çèå´å¯¹åºäºä½äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼ä»¥ååºç¨åæ°é³é¢è§£ç ææ¯äº§çä» ä» ç¨äºç¬¬äºä¸ªé¢çèå´ç常è§å£°éçè£ ç½®ï¼æè¿°ç¬¬äºä¸ªé¢çèå´å¯¹åºäºé«äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼å ¶ä¸ï¼åæ°é³é¢è§£ç ææ¯ä½¿ç¨åæ°é³é¢ç¼ç çæé³é¢è¾åºå£°éï¼å¯¹äºç¬¬ä¸ä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç 对åºäºææé³é¢è¾åºå£°éï¼ä»¥å对äºç¬¬äºä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç ä» ä» å¯¹åºäºå¸¸è§å£°éï¼èä¸é对è³å°ä¸ä¸ªä½é¢ææå£°éã A device for synthesizing a multi-channel audio signal having a plurality of audio output channels, the multi-channel audio signal having a plurality of conventional channels and at least one low-frequency effect channel, the device comprising: applying parametric audio decoding techniques to generate means for all audio output channels of a first frequency range corresponding to one or more sub-bands below a specified cutoff frequency; and applying parametric audio decoding techniques to generate means of regular channels of frequency ranges, the second frequency range corresponding to one or more sub-bands above a specified cutoff frequency; wherein: parametric audio decoding techniques use parametric audio coding to generate audio output channels; for the first For the frequency range, the parametric audio coding corresponds to all audio output channels; and for the second frequency range, the parametric audio coding corresponds to only the normal channels, not to at least one low-frequency effects channel.
ä¸ç§åæ°é³é¢è§£ç å¨ï¼ç¨äºåæå ·æå¤ä¸ªé³é¢è¾åºå£°éçå¤å£°éé³é¢ä¿¡å·ï¼æè¿°å¤å£°éé³é¢ä¿¡å·å ·æå¤ä¸ªå¸¸è§å£°éåè³å°ä¸ä¸ªä½é¢ææå£°éï¼æè¿°åæ°é³é¢è§£ç å¨å æ¬ï¼è¾¹ä¿¡æ¯å¤çå¨ï¼å ¶éç¨äºæ¢å¤åæ°é³é¢ç¼ç ï¼ä»¥ååæå¨ï¼å ¶éäºï¼åºç¨åæ°é³é¢è§£ç ææ¯äº§çç¨äºç¬¬ä¸ä¸ªé¢çèå´çææé³é¢è¾åºå£°éï¼æè¿°ç¬¬ä¸ä¸ªé¢çèå´å¯¹åºäºä½äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼ä»¥ååºç¨åæ°é³é¢è§£ç ææ¯äº§çä» ä» ç¨äºç¬¬äºä¸ªé¢çèå´ç常è§å£°éï¼æè¿°ç¬¬äºä¸ªé¢çèå´å¯¹åºäºé«äºæå®æªæ¢é¢ççä¸ä¸ªæå¤ä¸ªåé¢å¸¦ï¼å ¶ä¸ï¼åæ°é³é¢è§£ç ææ¯ä½¿ç¨åæ°é³é¢ç¼ç çæé³é¢è¾åºå£°éï¼å¯¹äºç¬¬ä¸ä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç 对åºäºææé³é¢è¾åºå£°éï¼ä»¥å对äºç¬¬äºä¸ªé¢çèå´ï¼åæ°é³é¢ç¼ç ä» ä» å¯¹åºäºå¸¸è§å£°éï¼èä¸é对è³å°ä¸ä¸ªä½é¢ææå£°éã A parametric audio decoder for synthesizing a multi-channel audio signal with a plurality of audio output channels, the multi-channel audio signal having a plurality of conventional channels and at least one low-frequency effect channel, the parametric audio decoding The processor comprises: a side information processor adapted to recover parametric audio coding, and a synthesizer adapted to: apply parametric audio decoding techniques to generate all audio output channels for a first frequency range, said first frequency range corresponds to one or more sub-bands below the specified cutoff frequency; and applying parametric audio decoding techniques produces regular channels only for the second frequency range corresponding to frequencies above the specified cutoff frequency One or more frequency subbands; where: the parametric audio decoding technique generates audio output channels using parametric audio encoding; for the first frequency range, the parametric audio encoding corresponds to all audio output channels; and for the second frequency range, the parametric audio encoding The encoding corresponds only to the regular channels, not to at least one low-frequency effects channel. the
为äºç¯ç»é³åçåºç¨ï¼æ¬åæç宿½ä¾æ¶ååºäºBCCçåæ°é³é¢ç¼ç ææ¯ï¼å ¶ä¸åºäºé¢å¸¦çBCCç¼ç å¹¶ä¸åºç¨äºä½é¢è¶ ä½é³åå(LFE)声éçé«äºæªæ¢é¢ççåé¢å¸¦ãä¾å¦ï¼å¯¹5.1ç¯ç»é³åèè¨ï¼BCCç¼ç åºç¨äºææçå 个声é(å³äºä¸ªå¸¸è§å£°éå ä¸ä¸ªLFE声é)çä½äºæªæ¢é¢ççåé¢å¸¦ï¼èBCCç¼ç ä» ä» åºç¨äºäºä¸ªå¸¸è§å£°é(å³ä¸åºç¨äºLFE声é)çé«äºæªæ¢é¢ççåé¢å¸¦ãéè¿é¿å BCCç¼ç å¨LFE声éçâé«âé¢ä¸çåºç¨ï¼æ¬åæçè¿äºå®æ½ä¾ä¸ç¸åºç卿æé¢çä¸å¤çææå 个声éçåºäºBCCçç³»ç»ç¸æ¯ï¼å ·æ(1)åå°çç¼ç å¨åè§£ç å¨å¤çè´æ å(2)æ´å°çBCCç æ¯ç¹æµã For surround sound applications, embodiments of the present invention relate to BCC-based parametric audio coding techniques, where band-based BCC coding is not applied to sub-bands above the cutoff frequency of the low-frequency subwoofer (LFE) channel. For example, for 5.1 surround sound, BCC coding is applied to sub-bands below the cutoff frequency of all six channels (that is, five conventional channels plus one LFE channel), while BCC coding is applied to only five conventional channels. The sub-band above the cutoff frequency of the channel (i.e. not applied to the LFE channel). By avoiding the application of BCC coding on the "high" frequencies of the LFE channel, these embodiments of the invention have (1) reduced Encoder and decoder processing burden and (2) smaller BCC coded bitstream. the
æ´ä¸è¬èè¨ï¼æ¬åææ¶åå¦BCCç¼ç çåæ°é³é¢ç¼ç ææ¯çåºç¨ï¼ä½å¹¶ä¸å±éäºBCCç¼ç ï¼å ¶ä¸å¨ä¸¤ä¸ªææ´å¤ä¸ªä¸åçé¢çèå´å å¤çè¾å ¥å£°éçä¸¤ä¸ªææ´å¤ä¸ªä¸åçåéãå¦å¨è¯¥è¯´æä¹¦ä¸æä½¿ç¨çï¼æ¯è¯âåéâ坿å 嫿æè¾å ¥å£°éçéå以åé£äºå æ¬æ¯ææè¾å ¥å£°éè¦å°å£°éçéå½çåéãæ¬åæå¨5.1åå ¶å®ç¯ç»é³åä¿¡å·çBCCç¼ç ä¸çåºç¨åªæ¯æ¬åæçä¸ç§ç¹ä¾ã More generally, the present invention relates to the application of parametric audio coding techniques such as BCC coding, but not limited to BCC coding, in which two or more input channels are processed in two or more different frequency ranges. different subsets. As used in this specification, the term "subset" may refer to a set comprising all input channels as well as those suitable subsets comprising fewer than all input channels. The application of the present invention to BCC coding of 5.1 and other surround sound signals is only a special case of the present invention. the
éå¾è¯´æDescription of drawings
æ¬åæçå ¶å®æ¹é¢ãç¹å¾åä¼ç¹å°ç±ä¸é¢çå ·ä½æè¿°ãæéçæå©è¦æ±ä»¥åéå¾èæ´å ææ¾ï¼å ¶ä¸ï¼Â Other aspects, features and advantages of the present invention will be more apparent from the following detailed description, appended claims and accompanying drawings, wherein:
å¾1示åºäºæ§è¡å声éæç¤ºç (BCC)çé³é¢å¤çç³»ç»çæ¹æ¡å¾ï¼ä»¥å Figure 1 shows a block diagram of an audio processing system implementing a binaural cue code (BCC); and
å¾2示åºäºæ ¹æ®æ¬åæçä¸ä¸ªå®æ½ä¾æ§è¡BCCç¼ç çé³é¢å¤çç³»ç»çæ¹æ¡å¾ã FIG. 2 shows a block diagram of an audio processing system performing BCC encoding according to one embodiment of the present invention. the
å ·ä½å®æ½æ¹å¼ Detailed ways
å¾2示åºäºæ ¹æ®æ¬åæçä¸ä¸ªå®æ½ä¾æ§è¡5.1ç¯ç»é³åçå声éæç¤ºç¼ç (BCC)çé³é¢å¤çç³»ç»200çæ¹æ¡å¾ãBCCç³»ç»200å ·æä¸ä¸ªBCCç¼ç å¨202ï¼å®æ¥æ¶å 个é³é¢è¾å ¥å£°é208(å³äºä¸ªå¸¸è§å£°éåä¸ä¸ªLFE声é)ãBCCç¼ç å¨202å ·æä¸ä¸ªä¸æ··é¢å¨210ï¼å®æé³é¢è¾å ¥å£°é(å æ¬LFE声é)转æ¢(ä¾å¦åå¹³å)为ä¸ä¸ªæå¤ä¸ªãä½å°äºå 个çç»å声é212ã FIG. 2 shows a block diagram of an audio processing system 200 for performing binaural cue coding (BCC) for 5.1 surround sound according to an embodiment of the present invention. The BCC system 200 has a BCC encoder 202 that receives six audio input channels 208 (ie, five regular channels and one LFE channel). The BCC encoder 202 has a down- mixer 210 that converts (eg averages) the audio input channels (including the LFE channel) into one or more, but less than six, combined channels 212 . the
å¦å¤ï¼BCCç¼ç å¨202å ·æä¸ä¸ªBCCåæå¨214ï¼å®ä¸ºè¾å ¥å£°é产çBCCæç¤ºç æ°æ®æµ216ãå¦å¾2æç¤ºï¼å¯¹å¤äºæä½äºä¸ä¸ªæå®çæªæ¢é¢çfcçåé¢å¸¦ï¼BCCåæå¨214å¨çæBCCæç¤ºç æ°æ®æ¶ä½¿ç¨ææçå 个5.1ç¯ç»é³åè¾å ¥å£°é(å æ¬LFE声é)ã对ææå ¶å®(å³é«é¢)çåé¢å¸¦ï¼BCCåæå¨214ä» ä½¿ç¨äºä¸ªå¸¸è§å£°é(ä¸ä½¿ç¨LFE声é)æ¥äº§çBCCæç¤ºç æ°æ®ãç»æï¼LFE声éä» å¯¹å¤äºæä½äºæªæ¢é¢ççBCCåé¢å¸¦ãè䏿¯å¯¹æ´ä¸ªBCCé¢çèå´è´¡ç®BCCç ï¼ä»èåå°äºè¾¹ä¿¡æ¯æ¯ç¹æµçæ´ä¸ªå¤§å°ã Additionally, the BCC encoder 202 has a BCC analyzer 214 which generates a BCC hint code data stream 216 for the input channels. As shown in FIG. 2, for subbands at or below a specified cutoff frequency f , the BCC analyzer 214 uses all six 5.1 surround sound input channels (including the LFE channel) in generating the BCC cue code data . For all other (ie, high frequency) subbands, the BCC analyzer 214 uses only the five regular channels (the LFE channel is not used) to generate the BCC hint code data. As a result, the LFE channel contributes BCC codes only to the BCC subbands at or below the cutoff frequency, rather than to the entire BCC frequency range, thereby reducing the overall size of the side information bitstream.
æªæ¢é¢çæå¥½è¢«éæ©ä¸ºä½¿å¾LFE声éçææé³é¢å¸¦å®½å°äºæçäºfc(å³LFE声éå¨è¶ è¿æªæ¢é¢çæ¶å®é 䏿²¡æè½éæè 没æå®é çé³é¢å 容)ãé¤éåé¢å¸¦è¢«è°æ´å°æªæ¢é¢çï¼å¦åæªæ¢é¢çå°è½å ¥ç¹å®çåé¢å¸¦å ãè¿ç§æ åµä¸ï¼é¨ååé¢å¸¦å°è¶ è¿æªæ¢é¢çã为äºä¾¿äºè¯´æï¼è¿æ ·çåé¢å¸¦è¢«ç§°ä¸ºâå¤äºâæªæ¢é¢çä¸ãå¨ä¼éç宿½ä¾ä¸ï¼Â LFE声éçæ´ä¸ªåé¢å¸¦é½è¢«BCCç¼ç ï¼ä¸ä¸ä¸ªè¾é«é¢ççåé¢å¸¦ä¸ºæªç»è¿BCCç¼ç ç第ä¸é«é¢åé¢å¸¦ã The cutoff frequency is preferably chosen such that the effective audio bandwidth of the LFE channel is less than or equal to fc (ie the LFE channel has practically no energy or actual audio content above the cutoff frequency). Unless the subband is tuned to the cutoff frequency, the cutoff frequency will fall within the particular subband. In this case, some subbands will exceed the cutoff frequency. For ease of illustration, such sub-bands are said to be "at" the cutoff frequency. In a preferred embodiment, the entire subband of the LFE channel is BCC coded, and the next higher frequency subband is the first high frequency subband that is not BCC coded.
å¨ä¸ç§å¯è½çå®ç°æ¹å¼ä¸ï¼BCCæç¤ºç å æ¬è¾å ¥å£°éç声éé´çµå¹³å·®(ICLD)ã声éé´æ¶é´å·®(ICTD)以å声éé´ç¸å ³(ICC)æ°æ®ãBCCåæå¨214æå¥½æ§è¡ç±»ä¼¼äºâ877åâ458ç³è¯·ä¸ææè¿°çåºäºé¢å¸¦çå¤çï¼ä»è为é³é¢è¾å ¥å£°éçä¸ååé¢å¸¦äº§çICLDåICTDæ°æ®ãå¦å¤ï¼BCCåæå¨214æå¥½ä¸ºä¸åçåé¢å¸¦äº§çä½ä¸ºICCæ°æ®çç¸å¹²æ§é度ãè¿äºç¸å¹²æ§é度å¨â437åâ591ç³è¯·ä¸åäºæ´å ·ä½çæè¿°ã In a possible implementation manner, the BCC prompt code includes inter-channel level difference (ICLD), inter-channel time difference (ICTD) and inter-channel correlation (ICC) data of input channels. The BCC analyzer 214 preferably performs band-based processing similar to that described in the '877 and '458 applications to generate ICLD and ICTD data for different sub-bands of the audio input channels. In addition, the BCC analyzer 214 preferably generates coherence measures as ICC data for the different subbands. These coherence measures are described in more detail in the '437 and '591 applications. the
BCCç¼ç å¨202å°ä¸ä¸ªæå¤ä¸ªç»å声é212以åBCCæç¤ºç æ°æ®æµ216(ä¾å¦å ³äºè¯¥ç»å声éçå¸¦å æå¸¦å¤è¾¹ä¿¡æ¯)åéå°BCCç³»ç»200çBCCè§£ç å¨204ãBCCè§£ç å¨204å ·æä¸ä¸ªè¾¹ä¿¡æ¯å¤çå¨218ï¼å®å¯¹æ°æ®æµ216è¿è¡å¤çï¼ä»¥æ¢å¤BCCæç¤ºç 220(ä¾å¦ICLDãICTDåICCæ°æ®)ãBCCè§£ç å¨204è¿å ·æä¸ä¸ªBCCåæå¨222ï¼å®ä½¿ç¨æ¢å¤çBCCæç¤ºç 220ç±ä¸ä¸ªæå¤ä¸ªç»å声é212åæå 个é³é¢è¾åºå£°é224ï¼ä»èåå«éè¿å 个ç¯ç»é³åæ¬å£°å¨226ææ¾ã BCC encoder 202 sends one or more combined channels 212 and a BCC hint code data stream 216 (eg, in-band or out-of-band side information about the combined channels) to BCC decoder 204 of BCC system 200 . BCC decoder 204 has a side information processor 218 that processes data stream 216 to recover BCC hint codes 220 (eg, ICLD, ICTD, and ICC data). The BCC decoder 204 also has a BCC synthesizer 222 which uses the recovered BCC cue code 220 to synthesize six audio output channels 224 from the one or more composite channels 212 for playback through the six surround sound speakers 226 respectively. the
å¦å¾2æç¤ºï¼BCCåæå¨222对å¤äºæä½äºæªæ¢é¢çfcçåé¢å¸¦æ§è¡å 个声éçBCCåæï¼ä»è为ææçå 个5.1ç¯ç»å£°é(å³å æ¬LFE声é)产çé¢çå 容ï¼åæ¶å¯¹é«äºæªæ¢é¢ççåé¢å¸¦æ§è¡äºå£°éBCCåæï¼ä»èä» ä» ä¸º5.1ç¯ç»é³åçäºä¸ªå¸¸è§å£°é产çé¢çå 容ãç¹å«å°ï¼BCCåæå¨222ææ¥æ¶å°çç»å声é212å解为å¤ä¸ªåé¢å¸¦(ä¾å¦ä¸´çé¢å¸¦)ãå¨è¿äºåé¢å¸¦ä¸åºç¨ä¸åçå¤ç以è·åè¾åºé³é¢å£°éçç¸åºåé¢å¸¦ãç»æï¼å¯¹äºLFE声éä» è·åå ¶é¢çå¤äºæä½äºæªæ¢é¢ççåé¢å¸¦ãæ¢å¥è¯è¯´ï¼LFE声éä» å ·æå¤äºæä½äºæªæ¢é¢ççåé¢å¸¦çé¢çå 容ãLFE声éçè¾é«çåé¢å¸¦(å³é£äºé«äºæªæ¢é¢ççåé¢å¸¦)å¯ä»¥ç¨é¶ä¿¡å·æ¥å¡«å (å¦æå¿ è¦çè¯)ã As shown in FIG. 2 , BCC synthesizer 222 performs six-channel BCC synthesis of the subbands at or below the cutoff frequency f , resulting in frequencies for all six 5.1 surround channels (i.e., including the LFE channel). content while performing five-channel BCC synthesis on subbands above the cutoff frequency, resulting in frequency content for only the five regular channels of 5.1 surround sound. In particular, the BCC synthesizer 222 decomposes the received combined channel 212 into a plurality of sub-bands (eg, critical bands). Different processing is applied in these subbands to obtain the corresponding subbands of the output audio channels. As a result, only subbands whose frequencies are at or below the cutoff frequency are acquired for the LFE channel. In other words, the LFE channel only has the frequency content of subbands at or below the cutoff frequency. The upper subbands of the LFE channel (ie those above the cutoff frequency) can be filled with null signals (if necessary).
æ ¹æ®ç¹å«çå®ç°æ¹å¼ï¼BCCç¼ç å¨å¯ä»¥è¢«è®¾è®¡ä¸ºå¯¹ææçé¢ç产çBCCæç¤ºç ï¼ä¸ç®åå°ä¸å¯¹ç¹å«çåé¢å¸¦(ä¾å¦é«äºæªæ¢é¢ççåé¢å¸¦å/æå®é ä¸å ·æé¶è½éçåé¢å¸¦)åéè¿äºæç¤ºç ã类似å°ï¼Â ç¸åºçBCCè§£ç å¨å¯ä»¥è¢«è®¾è®¡ä¸ºå¯¹ææçé¢çæ§è¡å¸¸è§çBCCåæï¼å ¶ä¸BCCè§£ç å¨å¯¹é£äºä¸å ·ææç¡®ä¼ éçç¼ç çåé¢å¸¦åºç¨åéçBCCæç¤ºç å¼ã Depending on the particular implementation, the BCC encoder can be designed to generate BCC hint codes for all frequencies, and simply not for specific sub-bands (e.g. sub-bands above the cut-off frequency and/or sub-bands with practically zero energy ) to send these prompt codes. Similarly, a corresponding BCC decoder can be designed to perform conventional BCC synthesis for all frequencies, where the BCC decoder applies appropriate BCC hint code values for those subbands that do not have an explicitly transmitted code. the
è½ç¶æ¬åæå·²ç»å¨åºç¨äºâ877åâ458ç³è¯·æè¿°çææ¯æ¥åæå¬è§åºæ¯çBCCè§£ç å¨çæ åµä¸è¿è¡äºæè¿°ï¼ä½æ¯æ¬åæä¹å¯å¨åºç¨å ¶å®ææ¯æ¥åæå¬è§åºæ¯çBCCè§£ç å¨çæ åµä¸å®ç°ï¼èä¸å¿ ä¾èµäºâ877åâ458ç³è¯·æè¿°çææ¯ãä¾å¦ï¼æ¬åæçBCCå¤ççå®ç°å¯æ éICTDãICLDå/æICCæ°æ®ï¼å¯ä½¿ç¨æä¸ä½¿ç¨å ¶å®åéçä¾å¦ä¸å¤´é¨ç¸å ³ä¼ é彿°ç¸å ³èçæç¤ºç ã Although the present invention has been described in the context of a BCC decoder that applies the techniques described in the '877 and '458 applications to synthesize auditory scenes, the invention is also applicable to BCC decoders that employ other techniques to synthesize auditory scenes. case, without having to rely on the techniques described in the '877 and '458 applications. For example, the BCC processing of the present invention may be implemented without ICTD, ICLD and/or ICC data, with or without other suitable hint codes such as those associated with head-related transfer functions. the
å¨å¾2ç宿½ä¾ä¸ï¼éè¿æå 声éBCCåæåºç¨å°å¤äºæä½äºæªæ¢é¢ççåé¢å¸¦ä»¥åæäºå£°éBCCåæåºç¨å°é«äºæªæ¢é¢ççåé¢å¸¦æ¥å¯¹5.1ç¯ç»é³åè¿è¡ç¼ç ãå¨å¦ä¸ä¸ªå®æ½ä¾ä¸ï¼æ¬åæå¯åºç¨äº7.1ç¯ç»é³åï¼å ¶ä¸å «å£°éBCCåæè¢«åºç¨å°å¤äºæä½äºæå®çæªæ¢é¢ççåé¢å¸¦ï¼ä¸å£°éBCCåæ(æé¤äºå个LFE声é)被åºç¨å°é«äºæªæ¢é¢ççåé¢å¸¦ã In the embodiment of Figure 2, 5.1 surround sound is encoded by applying six-channel BCC analysis to sub-bands at or below the cut-off frequency and five-channel BCC analysis to sub-bands above the cut-off frequency. In another embodiment, the invention is applicable to 7.1 surround sound, where eight-channel BCC analysis is applied to sub-bands at or below a specified cutoff frequency, seven-channel BCC analysis (excluding the single LFE channel) is applied to subbands above the cutoff frequency. the
æ¬åæè¿å¯åºç¨äºå ·æè¶ è¿ä¸ä¸ªLFE声éçç¯ç»é³åãä¾å¦å¯¹äº10.2ç¯ç»é³åï¼å¯å°åäºå£°éBCCåæåºç¨å°å¤äºæä½äºæå®çæªæ¢é¢ççåé¢å¸¦ï¼èå°å声éBCCåæ(æé¤äºä¸¤ä¸ªLFE声é)åºç¨å°é«äºæªæ¢é¢ççåé¢å¸¦ãä½ä¸ºæ¿ä»£ï¼ä¹å¯ä»¥æå®ä¸¤ä¸ªä¸åçæªæ¢é¢çï¼ç¨äº10.2ç¯ç»é³åç第ä¸ä¸ªLFE声éçç¬¬ä¸æªæ¢é¢ç以åç¨äºç¬¬äºä¸ªLFE声éçç¬¬äºæªæ¢é¢çãå¨è¿ç§æ åµä¸ï¼åå®ç¬¬ä¸æªæ¢é¢çä½äºç¬¬äºæªæ¢é¢çï¼å¯å°åäºå£°éBCCåæåºç¨å°å¤äºæä½äºç¬¬ä¸æªæ¢é¢ççåé¢å¸¦ï¼èå°åä¸å£°éBCCåæ(æé¤äºç¬¬ä¸ä¸ªLFE声é)åºç¨å°(1)é«äºç¬¬ä¸æªè³é¢çä¸(2)å¤äºæä½äºç¬¬äºæªæ¢é¢ççåé¢å¸¦ï¼å¹¶å°å声éBCCåæ(æé¤äºä¸¤ä¸ªLFE声é)åºç¨å°é«äºç¬¬äºæªæ¢é¢ççåé¢å¸¦ã The invention is also applicable to surround sound with more than one LFE channel. For 10.2 surround sound for example, twelve-channel BCC analysis can be applied to subbands at or below the specified cutoff frequency, while ten-channel BCC analysis (with the two LFE channels excluded) can be applied above the cutoff frequency sub-band. Alternatively, two different cutoff frequencies can also be specified: a first cutoff frequency for the first LFE channel of 10.2 surround sound and a second cutoff frequency for the second LFE channel. In this case, assuming that the first cutoff frequency is lower than the second cutoff frequency, twelve-channel BCC analysis can be applied to subbands at or below the first cutoff frequency, while eleven-channel BCC analysis (excluding excluding the first LFE channel) to subbands that are (1) above the first cutoff frequency and (2) at or below the second cutoff frequency, and apply the ten-channel BCC analysis (excluding the two LFE channels ) is applied to subbands above the second cutoff frequency. the
类似å°ï¼ä¸äºæ¶è´¹ç±»å¤å£°é设å¤è¢«ç¹æè®¾è®¡ä¸ºå ·æä¸åé¢çèå´çä¸åè¾åºå£°éãä¾å¦ï¼ä¸äº5.1ç¯ç»é³å设å¤å ·æä¸¤ä¸ªåæ¹å£°éï¼è¿äºåæ¹å£°éè¢«è®¾è®¡ä¸ºä» åç°ä½äº7kHzçé¢çãæ¬åæéè¿æå®ä¸¤Â ä¸ªæªæ¢é¢çå¯åºç¨äºè¿ç±»ç³»ç»ï¼ä¸ä¸ªæªæ¢é¢çç¨äºLFE声éï¼èä¸ä¸ªè¾é«çç¨äºåæ¹å£°éãå¨è¿ç§æ åµä¸ï¼å 声éBCCåæå¯è¢«åºç¨å°å¤äºæä½äºLFEæªæ¢é¢ççåé¢å¸¦ï¼äºå£°éBCCåæ(æé¤äºLFE声é)å¯è¢«åºç¨å°(1)é«äºLFEæªæ¢é¢çå(2)å¤äºæä½äºåæ¹å£°éæªæ¢é¢ççåé¢å¸¦ï¼ä¸å£°éBCCåæ(æé¤äºLFE声éåä¸¤ä¸ªåæ¹å£°é)å¯è¢«åºç¨å°é«äºåæ¹å£°éæªæ¢é¢ççåé¢å¸¦ã Similarly, some consumer multi-channel devices are purposely designed to have different output channels with different frequency ranges. For example, some 5.1 surround sound equipment has two rear channels that are designed to reproduce only frequencies below 7kHz. The present invention is applicable to such systems by specifying two cutoff frequencies: one for the LFE channel and one higher for the rear channels. In this case, six-channel BCC analysis can be applied to subbands at or below the LFE cutoff frequency, and five-channel BCC analysis (excluding the LFE channel) can be applied to (1) above the LFE cutoff frequency and (2) sub-bands at or below the cut-off frequency of the rear channels, the three-channel BCC analysis (excluding the LFE channel and the two rear channels) can be applied to the sub-bands above the cut-off frequency of the rear channels. the
æ¬åæè¿å¯è¿ä¸æ¥æ¨å¹¿ç¨äºæåæ°é³é¢ç¼ç åºç¨äºä¸¤ä¸ªææ´å¤ä¸ªä¸åé¢çèå´å çè¾å ¥å£°éçä¸¤ä¸ªææ´å¤ä¸ªä¸åçåéï¼å ¶ä¸åæ°é³é¢ç¼ç å¯ä»¥ä¸åäºBCCç¼ç ï¼ä¸åé¢çèå´è¢«éæ©ä¸ºä½¿å¾ä¸åè¾å ¥å£°éçé¢çå 容å¨è¿äºèå´å åæ åºæ¥ãæ ¹æ®ç¹å®çåºç¨ï¼ä¸åç声éå¯ä»¥ä»¥ä»»ä½éå½çç»å被æé¤å¨ä¸åé¢çèå´ä¹å¤ãä¾å¦ï¼ä½é¢å£°éå¯ä»¥è¢«æé¤å¨é«é¢åºåä¹å¤å/æé«é¢å£°éå¯ä»¥è¢«æé¤å¨ä½é¢åºåä¹å¤ãçè³å¯ä»¥æ¯è¿ç§æ åµï¼å³ä»»ä½å个é¢çèå´é½ä¸è½å 嫿æçè¾å ¥å£°éã The invention can be further extended to apply parametric audio coding to two or more different subsets of input channels in two or more different frequency ranges, where parametric audio coding can be different from BCC coding, The different frequency ranges are chosen such that the frequency content of the different input channels is reflected within these ranges. Depending on the particular application, different channels may be excluded from different frequency ranges in any suitable combination. For example, low frequency channels may be excluded from the high frequency region and/or high frequency channels may be excluded from the low frequency region. It may even be the case that any single frequency range cannot encompass all input channels. the
å¦å åæè¿°ï¼å°½ç®¡è¾å ¥å£°é208å¯ä»¥è¢«ä¸æ··é¢ä»¥å½¢æå个ç»å(ä¾å¦å)声é212ï¼å¨å¯éçå®ç°æ¹å¼ä¸ï¼æ ¹æ®ç¹å®çé³é¢å¤çåºç¨ï¼å¤ä¸ªè¾å ¥å£°éå¯ä»¥è¢«ä¸æ··é¢ä»èå½¢æä¸¤ä¸ªæå¤ä¸ªä¸åçâç»åâ声éãè¿ç§ææ¯çæ´å¤ä¿¡æ¯å¯è§04å¹´1æ20æ¥æäº¤çç¾å½ä¸å©ç³è¯·å·10/762100ï¼å ¶å 容卿¤å¹¶å ¥ä½ä¸ºåèã As previously stated, while input channels 208 may be down-mixed to form a single combined (e.g., mono) channel 212, in alternative implementations, multiple input channels may be down-mixed, depending on the particular audio processing application. Mixing to form two or more different "combined" channels. More information on this technique can be found in US Patent Application No. 10/762,100, filed January 20, 04, the contents of which are incorporated herein by reference. the
å¨ä¸äºå®ç°æ¹å¼ä¸ï¼å½è¿è¡ä¸æ··é¢ä»¥äº§çå¤ä¸ªç»å声鿶ï¼ç»å声éçæ°æ®å¯ä½¿ç¨å¸¸è½¨çé³é¢ä¼ è¾ææ¯æ¥ä¼ éãä¾å¦ï¼å¨äº§ç两个ç»å声鿶ï¼è½å¤ä½¿ç¨å¸¸è½¨çç«ä½å£°ä¼ è¾ææ¯ãå¨è¿ç§æ åµä¸ï¼BCCè§£ç å¨å¯æå并使ç¨BCCç¼ç ä»ä¸¤ä¸ªç»å声éåæå¤å£°éä¿¡å·(ä¾å¦5.1ç¯ç»é³å)ãæ¤å¤ï¼è¿å¯ä»¥æä¾åä¸çå ¼å®¹æ§ï¼å ¶ä¸ä¸¤ä¸ªBCCåæå£°é使ç¨å¸¸è½¨ç(å³ä¸åºäºBCCç)ç«ä½å£°è§£ç 卿¥åæ¾è忽ç¥BCCç¼ç ã类似å°ï¼åä¸å ¼å®¹æ§çå®ç°å¯å¨äº§çå个BCCç»å声鿶ç¨äºå¸¸è½¨çåè§£ç å¨ãè¦æ³¨æçæ¯ï¼ç论ä¸è®²ï¼å½æå¤ä¸ªâç»åâ声鿶ï¼è¿äºç»å声éä¸çä¸ä¸ªæå¤ä¸ªå®é ä¸å¯åºäºåç¬çè¾å ¥å£°éã In some implementations, when downmixing is performed to generate multiple combined channels, data for the combined channels may be transmitted using conventional audio transmission techniques. For example, when generating two combined channels, a conventional stereo transmission technique can be used. In this case, a BCC decoder can extract and use BCC encoding to synthesize a multi-channel signal (eg 5.1 surround sound) from two combined channels. Furthermore, this can provide backward compatibility where the two BCC synthesis channels are played back using a regular (ie non-BCC based) stereo decoder ignoring the BCC encoding. Similarly, a backwards-compatibility implementation can be used with a single decoder for conventional tracks when producing a single BCC composite channel. Note that, theoretically, when there are multiple "combined" channels, one or more of these combined channels could actually be based on separate input channels. the
è½ç¶BCCç³»ç»200å¯å ·æä¸é³é¢è¾åºå£°éç¸åæ°ç®çé³é¢è¾å ¥Â 声éï¼ä½å¨ä½ä¸ºæ¿ä»£ç宿½ä¾ä¸ï¼è¾å ¥å£°éçæ°ç®å¯æ ¹æ®ç¹å®çåºç¨å¤§äºæå°äºè¾åºå£°éçæ°ç®ãä¾å¦ï¼è¾å ¥é³é¢å¯å¯¹åºäº7.1ç¯ç»é³åï¼èåæçè¾åºé³é¢å¯å¯¹åºäº5.1ç¯ç»é³åï¼åä¹äº¦ç¶ã While the BCC system 200 may have the same number of audio input channels as audio output channels, in alternative embodiments, the number of input channels may be greater or less than the number of output channels depending on the particular application. For example, the input audio may correspond to 7.1 surround sound, while the synthesized output audio may correspond to 5.1 surround sound, or vice versa. the
ä¸è¬æ¥è®²ï¼æ¬åæçBCCç¼ç å¨çå®ç°å¯åºäºè¿æ ·ä¸ç§æ åµï¼å³æM个è¾å ¥é³é¢å£°éè½¬æ¢æN个ç»å声é以åä¸ä¸ªæå¤ä¸ªç¸åºçBCCç¼ç åéï¼å ¶ä¸Mï¼Nâ¥1ã类似å°ï¼æ¬åæçBCCè§£ç å¨çå®ç°è¿å¯åºäºè¿æ ·ä¸ç§æ åµï¼å³ä»N个ç»åé³é¢å£°éä¸äº§çP个è¾åºå£°é以åç¸åºçBCCç¼ç åéï¼å ¶ä¸Pï¼Nï¼ä¸På¯ä»¥åMç¸åä¹å¯ä»¥ä¸åã In general, the implementation of the BCC encoder of the present invention can be based on the case of converting M input audio channels into N combined channels and one or more corresponding BCC coded subsets, where M>N â¥1. Similarly, the realization of the BCC decoder of the present invention can also be based on the fact that P output channels and corresponding BCC coding subsets are generated from N combined audio channels, where P>N, and P can It can be the same as M or different. the
æ ¹æ®ç¹å®çå®ç°æ¹å¼ï¼å¾2çBCCç¼ç å¨202åBCCè§£ç å¨204äºè æ¥æ¶å°ç以å产ççå¤ç§ä¸åä¿¡å·å¯ä»¥æ¯æ¨¡æå/æ°åä¿¡å·çä»»ä½éå½çç»åï¼å æ¬ææç模æä¿¡å·æææçæ°åä¿¡å·ãè½ç¶å¾2䏿ªä½æ¾ç¤ºï¼ä½æ¬é¢åçææ¯äººåå¯ä»¥çè§£ï¼ä¸ä¸ªæå¤ä¸ªç»å声é212以åBCCæç¤ºç æ°æ®æµ116å¯ç±BCCç¼ç å¨202è¿è¡è¿ä¸æ¥çç¼ç ï¼å¹¶ç±BCCè§£ç å¨204è¿è¡ç¸åºçè§£ç ï¼ä¾å¦åºäºä¸äºéå½çå缩æºå¶(ä¾å¦ADPCM)è¿ä¸æ¥åå°ä¼ éçæ°æ®ç大å°ã Depending on the particular implementation, the various signals received by and generated by both BCC encoder 202 and BCC decoder 204 of FIG. 2 may be any suitable combination of analog and/or digital signals, including all analog signals or all digital signal. Although not shown in FIG. 2 , those skilled in the art can understand that one or more composite channels 212 and the BCC prompt code data stream 116 can be further encoded by the BCC encoder 202 and correspondingly encoded by the BCC decoder 204. The decoding of , eg based on some suitable compression mechanism (eg ADPCM), further reduces the size of the transmitted data. the
ä»BCCç¼ç å¨202å°BCCè§£ç å¨204çæ°æ®ä¼ è¾çå®ä¹åå³äºé³é¢å¤çç³»ç»200çç¹å®åºç¨ãä¾å¦ï¼å¨ä¸äºå®æ½ä¾ä¸ï¼å¦é³ä¹ä¼çç°åºå¹¿æï¼ä¼ è¾å¯æ¶åæ°æ®ç宿¶ä¼ è¾ä»¥ä¾¿å¨è¿ç«¯ä½ç½®ç«å³ææ¾ãå¨å ¶å®çåºç¨ä¸ï¼âä¼ è¾â坿¶åæ°æ®å°CDæå ¶å®éå½çåå¨ä»è´¨çåå¨ä»¥ç¨äºä»¥åç(å³é宿¶ç)ææ¾ãå½ç¶ï¼å ¶å®çåºç¨ä¹æ¯æå¯è½çã The definition of data transfer from BCC encoder 202 to BCC decoder 204 depends on the particular application of audio processing system 200 . For example, in some embodiments, such as a live broadcast of a concert, transmission may involve real-time transmission of data for immediate playback at a remote location. In other applications, "transfer" may involve storage of data to a CD or other suitable storage medium for later (ie, non-real-time) playback. Of course, other applications are also possible. the
æ ¹æ®ç¹å®çå®ç°æ¹å¼ï¼ä¼ è¾ä¿¡éå¯ä»¥æ¯æçº¿çææ çº¿çï¼å¯ä»¥ä½¿ç¨å®å¶çææ ååçåè®®(ä¾å¦IP)ãå¦CDãDVDãæ°åç£å¸¦å½é³æºä»¥ååºæåå¨å¨çä»è´¨å¯ç¨äºåå¨ãæ¤å¤ï¼ä¼ è¾å/æåå¨å¯ä»¥å æ¬ãä½å¹¶ä¸å¿ é¡»å æ¬ä¿¡éç¼ç ã类似å°ï¼è½ç¶æ¬åæå·²ç»åºäºæ°åé³é¢ç³»ç»åäºæè¿°ï¼ä½æ¬é¢åçææ¯äººåå¯ä»¥çè§£çæ¯ï¼æ¬åæè¿å¯åºäºæ¨¡æé³é¢ç³»ç»çæ 嵿¥å®ç°ï¼å¦AMæ 线çµãFMæ 线çµä»¥å模æçµè§å¹¿æçé³é¢é¨åï¼ä»¥ä¸åæ¯æéå ç带å 使¯ç¹çä¼ è¾ä¿¡éçå¼å ¥ã Depending on the particular implementation, the transport channel can be wired or wireless, and custom or standardized protocols (eg, IP) can be used. Media such as CDs, DVDs, digital tape recorders, and solid-state memory can be used for storage. Furthermore, transmission and/or storage may, but need not, include channel coding. Similarly, although the present invention has been described in terms of digital audio systems, those skilled in the art will appreciate that the present invention can also be practiced in the context of analog audio systems, such as AM radio, FM radio, and analog television broadcasting. In the audio part, all of the above support the introduction of additional in-band low bit rate transmission channels. the
æ¬åæçå®ç°è¿å¯ç¨äºè®¸å¤ç§ä¸åçåºç¨ï¼å¦é³ä¹åç°ã广æå çµè¯ææ¯ãä¾å¦ï¼æ¬åæçå®ç°è¿å¯ç¨äºæ°åæ 线çµ/çµè§/å ç¹ç½(ä¾å¦ç½ç«æ·æ)广æï¼å¦å¤©ç¼æå«æå¹¿æå ¬å¸æXMå«æå¹¿æå ¬å¸ãå ¶å®çåºç¨å æ¬IPè¯é³ãPSTNæå ¶å®è¯é³ç½ç»ãæ¨¡ææ çº¿çµå¹¿æåå ç¹ç½å¹¿æã Implementations of the present invention can also be used in many different applications, such as music reproduction, broadcasting and telephony. For example, implementations of the present invention may also be used with digital radio/television/Internet (eg, webcast) broadcasts, such as Sirius Satellite Broadcasting or XM Satellite Broadcasting. Other applications include Voice over IP, PSTN or other voice networks, analog radio broadcasting and Internet broadcasting. the
æ ¹æ®ç¹å®çåºç¨ï¼å¯éç¨ä¸åçææ¯æBCCç¼ç çéååµå ¥å°ç»å声éä¸ä»èè·å¾æ¬åæçBCCä¿¡å·ãä»»ä½ç¹å®ææ¯çå¯è¡æ§å¯è½è³å°é¨åå°ä¾èµäºç¨äºBCCä¿¡å·çç¹å®çä¼ è¾/åå¨ä»è´¨ãä¾å¦ï¼æ°åæ 线çµå¹¿æåè®®éå¸¸æ¯æå æ¬è¢«å¸¸è§æ¥æ¶æºå¿½ç¥çéå å¢å¼ºæ¯ç¹(ä¾å¦ï¼å¨æ°æ®å çå 头é¨å)ãè¿äºéå çæ¯ç¹å¯ç¨æ¥è¡¨ç¤ºå¬è§åºæ¯åæ°çéå以æä¾BCCä¿¡å·ãä¸è¬æ¥è¯´ï¼æ¬åæçå®ç°ä¸å¯ä½¿ç¨ä»»ä½åéçææ¯ä¸ºé³é¢ä¿¡å·åæ è®°ï¼å ¶ä¸å¯¹åºäºå¬è§åºæ¯åæ°éåçæ°æ®è¢«åµå ¥å°é³é¢ä¿¡å·ä¸ä»èå½¢æBCCä¿¡å·ãä¾å¦ï¼è¿äºææ¯å¯æ¶åéè卿ç¥é®è½æ²çº¿ä¸çæ°æ®æéèå¨ä¼ªéæºåªå£°ä¸çæ°æ®ãä¼ªéæºåªå£°æè§èµ·æ¥åå¹³ç¼çåªå£°ãæ°æ®åµå ¥çå®ç°è¿å¯ä½¿ç¨ç±»ä¼¼äºTDM(æ¶åå¤ç¨)ä¼ è¾ä¸éç¨çä½å æ¢å¤ºæ¹æ³ï¼ä»èç¨äºå¸¦å 信令ãå¦ä¸ç§å¯è½çææ¯æ¯Î¼å¾LSBä½ç¿»è½¬ï¼å ¶ä¸æä½ææä½ç¨æ¥ä¼ éæ°æ®ã Depending on the specific application, different techniques can be used to embed the set of BCC codes into the combined channel to obtain the BCC signal of the present invention. The feasibility of any particular technique may depend, at least in part, on the particular transmission/storage medium used for the BCC signal. For example, digital radio broadcast protocols often support the inclusion of additional enhancement bits (eg, in the header portion of data packets) that are ignored by conventional receivers. These additional bits can be used to represent the set of auditory scene parameters to provide a BCC signal. In general, any suitable technique may be used in the implementation of the present invention to mark the audio signal, wherein data corresponding to a set of auditory scene parameters is embedded into the audio signal to form a BCC signal. For example, these techniques may involve data hidden under a perceptual masking curve or data hidden in pseudorandom noise. Pseudorandom noise feels like smooth noise. Data embedding can also be implemented for in-band signaling using a bit-snatching approach similar to that employed in TDM (Time Division Multiplexing) transmission. Another possible technique is μ-law LSB bit flipping, where the least significant bit is used to transmit data. the
æ¬åæå¯ç¨åºäºçµè·¯çå¤çæ¹å¼æ¥å®ç°ï¼å æ¬å¨å个éæçµè·¯ä¸çå¯è½çå®ç°ã对æ¬é¢åææ¯äººåæ¥è¯´å¾ææ¾ï¼çµè·¯å ä»¶çå¤ç§åè½è¿å¯ç¨è½¯ä»¶ç¨åºä¸çå¤çæ¥éª¤æ¥å®ç°ãè¿æ ·ç软件ä¾å¦å¯ç¨äºæ°åä¿¡å·å¤çå¨ãå¾®æ§å¶å¨æéç¨è®¡ç®æºä¸ã The invention may be implemented in circuit-based processing, including possible implementations on a single integrated circuit. It will be apparent to those skilled in the art that various functions of circuit elements may also be implemented by processing steps in a software program. Such software can be used, for example, in digital signal processors, microcontrollers or general purpose computers. the
æ¬åæå¯ä»¥ç¨è¿äºæ¹æ³ä»¥åå®ç°è¿äºæ¹æ³çè£ ç½®ç形弿¥ä½ç°ãæ¬åæè¿å¯ä»¥ç¨å å«å¨å¦è½¯çãCD-ROMã硬çæå ¶å®ä»»ä½æºå¨å¯è¯»åå¨ä»è´¨ççæå½¢ä»è´¨ä¸çç¨åºä»£ç ç形弿¥ä½ç°ï¼å ¶ä¸å½ç¨åºä»£ç 被å è½½å°å¦è®¡ç®æºççæºå¨ä¸å¹¶è¢«æ§è¡æ¶ï¼è¯¥æºå¨å°±æä¸ºå®ç°æ¬åæçè£ ç½®ãæ¬åæè¿å¯ä»¥ç¨ç¨åºä»£ç ç形弿¥ä½ç°ï¼è¿äºç¨åºä»£ç ä¾å¦åå¨å¨åå¨ä»è´¨ä¸ãå è½½å°æºå¨ä¸å/æç±æºå¨æ§è¡ï¼æè éè¿æäºä¼ è¾ä»è´¨æè½½ä½æ¥ä¼ éï¼å¦å¨çµçº¿æçµç¼ä¸ãéè¿å 纤ãæç»ç±çµç£è¾å°è¿è¡ä¼ éï¼å ¶ä¸å½ç¨åºä»£ç 被å è½½å°å¦è®¡ç®æºççæºå¨ä¸å¹¶è¢«æ§è¡æ¶ï¼è¯¥æºå¨å°±æä¸ºå®ç°æ¬åæçè£ ç½®ãå½å¨éç¨å¤çå¨ä¸å®ç°æ¶ï¼ç¨åºä»£ç 段ä¸è¯¥å¤çå¨å ±åæä¾ä¸ç§å¯ç±»ä¼¼äºä¸ç¨é»è¾çµè·¯å·¥ä½çå¯ä¸çè£ ç½®ã The present invention can be embodied in the form of these methods and apparatuses for carrying out these methods. The present invention can also be embodied in the form of program code contained in a tangible medium such as a floppy disk, CD-ROM, hard disk or any other machine-readable storage medium, wherein when the program code is loaded into a machine such as a computer and When executed, the machine becomes a means for implementing the invention. The present invention can also be embodied in the form of program code, for example, stored in a storage medium, loaded into a machine and/or executed by a machine, or transmitted through some transmission medium or carrier, such as on an electric wire or cable. , through optical fiber, or via electromagnetic radiation, wherein when the program code is loaded into a machine such as a computer and executed, the machine becomes a means for implementing the present invention. When implemented on a general-purpose processor, the program code segments cooperate with the processor to provide a unique device that operates like specific logic circuits. the
è¿åºçè§£çæ¯ï¼ä¸ºäºä¾¿äºéè¿°æ¬åæçæ¬è´¨èå·²ç»æè¿°åå±ç¤ºçé¨ä»¶çç»èãææä»¥åæåï¼é½å¯ç±æ¬é¢åææ¯äººåå¨ä¸è¿èå¦ä¸é¢çæå©è¦æ±æè¡¨è¿°çæ¬åæçèå´çæ åµä¸å 以æ¹åã It should also be understood that details, materials and arrangements of parts which have been described and shown for the purpose of illustrating the nature of the invention can be changed by those skilled in the art without departing from the scope of the invention as expressed in the following claims Change it below. the
Claims (16)1. the multi-channel audio signal with a plurality of audio frequency input sound channels is carried out Methods for Coding, said multi-channel audio signal has a plurality of conventional sound channels and at least one low-frequency effect sound channel, and this method comprises:
Application parameter audio coding technology produces parametric audio coding for all audio frequency input sound channels that are used for first frequency range, and said first frequency range is corresponding to being lower than one or more sub-bands of specifying cut-off frequency; And
Application parameter audio coding technology only produces parametric audio coding for the conventional sound channel that is used for second frequency range, and said second frequency range is corresponding to being higher than one or more sub-bands of specifying cut-off frequency, wherein:
For first frequency range, the parametric audio coding technology generates the parametric audio coding corresponding to all audio frequency input sound channels; And
For second frequency range, the parametric audio coding technology generates only corresponding to conventional sound channel, and is not directed against the parametric audio coding of at least one low-frequency effect sound channel.
2. according to the process of claim 1 wherein that the parametric audio coding technology is a dual track keying BCC coding, and said parametric audio coding is the BCC sign indicating number.
3. according to the process of claim 1 wherein that cut-off frequency is effective audio bandwidth of low-frequency effect sound channel at least.
4. according to the process of claim 1 wherein that multi-channel audio signal is 5.1 surround sound signals.
5. according to the method for claim 1, also comprise and transmit the parametric audio coding that is used for first and second frequency ranges.
6. the multi-channel audio signal with a plurality of audio frequency input sound channels is carried out apparatus for encoding, said multi-channel audio signal has a plurality of conventional sound channels and at least one low-frequency effect sound channel, and this device comprises:
Application parameter audio coding technology is the device that is used for all audio frequency input sound channels generation parametric audio codings of first frequency range, and said first frequency range is corresponding to being lower than one or more sub-bands of specifying cut-off frequency; And
Application parameter audio coding technology only produces the device of parametric audio coding for the conventional sound channel that is used for second frequency range, said second frequency range is corresponding to being higher than one or more sub-bands of specifying cut-off frequency, wherein:
For first frequency range, the parametric audio coding technology generates the parametric audio coding corresponding to all audio frequency input sound channels; And
For second frequency range, the parametric audio coding technology generates only corresponding to conventional sound channel, and is not directed against the parametric audio coding of at least one low-frequency effect sound channel.
7. parametric audio coders comprises:
Be applicable to the down-conversion mixer that is produced one or more combined channels by a plurality of audio frequency input sound channels of multi-channel audio signal, said multi-channel audio signal has a plurality of conventional sound channels and at least one low-frequency effect sound channel; And
Analyzer is used for producing:
(1) be used for the parametric audio coding of all the audio frequency input sound channels in first frequency range, said first frequency range is corresponding to being lower than one or more sub-bands of specifying cut-off frequency; And
(2) only be used for the parametric audio coding of the conventional sound channel in second frequency range, said second frequency range is corresponding to being higher than one or more sub-bands of specifying cut-off frequency, wherein:
For first frequency range, analyzer generates the parametric audio coding corresponding to all audio frequency input sound channels; And
For second frequency range, analyzer generates only corresponding to conventional sound channel, and is not directed against the parametric audio coding of at least one low-frequency effect sound channel.
8. according to the parametric audio coders of claim 7, wherein parametric audio coding is the BCC coding.
9. according to the parametric audio coders of claim 7, also comprise being applicable to the parametric audio coding that transmits first and second frequency ranges.
10. the method for the multi-channel audio signal with a plurality of audio frequency output channels being synthesized, said multi-channel audio signal have a plurality of conventional sound channels and at least one low-frequency effect sound channel, and this method comprises:
Application parameter audio decoder technology produces all audio frequency output channels that are used for first frequency range, and said first frequency range is corresponding to being lower than one or more sub-bands of specifying cut-off frequency; And
Application parameter audio decoder technology only produces the conventional sound channel that is used for second frequency range, and said second frequency range is corresponding to being higher than one or more sub-bands of specifying cut-off frequency; Wherein:
Parametric audio decoding technique operation parameter audio coding generates the audio frequency output channels;
For first frequency range, parametric audio coding is corresponding to all audio frequency output channels; And
For second frequency range, parametric audio coding is only corresponding to conventional sound channel, and at least one low-frequency effect sound channel.
11. according to the method for claim 10, wherein the parametric audio decoding technique is the BCC decoding.
12. according to the method for claim 10, wherein cut-off frequency is effective audio bandwidth of low-frequency effect sound channel at least.
13. according to the method for claim 10, wherein multi-channel audio signal is 5.1 surround sound signals.
14. to the device that the multi-channel audio signal with a plurality of audio frequency output channels synthesizes, said multi-channel audio signal has a plurality of conventional sound channels and at least one low-frequency effect sound channel, this device comprises:
Application parameter audio decoder technology produces the device of all audio frequency output channels that are used for first frequency range, and said first frequency range is corresponding to being lower than one or more sub-bands of specifying cut-off frequency; And
Application parameter audio decoder technology produces the device of the conventional sound channel that only is used for second frequency range, and said second frequency range is corresponding to being higher than one or more sub-bands of specifying cut-off frequency; Wherein:
Parametric audio decoding technique operation parameter audio coding generates the audio frequency output channels;
For first frequency range, parametric audio coding is corresponding to all audio frequency output channels; And
For second frequency range, parametric audio coding is only corresponding to conventional sound channel, and at least one low-frequency effect sound channel.
15. a parametric audio decoder is used for synthetic multi-channel audio signal with a plurality of audio frequency output channels, said multi-channel audio signal has a plurality of conventional sound channels and at least one low-frequency effect sound channel, and said parametric audio decoder comprises:
The side information processor, it is applicable to the recovery parametric audio coding, and
Synthesizer, it is suitable for:
Application parameter audio decoder technology produces all audio frequency output channels that are used for first frequency range, and said first frequency range is corresponding to being lower than one or more sub-bands of specifying cut-off frequency; And
Application parameter audio decoder technology produces the conventional sound channel that only is used for second frequency range, and said second frequency range is corresponding to being higher than one or more sub-bands of specifying cut-off frequency; Wherein:
Parametric audio decoding technique operation parameter audio coding generates the audio frequency output channels;
For first frequency range, parametric audio coding is corresponding to all audio frequency output channels; And
For second frequency range, parametric audio coding is only corresponding to conventional sound channel, and at least one low-frequency effect sound channel.
16. according to the parametric audio decoder of claim 15, wherein parametric audio coding is the BCC coding.
CN2005800070361A 2004-03-04 2005-02-23 Frequency-based coding of audio channels in parametric multi-channel coding systems Expired - Lifetime CN1930914B (en) Applications Claiming Priority (5) Application Number Priority Date Filing Date Title US54997204P 2004-03-04 2004-03-04 US60/549,972 2004-03-04 US10/827,900 US7805313B2 (en) 2004-03-04 2004-04-20 Frequency-based coding of channels in parametric multi-channel coding systems US10/827,900 2004-04-20 PCT/US2005/005605 WO2005094125A1 (en) 2004-03-04 2005-02-23 Frequency-based coding of audio channels in parametric multi-channel coding systems Publications (2) Family ID=37859620 Family Applications (1) Application Number Title Priority Date Filing Date CN2005800070361A Expired - Lifetime CN1930914B (en) 2004-03-04 2005-02-23 Frequency-based coding of audio channels in parametric multi-channel coding systems Country Status (2) Families Citing this family (12) * Cited by examiner, â Cited by third party Publication number Priority date Publication date Assignee Title EP2144229A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Efficient use of phase information in audio encoding and decoding WO2010013450A1 (en) * 2008-07-29 2010-02-04 ããã½ããã¯æ ªå¼ä¼ç¤¾ Sound coding device, sound decoding device, sound coding/decoding device, and conference system RU2505941C2 (en) * 2008-07-31 2014-01-27 ФÑаÑÐ½Ñ Ð¾ÑеÑ-ÐезеллÑÑаÑÑ ÑÑÑ Ð¤ÑÑдеÑÑнг Ð´ÐµÑ Ð°Ð½Ð³ÐµÐ²Ð°Ð½Ð´Ñен ФоÑÑÑнг Ð.Ф. Generation of binaural signals CN102227769A (en) 2008-10-01 2011-10-26 Gvbbæ§è¡è¡ä»½æéå ¬å¸ Decoding apparatus, decoding method, encoding apparatus, encoding method, and editing apparatus EP2175670A1 (en) * 2008-10-07 2010-04-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Binaural rendering of a multi-channel audio signal EP4528726A1 (en) * 2008-10-29 2025-03-26 Dolby International AB Signal clipping protection using pre-existing audio gain metadata EP2214161A1 (en) * 2009-01-28 2010-08-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus, method and computer program for upmixing a downmix audio signal US12002476B2 (en) 2010-07-19 2024-06-04 Dolby International Ab Processing of audio signals during high frequency reconstruction CA3072785C (en) 2010-07-19 2020-09-01 Dolby International Ab Processing of audio signals during high frequency reconstruction US8675719B2 (en) * 2010-09-28 2014-03-18 Tektronix, Inc. Multi-domain test and measurement instrument PT2880654T (en) * 2012-08-03 2017-12-07 Fraunhofer Ges Forschung Decoder and method for a generalized spatial-audio-object-coding parametric concept for multichannel downmix/upmix cases US9607624B2 (en) * 2013-03-29 2017-03-28 Apple Inc. Metadata driven dynamic range control Family Cites Families (5) * Cited by examiner, â Cited by third party Publication number Priority date Publication date Assignee Title US4132859A (en) * 1977-12-02 1979-01-02 Egils Ranga Sound reproducing apparatus US4382157A (en) * 1978-07-17 1983-05-03 Kenneth P. Wert, Sr. Multiple speaker type sound producing system US5265166A (en) * 1991-10-30 1993-11-23 Panor Corp. Multi-channel sound simulation system DE4135977C2 (en) * 1991-10-31 1996-07-18 Fraunhofer Ges Forschung Method for the simultaneous transmission of signals from N signal sources RU2193827C2 (en) * 1997-11-14 2002-11-27 Ð. ÐÐµÐ¹Ð²Ñ (СÑа) Ðнк. Post-amplifying stereo-to-ambient sound decoding circuitRef country code: HK
Ref legal event code: DE
Ref document number: 1101634
Country of ref document: HK
2012-06-27 C14 Grant of patent or utility model 2012-06-27 GR01 Patent grant 2012-12-07 REG Reference to a national codeRef country code: HK
Ref legal event code: GR
Ref document number: 1101634
Country of ref document: HK
2025-03-11 CX01 Expiry of patent term 2025-03-11 CX01 Expiry of patent termGranted publication date: 20120627
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4