ãï¼ï¼ï¼ï¼ã[0001]
ãçºæã®å±ããæè¡åéãæ¬çºæã¯ããã«ããã£ãã«ã®
é³å£°ä¿¡å·ãäºæ¸¬ç¬¦å·åããããã®é³å£°ç¬¦å·åè£
ç½®ãå
è¨
é²åªä½ãé³å£°å¾©å·è£
ç½®ãé³å£°ä¼éæ¹æ³åã³ä¼éåªä½ã«é¢
ãããBACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio encoding device for predictively encoding a multi-channel audio signal, an optical recording medium, an audio decoding device, an audio transmission method, and a transmission medium.
ãï¼ï¼ï¼ï¼ã[0002]
ã徿¥ã®æè¡ãé³å£°ä¿¡å·ãäºæ¸¬ç¬¦å·åããæ¹æ³ã¨ãã¦ã
æ¬çºæè
ã¯å
ã®åºé¡ï¼ç¹é¡å¹³ï¼âï¼ï¼ï¼ï¼ï¼ï¼å·ï¼ã«ã
ãã¦ï¼ãã£ãã«ã®åãã¸ã¿ã«é³å£°ä¿¡å·ã«å¯¾ãã¦ãç¹æ§ã
ç°ãªãè¤æ°ã®äºæ¸¬å¨ã«ããæéé åã«ãããéå»ã®ä¿¡å·
ããç¾å¨ã®ä¿¡å·ã®è¤æ°ã®ç·å½¢äºæ¸¬å¤ãç®åºããåãã¸ã¿
ã«é³å£°ä¿¡å·ã¨ããã®è¤æ°ã®ç·å½¢äºæ¸¬å¤ããäºæ¸¬å¨æ¯ã®äº
測æ®å·®ãç®åºããäºæ¸¬æ®å·®ã®æå°å¤ã鏿ããæ¹æ³ãæ
æ¡ãã¦ããã2. Description of the Related Art As a method of predictive encoding of a speech signal,
In the prior application (Japanese Patent Application No. 9-289159), the inventor of the present invention applied a plurality of linearizers of a current signal from a past signal in the time domain to a one-channel original digital audio signal using a plurality of predictors having different characteristics. A method is proposed in which a prediction value is calculated, a prediction residual for each predictor is calculated from the original digital audio signal and the plurality of linear prediction values, and a minimum value of the prediction residual is selected.
ãï¼ï¼ï¼ï¼ã[0003]
ãçºæã解決ãããã¨ãã課é¡ãããããªãããä¸è¨æ¹
æ³ã§ã¯åãã¸ã¿ã«é³å£°ä¿¡å·ããµã³ããªã³ã°å¨æ³¢æ°ï¼ï¼ï¼
ï½ï¼¨ï½ãéååãããæ°ï¼ï¼ï¼ãããç¨åº¦ã®å ´åã«ãã
ç¨åº¦ã®å§ç¸®å¹æãå¾ããã¨ãã§ããããè¿å¹´ã®ï¼¤ï¼¶ï¼¤ãª
ã¼ãã£ãªãã£ã¹ã¯ã§ã¯ãã®ï¼åã®ãµã³ããªã³ã°å¨æ³¢æ°
ï¼ï¼ï¼ï¼ï¼ï½ï¼¨ï½ï¼ã使ç¨ãããã¾ããéååãããæ°
ãï¼ï¼ãããã使ç¨ãããå¾åãããã®ã§ãå§ç¸®çãæ¹
åããå¿
è¦ããããããã«ãå
¥åãã«ããã£ã³ãã«é³å£°
ä¿¡å·ãã¹ãã¬ãªï¼ãã£ãã«ã§åçãããããã«ããã£ã³
ãã«ã§åçããããããã¨ãèæ
®ããªããã°ãªããªããHowever, in the above method, the original digital audio signal has a sampling frequency = 96.
Although a certain compression effect can be obtained when the kHz and the quantization bit number are about 20 bits, recent DVD audio discs use twice the sampling frequency (= 192 kHz). Since 24 bits also tend to be used, the compression ratio needs to be improved. In addition, it must be considered that the input multi-channel audio signal is reproduced on two stereo channels or reproduced on multiple channels.
ãï¼ï¼ï¼ï¼ãããã§æ¬çºæã¯ããã«ããã£ãã«ã®é³å£°ä¿¡
å·ãäºæ¸¬ç¬¦å·åããå ´åã«å§ç¸®çãæ¹åãããã¨ãã§ã
ãã¨ã¨ãã«ãã°ã«ã¼ãåããã¦ã¹ãã¬ãªï¼ãã£ãã«ãã
ã«ããã£ã³ãã«ã§åçãããã¨ãã§ããé³å£°ç¬¦å·åè£
ç½®ãå
è¨é²åªä½ãé³å£°å¾©å·è£
ç½®ãé³å£°ä¼éæ¹æ³åã³ä¼é
åªä½ãæä¾ãããã¨ãç®çã¨ããã[0004] Accordingly, the present invention provides an audio encoding apparatus which can improve the compression ratio when predictive encoding of a multi-channel audio signal is performed, and which can be reproduced in two stereo channels or multi channels by grouping. It is an object to provide an optical recording medium, an audio decoding device, an audio transmission method, and a transmission medium.
ãï¼ï¼ï¼ï¼ã[0005]
ã課é¡ã解決ããããã®ææ®µãæ¬çºæã¯ä¸è¨ç®çãéæ
ããããã«ã以ä¸ã®ï¼ï¼ãï¼ï¼ã«è¨è¼ã®ææ®µããæãã
ã®ã§ãããããªãã¡ãMeans for Solving the Problems In order to achieve the above object, the present invention comprises the following means 1) to 5). That is,
ãï¼ï¼ï¼ï¼ãï¼ï¼å
ã®ãã«ããã£ãã«ã®é³å£°ä¿¡å·ãå°ãª
ãã¨ãåæ¹ï¼ãã£ãã«ãå«ã第ï¼ã®ã°ã«ã¼ãã¨ä»ã®ãã£
ãã«ãå«ã第ï¼ã®ã°ã«ã¼ãã«åé¡ãã¦ãå°ãªãã¨ãåè¨
第ï¼ã®ã°ã«ã¼ãã®ãã£ãã«ã®é³å£°ä¿¡å·ãç¸é¢æ§ã®ããé³
声信å·ã«å¤æããç¸é¢ææ®µã¨ãåè¨ç¬¬ï¼ã®ã°ã«ã¼ãã¨å
è¨ç¬¬ï¼ã®ã°ã«ã¼ãã®ç¸é¢æ§ã®ããé³å£°ä¿¡å·ã®è¤æ°ã®äºæ¸¬
å¤ããã£ãã«æ¯ã«ç®åºãããã®è¤æ°ã®äºæ¸¬å¤ã®åäºæ¸¬æ®
å·®ãç®åºãããã®è¤æ°ã®äºæ¸¬æ®å·®ã®æå°å¤ã鏿ããè¤
æ°ã®äºæ¸¬ç¬¦å·åææ®µã¨ãåè¨äºæ¸¬ç¬¦å·åææ®µã«ãã鏿
ãããäºæ¸¬æ®å·®ãå«ãäºæ¸¬ç¬¦å·åãã¼ã¿ãåè¨ç¬¬ï¼ã第
ï¼ã®ã°ã«ã¼ãæ¯ã«åé¡ãããããã¹ããªã¼ã ã«ãã©ã¼ã
ããåããææ®µã¨ããæããé³å£°ç¬¦å·åè£
ç½®ã ï¼ï¼è«æ±é
ï¼è¨è¼ã®é³å£°ç¬¦å·åè£
ç½®ã«ããã¦é¸æããã
äºæ¸¬æ®å·®ãå«ãäºæ¸¬ç¬¦å·åãã¼ã¿ãåè¨åæ¹ï¼ãã£ãã«
ã®ã°ã«ã¼ãã¨ç¬¬ï¼ã®ã°ã«ã¼ãæ¯ã«åé¡ããããããã¹ã
ãªã¼ã ã«ãã©ã¼ãããåããã¦è¨é²ãããå
è¨é²åªä½ã ï¼ï¼è«æ±é
ï¼è¨è¼ã®é³å£°ç¬¦å·åè£
ç½®ã«ããã¦é¸æããã
äºæ¸¬æ®å·®ãå«ãäºæ¸¬ç¬¦å·åãã¼ã¿ããäºæ¸¬å¤ãç®åºãã
äºæ¸¬å¤ããå
ã®ãã«ããã£ãã«ã®é³å£°ä¿¡å·ã復å
ããã
ã¨ãç¹å¾´ã¨ããé³å£°å¾©å·è£
ç½®ã ï¼ï¼è«æ±é
ï¼è¨è¼ã®é³å£°ç¬¦å·åè£
ç½®ã«ããã¦é¸æããã
äºæ¸¬æ®å·®ãå«ãäºæ¸¬ç¬¦å·åãã¼ã¿ãéä¿¡åç·ãä»ãã¦ä¼
éãããã¨ãç¹å¾´ã¨ããé³å£°ä¼éæ¹æ³ã ï¼ï¼è«æ±é
ï¼è¨è¼ã®é³å£°ç¬¦å·åè£
ç½®ã«ããã¦é¸æããã
äºæ¸¬æ®å·®ãå«ãäºæ¸¬ç¬¦å·åãã¼ã¿ãä¼éãããã¨ãç¹å¾´
ã¨ããä¼éåªä½ã[0006] 1) The original multi-channel audio signals are classified into a first group including at least the front two channels and a second group including other channels, and the audio signals of at least the channels of the second group are classified. Correlating means for converting into a correlated audio signal; and calculating a plurality of predicted values of the correlated audio signal of the first group and the second group for each channel, A plurality of predictive encoding means for calculating a prediction residual and selecting a minimum value of the plurality of prediction residuals; and predictive coded data including the prediction residual selected by the predictive encoding means, in the first, Means for formatting into a bit stream classified for each second group. 2) In the speech encoding apparatus according to claim 1, the prediction coded data including the prediction residual selected is formatted and recorded in the bit stream classified into the group of the front two channels and the second group. Optical recording medium. 3) calculating a prediction value from the prediction coded data including the prediction residual selected in the speech coding apparatus according to claim 1;
An audio decoding device for restoring an original multi-channel audio signal from a predicted value. 4) A speech transmission method, characterized in that the speech encoding apparatus according to claim 1, wherein the prediction coded data including the prediction residual selected is transmitted via a communication line. (5) A transmission medium for transmitting predicted coded data including a prediction residual selected in the speech coding apparatus according to (1).
ãï¼ï¼ï¼ï¼ã[0007]
ãçºæã®å®æ½ã®å½¢æ
ã以ä¸ãå³é¢ãåç
§ãã¦æ¬çºæã説
æãããå³ï¼ã¯æ¬çºæã«ä¿ãé³å£°ç¬¦å·åè£
ç½®åã³é³å£°å¾©
å·è£
ç½®ã®ç¬¬ï¼ã®å®æ½å½¢æ
ã示ããããã¯å³ãå³ï¼ã¯å³ï¼
ã®ç¬¦å·åé¨ã詳ãã示ããããã¯å³ãå³ï¼ã¯å³ï¼ãå³ï¼
ã®ç¬¦å·åé¨ã«ãã符å·åããããããã¹ããªã¼ã ã示ã
説æå³ãå³ï¼ã¯å³ï¼ã®å¾©å·åé¨ã詳ãã示ããããã¯
å³ãå³ï¼ã¯ï¼¤ï¼¶ï¼¤ã®ããã¯ã®ãã©ã¼ãããã示ã説æ
å³ãå³ï¼ã¯ï¼¤ï¼¶ï¼¤ã®ãªã¼ãã£ãªããã¯ã®ãã©ã¼ãããã
示ã説æå³ãå³ï¼ãå³ï¼ã¯é³å£°ä¼éæ¹æ³ã示ãããã¼ã
ã£ã¼ãã§ãããDETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a first embodiment of a speech encoding apparatus and a speech decoding apparatus according to the present invention, and FIG.
FIG. 3 is a block diagram showing the encoding unit of FIG.
FIG. 4 is a block diagram showing the decoding unit in FIG. 1 in detail, FIG. 5 is an explanatory diagram showing a DVD pack format, and FIG. 6 is a DVD audio system. FIGS. 7 and 8 are explanatory diagrams showing the format of a pack, and FIGS. 7 and 8 are flowcharts showing an audio transmission method.
ãï¼ï¼ï¼ï¼ãããã§ããã«ããã£ãã«æ¹å¼ã¨ãã¦ã¯ãä¾
ãã°æ¬¡ã®ï¼ã¤ã®æ¹å¼ãç¥ããã¦ããã ï¼ï¼ï¼ï¼ãã£ãã«æ¹å¼ ãã«ãã¼ãµã©ã¦ã³ãæ¹å¼ã®
ããã«ãåæ¹ï¼¬ãï¼£ãï¼²ã®ï¼ãã£ãã«ï¼å¾æ¹ï¼³ã®ï¼ãã£
ãã«ã®åè¨ï¼ãã£ãã« ï¼ï¼ï¼ï¼ãã£ãã«æ¹å¼ ãã«ãã¼ï¼¡ï¼£âï¼æ¹å¼ã®ï¼³
ï¼·ãã£ãã«ãªãã®ããã«ãåæ¹ï¼¬ãï¼£ãï¼²ã®ï¼ãã£ãã«
ï¼å¾æ¹ï¼³ï¼¬ãSRã®ï¼ãã£ãã«ã®åè¨ï¼ãã£ãã« ï¼ï¼ï¼ï¼ãã£ãã«æ¹å¼ DTSï¼Digital Theater
Systemï¼æ¹å¼ãããã«ãã¼ï¼¡ï¼£âï¼æ¹å¼ã®ããã«ï¼ãã£
ãã«ï¼ï¼¬ãï¼£ãï¼²ãSWï¼ï¼¬ï½ï½
ï¼ãSLãï¼³ï¼²ï¼ ï¼ï¼ï¼ï¼ãã£ãã«æ¹å¼ SDDSï¼Sony Dynamic D
igital Soundï¼æ¹å¼ã®ããã«ãåæ¹ï¼¬ãLCãï¼£ãï¼²
ï¼£ãï¼²ãSWã®ï¼ãã£ãã«ï¼å¾æ¹ï¼³ï¼¬ãSRã®ï¼ãã£ã
ã«ã®åè¨ï¼ãã£ãã«Here, for example, the following four systems are known as multi-channel systems. (1) Four-channel system Like the Dolby surround system, a total of four channels of three channels of front L, C, and R + one channel of rear S. (2) Five-channel system S of Dolby AC-3 system
As without W channel, 3 channels of front L, C and R + 2 channels of rear SL and SR, total 5 channels (3) 6 channel system DTS (Digital Theater)
6) (L, C, R, SW (Lfe), SL, SR) such as the Dolby AC-3 system (4) 8-channel system SDDS (Sony Dynamic D
digital sound), forward L, LC, C, R
6 channels of C, R, SW + 2 channels of rear SL, SR, total 8 channels
ãï¼ï¼ï¼ï¼ãå³ï¼ã«ç¤ºã符å·åå´ã®ï¼ãã£ãã«ï¼chï¼ã
ã¯ã¹ï¼ãããªã¯ã¹åè·¯ï¼âã¯ããã«ããã£ãã«ä¿¡å·ã®ä¸
ä¾ã¨ãã¦ããã³ãã¬ããï¼ï¼¬ï½ï¼ãã»ã³ã¿ï¼ï¼£ï¼ããã
ã³ãã©ã¤ãï¼ï¼²ï½ï¼ããµã©ã¦ã³ãã¬ããï¼ï¼¬ï½ï¼ããµã©
ã¦ã³ãã©ã¤ãï¼ï¼²ï½ï¼åã³ï¼¬ï½ï½
ï¼Low Frequency Effe
ctï¼ã®ï¼chã®ï¼°ï¼£ï¼ãã¼ã¿ãä¿æ°ï½ijï¼ï½ï¼ï¼ï¼ï¼ï¼ï½
ï¼ï¼ï¼ï¼ãï¼ï¼ãç¨ãã¦æ¬¡å¼ï¼ï¼ï¼ã«ããã¹ãã¬ãªï¼ã
ã£ãã«ï¼ï¼¬ãï¼²ï¼ã«ãã¦ã³ãã¯ã¹ããã Lï¼ï½11ã»ï¼¬ï½ï¼ï½12ã»ï¼²ï½ï¼ï½13ã»ï¼£ ï¼ï½14ã»ï¼¬ï½ï¼ï½15ã»ï¼²ï½ï¼ï½16ã»ï¼¬ï½ï½
ï¼²ï¼ï½21ã»ï¼¬ï½ï¼ï½22ã»ï¼²ï½ï¼ï½23ã»ï¼£ ï¼ï½24ã»ï¼¬ï½ï¼ï½25ã»ï¼²ï½ï¼ï½26ã»ï¼¬ï½ï½
â¦ï¼ï¼ï¼The 6-channel (ch) mix and matrix circuit 1 'on the encoding side shown in FIG. 1 includes a front left (Lf), a center (C), a front right (Rf), a surround left ( Ls), surround light (Rs) and Lfe (Low Frequency Effe)
ct) of the 6-channel PCM data by coefficients mij (i = 1, 2, j)
= 1, 2 to 6), and downmixes to two stereo channels (L, R) by the following equation (1). L = m11 · Lf + m12 · Rf + m13 · C + m14 · Ls + m15 · Rs + m16 · Lfe R = m21 · Lf + m22 · Rf + m23 · C + m24 · Ls + m25 · Rs + m26 · Lfe (1)
ãï¼ï¼ï¼ï¼ãã¾ããã¯ã¹ï¼ãããªã¯ã¹åè·¯ï¼âã¯ãå
ã®
ï¼chï¼ï¼¬ï½ãï¼£ãï¼²ï½ãLï½ãï¼²ï½ãLï½ï½
ï¼ãåæ¹ã°
ã«ã¼ãã«é¢ããï¼chã¨ä»ã®ã°ã«ã¼ãã«é¢ããï¼chã«åé¡
ãã¦ï¼chãæ¬¡å¼ï¼ï¼ï¼ã®ããã«ãç¸é¢æ§ã®ããä¿¡å·
ãï¼ãããï¼ãã«å¤æããï¼chï¼ï¼¬ãï¼²ï¼ã第ï¼ç¬¦å·å
é¨ï¼ââï¼ã«ãã¾ããï¼chãï¼ãããï¼ãã第ï¼ç¬¦å·å
é¨ï¼ââï¼ã«åºåããã ãï¼ãï¼ï¼¬ ãï¼ãï¼ï¼² ãï¼ãï¼ï¼£âï¼ï¼¬ï½ï¼ï¼²ï½ï¼ï¼ï¼ ãï¼ãï¼ï¼¬ï½ï¼ï¼²ï½ ãï¼ãï¼ï¼¬ï½âï¼²ï½ ãï¼ãï¼ï¼¬ï½ï½
âï¼£ â¦ï¼ï¼ï¼The mix & matrix circuit 1 'classifies the original 6 channels (Lf, C, Rf, Ls, Rs, Lfe) into 2 channels for the front group and 4 channels for the other groups, and divides the 4 channels into the following equation (2). , And 2ch (L, R) to the first encoder 2â²-1, and 4ch â3â to â6â to the third signal â3â to â6â. And outputs the result to the 2 coding unit 2â²-2. â1â = L â2â = R â3â = Câ (Ls + Rs) / 2 â4â = Ls + Rs â5â = LsâRs â6â = LfeâC (2)
ãï¼ï¼ï¼ï¼ã符å·åé¨ï¼âãæ§æãã第ï¼åã³ç¬¬ï¼ç¬¦å·
åé¨ï¼ââï¼ãï¼ââï¼ã¯ãããããå³ï¼ã«è©³ãã示ã
ããã«ï¼chãï¼ãããï¼ãã¨ï¼chãï¼ãããï¼ãã®ï¼°ï¼£
ï¼ãã¼ã¿ããã£ãã«æ¯ã«äºæ¸¬ç¬¦å·åããäºæ¸¬ç¬¦å·åãã¼
ã¿ãå³ï¼ã«ç¤ºããããªãããã¹ããªã¼ã ã§è¨é²åªä½ï¼ã
è¡æåç·ãé»è©±åç·çã®éä¿¡åªä½ï¼ãä»ãã¦å¾©å·å´ã«ä¼
éããã復å·å´ã§ã¯å¾©å·åé¨ï¼âãæ§æãã第ï¼åã³ç¬¬
ï¼å¾©å·åé¨ï¼ââï¼ãï¼ââï¼ã«ãããå³ï¼ã«è©³ãã示
ãããã«ããããåæ¹ã°ã«ã¼ãã«é¢ããï¼chãï¼ãã
ãï¼ãã¨ä»ã®ã°ã«ã¼ãã«é¢ããï¼chãï¼ãããï¼ãã®äº
測符å·åãã¼ã¿ããã£ãã«æ¯ã«ï¼°ï¼£ï¼ãã¼ã¿ã«å¾©å·ã
ããæ¬¡ãã§ãã¯ã¹ï¼ãããªã¯ã¹åè·¯ï¼âã«ããå¼
ï¼ï¼ï¼ãï¼ï¼ï¼ã«åºã¥ãã¦å
ã®ï¼chï¼ï¼¬ï½ãï¼£ãï¼²ï½ã
Lï½ãï¼²ï½ãLï½ï½
ï¼ã復å
ããã¨ã¨ãã«ãã¹ãã¬ãªï¼
chãã¼ã¿ï¼ï¼¬ãï¼²ï¼ããã®ã¾ã¾åºåãããAs shown in detail in FIG. 2, the first and second encoders 2'-1 and 2'-2 which constitute the encoder 2 'respectively have 2ch "1", "2" and 4ch "3". "~" 6 "PC
The M data is predictively encoded for each channel, and the encoded prediction data is transmitted as a bit stream as shown in FIG. 3 to the decoding side via a recording medium 5 or a communication medium 6 such as a satellite line or a telephone line. On the decoding side, the first and second decoding units 3'-1 and 3'-2 constituting the decoding unit 3 'respectively perform 2ch "1" for the forward group, as shown in detail in FIG.
The prediction coded data of "2" and 4ch "3" to "6" relating to the other groups are decoded into PCM data for each channel. Next, the original 6 ch (Lf, C, Rf,
Ls, Rs, Lfe) and restore stereo 2
The ch data (L, R) is output as it is.
ãï¼ï¼ï¼ï¼ãå³ï¼ãåç
§ãã¦ç¬¦å·åé¨ï¼ââï¼ãï¼ââ
ï¼ã«ã¤ãã¦è©³ãã説æãããåchãï¼ãããï¼ãã®ï¼°ï¼£
ï¼ãã¼ã¿ã¯ï¼ãã¬ã¼ã æ¯ã«ï¼ãã¬ã¼ã ãããã¡ï¼ï¼ã«æ ¼
ç´ããããããã¦ãï¼ãã¬ã¼ã ã®åchãï¼ãããï¼ãã®
ãµã³ãã«ãã¼ã¿ãããããäºæ¸¬åè·¯ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤
ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ã«å°å ãããã¨ã¨ãã«ãåch
ãï¼ãããï¼ãã®åãã¬ã¼ã ã®å
é ãµã³ãã«ãã¼ã¿ï¼å¾
è¿°ã®ãªã¹ã¿ã¼ããããå
ã«æ ¼ç´ãããï¼ãã¢ã³ãããã³
ã°åè·¯ï¼åã³ãã©ã¼ãããååè·¯ï¼ï¼ã«å°å ããããã¾
ããï¼°ï¼£ï¼ãã¼ã¿ãAï¼ï¼¤å¤æãããã¨ãã®ãµã³ããªã³
ã°å¨æ³¢æ°ï¼ï½ï½ï¼ã¨éååãããæ°ï¼ï¼±ï½ï¼ããããã³
ã°åè·¯ï¼ï¼åã³ãã©ã¼ãããååè·¯ï¼ï¼ã«å°å ãããã
äºæ¸¬åè·¯ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ã¯ã
ããããåchãï¼ãããï¼ãã®ï¼°ï¼£ï¼ãã¼ã¿ã«å¯¾ãã¦ã
ç¹æ§ãç°ãªãè¤æ°ã®äºæ¸¬å¨ï¼ä¸å³ç¤ºï¼ã«ããæéé åã«
ãããéå»ã®ä¿¡å·ããç¾å¨ã®ä¿¡å·ã®è¤æ°ã®ç·å½¢äºæ¸¬å¤ã
ç®åºããæ¬¡ãã§åï¼°ï¼£ï¼ãã¼ã¿ã¨ããã®è¤æ°ã®ç·å½¢äºæ¸¬
å¤ããäºæ¸¬å¨æ¯ã®äºæ¸¬æ®å·®ãç®åºãããç¶ããããã¡ã»
鏿å¨ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ã¯ãã
ãããäºæ¸¬åè·¯ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤
ï¼ã«ããç®åºãããåäºæ¸¬æ®å·®ãä¸æè¨æ¶ãã¦ãé¸æä¿¡
å·ï¼ï¼¤ï¼´ï¼³ï¼ãã³ã¼ãã£ã³ã°ã»ã¿ã¤ã ã»ã¹ã¿ã³ãï¼çæ
å¨ï¼ï¼ã«ããæå®ããããµããã¬ã¼ã æ¯ã«äºæ¸¬æ®å·®ã®æ
å°å¤ã鏿ãããReferring to FIG. 2, encoding sections 2'-1, 2'-
2 will be described in detail. PC for each channel "1" to "6"
The M data is stored in one frame buffer 10 for each frame. Then, the sample data of each of the channels â1â to â6â of one frame are respectively supplied to the prediction circuits 13D1 and 13D.
2, 15D1 to 15D4 and each channel
First sample data (stored in a restart header described later) of each frame of â1â to â6â is applied to the unpacking circuit 8 and the formatting circuit 19. The sampling frequency (fs) and the number of quantization bits (Qb) when the PCM data is A / D converted are applied to the packing circuit 18 and the formatting circuit 19.
The prediction circuits 13D1, 13D2, and 15D1 to 15D4 respectively calculate the PCM data of each channel â1â to â6â.
A plurality of predictors (not shown) having different characteristics calculate a plurality of linear prediction values of the current signal from a past signal in the time domain, and then perform prediction for each predictor from the original PCM data and the plurality of linear prediction values. Calculate the residual. The following buffer
The selectors 14D1, 14D2, 16D1 to 16D4 are prediction circuits 13D1, 13D2, 15D1 to 15D, respectively.
4 is temporarily stored, and the minimum value of the prediction residual is selected for each subframe specified by the selection signal / DTS (decoding time stamp) generator 17.
ãï¼ï¼ï¼ï¼ãé¸æä¿¡å·çæå¨ï¼ï¼ã¯äºæ¸¬æ®å·®ã®ãããæ°
ãã©ã°ããããã³ã°åè·¯ï¼ï¼ã¨ãã©ã¼ãããååè·¯ï¼ï¼
ã«å¯¾ãã¦å°å ããã¾ããäºæ¸¬æ®å·®ãæå°ã®äºæ¸¬å¨ã示ã
äºæ¸¬å¨é¸æãã©ã°ã¨ãå¾è¿°ãããããªç¸é¢ä¿æ°ããã©ã¼
ãããååè·¯ï¼ï¼ã«å¯¾ãã¦å°å ããããããã³ã°åè·¯ï¼
ï¼ã¯ãããã¡ã»é¸æå¨ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ã
ï¼ï¼ï¼¤ï¼ã«ãã鏿ãããï¼chåã®äºæ¸¬æ®å·®ããé¸æä¿¡
å·çæå¨ï¼ï¼ã«ããæå®ããããããæ°ãã©ã°ã«åºã¥ã
ã¦æå®ãããæ°ã§ãããã³ã°ãããThe selection signal generator 17 outputs a bit number flag of the prediction residual to a packing circuit 18 and a formatting circuit 19.
, And a predictor selection flag indicating the predictor with the smallest prediction residual and a correlation coefficient as described later are applied to the formatting circuit 19. Packing circuit 1
8 is a buffer / selector 14D1, 14D2, 16D1.
The prediction residual for 6 ch selected by 16D4 is packed with the specified number of bits based on the bit number flag specified by the selection signal generator 17.
ãï¼ï¼ï¼ï¼ãç¶ããã©ã¼ãããååè·¯ï¼ï¼ã¯å³ï¼ã«ç¤ºã
ãããªã¦ã¼ã¶ãã¼ã¿ã«ãã©ã¼ãããåããããã®ã¦ã¼ã¶
ãã¼ã¿ã¯åæ¹ã°ã«ã¼ãã«é¢ããï¼chï¼ï¼ï¼ãï¼ï¼ï¼ã®äº
測符å·åãã¼ã¿ãå«ãå¯å¤ã¬ã¼ããããã¹ããªã¼ã BS
ï¼ã¨ãä»ã®ã°ã«ã¼ãã«é¢ããï¼chï¼ï¼ï¼ãï¼ï¼ï¼ã®äºæ¸¬
符å·åãã¼ã¿ãå«ãå¯å¤ã¬ã¼ããããã¹ããªã¼ã BSï¼
ã¨ãã¹ããªã¼ã BSï¼ãBSï¼ã®åã«è¨ããããããã
ã¹ããªã¼ã ãããã«ããæ§æããã¦ãããã¾ããï¼ãã¬
ã¼ã åã®ã¹ããªã¼ã BSï¼ãBSï¼ã¯ ã»ãã¬ã¼ã ãããã¨ã ã»åchï¼ï¼ï¼ãï¼ï¼ï¼ã®ï¼ãã¬ã¼ã ã®å
é ãµã³ãã«ãã¼
ã¿ã¨ã ã»åchï¼ï¼ï¼ãï¼ï¼ï¼ã®ãµããã¬ã¼ã æ¯ã®äºæ¸¬å¨é¸æã
ã©ã°ã¨ã ã»åchï¼ï¼ï¼ãï¼ï¼ï¼ã®ãµããã¬ã¼ã æ¯ã®ãããæ°ãã©
ã°ã¨ã ã»åchï¼ï¼ï¼ãï¼ï¼ï¼ã®äºæ¸¬æ®å·®ãã¼ã¿åï¼å¯å¤ããã
æ°ï¼ã¨ã ã»å¾è¿°ããç¸é¢ä¿æ° ãå¤éåããã¦ããããã®ãããªäºæ¸¬ç¬¦å·åã«ããã°ã
åä¿¡å·ãä¾ãã°ãµã³ããªã³ã°å¨æ³¢æ°ï¼ï¼ï¼ï½ï¼¨ï½ãéå
åãããæ°ï¼ï¼ï¼ããããï¼ãã£ãã«ã®å ´åãï¼ï¼ï¼
ã®
å§ç¸®çãå®ç¾ãããã¨ãã§ãããThe following formatting circuit 19 formats the data into user data as shown in FIG. This user data is a variable rate bit stream BS including 2ch (1) and (2) prediction coded data for the front group.
0 and the variable rate bit stream BS1 including the prediction coded data of 4ch (3) to (6) regarding other groups.
And a bit stream header provided before the streams BS0 and BS1. The streams BS0 and BS1 for one frame include: a frame header; first sample data of one frame of each channel (1) to (6); and each subframe of each channel (1) to (6). A predictor selection flag; a bit number flag for each subframe of each channel (1) to (6); a prediction residual data sequence (variable number of bits) for each channel (1) to (6); The correlation coefficient described later is multiplexed. According to such predictive coding,
When the original signal has, for example, a sampling frequency of 96 kHz, the number of quantization bits = 24 bits, and six channels, a compression ratio of 71% can be realized.
ãï¼ï¼ï¼ï¼ã次ã«å³ï¼ãåç
§ãã¦å¾©å·åé¨ï¼ââï¼ã
ï¼ââï¼ã«ã¤ãã¦èª¬æãããä¸è¨ãã©ã¼ãããã®å¯å¤ã¬
ã¼ããããã¹ããªã¼ã ãã¼ã¿ï¼¢ï¼³ï¼ãBSï¼ã¯ãããã©
ã¼ãããååè·¯ï¼ï¼ã«ããã¹ããªã¼ã ãã¼ã¿ã¨ãã¬ã¼ã
ãããã«åºã¥ãã¦åé¢ããããããã¦ãåï½ï½ãï¼ãã
ãï¼ãã®ï¼ãã¬ã¼ã ã®å
é ãµã³ãã«ãã¼ã¿ã¨äºæ¸¬å¨é¸æ
ãã©ã°ã¯ããããäºæ¸¬åè·¯ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤
ï¼ãï¼ï¼ï¼¤ï¼ã«å°å ãããåï½ï½ãï¼ãããï¼ãã®ãã
ãæ°ãã©ã°ã¨äºæ¸¬æ®å·®ãã¼ã¿åã¯ã¢ã³ãããã³ã°åè·¯ï¼
ï¼ã«å°å ããããããã§ãäºæ¸¬åè·¯ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤
ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼å
ã®è¤æ°ã®äºæ¸¬å¨ï¼ä¸å³ç¤ºï¼ã¯
ããããã符å·åå´ã®äºæ¸¬åè·¯ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼
ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼å
ã®è¤æ°ã®äºæ¸¬å¨ã¨åä¸ã®ç¹æ§ã§ã
ããäºæ¸¬å¨é¸æãã©ã°ã«ããåä¸ç¹æ§ã®ãã®ã鏿ãã
ããNext, referring to FIG. 4, the decoding units 3'-1,
3â²-2 will be described. The variable-rate bit stream data BS0 and BS1 in the above format are separated by the deformatting circuit 21 based on the stream data and the frame header. And each channel "1" ~
The head sample data of one frame of â6â and the predictor selection flag are stored in the prediction circuits 24D1, 24D2, and 23D, respectively.
1 to 23D4, and the bit number flags of each channel â1â to â6â and the prediction residual data sequence are stored in the unpacking circuit 2
2 is applied. Here, the prediction circuits 24D1, 24D
2, a plurality of predictors (not shown) in 23D1 to 23D4 are prediction circuits 13D1, 13D2, and 1 on the encoding side, respectively.
The characteristics are the same as those of the plurality of predictors in 5D1 to 15D4, and those having the same characteristics are selected by the predictor selection flag.
ãï¼ï¼ï¼ï¼ãã¢ã³ãããã³ã°åè·¯ï¼ï¼ã¯åï½ï½ãï¼ãã
ãï¼ãã®äºæ¸¬æ®å·®ãã¼ã¿åããããæ°ãã©ã°æ¯ã«åºã¥ã
ã¦åé¢ãã¦ããããäºæ¸¬åè·¯ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼
Dï¼ãï¼ï¼ï¼¤ï¼ã«åºåãããäºæ¸¬åè·¯ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤
ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ã§ã¯ãããããã¢ã³ãããã³ã°
åè·¯ï¼ï¼ããã®åï½ï½ãï¼ãããï¼ãã®ä»åã®äºæ¸¬æ®å·®
ãã¼ã¿ã¨ãå
é¨ã®è¤æ°ã®äºæ¸¬å¨ã®å
ãäºæ¸¬å¨é¸æãã©ã°
ã«ãã鏿ãããåï¼ã¤ã«ããäºæ¸¬ãããååã®äºæ¸¬å¤
ãå ç®ããã¦ä»åã®äºæ¸¬å¤ãç®åºãããæ¬¡ãã§ï¼ãã¬ã¼
ã ã®å
é ãµã³ãã«ãã¼ã¿ãåºæºã¨ãã¦åãµã³ãã«ã®ï¼°ï¼£
ï¼ãã¼ã¿ãç®åºããããThe unpacking circuit 22 is provided for each channel "1" to
The prediction residual data string of â6â is separated based on each bit number flag, and is divided into prediction circuits 24D1, 24D2, and 23, respectively.
It outputs to D1-23D4. Prediction circuits 24D1, 24D
2, 23D1 to 23D4, the current prediction residual data of each of the channels â1â to â6â from the unpacking circuit 22 and each of the plurality of internal predictors selected by the predictor selection flag. The previous predicted value predicted by one frame is added to calculate the current predicted value, and then the PC of each sample is determined based on the first sample data of one frame.
M data is calculated.
ãï¼ï¼ï¼ï¼ãããã§ãå³ï¼ã«ç¤ºã符å·åé¨ï¼ââï¼ã
ï¼ââï¼ã«ããäºæ¸¬ç¬¦å·åãããå¯å¤ã¬ã¼ããããã¹ã
ãªã¼ã ãã¼ã¿ããè¨é²åªä½ã®ä¸ä¾ã¨ãã¦ï¼¤ï¼¶ï¼¤ãªã¼ãã£
ãªãã£ã¹ã¯ã«è¨é²ããå ´åã«ã¯ãå³ï¼ã«ç¤ºããªã¼ãã£ãª
ï¼ï¼¡ï¼ããã¯ã«ãããã³ã°ãããããã®ããã¯ã¯ï¼ï¼ï¼
ï¼ãã¤ãã®ã¦ã¼ã¶ãã¼ã¿ï¼ï¼¡ãã±ãããï¼¶ãã±ããï¼ã«
対ãã¦ï¼ãã¤ãã®ããã¯ã¹ã¿ã¼ãæ
å ±ã¨ãï¼ãã¤ãã®ï¼³
CRï¼System Clock Referenceï¼ã·ã¹ãã æå»åºæºåç
§
å¤ï¼æ
å ±ã¨ãï¼ãã¤ãã®Mux ã¬ã¼ãï¼rateï¼æ
å ±ã¨ï¼ã
ã¤ãã®ã¹ã¿ããã£ã³ã°ã®åè¨ï¼ï¼ãã¤ãã®ããã¯ããã
ãä»å ããã¦æ§æããã¦ããï¼ï¼ããã¯ï¼åè¨ï¼ï¼ï¼ï¼
ãã¤ãï¼ããã®å ´åãã¿ã¤ã ã¹ã¿ã³ãã§ããSCRæ
å ±
ããACBã¦ãããå
ã®å
é ããã¯ã§ã¯ãï¼ãã¨ãã¦å
ä¸ã¿ã¤ãã«å
ã§é£ç¶ã¨ãããã¨ã«ããåä¸ã¿ã¤ãã«å
ã®
Aããã¯ã®æéã管çãããã¨ãã§ãããHere, the coding units 2'-1 shown in FIG.
When recording the variable-rate bit stream data predictively encoded by 2â²-2 on a DVD audio disc as an example of a recording medium, the data is packed in an audio (A) pack shown in FIG. This pack is 203
For 4 bytes of user data (A packet, V packet), 4 bytes of pack start information and 6 bytes of S
It is configured by adding a 14-byte pack header of CR (System Clock Reference: system time reference value) information, 3-byte Mux rate (rate) information, and 1-byte stuffing (1 pack = 2048 total).
Part-Time Job). In this case, the time of the A pack in the same title can be managed by setting the SCR information as the time stamp to be â1â in the first pack in the ACB unit so as to be continuous in the same title.
ãï¼ï¼ï¼ï¼ãå§ç¸®ï¼°ï¼£ï¼ã®ï¼¡ãã±ããã¯å³ï¼ã«è©³ãã示
ãããã«ãï¼ãï¼ï¼ãã¤ãã®ãã±ãããããã¨ãå§ç¸®ï¼°
ï¼£ï¼ã®ãã©ã¤ãã¼ããããã¨ãå³ï¼ã«ç¤ºããã©ã¼ããã
ã®ï¼ãªããï¼ï¼ï¼ï¼ãã¤ãã®ãªã¼ãã£ãªãã¼ã¿ï¼å§ç¸®ï¼°
ï¼£ï¼ï¼ã«ããæ§æããã¦ãããå§ç¸®ï¼°ï¼£ï¼ã®ãã©ã¤ãã¼
ããããã¯ã ã»ï¼ãã¤ãã®ãµãã¹ããªã¼ã IDã¨ã ã»ï¼ãã¤ãã®ï¼µï¼°ï¼£ï¼ï¼¥ï¼¡ï¼®âISRCï¼Universal Pr
oduct Code/European Article Number-International S
tandard Recording Codeï¼çªå·ãåã³ï¼µï¼°ï¼£ï¼ï¼¥ï¼¡ï¼®â
ISRCãã¼ã¿ã¨ã ã»ï¼ãã¤ãã®ãã©ã¤ãã¼ããããé·ã¨ã ã»ï¼ãã¤ãã®ç¬¬ï¼ã¢ã¯ã»ã¹ã¦ããããã¤ã³ã¿ã¨ã ã»ï¼ãã¤ãã®ãªã¼ãã£ãªãã¼ã¿æ
å ±ï¼ï¼¡ï¼¤ï¼©ï¼ã¨ã ã»ï¼ãï¼ãã¤ãã®ã¹ã¿ããã£ã³ã°ãã¤ãã¨ã«ã ããæ§æããã¦ãããThe A packet of the compressed PCM has a packet header of 9 to 22 bytes and a compressed P
The CM private header and 1 to 2015 bytes of audio data (compressed P
CM). The private header of the compressed PCM is: 1-byte substream ID, 2 bytes of UPC / EAN-ISRC (Universal Prism).
oduct Code / European Article Number-International S
tandard Recording Code) number and UPC / EAN-
ISRC data, 1-byte private header length, 2-byte first access unit pointer, 4-byte audio data information (ADI), and 0-7 byte stuffing bytes. ing.
ãï¼ï¼ï¼ï¼ãããã¦ãADIå
ã«ï¼ç§å¾ã®ã¢ã¯ã»ã¹ã¦ã
ããããµã¼ãããããã®åæ¹ã¢ã¯ã»ã¹ã¦ãããã»ãµã¼ã
ãã¤ã³ã¿ã¨ãï¼ç§åã®ã¢ã¯ã»ã¹ã¦ãããããµã¼ãããã
ãã®å¾æ¹ã¢ã¯ã»ã¹ã¦ãããã»ãµã¼ããã¤ã³ã¿ãã¨ãã«ï¼
ãã¤ãã§ã»ããããããå
·ä½çã«ã¯ãADIã®ï¼ãã¤ã
ç®ã«åæ¹ã¢ã¯ã»ã¹ã¦ãããã»ãµã¼ããã¤ã³ã¿ããï¼ãã¤
ãç®ã«å¾æ¹ã¢ã¯ã»ã¹ã¦ãããã»ãµã¼ããã¤ã³ã¿ãã»ãã
ãããããã®ããã«ï¼¡ï¼¤ï¼©ã¯ãå§ç¸®ï¼°ï¼£ï¼ã§ã¯ï¼ãã¤ã
ã«æ¸å°ããããããªã¼ãã£ãªãã¼ã¿ãï¼ï¼ï¼ï¼ãã¤ãã¾
ã§åç´ã§ãããThe forward access unit search pointer for searching for the access unit one second later and the backward access unit search pointer for searching for the access unit one second earlier in the ADI are both 1
Set in bytes. Specifically, the forward access unit search pointer is set in the first byte of the ADI, and the backward access unit search pointer is set in the eighth byte. Thus, ADI can store up to 2015 bytes of audio data in order to reduce it to 4 bytes in compressed PCM.
ãï¼ï¼ï¼ï¼ãå³ï¼ã«ç¤ºãå§ç¸®ï¼°ï¼£ï¼ï¼ï¼°ï¼°ï¼£ï¼ï¼ã®ãªã¼
ãã£ãªãã±ããã«ããããªã¼ãã£ãªãã¼ã¿ã¨ãªã¢ã¯ãå³
ï¼ã«ç¤ºãããã«è¤æ°ã®ï¼°ï¼°ï¼£ï¼ã¢ã¯ã»ã¹ã¦ãããã«ãã
æ§æãããï¼°ï¼°ï¼£ï¼ã¢ã¯ã»ã¹ã¦ãããã¯ï¼°ï¼°ï¼£ï¼ã·ã³ã¯
æ
å ±ã¨ãµããã±ããã«ããæ§æããã¦ãããæåã®ï¼°ï¼°
ï¼£ï¼ã¢ã¯ã»ã¹ã¦ãããå
ã®ãµããã±ããã¯ããã£ã¬ã¯ã
ãªã¨ããµãã¹ããªã¼ã ãBSï¼ãã¨ãCRCï¼ï¼ãã¤ã
åã¯ï¼ãã¤ãï¼ã¨ããµãã¹ããªã¼ã ãBSï¼ãã¨ãCR
ï¼£ã¨ã¨ã¯ã¹ãã©æ
å ±ã«ããæ§æããããµãã¹ããªã¼ã
ãBSï¼ãããBSï¼ãã¯ï¼°ï¼°ï¼£ï¼ãããã¯ã®ã¿ã«ãã
æ§æããã¦ãããï¼çªç®ä»¥éã®ï¼°ï¼°ï¼£ï¼ã¢ã¯ã»ã¹ã¦ãã
ãå
ã®ãµããã±ãããããã£ã¬ã¯ããªã¨ããµãã¹ããªã¼
ã ãBSï¼ãã¨ãCRCã¨ããµãã¹ããªã¼ã ãBSï¼ã
ã¨ãCRCã¨ã¨ã¯ã¹ãã©æ
å ±ã«ããæ§æããããµãã¹ã
ãªã¼ã ãBSï¼ãããBSï¼ãã¯ãªã¹ã¿ã¼ããããã¨ï¼°
ï¼°ï¼£ï¼ãããã¯ã«ããæ§æããã¦ãããThe audio data area in the compressed PCM (PPCM) audio packet shown in FIG. 6 is composed of a plurality of PPCM access units as shown in FIG. 7, and the PPCM access unit is composed of PPCM sync information and sub-packets. I have. First PP
The subpacket in the CM access unit includes a directory, a substream âBS0â, a CRC (1 byte or 2 bytes), a substream âBS1â,
C and extra information, and the sub-streams âBS0â and âBS1â are composed of only PPCM blocks. Sub-packets in the second and subsequent PPCM access units also include a directory, a sub-stream âBS0â, a CRC, and a sub-stream âBS1â.
, CRC and extra information, and the sub-streams âBS0â and âBS1â have a restart header and P
It is composed of PCM blocks.
ãï¼ï¼ï¼ï¼ãã¾ããå³ï¼ã«ç¤ºã符å·åé¨ï¼ââï¼ãï¼â
âï¼ã«ããäºæ¸¬ç¬¦å·åãããå¯å¤ã¬ã¼ããããã¹ããªã¼
ã ãã¼ã¿ããããã¯ã¼ã¯ãä»ãã¦ä¼éããå ´åã«ã¯ã符
å·åå´ã§ã¯å³ï¼ç¤ºãããã«ä¼éç¨ã«ãã±ããåãï¼ã¹ã
ããï¼³ï¼ï¼ï¼ã次ãã§ãã±ããããããä»ä¸ãï¼ã¹ãã
ãï¼³ï¼ï¼ï¼ã次ãã§ãã®ãã±ããããããã¯ã¼ã¯ä¸ã«é
ãåºãï¼ã¹ãããï¼³ï¼ï¼ï¼ã復å·å´ã§ã¯å³ï¼ã«ç¤ºããã
ã«ããããé¤å»ãï¼ã¹ãããï¼³ï¼ï¼ï¼ã次ãã§ãã¼ã¿ã
復å
ãï¼ã¹ãããï¼³ï¼ï¼ï¼ã次ãã§ãã®ãã¼ã¿ãã¡ã¢ãª
ã«æ ¼ç´ãã¦å¾©å·ãå¾
ã¤ï¼ã¹ãããï¼³ï¼ï¼ï¼ãThe encoding units 2'-1, 2 'shown in FIG.
When the variable rate bit stream data predicted and coded according to -2 is transmitted through a network, the coding side packetizes the data for transmission as shown in FIG. 8 (step S41), and then attaches a packet header (step S41). (Step S42) Then, the packet is sent out onto the network (Step S43). On the decoding side, the header is removed as shown in FIG. 9 (step S51), the data is restored (step S52), and the data is stored in a memory and decoding is waited (step S53).
ãï¼ï¼ï¼ï¼ããªããä¸è¨å®æ½å½¢æ
ã§ã¯ãã¹ãã¬ãªï¼chã
ã¼ã¿ï¼ï¼¬ãï¼²ï¼ããã®ã¾ã¾ä¼éãããã ãï¼ãï¼ï¼¬ï¼ï¼² ãï¼ãï¼ï¼¬âï¼² ãï¼ãããï¼ãã¯åã ãï¼ãï¼ï¼¬ï½ï½
âï½Ãï¼£ ãã ããï¼â¦ï½â¦ï¼ â¦ï¼ï¼ï¼â ã«ããï¼ãã£ãã«ãï¼ãããï¼ãã¨å
±ã«ãç¸é¢ã®ããä¿¡
å·ã«å¤æãã¦äºæ¸¬ç¬¦å·åããããã«ãã¦ãããï¼ç¬¬ï¼ã®
宿½å½¢æ
ï¼ããã®å ´åã«ã¯ã復å·åå´ã®ãã¯ã¹ï¼ãããª
ã¯ã¹åè·¯ï¼âã¯ãã£ãã«ãï¼ãããï¼ããå ç®ãããã¨
ã«ãããã£ãã«ï¼¬ããæ¸ç®ãããã¨ã«ãããã£ãã«ï¼²ã
çæãããã¨ãã§ããããªããä¸è¨å®æ½ä¾ã§ã¯ããã«ã
ãã£ã³ãã«ï¼ï¼ï½ï½ï¼ã¨ã¹ãã¬ãªï¼ï¼ï½ï½ï¼ã¨å¾©å
ãã
ããã«ãã¦ãããããããã䏿¹ã§ããããã¨ã¯è¨ãã¾
ã§ããªããIn the above embodiment, the stereo 2ch data (L, R) is transmitted as it is, but â1â = L + R â2â = LR â3â to â5â are the same â6â = Lfe âa à C However, 0 ⦠a ⦠1... (2) â² may be converted into a correlated signal together with the six channels â1â to â6â for predictive coding (second embodiment). Form). In this case, the mix & matrix circuit 4 â² on the decoding side can generate the channel L by adding the channels â1â and â2â, and generate the channel R by subtracting the channel L. In the above embodiment, the multi-channel (6ch) and the stereo (2ch) are restored, but it goes without saying that either one may be restored.
ãï¼ï¼ï¼ï¼ãã¾ããå³ï¼ï¼ã¯ç¬¬ï¼ã®å®æ½ã®å½¢æ
ã示ãå³
ã§ããã®å ´åã«ã¯ãã¦ã³ããã¯ã¹ãããã¨ãªããåæ¹ã°
ã«ã¼ãã«é¢ããï¼chãï¼ãããï¼ãã ãï¼ãï¼ï¼¬ï½ï¼ï¼²ï½ ãï¼ãï¼ï¼¬ï½âï¼²ï½ ã¨ãã¦ä¼éãããããã¦ãåçå´ã§ã¯ãææã«å¿ãã¦å¾
段å´ã®ããã¯ã¹ï¼ãããªã¯ã¹åè·¯ï¼âããåºåãããã
ã¦ã³ããã¯ã¹ãããªãä¿¡å·ï¼¬ï½ï¼ï¼²ï½ãã¹ãã¬ãªï¼ãã£
ã³ãã«ã¨ãã¦ä½¿ç¨ãããããã®åè·¯ï¼âå
ã§ãã¦ã³ãã
ã¯ã¹ããã¦åãåºãããã¹ãã¬ãªï¼ãã£ã³ãã«ä¿¡å·ï¼¬ï¼
ï¼²ã使ç¨ãããã¨ãã§ãããFIG. 10 is a diagram showing the third embodiment. In this case, without downmixing, 2ch â1â and â2â for the front group are changed to â1â = Lf + Rf â2â = It is transmitted as Lf-Rf. Then, on the reproduction side, the signals Lf and Rf which are not downmixed and output from the subsequent mix & matrix circuit 4 'are used as two stereo channels as required, or are downmixed and taken out in this circuit 4'. Stereo 2-channel signal L,
R can also be used.
ãï¼ï¼ï¼ï¼ã次ã«å³ï¼ï¼ãå³ï¼ï¼ãå³ï¼ï¼ãåç
§ãã¦ç¬¬
ï¼ã®å®æ½å½¢æ
ã«ã¤ãã¦èª¬æãããä¸è¨ã®å®æ½å½¢æ
ã§ã¯ã
ï¼ã°ã«ã¼ãã®ç¸é¢æ§ã®ä¿¡å·ãï¼ãããï¼ããäºæ¸¬ç¬¦å·å
ããããã«æ§æããã¦ãããããã®ç¬¬ï¼ã®å®æ½å½¢æ
ã§ã¯
è¤æ°ã°ã«ã¼ãã®ç¸é¢æ§ã®ããä¿¡å·ãçæãã¦äºæ¸¬ç¬¦å·å
ããå§ç¸®çãæãé«ãã°ã«ã¼ãã®äºæ¸¬ç¬¦å·åãã¼ã¿ãé¸
æããããã«æ§æããã¦ãããã¾ãããã®å®æ½ä¾ã§ã¯ã
ã®ï¼ã°ã«ã¼ãå
ã«ããã符å·åã¯ãåè¿°ã®å宿½ä¾ã®å ´
åã®ããã«åæ¹ã°ã«ã¼ãã«é¢ããï¼ï½ï½ã¨ä»ã®ã°ã«ã¼ã
ã«é¢ããï¼ï½ï½ã«åé¡ãã¦å¤æãããããªãã¨ã¯ãã
ã«ãä¸ã¤ã«ã¾ã¨ãã符å·åå¦çãè¡ãããæ§æã§ãå³ï¼
ï¼ã¯åè¿°ã®å³ï¼ã«å¯¾å¿ããå³ã¨ãã¦ç¤ºãã¦ãããã¾ãã
å³ï¼ï¼ã¯ç¬¦å·åé¨ã®è©³ç´°ãããã¯ã示ããã®ã§ãããã
æ¬å®æ½ä¾ã®å ´åã«ã¯ï½åã®ç¸é¢åè·¯ï¼âï¼ãï¼âï½ã¾ã§
ãããã¯ã¹ï¼ãããªã¯ã¹åè·¯ï¼âå´ã«è¨ãããã¦ããã
ãããï½åã®ç¸é¢åè·¯ï¼âï¼ãï¼âï½ã¯ä¾ãã°ï¼chï¼ï¼¬
ï½ãï¼£ãï¼²ï½ãLï½ãï¼²ï½ãLï½ï½
ï¼ã®ï¼°ï¼£ï¼ãã¼ã¿
ããç¸é¢æ§ãç°ãªãï½ç¨®é¡ã®ï¼chä¿¡å·ãï¼ãããï¼ãã«
夿ãããNext, a fourth embodiment will be described with reference to FIG. 11, FIG. 12, and FIG. In the above embodiment,
Although one group of correlated signals â1â to â6â are configured to be predictively coded, in the fourth embodiment, a plurality of groups of correlated signals are generated and predicted and coded. It is configured to select the prediction coded data of the group having the highest compression ratio. Further, in this embodiment, the encoding in one group is not classified and converted into 2ch for the front group and 4ch for the other group as in each of the above-described embodiments. FIG. 1 shows a configuration in which the combined encoding process is performed.
Reference numeral 1 denotes a diagram corresponding to FIG. Also,
FIG. 12 shows a detailed block of the encoding unit.
In this embodiment, n correlation circuits 1-1 to 1-n are provided on the mix & matrix circuit 1 'side.
These n correlation circuits 1-1 to 1-n are, for example, 6 ch (L
f, C, Rf, Ls, Rs, Lfe) are converted into n types of 6-channel signals â1â to â6â having different correlations.
ãï¼ï¼ï¼ï¼ãä¾ãã°ç¬¬ï¼ã®ç¸é¢åè·¯ï¼âï¼ã¯ä»¥ä¸ã®ãã
ã«å¤æãã ãï¼ãï¼ï¼¬ï½ ãï¼ãï¼ï¼£âï¼ï¼¬ï½ï¼ï¼²ï½ï¼ï¼ï¼ ãï¼ãï¼ï¼²ï½âï¼¬ï½ ãï¼ãï¼ï¼¬ï½âï½ÃLï½ï½
ãï¼ãï¼ï¼²ï½âï½Ãï¼²ï½ ãï¼ãï¼ï¼¬ï½ï½
ã¾ãã第ï½ã®ç¸é¢åè·¯ï¼âï½ã¯ä»¥ä¸ã®ããã«å¤æãã ãï¼ãï¼ï¼¬ï½ï¼ï¼²ï½ ãï¼ãï¼ï¼£âï¼¬ï½ ãï¼ãï¼ï¼²ï½âï¼¬ï½ ãï¼ãï¼ï¼¬ï½âï¼¬ï½ ãï¼ãï¼ï¼²ï½âï¼¬ï½ ãï¼ãï¼ï¼¬ï½ï½
âï¼£ ã¾ããä»ã®ç¸é¢åè·¯ã¯ç¬¬ï¼ã®å®æ½å½¢æ
ã®ããã«å¤æã
ããFor example, the first correlation circuit 1-1 converts as follows: "1" = Lf "2" = C- (Ls + Rs) / 2 "3" = Rf-Lf "4" = Ls-a à Lfe â5â = Rsâb à Rf â6â = Lfe Further, the n-th correlation circuit 1-n converts as follows: â1â = Lf + Rf â2â = CâLf â3â = RfâLf â4â = LsâLf â5â = RsâLf â6â = LfeâC Further, other correlation circuits perform conversion as in the first embodiment.
ãï¼ï¼ï¼ï¼ãã¾ããç¸é¢åè·¯ï¼âï¼ãï¼âï½æ¯ã«äºæ¸¬å
è·¯ï¼ï¼ã¨ãããã¡ã»é¸æå¨ï¼ï¼ãè¨ããããã°ã«ã¼ãæ¯
ã®äºæ¸¬æ®å·®ã®æå°å¤ã®ãã¼ã¿éã«åºã¥ãã¦å§ç¸®çãæã
é«ãã°ã«ã¼ããç¸é¢é¸æä¿¡å·çæå¨ï¼ï¼ï½ã«ãã鏿ã
ããããã®ã¨ãããã©ã¼ãããååè·¯ï¼ï¼ã¯ãã®é¸æã
ã©ã°ï¼ç¸é¢åè·¯é¸æãã©ã°ããã®ç¸é¢åè·¯ã®ç¸é¢ä¿æ°
ï½ãï½ï¼ã追å ãã¦å¤éåãããFurther, a prediction circuit 15 and a buffer / selector 16 are provided for each of the correlation circuits 1-1 to 1-n, and the group having the highest compression ratio is determined based on the data amount of the minimum value of the prediction residual for each group. Are selected by the correlation selection signal generator 17b. At this time, the formatting circuit 19 adds and multiplexes the selection flag (correlation circuit selection flag, correlation coefficients a and b of the correlation circuit).
ãï¼ï¼ï¼ï¼ãããã¦ãå³ï¼ï¼ã¯åè¿°ã®å³ï¼ã«å¯¾å¿ããã
ã¼ã¿ã¨ãªã¢ã示ãããã®å®æ½ä¾ã§ã¯ãµãã¹ããªã¼ã ãï¼¢
ï¼³ï¼ããç¨ããããµãã¹ããªã¼ã ãBSï¼ãã®ã¿ã§æ§æ
ãããã¨ã«ãªããFIG. 13 shows a data area corresponding to FIG. 6 described above.
Instead of using âS1â, the sub-stream âBS0â alone is used.
ãï¼ï¼ï¼ï¼ãã¾ããå³ï¼ï¼ã«ç¤ºã復å·åå´ã§ã¯ã符å·å
å´ã®ç¸é¢åè·¯ï¼âï¼ãï¼âï½ã«å¯¾ãã¦ï½åã®ç¸é¢åè·¯ï¼
âï¼ãï¼âï½ï¼åã¯ä¿æ°ï½ãï½ã夿´å¯è½ãªï¼ã¤ã®ç¸é¢
åè·¯ï¼ï¼ãè¨ããããããªããå³ï¼ï¼ã«ç¤ºãï½ã°ã«ã¼ã
ã®äºæ¸¬åè·¯ãåä¸ã®æ§æã§ããå ´åã復å·è£
ç½®ã§ã¯å³ï¼
ï¼ã«ç¤ºãããã«ï½ã°ã«ã¼ãåã®äºæ¸¬åè·¯ãè¨ããå¿
è¦ã¯
ãªããï¼ã¤ã®ã°ã«ã¼ãåã®äºæ¸¬åè·¯ã§ãããããã¦ã符
å·åè£
ç½®ããä¼éããã鏿ãã©ã°ã«åºã¥ãã¦ç¸é¢åè·¯
ï¼âï¼ãï¼âï½ã®ï¼ã¤ã鏿ãåã¯ä¿æ°ï½ãï½ãè¨å®ã
ã¦å
ã®ï¼chï¼ï¼¬ï½ãï¼£ãï¼²ï½ãLï½ãï¼²ï½ãLï½ï½
ï¼ã
復å
ããã¾ããå¼ï¼ï¼ï¼ã«ãããã«ããã£ãã«ããã¦ã³
ãã¯ã¹ãã¦ã¹ãã¬ãªï¼chãã¼ã¿ï¼ï¼¬ãï¼²ï¼ãçæããã
ã¾ãããã£ã³ãã«æ°ããï¼ãããï¼ãã®ï¼ãã£ã³ãã«æ¹
å¼ã®ãã®ã¯ãä¸ä¾ã§ãã£ã¦ï¼ãã£ã³ãã«æ¹å¼çä»ã®æ¹å¼
ã®ãã®ã§ãã£ã¦ããããFurther, on the decoding side shown in FIG. 14, n correlation circuits 4 are provided for the correlation circuits 1-1 to 1-n on the encoding side.
â1 to 4-n (or one correlation circuit 4 whose coefficients a and b can be changed) are provided. Note that when the prediction circuits of the n groups shown in FIG. 12 have the same configuration,
As shown in FIG. 4, there is no need to provide prediction circuits for n groups, and prediction circuits for one group are sufficient. Then, one of the correlation circuits 4-1 to 4-n is selected based on the selection flag transmitted from the encoding device, or coefficients a and b are set and the original 6 ch (Lf, C, Rf, Ls, Rs, Lfe) are restored, and the multi-channel is downmixed according to equation (1) to generate stereo 2-ch data (L, R).
Further, the 6-channel system having the number of channels â1â to â6â is an example, and another system such as a 5-channel system may be used.
ãï¼ï¼ï¼ï¼ãã¾ããä¸è¨ã®ç¬¬ï¼ã®å®æ½å½¢æ
ã§ã¯ãï¼ç¨®é¡
ã®ç¸é¢æ§ã®ä¿¡å·ãï¼ãããï¼ããäºæ¸¬ç¬¦å·åããããã«
æ§æããã¦ãããããã®ä¿¡å·ãï¼ãããï¼ãã®ã°ã«ã¼ã
ã¨åä¿¡å·ï¼ï¼¬ï½ãï¼£ãï¼²ï½ãLï½ãï¼²ï½ãLï½ï½
ï¼ã®ã°
ã«ã¼ããäºæ¸¬ç¬¦å·åããå§ç¸®çãé«ãæ¹ã®ã°ã«ã¼ããé¸
æããããã«ãã¦ããããIn the first embodiment, one kind of correlation signal "1" to "6" is configured to be predictively coded. However, the signals "1" to "6" are encoded. And the group of the original signals (Lf, C, Rf, Ls, Rs, Lfe) may be predictively coded and the group with the higher compression ratio may be selected.
ãï¼ï¼ï¼ï¼ã[0030]
ãçºæã®å¹æã以ä¸èª¬æããããã«æ¬çºæã«ããã°ãå
ã®ãã«ããã£ãã«ã®é³å£°ä¿¡å·ãã¹ãã¬ãªï¼ãã£ãã«ã®é³
声信å·ã«å¤æããã¨å
±ã«ãå¾ãããã¹ãã¬ãªï¼ãã£ãã«
ãå«ã第ï¼ã®ã°ã«ã¼ãã¨ä»ã®ãã£ãã«ãå«ã第ï¼ã®ã°ã«
ã¼ãã«åé¡ãã¦ãå°ãªãã¨ãåè¨ç¬¬ï¼ã®ã°ã«ã¼ãã®ãã£
ãã«ã®é³å£°ä¿¡å·ãç¸é¢æ§ã®ããé³å£°ä¿¡å·ã«å¤æãã¦åã
ã£ãã«ãäºæ¸¬ç¬¦å·åããããã«ããã®ã§ããã«ããã£ã
ã«ã®é³å£°ä¿¡å·ãäºæ¸¬ç¬¦å·åããå ´åã«å§ç¸®çãæ¹åãã
ãã¨ãã§ããã¨ã¨ãã«ã第ï¼ã®ã°ã«ã¼ãã®ã¿ãç¨ãã¦å¾©
å·ãã¦ã¹ãã¬ãªï¼ãã£ã³ãã«ãåçãããã¨ãã§ãããAs described above, according to the present invention, the original multi-channel audio signal is converted into a stereo two-channel audio signal, and the obtained first group including the stereo two channels and other two-channel audio signals are converted. Since the channels are classified into a second group including channels, and at least the audio signals of the channels of the second group are converted into correlated audio signals to predictively encode each channel, multi-channel audio is obtained. The compression rate can be improved when a signal is predictively coded, and two stereo channels can be reproduced by decoding using only the first group.
ãå³ï¼ãæ¬çºæã«ä¿ãé³å£°ç¬¦å·åè£
ç½®åã³é³å£°å¾©å·è£
ç½®
ã®ç¬¬ï¼ã®å®æ½å½¢æ
ã示ããããã¯å³ã§ãããFIG. 1 is a block diagram illustrating a first embodiment of a speech encoding device and a speech decoding device according to the present invention.
ãå³ï¼ãå³ï¼ã®ç¬¦å·åé¨ã詳ãã示ããããã¯å³ã§ã
ããFIG. 2 is a block diagram illustrating an encoding unit of FIG. 1 in detail.
ãå³ï¼ãå³ï¼ãå³ï¼ã®ç¬¦å·åé¨ã«ãã符å·åããããã
ãã¹ããªã¼ã ã示ã説æå³ã§ãããFIG. 3 is an explanatory diagram showing a bit stream encoded by an encoding unit shown in FIGS. 1 and 2;
ãå³ï¼ãå³ï¼ã®å¾©å·åé¨ã詳ãã示ããããã¯å³ã§ã
ããFIG. 4 is a block diagram illustrating a decoding unit of FIG. 1 in detail;
ãå³ï¼ãDVDã®ããã¯ã®ãã©ã¼ãããã示ã説æå³ã§
ãããFIG. 5 is an explanatory diagram showing a format of a DVD pack.
ãå³ï¼ãDVDã®ãªã¼ãã£ãªããã¯ã®ãã©ã¼ãããã示
ã説æå³ã§ãããFIG. 6 is an explanatory diagram showing a format of a DVD audio pack.
ãå³ï¼ãå³ï¼ã®ãªã¼ãã£ãªãã¼ã¿ã¨ãªã¢ã®ãã©ã¼ããã
ã詳ãã示ã説æå³ã§ãããFIG. 7 is an explanatory diagram showing a format of an audio data area in FIG. 6 in detail;
ãå³ï¼ãé³å£°ä¼éæ¹æ³ã示ãããã¼ãã£ã¼ãã§ãããFIG. 8 is a flowchart showing a voice transmission method.
ãå³ï¼ãé³å£°ä¼éæ¹æ³ã示ãããã¼ãã£ã¼ãã§ãããFIG. 9 is a flowchart showing a voice transmission method.
ãå³ï¼ï¼ã第ï¼ã®å®æ½å½¢æ
ã®é³å£°ç¬¦å·åè£
ç½®åã³é³å£°å¾©
å·è£
ç½®ã示ããããã¯å³ã§ãããFIG. 10 is a block diagram illustrating a speech encoding device and a speech decoding device according to a third embodiment.
ãå³ï¼ï¼ãæ¬çºæã«ä¿ãé³å£°ç¬¦å·åè£
ç½®åã³é³å£°å¾©å·è£
ç½®ã®ç¬¬ï¼ã®å®æ½å½¢æ
ã示ããããã¯å³ã§ãããFIG. 11 is a block diagram showing a fourth embodiment of the speech encoding device and the speech decoding device according to the present invention.
ãå³ï¼ï¼ã第ï¼ã®å®æ½å½¢æ
ã®é³å£°ç¬¦å·åè£
ç½®ã示ããã
ãã¯å³ã§ãããFIG. 12 is a block diagram illustrating a speech encoding device according to a fourth embodiment.
ãå³ï¼ï¼ãå³ï¼ã«å¯¾å¿ããå¥ã®å®æ½ä¾ã®èª¬æå³ã§ãããFIG. 13 is an explanatory diagram of another embodiment corresponding to FIG. 7;
ãå³ï¼ï¼ã第ï¼ã®å®æ½å½¢æ
ã®é³å£°å¾©å·è£
ç½®ã示ãããã
ã¯å³ã§ãããFIG. 14 is a block diagram illustrating a speech decoding device according to a fourth embodiment.
ï¼â ï¼chãã¯ã¹ï¼ãããªã¯ã¹åè·¯ï¼ç¸é¢ææ®µããã¦ã³
ãã¯ã¹ææ®µï¼ ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ äºæ¸¬åè·¯
ï¼ãããã¡ã»é¸æå¨ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼
ï¼ï¼¤ï¼ã¨å
±ã«äºæ¸¬ç¬¦å·åææ®µãæ§æãããï¼ ï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ãï¼ï¼ï¼¤ï¼ ãããã¡ã»
é¸æå¨ ï¼ï¼ ãã©ã¼ãããååè·¯ï¼ãã©ã¼ãããåææ®µï¼1 '6ch mix & matrix circuit (correlation means, downmix means) 13D1, 13D2, 15D1-15D4 Prediction circuit (buffer / selector 14D1, 14D2, 16D1-1)
Together with 6D4, it constitutes a predictive coding means. 14D1, 14D2, 16D1-16D4 buffer
Selector 19 Formatting circuit (Formatting means)
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4