RetroSearch Browse

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Showing content from https://patents.google.com/patent/JPH09102742A/en below:

JPH09102742A - Encoding method and device, decoding method and device and recording medium

ãçºæã®è©³ç´°ãªèª¬æãDETAILED DESCRIPTION OF THE INVENTION

ãï¼ï¼ï¼ï¼ã[0001]

ãçºæã®å±ããæè¡åéãæ¬çºæã¯ãç¬¦å·åæ¹æ³ããã³ è£ç½®ãå¾©å·åæ¹æ³ããã³è£ç½®ãä¸¦ã³ã«è¨é²åªä½ã«é¢ãã ä¾ãã°ããããªãã¼ãã¬ã³ã¼ãããããªãã£ã¹ã¯ãã¬ã¼ ã¤çã®ã¹ãã¬ãªãããããããã«ããµã¦ã³ãé³é¿ã·ã¹ã ã ã«ããã¦ããã«ããã£ãã«ã®ãã£ã¸ã¿ã«ä¿¡å·ãå§ç¸®ç¬¦ å·åããå ´åã«ç¨ãã¦å¥½é©ãªç¬¦å·åæ¹æ³ããã³è£ç½®ãå¾© å·åæ¹æ³ããã³è£ç½®ãä¸¦ã³ã«è¨é²åªä½ã«é¢ãããTECHNICAL FIELD The present invention relates to an encoding method and apparatus, a decoding method and apparatus, and a recording medium, For example, a stereo such as a video tape recorder, a video disc player, or a so-called multi-sound acoustic system, which is suitable for use in compressing and encoding a multi-channel digital signal, an encoding method and apparatus, a decoding method and apparatus, and Recording medium

ãï¼ï¼ï¼ï¼ã[0002]

ãå¾æ¥ã®æè¡ãå¾æ¥ããããªã¼ãã£ãªãããã¯é³å£°çã® ä¿¡å·ã®é«è½çç¬¦å·åã®ææ³ããã³è£ç½®ã«ã¯ç¨®ããããã ä¾ãã°ãæéé åã®ãªã¼ãã£ãªä¿¡å·çãåä½æéæ¯ã«ã ããã¯åãããã®ãããã¯æ¯ã®æéè»¸ã®ä¿¡å·ãå¨æ³¢æ°è»¸ ä¸ã®ä¿¡å·ã«å¤æï¼ç´äº¤å¤æï¼ãã¦è¤æ°ã®å¨æ³¢æ°å¸¯åã«å å²ããåå¸¯åæ¯ã«ç¬¦å·åãããããã¯åå¨æ³¢æ°å¸¯ååå² æ¹å¼ãããããå¤æç¬¦å·åï¼ãã©ã³ã¹ãã©ã¼ã ã³ã¼ãã£ ã³ã°ï¼ããæéé åã®ãªã¼ãã£ãªä¿¡å·çãåä½æéæ¯ã« ãããã¯åããªãã§ãè¤æ°ã®å¨æ³¢æ°å¸¯åã«åå²ãã¦ç¬¦å· åããéãããã¯åå¨æ³¢æ°å¸¯ååå²æ¹å¼ã§ããå¸¯ååå² ç¬¦å·åï¼ãµããã³ãã³ã¼ãã£ã³ã°ï¼ï¼³ï¼¢ï¼£ï¼çãæãã ãã¨ãã§ããã2. Description of the Related Art Conventionally, there are various techniques and apparatuses for highly efficient encoding of signals such as audio or voice. For example, an audio signal in the time domain is divided into blocks for each unit time, the time axis signal of each block is converted into a signal on the frequency axis (orthogonal conversion), and divided into a plurality of frequency bands. Blocking frequency band division method for encoding, so-called transform coding (transform coding), or non-blocking for dividing into a plurality of frequency bands and encoding without dividing the time domain audio signal into blocks for each unit time Band division coding (sub-band coding: SBC), which is a generalized frequency band division method, can be used.

ãï¼ï¼ï¼ï¼ãã¾ããä¸è¿°ã®å¸¯ååå²ç¬¦å·åã¨å¤æç¬¦å·å ã¨ãçµã¿åãããé«è½çç¬¦å·åã®ææ³ããã³è£ç½®ãèã ããã¦ããããã®å ´åã«ã¯ãä¾ãã°ãå¸¯ååå²ç¬¦å·åã§ å¸¯ååå²ãè¡ã£ãå¾ãåå²ãããåå¸¯åæ¯ã®ä¿¡å·ãå¨æ³¢ æ°é åã®ä¿¡å·ã«ç´äº¤å¤æããç´äº¤å¤æãããä¿¡å·ãåé åæ¯ã«ç¬¦å·åããããFurther, a high-efficiency coding method and apparatus combining the above-described band-division coding and transform coding has been considered. In this case, for example, band-division is performed by band-division coding. After that, the divided signal for each band is orthogonally transformed into a signal in the frequency domain, and the orthogonally transformed signal is encoded for each region.

ãï¼ï¼ï¼ï¼ãããã§ãä¸è¿°ããå¸¯ååå²ç¬¦å·åã«ããã¦ ç¨ããããå¸¯ååå²ç¨ãã£ã«ã¿ã¨ãã¦ã¯ãä¾ãã°ï¼±ï¼ï¼¦ ï¼ç´äº¤ãã©ã¼ãã£ã«ã¿ï¼Quadrature Mirror Filterï¼ãª ã©ã®ãã£ã«ã¿ãããããã®ï¼±ï¼ï¼¦ãã£ã«ã¿ã¯ãæç®ãã ã£ã¸ã¿ã«ã»ã³ã¼ãã£ã³ã°ã»ãªãã»ã¹ãã¼ãã»ã¤ã³ã»ãµã ãã³ãºã("Digital coding of speech in subbands"R. E.Crochiere, BellSyst.Tech. J., Vol.55,No.8 1976) ã«è¿°ã¹ããã¦ããããã®ï¼±ï¼ï¼¦ãã£ã«ã¿ã¯ãå¸¯åãçã ã³ãå¹ã«ï¼åå²ãããã®ã§ããããã®ãã£ã«ã¿ã«ããã¦ ã¯ãåå²ããå¸¯åãå¾ã«åæããéã«ããããã¨ãªã¢ã· ã³ã°ãçºçããªããã¨ãç¹å¾´ã¨ãªã£ã¦ãããHere, as the band division filter used in the above band division encoding, for example, QMF is used. There is a filter such as (Quadrature Mirror Filter), and this QMF filter is referred to as "Digital coding of speech in subbands" R. E. Crochiere, BellSyst.Tech. J., Vol.55, No.8 1976) It is described in. This QMF filter divides the band into two equal bandwidths, and this filter is characterized in that so-called aliasing does not occur when the divided bands are combined later.

ãï¼ï¼ï¼ï¼ãã¾ããæç®ãããªãã§ã¤ãºã»ã¯ã¡ãã©ãã¥ ã¢ã»ãã£ã«ã¿ã¼ãºâæ°ããå¸¯ååå²ç¬¦å·åæè¡ã("Poly phase Quadrature filters -A newsubband coding tech nique", oseph H. Rothweiler ICASSP 83, BOSTON) ã« ã¯ãçå¸¯åå¹ã®ãã£ã«ã¿åå²æ¹æ³ãè¿°ã¹ããã¦ãããã ã®ããªãã§ã¤ãºã»ã¯ã¡ãã©ãã¥ã¢ã»ãã£ã«ã¿ã«ããã¦ ã¯ãä¿¡å·ãçãã³ãå¹ã®è¤æ°ã®å¸¯åã«åå²ããéã«ä¸åº¦ ã«åå²ã§ãããã¨ãç¹å¾´ã¨ãªã£ã¦ãããIn addition, the document "Polyphase Quadrature Filters-New Band Division Coding Technique"("Poly phase Quadrature filters -A newsubband coding tech nique ", oseph H. Rothweiler ICASSP 83, BOSTON) describes a method for dividing equal-bandwidth filters. In this polyphase quadrature filter, the signal is divided into multiple bands of equal bandwidth. The feature is that it can be divided at one time.

ãï¼ï¼ï¼ï¼ãããã«ãä¸è¿°ããç´äº¤å¤æã®ã¹ãã¯ãã«å¤ æã¨ãã¦ã¯ãä¾ãã°ãå¥åãªã¼ãã£ãªä¿¡å·ãæå®åä½æ éï¼ãã¬ã¼ã ï¼ã§ãããã¯åãããããã¯æ¯ã«é¢æ£ãã¼ ãªã¨å¤æï¼ï¼¤ï¼¦ï¼´ï¼ãé¢æ£ã³ãµã¤ã³å¤æï¼ï¼¤ï¼£ï¼´ï¼ãã¾ ãã¯ã¢ãã£ãã¡ã¤ãé¢æ£ã³ãµã¤ã³å¤æï¼ï¼ï¼¤ï¼£ï¼´ï¼çã è¡ããã¨ã§æéè»¸ãå¨æ³¢æ°è»¸ã«å¤æãããã®ãããããª ããä¸è¨ï¼ï¼¤ï¼£ï¼´ã«ã¤ãã¦ã¯ãæç®ãæéé åã¨ãªã¢ã· ã³ã°ã»ãã£ã³ã»ã«ãåºç¤ã¨ãããã£ã«ã¿ã»ãã³ã¯è¨è¨ã ç¨ãããµããã³ãï¼å¤æç¬¦å·åã("Subband/Transform Coding Using Filter Bank Designs Based on Time Dom ain Aliasing Cancellation", J.P.Princen A.B.Bradla y, Univ.of Surrey Royal Melbourne Inst.of Tech. IC ASSP 1987) ã«è¿°ã¹ããã¦ãããFurther, as the spectrum transformation of the above-mentioned orthogonal transformation, for example, an input audio signal is divided into blocks in a predetermined unit time (frame), and discrete Fourier transform (DFT), discrete cosine transform (DCT), or modified for each block. There is one that transforms a time axis into a frequency axis by performing discrete cosine transform (MDCT) or the like. Regarding the MDCT, reference is made to the document "Subband / Transform Coding Using Filter Bank Design Based on Time Domain Aliasing Cancellation"("Subband / Transform Coding Using Filter Bank Designs Based on Time Dom ain Aliasing Cancellation ", JPPrincen ABBradla y, Univ.of Surrey Royal Melbourne Inst.of Tech. IC ASSP 1987).

ãï¼ï¼ï¼ï¼ããã®ããã«ãã£ã«ã¿ãã¹ãã¯ãã«å¤æã«ã ã£ã¦å¸¯åæ¯ã«åå²ãããä¿¡å·ãéååãããã¨ã«ããã éååéé³ãçºçããå¸¯åãå¶å¾¡ãããã¨ãã§ãããã ãããã¹ãã³ã°å¹æãªã©ã®æ§è³ªãå©ç¨ãã¦è´è¦çã«ãã é«è½çãªç¬¦å·åãè¡ããã¨ãã§ãããã¾ããããã§éå åãè¡ãåã«ãåå¸¯åæ¯ã«ä¾ãã°ãã®å¸¯åã«ãããä¿¡å· æåã®çµ¶å¯¾å¤ã®æå¤§å¤ã§æ£è¦åãè¡ãããã«ããã°ãã ãã«é«è½çãªç¬¦å·åãè¡ããã¨ãã§ãããIn this way, by quantizing the signal divided for each band by the filter and the spectrum conversion, It is possible to control the band in which the quantization noise is generated, and it is possible to perform auditory and more efficient encoding by utilizing the properties such as the so-called masking effect. Further, if the normalization is performed for each band, for example, with the maximum absolute value of the signal component in that band before the quantization is performed here, more efficient encoding can be performed.

ãï¼ï¼ï¼ï¼ãããã§ãå¨æ³¢æ°å¸¯ååå²ãããåå¨æ³¢æ°æ åãéååããå ´åã®å¨æ³¢æ°åå²å¹ã¨ãã¦ã¯ãä¾ãã°äºº éã®è´è¦ç¹æ§ãèæ®ããå¸¯åå¹ãç¨ãããã¨ãå¤ããã ãªãã¡ãä¸è¬ã«é«åã»ã©å¸¯åå¹ãåºããªããããªè¨çå¸¯ åï¼ã¯ãªãã£ã«ã«ãã³ãï¼ã¨å¼ã°ãã¦ããå¸¯åå¹ã§ããª ã¼ãã£ãªä¿¡å·ãè¤æ°ï¼ä¾ãã°ï¼ï¼ãã³ãï¼ã®å¸¯åã«åå² ãããã¨ããããã¾ãããã®æã®åå¸¯åæ¯ã®ãã¼ã¿ãç¬¦ å·åããéã«ã¯ãåå¸¯åæ¯ã«æå®ã®ãããéåãããã ã¯ãåå¸¯åæ¯ã«é©å¿çãªãããå²å½ã¦ï¼ãããã¢ãã±ã¼ ã·ã§ã³ï¼ã«ããç¬¦å·åãè¡ããããä¾ãã°ãä¸è¨ï¼ï¼¤ï¼£ ï¼´å¦çããã¦å¾ãããä¿æ°ãã¼ã¿ãä¸è¨ãããéåã«ã ã£ã¦ç¬¦å·åããéã«ã¯ãä¸è¨åãããã¯æ¯ã®ï¼ï¼¤ï¼£ï¼´å¦ çã«ããå¾ãããåå¸¯åæ¯ã®ï¼ï¼¤ï¼£ï¼´ä¿æ°ãã¼ã¿ã«å¯¾ã ã¦ãé©å¿çãªéåãããæ°ï¼é©å¿çãªéåãããæ°ï¼ã§ ç¬¦å·åãè¡ããããã¨ã«ãªããHere, as the frequency division width in the case of quantizing each frequency component divided into frequency bands, for example, a bandwidth considering human auditory characteristics is often used. That is, the audio signal may be divided into a plurality of bands (for example, 25 bands) with a bandwidth generally called a critical band in which the higher the band, the wider the bandwidth. Further, at the time of encoding the data for each band at this time, encoding is performed by predetermined bit allocation for each band or adaptive bit allocation (bit allocation) for each band. For example, the MDC When the coefficient data obtained by the T processing is encoded by the bit allocation, the number of adaptive allocation bits (adaptation to the MDCT coefficient data for each band obtained by the MDCT processing for each block) The encoding is performed with the number of distributed bits).

ãï¼ï¼ï¼ï¼ãä¸è¨ãããå²å½ææ³ï¼ãããéåææ³ï¼ã¨ ãã¦ã¯ãæ¬¡ã®ï¼ã¤ã®ææ³ãç¥ããã¦ãããThe following two methods are known as the bit allocation method (bit allocation method).

ãï¼ï¼ï¼ï¼ãä¾ãã°ãæç®ãé³å£°ä¿¡å·ã®é©å¿å¤æç¬¦å· åã("Adaptive Transform Coding ofSpeech Signals", IEEE Transactions of Acoustics, Speech, and Signa l Processing, vol.ASSP-25, No.4, August 1977) ã§ ã¯ãåå¸¯åæ¯ã®ä¿¡å·ã®å¤§ããã«åºã¥ãã¦ããããå²å½ã è¡ã£ã¦ãããFor example, the document "Adaptive Transform Coding of Speech Signals", IEEE Transactions of Acoustics, Speech, and Signa Processing, vol.ASSP-25, No.4, August 1977), bit allocation is performed based on the signal size of each band.

ãï¼ï¼ï¼ï¼ãã¾ããä¾ãã°æç®ãè¨çå¸¯åç¬¦å·åå¨ â ãã£ã¸ã¿ã«ã»ã¨ã³ã³ã¼ãã£ã³ã°ã»ãªãã»ãã¼ã»ããã¥ã¢ ã«ã»ãªã¯ã¯ã¤ã¢ã¡ã³ãã»ãªãã»ã¸ã»ãªã¼ãã£ããªã£ã»ã· ã¹ãã ã("The critical band coder --digital encodi ng of the perceptual requirements of the auditory system", M.A. Kransner MIT, ICASSP 1980) ã§ã¯ãè´ è¦ãã¹ãã³ã°ãå©ç¨ãããã¨ã§ãåå¸¯åæ¯ã«å¿è¦ãªä¿¡å· å¯¾éé³æ¯ãå¾ã¦åºå®çãªãããå²å½ãè¡ãææ³ãè¿°ã¹ã ãã¦ãããIn addition, for example, the document "Critical band encoder- Digital Encoding of Perceptual Requirements of the Auditory System "(" The critical band coder --digital encodi ng of the perceptual requirements of the auditory system ", MA Kransner MIT, ICASSP 1980) describes a method that uses auditory masking to obtain a necessary signal-to-noise ratio for each band and perform fixed bit allocation.

ãï¼ï¼ï¼ï¼ã[0012]

ãçºæãè§£æ±ºãããã¨ããèª²é¡ãã¨ããã§ãä¾ãã°ä¸è¿° ãããããªãµããã³ãã³ã¼ãã£ã³ã°çãç¨ãããªã¼ãã£ ãªä¿¡å·ã®é«è½çå§ç¸®ç¬¦å·åæ¹å¼ã«ããã¦ã¯ãäººéã®è´è¦ ä¸ã®ç¹æ§ãå©ç¨ãããªã¼ãã£ãªãã¼ã¿ãç´ï¼ï¼ï¼ã«å§ç¸® ãããããªæ¹å¼ãæ¢ã«å®ç¨åããã¦ããããªãããã®ãª ã¼ãã£ãªãã¼ã¿ãç´ï¼ï¼ï¼ã«å§ç¸®ããé«è½çç¬¦å·åæ¹å¼ ã¨ãã¦ã¯ãä¾ãã°ï¼ï¼¤ï¼SONYç¤¾åæ¨ãMini Discï¼è¦æ ¼ ã«ä½¿ç¨ããã¦ãããï¼¡ï¼´ï¼²ï¼¡ï¼£ï¼SONYç¤¾åæ¨ãAdaptive TRansform Acoustic Codingï¼ã¨å¼ã°ããæ¹å¼ããããBy the way, for example, in the high-efficiency compression encoding system for audio signals using the above-mentioned sub-band coding or the like, the human auditory characteristic is utilized to convert the audio data into about 1 A method of compressing to / 5 has already been put into practical use. As a high-efficiency encoding method for compressing the audio data to about 1/5, for example, ATRAC (a trademark of Sony Corporation, Adaptive, which is used in MD (a trademark of Sony Corporation, Mini Disc) standard). There is a method called TRansform Acoustic Coding).

ãï¼ï¼ï¼ï¼ãã¾ããéå¸¸ã®ãªã¼ãã£ãªæ©å¨ã®å ´åã®ã¿ãª ãããä¾ãã°æ ç»ãã£ã«ã æ åã·ã¹ãã ãé«åä½ãã¬ã ã¸ã§ã³ããããªãã¼ãã¬ã³ã¼ãããããªãã£ã¹ã¯ãã¬ã¼ ã¤çã®ã¹ãã¬ãªè¥ããã¯ãã«ããµã¦ã³ãé³é¿ã·ã¹ãã ã« ããã¦ã¯ãä¾ãã°ï¼ä¹è³ï¼ãã£ãã«çã®è¤æ°ãã£ãã«ã® ãªã¼ãã£ãªä¿¡å·ãããã¯é³å£°ä¿¡å·ãæ±ãããã«ãªãã¤ã¤ ããããã®å ´åã«ããã¦ãããããã¬ã¼ããåæ¸ããé« è½çç¬¦å·åãè¡ããã¨ãæã¾ãã¦ãããNot only in the case of ordinary audio equipment but also in stereo or multi-sound sound systems such as movie film projection systems, high-definition televisions, video tape recorders, video disc players, etc., for example, 4 to 8 channels, etc. In this case, audio signals or voice signals of a plurality of channels are being handled, and even in this case, it is desired to perform high efficiency coding that reduces the bit rate.

ãï¼ï¼ï¼ï¼ãããã¦ãããã«å ãã¦ãè¤æ°ãã£ãã«ã®ãª ã¼ãã£ãªä¿¡å·ãããã¯é³å£°ä¿¡å·ã¨ã¯å¥ã«ãæ¢åã®ã¹ãã¬ ãªè¥ããã¯é³é¿ã·ã¹ãã ã«ããã¦ãåçãå¯è½ã¨ããã ãã«ãä¾ãã°ï¼ä¹è³ï¼ãã£ãã«çã®è¤æ°ãã£ãã«ã®ãªã¼ ãã£ãªä¿¡å·ãããã¯é³å£°ä¿¡å·ããäºããã¦ã³ããã·ã³ã° ãªã©ã®ææ³ã«ãããä¾ãã°ï¼ãã£ãã«ã®ãã¼ã¿ã«å¤æ ããå¤æããããã®ï¼ãã£ãã«ã®ãã¼ã¿ãåã®è¤æ°ãã£ ãã«ã®ãªã¼ãã£ãªä¿¡å·ã¨ã¯å¥ã«è¨é²ããææ³ãåããã ãã¨ããããIn addition to this, in addition to audio signals or audio signals of a plurality of channels, in order to enable reproduction even in an existing stereo or acoustic system, audio signals of a plurality of channels such as 4 to 8 channels or In some cases, the audio signal may be converted into, for example, 2-channel data by a method such as down-mixing, and the converted 2-channel data may be recorded separately from the audio signals of the plurality of channels.

ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ã¯ãåãã£ãã«ã®ä¿¡å·ãå§ç¸®ç¬¦å·å ããã¨ã¨ãã«ãåãã£ãã«ã®ä¿¡å·ã«å¯¾ãã¦ããã·ã³ã°ãª ã©ã®å¦çãæ½ãã¦å¾ãããä¿¡å·ãåæã«ç¬¦å·åãããã« ããã£ãã«ã®ç¬¦å·åè£ç½®ã®æ§æä¾ãç¤ºãã¦ãããFIG. 10 shows an example of the configuration of a multi-channel coding apparatus for compressing and coding the signals of the respective channels, and simultaneously coding the signals obtained by subjecting the signals of the respective channels to processing such as mixing. Is shown.

ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ã«ç¤ºããç¬¦å·åè£ç½®ã¯ãå¥åç«¯åï¼ ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããå¥åãããåãã£ãã«ã®ä¿¡å·ã æ··åããï¼ãã£ãã«ã®ä¿¡å·ã«å¤æããæ··åå¨ï¼ï¼ï¼ãå ãã£ãã«ã®ä¿¡å·ãããããç¬¦å·åããç¬¦å·å¨ï¼ï¼ï¼ï½ä¹ è³ï¼ï¼ï¼ï½ãæ··åå¨ï¼ï¼ï¼ããä¾çµ¦ãããæ··åãããä¿¡ å·ãç¬¦å·åããç¬¦å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ãç¬¦å·å¨ï¼ï¼ ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããã®ç¬¦å·åãããä¿¡å·ããããã¹ã ãªã¼ã ã«å¤æãããã«ããã¬ã¯ãµï¼ï¼ï¼ãããã³ãã«ã ãã¬ã¯ãµï¼ï¼ï¼ããã®ãããã¹ããªã¼ã ãåºåããåºå ç«¯åï¼ï¼ï¼ããæ§æããã¦ãããThe encoder shown in FIG. 10 has an input terminal 1 Mixers 102 for mixing the signals of the respective channels input from 01a to 101e and converting them into signals of the two channels, encoders 105a to 105e for encoding the signals of the respective channels, and mixers supplied by the mixer 102. Encoders 105f and 105g for encoding the generated signal, encoder 10 It comprises a multiplexer 106 for converting the coded signals from 5a to 105g into a bit stream, and an output terminal 107 for outputting the bit stream from the multiplexer 106.

ãï¼ï¼ï¼ï¼ãå¥åç«¯åï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ãä»ãã¦ä¾ çµ¦ããããä¾ãã°ãã»ã³ã¿ï¼ï¼£ï¼ãã£ãã«ãã¬ãã ï¼ï¼¬ï¼ãã£ãã«ãã©ã¤ãï¼ï¼²ï¼ãã£ãã«ãã¬ãããµã©ã¦ ã³ãï¼ï¼³ï¼¬ï¼ãã£ãã«ãããã³ã©ã¤ããµã©ã¦ã³ãï¼ï¼³ ï¼²ï¼ãã£ãã«ã®åãªã¼ãã£ãªãã¼ã¿ã¯ãã·ã³ã°ã«ãã£ã ã«ç¨ç¬¦å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããããä¾çµ¦ãã ãããããç¬¦å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã§ã¯ãå¥åä¿¡å· ã«å¯¾ãã¦ç¬¦å·åãè¡ãããç¬¦å·åããããã¼ã¿ããã«ã ãã¬ã¯ãµï¼ï¼ï¼ã«ä¾çµ¦ãããããã«ããã¬ã¯ãµï¼ï¼ï¼ã§ ã¯åãã£ãã«ã®ç¬¦å·åããããã¼ã¿ãï¼ã¤ã®ãããã¹ã ãªã¼ã ã«ããããã®ãããã¹ããªã¼ã ãåºåç«¯åï¼ï¼ï¼ ããåºåããããFor example, a center (C) channel, a left (L) channel, a right (R) channel, a left surround (SL) channel, and a right surround (S) supplied through the input terminals 101a to 101e. Each audio data of the R) channel is supplied to each of the single-channel encoders 105a to 105e. In these encoders 105a to 105e, the input signal is encoded, and the encoded data is supplied to the multiplexer 106. The multiplexer 106 converts the encoded data of each channel into one bit stream, and this bit stream is output to the output terminal 107. Output from

ãï¼ï¼ï¼ï¼ãä¸æ¹ãå¥åç«¯åï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ãã å¥åããããªã¼ãã£ãªãã¼ã¿ã¯ãæ··åå¨ï¼ï¼ï¼ã«ãä¾çµ¦ ãããä¾ãã°ãæ¬¡ã®ãããªå²åã§æ··åå¦çãæ½ãããï¼ ãã£ãã«ã®ãã¼ã¿ã¨ãã¦åæ§æããããOn the other hand, the audio data input from the input terminals 101a to 101e is also supplied to the mixer 102, and, for example, the mixing processing is performed at the following ratio, Reconstructed as channel data.

ãï¼ï¼ï¼ï¼ãããªãã¡ãä¸æ¹ã®ãã£ãã«ï¼ï¼¬_mixãã£ã ã«ï¼ã¯ãï¼¬ãã£ãã«ãï¼²ãã£ãã«ãï¼£ãã£ãã«ãï¼³ï¼¬ã ã£ãã«ãããã³ï¼³ï¼²ãã£ãã«ããï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ããã³ï¼ï¼ï¼ï¼ ï¼ï¼ã®å²åã§æ··åããããä»æ¹ã®ãã£ãã«ï¼ï¼²_mixãã£ ãã«ï¼ã¯ãï¼¬ãã£ãã«ãï¼²ãã£ãã«ãï¼£ãã£ãã«ãï¼³ï¼¬ ãã£ãã«ãããã³ï¼³ï¼²ãã£ãã«ããï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ããã³ï¼ï¼ï¼ ï¼ï¼ï¼ã®å²åã§æ··åããããThat is, in one channel (L _mix channel), the L channel, the R channel, the C channel, the SL channel, and the SR channel are 1.0000 and 0.0. 000, 0.7071, 0.7071, and 0.00 It is mixed in the ratio of 00. The other channel (R _mix channel) is L channel, R channel, C channel, SL Channels and SR channels are 0.0000, 1. 0000, 0.7071, 0.0000, and 0.7 071 mixed.

ãï¼ï¼ï¼ï¼ããããæ··åå¦çã«ãã£ã¦åæ§æãããï¼ã ã£ãã«ã®ãã¼ã¿ã¯ãç¬¦å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ã«ããã ãä¾çµ¦ãããç¬¦å·åããããç¬¦å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ ã«ããã¦ç¬¦å·åããããã¼ã¿ã¯ãã«ããã¬ã¯ãµï¼ï¼ï¼ã« ä¾çµ¦ãããä¸è¨ï¼ãã£ãã«ã®ç¬¦å·åããããã¼ã¿ãï¼ã¤ ã®ãããã¹ããªã¼ã ã«ãããå¾ãåºåç«¯åï¼ï¼ï¼ããåº åããããThe two-channel data reconstructed by these mixing processes are supplied to the encoders 105f and 105g and encoded. Encoders 105f and 105g The encoded data in 1 is supplied to the multiplexer 106, and the encoded data of the above two channels is converted into one bit stream, and then output from the output terminal 107.

ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ã¯ãåãã£ãã«æ¯ã«å¾©å·åãè¡ãã ã«ããã£ãã«ã®å¾©å·åè£ç½®ã®æ§æä¾ãç¤ºãã¦ãããã¾ ããå³ï¼ï¼ã¯ãè¤æ°ãã£ãã«ã®ãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ã¾ ãã¯å¨é¨ãæ··åå¦çãããä¿¡å·ã®å¾©å·åãè¡ãï¼ãã£ã ã«ã®å¾©å·åè£ç½®ã®æ§æä¾ãç¤ºãã¦ãããFIG. 11 shows an example of the configuration of a multi-channel decoding device that performs decoding for each channel. Further, FIG. 12 shows a configuration example of a two-channel decoding device that decodes a signal in which a part or all of digital signals of a plurality of channels are mixed.

ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ã«ç¤ºããå¾©å·åè£ç½®ã¯ãå¥åç«¯åï¼ ï¼ï¼ããå¥åããããããã¹ããªã¼ã ãåãã£ãã«ã®ç¬¦ å·åãã¼ã¿ã«åå²ããããã«ããã¬ã¯ãµï¼ï¼ï¼ãããã« ããã¬ã¯ãµï¼ï¼ï¼ããã®åãã£ãã«ã«å¯¾å¿ããç¬¦å·åã ã¼ã¿ãããããå¾©å·ããå¾©å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã ããã³å¾©å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããã¦å¾©å·åãã ãä¿¡å·ãåºåããåºåç«¯åï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããæ§ æããã¦ãããThe decoding device shown in FIG. 11 has an input terminal 1 Demultiplexer 132 that divides the bit stream input from 31 into encoded data of each channel, decoders 133a to 133e that respectively decode encoded data corresponding to each channel from demultiplexer 132, And output terminals 136a to 136e for outputting the signals decoded by the decoders 133a to 133e.

ãï¼ï¼ï¼ï¼ãå¥åç«¯åï¼ï¼ï¼ãä»ãã¦ä¾çµ¦ãããç¬¦å·å ããããããã¹ããªã¼ã ãã¼ã¿ã¯ãããã«ããã¬ã¯ãµï¼ ï¼ï¼ã«ããã¦åãã£ãã«ã«å¯¾å¿ããç¬¦å·åãã¼ã¿ã«åå² ãããå¾©å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããããä¾çµ¦ãã ããå¾©å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ä¾çµ¦ãããç¬¦å·åã ã¼ã¿ã¯ãå¾©å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããã¦ãããã å¾©å·åãããå¾©å·åããããªã¼ãã£ãªãã¼ã¿ã¯ãåºåç«¯ åï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããããããåºåããããThe encoded bit stream data supplied through the input terminal 131 is the demultiplexer 1 At 32, the data is divided into encoded data corresponding to each channel and is supplied to each of the decoders 133a to 133e. The encoded data supplied to the decoders 133a to 133e are decoded in the decoders 133a to 133e, respectively, and the decoded audio data are output from the output terminals 136a to 136e, respectively.

ãï¼ï¼ï¼ï¼ãã¾ããå³ï¼ï¼ã«ç¤ºããå¾©å·åè£ç½®ã¯ãå¥å ç«¯åï¼ï¼ï¼ããå¥åãããç¬¦å·åããããããã¹ããªã¼ ã ãã¼ã¿ãæ··åå¦çãããï¼ãã£ãã«ã®ç¬¦å·åãã¼ã¿ã« åå²ããããã«ããã¬ã¯ãµï¼ï¼ï¼ãæ··åå¦çãããï¼ã ã£ãã«ã®ç¬¦å·åãã¼ã¿ãããããå¾©å·åããå¾©å·å¨ï¼ï¼ ï¼ï½ï¼ï¼ï¼ï¼ï½ããæ§æããããIn the decoding apparatus shown in FIG. 12, the demultiplexer 132 for dividing the encoded bit stream data input from the input terminal 131 into the mixed two-channel encoded data is subjected to the mixing processing. Decoder 13 for decoding each of the two-channel encoded data It is composed of 3f and 133g.

ãï¼ï¼ï¼ï¼ãå¥åç«¯åï¼ï¼ï¼ãä»ãã¦ä¾çµ¦ãããç¬¦å·å ããããããã¹ããªã¼ã ãã¼ã¿ã¯ãããã«ããã¬ã¯ãµï¼ ï¼ï¼ã«ããã¦æ··åå¦çãããï¼ãã£ãã«ã®ç¬¦å·åãã¼ã¿ ã«åå²ããããåå²ãããï¼ãã£ãã«ã®ç¬¦å·åãã¼ã¿ ã¯ãå¾©å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ã«ããããä¾çµ¦ãããã å¾©å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ã«ä¾çµ¦ãããç¬¦å·åãã¼ã¿ã¯ ããããå¾©å·åãããå¾©å·åããããªã¼ãã£ãªãã¼ã¿ ã¯ãåºåç«¯åï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ããããããåºåãã ããThe encoded bit stream data supplied through the input terminal 131 is the demultiplexer 1 In 32, the mixed data is divided into two-channel encoded data. The divided 2-channel encoded data is supplied to the decoders 133f and 133g, respectively. The encoded data supplied to the decoders 133f and 133g are decoded, and the decoded audio data is output from the output terminals 136f and 136g, respectively.

ãï¼ï¼ï¼ï¼ããã®ããã«ãè¤æ°ãã£ãã«ã®ãªã¼ãã£ãªä¿¡ å·ãããã¯é³å£°ä¿¡å·ã¨ã¯å¥ã«ããããã®ä¿¡å·ãæ··åå¦ç ãããä¾ãã°ï¼ãã£ãã«ã®ä¿¡å·ãè¨é²ãããããªå ´åã« ããã¦ã¯ãããã«ãããã¬ã¼ããåæ¸ããé«è½çç¬¦å·å ãè¡ããã¨ãæã¾ãã¦ãããAs described above, in the case of recording, for example, a two-channel signal in which these signals are mixed and processed separately from the audio signal or the voice signal of a plurality of channels, a high efficiency code for further reducing the bit rate. It is desired to implement

ãï¼ï¼ï¼ï¼ãããããªãããå³ï¼ï¼ï¼å³ï¼ï¼ãããã³å³ ï¼ï¼ã«ç¤ºãããããªæ§æã®ç¬¦å·åè£ç½®ããã³å¾©å·åè£ç½® ã«ããã¦ã¯ãä¸è¨ãã£ã¸ã¿ã«ãªã¼ãã£ãªãã¼ã¿ãç´ï¼ï¼ ï¼ã«å§ç¸®ããé«è½çç¬¦å·åæ¹å¼ã¯ãã·ã³ã°ã«ãã£ãã«ç¨ ã®ç¬¦å·åæ¹å¼ã§ããããããç¨ãã¦ãã«ããã£ãã«ãªã¼ ãã£ãªãã¼ã¿ãç¬¦å·åããå ´åã«ã¯ããã£ãã«éã®ãã¼ ã¿ã®ä¾åé¢ä¿ããåãã£ãã«ã®ãã¼ã¿ç¹æ§ããã©ã¼ãã ãç¹æ§ã¨ãã£ãè¦ç´ ãç¨ããå¹æçãªãã¼ã¿ç¬¦å·åå¦ç ããããã¨ãã§ããªããããªãã¡ãï¼£ãã£ãã«ã¨ï¼¬ãã£ ãã«ã®ãªã¼ãã£ãªãã¼ã¿ãä¼¼ã¦ããã¨ãããããã¯ãã ã©ã¼ãããçã«å·¦å³ã®ãµã©ã¦ã³ããã£ãã«ã®ãªã¼ãã£ãª ãã¼ã¿ãä¼¼ã¦ããã¨ãã£ãè¦ç´ ãå©ç¨ãããã¨ãã§ããª ããHowever, in the encoding device and the decoding device having the configurations shown in FIGS. 10, 11 and 12, the digital audio data is reduced to about 1 / The high-efficiency coding method of compressing to 5 is a coding method for a single channel, and when coding multi-channel audio data using this, data dependence between channels and data of each channel It is not possible to perform effective data coding processing using elements such as characteristics and format characteristics. That is, it is impossible to use an element that the audio data of the C channel and the audio data of the L channel are similar to each other, or the audio data of the left and right surround channels are similar in terms of format.

ãï¼ï¼ï¼ï¼ãæ¬çºæã¯ããã®ãããªç¶æ³ã«éã¿ã¦ãªãã ããã®ã§ããããã«ããã£ãã«ã®ä¿¡å·ã®å§ç¸®ç¬¦å·åã«ã ãã¦ããã«ããã£ãã«éã®ãã£ã¸ã¿ã«ãã¼ã¿ã®ç¸é¢é¢ä¿ ã®ç¨åº¦ã«é©ããé«å§ç¸®ããæ¢åã®ç¬¦å·å¨ããã³å¾©å·å¨ã å©ç¨ãã¦å®ç¾å¯è½ã«ãããã®ã§ãããThe present invention has been made in view of such a situation, and in the compression coding of multi-channel signals, high compression suitable for the degree of correlation of digital data between multi-channels has been achieved. It can be realized by using an encoder and a decoder.

ãï¼ï¼ï¼ï¼ã[0029]

ãèª²é¡ãè§£æ±ºããããã®ææ®µãè«æ±é ï¼ã«è¨è¼ã®ç¬¦å·å æ¹æ³ã¯ããã£ã¸ã¿ã«ä¿¡å·ã®ç¹æ§ããã³åçç°å¢ã«å¯¾å¿ã ã¦ãå°ãªãã¨ãï¼ã¤ã®ãã£ãã«ã®ãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ ã¾ãã¯å¨é¨ã®å¨æ³¢æ°å¸¯åãå°ãªãã¨ãï¼ã¤ã®æ··åãã£ã ã«ã«æ··åãããã£ã¸ã¿ã«ä¿¡å·ãããæ··åãã£ãã«ã®æ··å ãã£ã¸ã¿ã«ä¿¡å·ã«ãã£ã¦åç¾ããä¿¡å·ãé¤ãããåå¥ã« ç¬¦å·åããåå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãæ½åºããæ··åã ã£ã¸ã¿ã«ä¿¡å·ããã³åå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãç¬¦å·å ãããã¨ãç¹å¾´ã¨ãããAccording to a first aspect of the present invention, there is provided at least one frequency band of at least one part of a digital signal of at least one channel corresponding to a characteristic of the digital signal and a reproduction environment. Individual mixed coded digital signals that are mixed into two mixed channels and are not encoded by the mixed digital signals of the mixed channels are reproduced. It is characterized by

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ã«è¨è¼ã®ç¬¦å·åè£ç½®ã¯ããã£ã¸ã¿ ã«ä¿¡å·ã®ç¹æ§ããã³åçç°å¢ã«å¯¾å¿ãã¦ãå°ãªãã¨ãï¼ ã¤ã®ãã£ãã«ã®ãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ã¾ãã¯å¨é¨ã®å¨æ³¢ æ°å¸¯åãå°ãªãã¨ãï¼ã¤ã®æ··åãã£ãã«ã«æ··åããæ··å ææ®µã¨ããã£ã¸ã¿ã«ä¿¡å·ãããæ··åãã£ãã«ã®æ··åãã£ ã¸ã¿ã«ä¿¡å·ã«ãã£ã¦åç¾ããä¿¡å·ãé¤ãããåå¥ã«ç¬¦å· åããåå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãæ½åºããæ½åºææ®µ ã¨ãæ··åãã£ã¸ã¿ã«ä¿¡å·ããã³åå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡ å·ãç¬¦å·åããç¬¦å·åææ®µã¨ãåãããã¨ãç¹å¾´ã¨ã ããAccording to a second aspect of the present invention, the encoding device has at least 1 in accordance with the characteristics of the digital signal and the reproduction environment. Mixing means for mixing part or all of the frequency bands of the digital signals of one channel into at least one mixing channel, and individually encoding the digital signal excluding the signal reproduced by the mixed digital signal of the mixing channel It is characterized in that it comprises extraction means for extracting the encoded digital signal and encoding means for encoding the mixed digital signal and the individual encoded digital signal.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ï¼ã«è¨è¼ã®å¾©å·åæ¹æ³ã¯ãç¬¦å·å æå ±ã«åºã¥ãã¦ãæ··åãã£ã¸ã¿ã«ä¿¡å·ãå¾©å·åããå¾©å· åãããæ··åãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ã¾ãã¯å¨é¨ãç¨ã ã¦ããã£ã¸ã¿ã«ä¿¡å·ãå¾©åããããã®å¾©åç¨ãã¼ã¿ãä½ æããåå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ã¨å¾©åç¨ãã¼ã¿ãåæ ãããã£ã¸ã¿ã«ä¿¡å·ãå¾©åãããã¨ãç¹å¾´ã¨ãããThe decoding method according to claim 11 is for decoding the mixed digital signal based on the coded information and for restoring the digital signal by using a part or all of the decoded mixed digital signal. It is characterized in that the data for restoration is created, the individually encoded digital signal and the data for restoration are combined, and the digital signal is restored.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ï¼ã«è¨è¼ã®å¾©å·åè£ç½®ã¯ãç¬¦å·å æå ±ã«åºã¥ãã¦ãæ··åãã£ã¸ã¿ã«ä¿¡å·ãå¾©å·åããå¾©å· åææ®µã¨ãå¾©å·åææ®µã«ããå¾©å·åãããæ··åãã£ã¸ã¿ ã«ä¿¡å·ã®ä¸é¨ã¾ãã¯å¨é¨ãç¨ãã¦ããã£ã¸ã¿ã«ä¿¡å·ãå¾© åããããã®å¾©åç¨ãã¼ã¿ãä½æããå¾©åç¨ãã¼ã¿ä½æ ææ®µã¨ãå¾©åç¨ãã¼ã¿ä½æææ®µã«ããä½æãããå¾©åç¨ ãã¼ã¿ã¨ãåå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãåæãããã£ã¸ ã¿ã«ä¿¡å·ãå¾©åããå¾©åææ®µã¨ãåãããã¨ãç¹å¾´ã¨ã ããAccording to a twelfth aspect of the present invention, there is provided a decoding device for decoding the mixed digital signal based on the coded information, and a part or all of the mixed digital signal decoded by the decoding means. Using the restoring data creating means for creating the restoring data for restoring the digital signal, the restoring data created by the restoring data creating means, and the individual encoded digital signal are combined to restore the digital signal. And a restoring means for performing the restoration.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ï¼ã«è¨è¼ã®è¨é²åªä½ã¯ãè¤æ°ãã£ ãã«ã®å°ãªãã¨ãï¼ã¤ã®ãã£ãã«ã®ãã£ã¸ã¿ã«ä¿¡å·ã®ä¸ é¨ã¾ãã¯å¨é¨ã®å¨æ³¢æ°å¸¯åããå°ãªãã¨ãï¼ã¤ã®æ··åã ã£ãã«ã«æ··åããããã£ã¸ã¿ã«ä¿¡å·ã®å¨æ³¢æ°ç¹æ§ããã³ åçç°å¢ã«å¯¾å¿ãã¦ããã£ã¸ã¿ã«ä¿¡å·ãããæ··åãã£ã ã«ã®æ··åãã£ã¸ã¿ã«ä¿¡å·ã«ãã£ã¦åç¾ããä¿¡å·ãé¤ã ããåå¥ã«ç¬¦å·åããåå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãæ½åº ãããæå®ã®ç¬¦å·åæå ±ã«åºã¥ãã¦ç¬¦å·åãããæ··åã ã£ã¸ã¿ã«ä¿¡å·ããã³åå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãè¨é²ã ãããã¨ãç¹å¾´ã¨ãããIn the recording medium according to the eighteenth aspect, a part or all of the frequency bands of the digital signals of at least one channel of the plurality of channels are mixed in at least one mixing channel, and the frequency characteristics of the digital signal and the reproduction environment. Corresponding to, the individual coded digital signal to be individually coded is extracted from the digital signal excluding the signal reproduced by the mixed digital signal of the mixed channel, and is mixed based on the predetermined coded information. A digital signal and an individually encoded digital signal are recorded.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ã«è¨è¼ã®ç¬¦å·åæ¹æ³ã«ããã¦ã¯ã ãã£ã¸ã¿ã«ä¿¡å·ã®ç¹æ§ããã³åçç°å¢ã«å¯¾å¿ãã¦ãå°ãª ãã¨ãï¼ã¤ã®ãã£ãã«ã®ãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ã¾ãã¯å¨ é¨ã®å¨æ³¢æ°å¸¯åãå°ãªãã¨ãï¼ã¤ã®æ··åãã£ãã«ã«æ··å ããããã£ã¸ã¿ã«ä¿¡å·ãããæ··åãã£ãã«ã®æ··åãã£ã¸ ã¿ã«ä¿¡å·ã«ãã£ã¦åç¾ããä¿¡å·ãé¤ããããåå¥ã«ç¬¦å· åããåå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãæ½åºãããæ··åãã£ ã¸ã¿ã«ä¿¡å·ããã³åå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãç¬¦å·åã ãããå¾ã£ã¦ãæ··åãã£ã¸ã¿ã«ä¿¡å·ã«åºã¥ãã¦åã®ãã£ ã¸ã¿ã«ä¿¡å·ãåç¾ãããã¨ãã§ãããã£ã¸ã¿ã«ä¿¡å·ã®å§ ç¸®çãé«ãããã¨ãã§ãããIn the encoding method according to claim 1, Depending on the characteristics of the digital signal and the reproduction environment, some or all of the frequency bands of the digital signal of the at least one channel are mixed into the at least one mixed channel and reproduced from the digital signal by the mixed digital signal of the mixed channel. The individual coded digital signals to be coded separately from which the signals have been removed are extracted, and the mixed digital signal and the individual coded digital signals are coded. Therefore, the original digital signal can be reproduced based on the mixed digital signal, and the compression rate of the digital signal can be increased.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ã«è¨è¼ã®ç¬¦å·åè£ç½®ã«ããã¦ã¯ã ãã£ã¸ã¿ã«ä¿¡å·ã®ç¹æ§ããã³åçç°å¢ã«å¯¾å¿ãã¦ãæ··å ææ®µã«ãããå°ãªãã¨ãï¼ã¤ã®ãã£ãã«ã®ãã£ã¸ã¿ã«ä¿¡ å·ã®ä¸é¨ã¾ãã¯å¨é¨ã®å¨æ³¢æ°å¸¯åãå°ãªãã¨ãï¼ã¤ã®æ·· åãã£ãã«ã«æ··åãããæ½åºææ®µã«ããããã£ã¸ã¿ã«ä¿¡ å·ãããæ··åãã£ãã«ã®æ··åãã£ã¸ã¿ã«ä¿¡å·ã«ãã£ã¦å ç¾ããä¿¡å·ãé¤ããããåå¥ã«ç¬¦å·åããåå¥ç¬¦å·åã ã£ã¸ã¿ã«ä¿¡å·ãæ½åºãããç¬¦å·åææ®µã«ãããæ··åãã£ ã¸ã¿ã«ä¿¡å·ããã³åå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãç¬¦å·åã ãããå¾ã£ã¦ãæ··åãã£ã¸ã¿ã«ä¿¡å·ã«åºã¥ãã¦åã®ãã£ ã¸ã¿ã«ä¿¡å·ãåç¾ãããã¨ãã§ãããã£ã¸ã¿ã«ä¿¡å·ã®å§ ç¸®çãé«ãããã¨ãã§ãããIn the encoding device according to the second aspect, Depending on the characteristics of the digital signal and the reproduction environment, some or all of the frequency bands of the digital signal of the at least one channel are mixed by the mixing means into the at least one mixing channel, and the mixing means mixes the digital signal by the extraction means. The individual coded digital signal to be individually coded is extracted by removing the signal reproduced by the mixed digital signal of the channel, and the mixed digital signal and the individual coded digital signal are coded by the coding means. Therefore, the original digital signal can be reproduced based on the mixed digital signal, and the compression rate of the digital signal can be increased.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ï¼ã«è¨è¼ã®å¾©å·åæ¹æ³ã«ããã¦ ã¯ãç¬¦å·åæå ±ã«åºã¥ãã¦ãæ··åãã£ã¸ã¿ã«ä¿¡å·ãå¾©å· åãããå¾©å·åãããæ··åãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ã¾ãã¯ å¨é¨ãç¨ãã¦ããã£ã¸ã¿ã«ä¿¡å·ãå¾©åããããã®å¾©åç¨ ãã¼ã¿ãä½æãããåå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ã¨å¾©åç¨ ãã¼ã¿ãåæããããã£ã¸ã¿ã«ä¿¡å·ãå¾©åããããå¾ã£ ã¦ãæ··åãã£ã¸ã¿ã«ä¿¡å·ã«åºã¥ãã¦åã®ãã£ã¸ã¿ã«ä¿¡å· ãå¾©åãããã¨ãã§ãããIn the decoding method according to the eleventh aspect, the mixed digital signal is decoded based on the coded information, and the digital signal is restored by using a part or all of the decoded mixed digital signal. Restoration data is created, and the individually encoded digital signal and the restoration data are combined to restore the digital signal. Therefore, the original digital signal can be restored based on the mixed digital signal.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ï¼ã«è¨è¼ã®å¾©å·åè£ç½®ã«ããã¦ ã¯ãå¾©å·åææ®µã«ãããç¬¦å·åæå ±ã«åºã¥ãã¦ãæ··åã ã£ã¸ã¿ã«ä¿¡å·ãå¾©å·åãããå¾©åç¨ãã¼ã¿ä½æææ®µã«ã ããæ··åãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ã¾ãã¯å¨é¨ãç¨ãã¦ãã ã£ã¸ã¿ã«ä¿¡å·ãå¾©åããããã®å¾©åç¨ãã¼ã¿ãä½æã ããå¾©åææ®µã«ãããå¾©åç¨ãã¼ã¿ã¨åå¥ç¬¦å·åãã£ã¸ ã¿ã«ä¿¡å·ãåæããããã£ã¸ã¿ã«ä¿¡å·ãå¾©åããããå¾ ã£ã¦ãæ··åãã£ã¸ã¿ã«ä¿¡å·ã«åºã¥ãã¦ãåã®ãã£ã¸ã¿ã« ä¿¡å·ãå¾©åãããã¨ãã§ãããIn the decoding device according to the twelfth aspect of the present invention, the decoding means decodes the mixed digital signal based on the coded information, and the restoration data creating means makes part or all of the mixed digital signal. Is used to create restoration data for restoring the digital signal, and the restoration means synthesizes the restoration data and the individual encoded digital signal to restore the digital signal. Therefore, the original digital signal can be restored based on the mixed digital signal.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ï¼ã«è¨è¼ã®è¨é²åªä½ã«ããã¦ã¯ã æå®ã®ç¬¦å·åæå ±ã«åºã¥ãã¦ç¬¦å·åãããæ··åãã£ã¸ã¿ ã«ä¿¡å·ããã³åå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãè¨é²ãããã å¾ã£ã¦ãé«è½çç¬¦å·åããããã£ã¸ã¿ã«ä¿¡å·ãè¨é²ãã ãããåçãããã¨ãã§ãããIn the recording medium according to claim 18, The mixed digital signal and the individually encoded digital signal encoded based on predetermined encoded information are recorded. Therefore, a high efficiency coded digital signal is recorded, You can play it.

ãï¼ï¼ï¼ï¼ã[0039]

ãçºæã®å®æ½ã®å½¢æãä»¥ä¸ãæ¬çºæã®å®æ½ä¾ã«ã¤ãã¦å³ é¢ãåç§ããªããèª¬æãããBEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings.

ãï¼ï¼ï¼ï¼ãå³ï¼ã¯ãæ¬çºæã®ç¬¦å·åæ¹æ³ãé©ç¨ããã ç¬¦å·åè£ç½®ã®ä¸å®æ½ä¾ã®æ§æãç¤ºããããã¯å³ã§ããã åå³ã«ç¤ºããããã«ãæ¬çºæã®ç¬¦å·åè£ç½®ã¯ãã·ã³ã°ã« ãã£ãã«ç¨ã®å§ç¸®ç¬¦å·åå¨ï¼ä¾ãã°ä¸è¿°ããããããï¼¡ ï¼´ï¼²ï¼¡ï¼£æ¹å¼ã®ç¬¦å·åå¨ï¼ãè¤æ°ç¨ãã¦ããã«ããã£ã ã«ã®å§ç¸®ç¬¦å·åãå®ç¾ãããã®ã§ãããFIG. 1 is a block diagram showing the configuration of an embodiment of an encoding device to which the encoding method of the present invention is applied. As shown in the figure, the encoding apparatus of the present invention is a single-channel compression encoder (for example, the so-called A described above). A plurality of TRAC encoders are used to realize multi-channel compression encoding.

ãï¼ï¼ï¼ï¼ãããªãã¡ãå³ï¼ã«ç¤ºããç¬¦å·åè£ç½®ã¯ãè¤ æ°ãã£ãã«ã®ãã£ã¸ã¿ã«ãªã¼ãã£ãªä¿¡å·ãç¬¦å·åããç¬¦ å·åããããã£ã¸ã¿ã«ãªã¼ãã£ãªä¿¡å·ã¨å±ã«ç¬¦å·åã®ã ã©ã¡ã¼ã¿æå ±ãåºåããè£ç½®ã§ãããæ··åå¨ï¼ï¼ï¼ï¼æ·· åææ®µï¼ã¯ãè¤æ°ãã£ãã«ã®ãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ã¾ã ã¯å¨é¨ãæ··åãããå¦çãã¼ã¿æ½åºå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ ï¼ï½ï¼æ½åºææ®µï¼ã¯ãæ··åå¨ï¼ï¼ï¼ããä¾çµ¦ãããæ··å ããããã£ã¸ã¿ã«ä¿¡å·ï¼æ··åå¦çãã¼ã¿ï¼ã¨ãå¥åç«¯å ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããä¾çµ¦ããããªã¼ãã£ãªãã¼ã¿ ãæ¯è¼ãããã®ãªã¼ãã£ãªãã¼ã¿ãããæ··åå¦çãã¼ã¿ ã¨ã¯å¥ã«ãåå¥ã«ç¬¦å·åããæ¹ãé©åã§ããã¨å¤æãã ãªã¼ãã£ãªãã¼ã¿ï¼åå¥ç¬¦å·åãã¼ã¿ï¼ãæ½åºããç¬¦å· åå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ï¼ç¬¦å·åææ®µï¼ã«ä¾çµ¦ããã ã¾ããæ··åå¨ï¼ï¼ï¼ããä¾çµ¦ãããæ··åå¦çãã¼ã¿ã«ã ã£ã¦åã®ãªã¼ãã£ãªãã¼ã¿ãåç¾ããæ¹ãæå¹ã§ããã¨ å¤æãããæ®ãã®ãªã¼ãã£ãªãã¼ã¿ã«ã¤ãã¦ã¯ãããã åç¾ããããã®åç¾ç¨ã®ãã©ã¡ã¼ã¿ãçæãããã«ãã ã¬ã¯ãµï¼ï¼ï¼ã«ä¾çµ¦ããããã«ãªããã¦ãããThat is, the encoding device shown in FIG. 1 is a device for encoding digital audio signals of a plurality of channels and outputting encoding parameter information together with the encoded digital audio signals. Means) mixes some or all of the digital signals of the plurality of channels. Processed data extractors 103a to 10 3e (extracting means) compares the mixed digital signal (mixing processing data) supplied from the mixer 102 with the audio data supplied from the input terminals 101a to 101e, and from this audio data, the mixing processing data is obtained. Separately, audio data (individually encoded data) determined to be more suitable for individual encoding is extracted and supplied to the encoders 105a to 105e (encoding means). Further, for the remaining audio data for which it is judged that it is more effective to reproduce the original audio data by the mixed processing data supplied from the mixer 102, a reproduction parameter for reproducing the remaining audio data is generated, It is adapted to be supplied to the multiplexer 106.

ãï¼ï¼ï¼ï¼ãç¬¦å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã¯ãå¦çãã¼ ã¿æ½åºå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããä¾çµ¦ãããåå¥ç¬¦å· åãã¼ã¿ãç¬¦å·åããåºåãããç¬¦å·å¨ï¼ï¼ï¼ï½ããã³ ï¼ï¼ï¼ï½ã¯ãæ··åå¨ï¼ï¼ï¼ããä¾çµ¦ãããæ··åå¦çãã¼ ã¿ãç¬¦å·åããåºåããããã«ãªããã¦ãããThe encoders 105a to 105e encode the individual encoded data supplied from the processed data extractors 103a to 103e and output the encoded data. The encoders 105f and 105g are configured to encode and output the mixed processing data supplied from the mixer 102.

ãï¼ï¼ï¼ï¼ããã«ããã¬ã¯ãµï¼ï¼ï¼ã¯ãç¬¦å·å¨ï¼ï¼ï¼ï½ ä¹è³ï¼ï¼ï¼ï½ããä¾çµ¦ãããç¬¦å·åãããç¬¦å·åãã¼ã¿ ããã³åç¾ç¨ã®ãã©ã¡ã¼ã¿ãï¼ã¤ã®ãããã¹ããªã¼ã ã« ããå¾ãåºåç«¯åï¼ï¼ï¼ããåºåããããã«ãªããã¦ã ããThe multiplexer 106 has an encoder 105a. To 105g, the encoded data and the reproduction parameter supplied from the first to 105 g are converted into one bit stream, which is then output from the output terminal 107.

ãï¼ï¼ï¼ï¼ãããã§ã¯ãã»ã³ã¿ï¼ï¼£ï¼ãã£ãã«ãã¬ãã ï¼ï¼¬ï¼ãã£ãã«ãã©ã¤ãï¼ï¼²ï¼ãã£ãã«ãã¬ãããµã©ã¦ ã³ãï¼ï¼³ï¼¬ï¼ãã£ãã«ãããã³ã©ã¤ããµã©ã¦ã³ãï¼ï¼³ ï¼²ï¼ãã£ãã«ã®ï¼ãã£ãã«ãªã¼ãã£ãªãã¼ã¿ããå¥åç«¯ åï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããããããå¥åããããã®ã¨ ãã¦èª¬æãããã¾ããæ··åå¨ï¼ï¼ï¼ã¯ããããï¼ãã£ã ã«ãªã¼ãã£ãªãã¼ã¿ã«æ··åå¦çãæ½ãã¦ãï¼ãã£ãã«ãª ã¼ãã£ãªãã¼ã¿ãä½æããåºåãããã®ã¨ãããHere, the center (C) channel, the left (L) channel, the right (R) channel, the left surround (SL) channel, and the right surround (S). The description will be made assuming that the R channel 5-channel audio data is input from the input terminals 101a to 101e, respectively. Further, the mixer 102 performs a mixing process on these 5-channel audio data to create 2-channel audio data, and outputs the 2-channel audio data.

ãï¼ï¼ï¼ï¼ãå³ï¼ã«ç¤ºããå®æ½ä¾ã«ããã¦ã¯ãå¥åç«¯å ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ãä»ãã¦ä¾çµ¦ãããã»ã³ã¿ ï¼ï¼£ï¼ãã¬ããï¼ï¼¬ï¼ãã©ã¤ãï¼ï¼²ï¼ãã¬ãããµã©ã¦ã³ ãï¼ï¼³ï¼¬ï¼ãããã³ã©ã¤ããµã©ã¦ã³ãï¼ï¼³ï¼²ï¼ã®åãã£ ãã«ã®ãªã¼ãã£ãªãã¼ã¿ã¯ãã¾ãæ··åå¨ï¼ï¼ï¼ã«å¥åã ãããæ··åå¨ï¼ï¼ï¼ã¯ãåºæ¬çã«ã¯å³ï¼ï¼ã«ç¤ºããæ··å å¨ï¼ï¼ï¼ã¨åæ§ã®ãã®ã§ããã®ã§ãã®è©³ç´°ãªèª¬æã¯çç¥ ããããå³ï¼ï¼ãåç§ãã¦ä¸è¿°ããããã«ãå¥åç«¯åï¼ ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããå¥åãããåãã£ãã«ã®ãªã¼ã ã£ãªãã¼ã¿ãæ··åãããï¼ãã£ãã«ã®ãã¼ã¿ã¨ãã¦åæ§ æããããæ··åå¨ï¼ï¼ï¼ã®åºåã¯ãç¬¦å·å¨ï¼ï¼ï¼ï½ä¹è³ ï¼ï¼ï¼ï½ã«ä¾çµ¦ãããã¨ã¨ãã«ãå¦çãã¼ã¿æ½åºå¨ï¼ï¼ ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ãä¾çµ¦ããããIn the embodiment shown in FIG. 1, the center (C), the left (L), the right (R), the left surround (SL), and the right surround (SR) supplied via the input terminals 101a to 101e. The audio data of each channel is first input to the mixer 102. The mixer 102 is basically the same as the mixer 102 shown in FIG. 10, and a detailed description thereof will be omitted. However, as described above with reference to FIG. The audio data of each channel input from 01a to 101e are mixed and reconfigured as two-channel data. The output of the mixer 102 is supplied to the encoders 105f to 105g and the processed data extractor 10 It is also supplied to 3a to 103e.

ãï¼ï¼ï¼ï¼ãå¦çãã¼ã¿æ½åºå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã« ããã¦ã¯ãåãã£ãã«æ¯ã«æ··åå¨ï¼ï¼ï¼ããã®åºåã¨å¥ åç«¯åï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããä¾çµ¦ããããªã¼ãã£ãª ãã¼ã¿ã¨ãæ¯è¼ãããåå¥ã«ç¬¦å·åããã®ãé©åã¨å¤æ ããããªã¼ãã£ãªãã¼ã¿ãæ½åºãããç¬¦å·å¨ï¼ï¼ï¼ï½ä¹ è³ï¼ï¼ï¼ï½ã«åºåããããä¸æ¹ãæ··åå¨ï¼ï¼ï¼ããã®åº åã«ãã£ã¦åã®ãªã¼ãã£ãªãã¼ã¿ãåç¾ããæ¹ãæå¹ã¨ å¤æãããæ®ãã®ãªã¼ãã£ãªãã¼ã¿ã«ã¤ãã¦ã¯ãåç¾ç¨ ã®ãã©ã¡ã¼ã¿ãçæããããã«ããã¬ã¯ãµï¼ï¼ï¼ã«ä¾çµ¦ ããããå¦çãã¼ã¿æ½åºå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã®è©³ç´° ãªæ§æããã³åä½ã«ã¤ãã¦ã¯å¾è¿°ãããIn the processed data extractors 103a to 103e, the output from the mixer 102 is compared with the audio data supplied from the input terminals 101a to 101e for each channel, and it is judged appropriate to individually encode. The audio data thus extracted is extracted and output to the encoders 105a to 105e. On the other hand, for the remaining audio data for which it is determined that it is more effective to reproduce the original audio data by the output from the mixer 102, a reproduction parameter is generated and supplied to the multiplexer 106. Detailed configurations and operations of the processed data extractors 103a to 103e will be described later.

ãï¼ï¼ï¼ï¼ãç¬¦å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããã¦ã¯ã å¦çãã¼ã¿æ½åºå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããä¾çµ¦ããã åå¥ã«ç¬¦å·åãã¹ããªã¼ãã£ãªãã¼ã¿ï¼åå¥ç¬¦å·åãã¼ ã¿ï¼ãããããç¬¦å·åããããã«ããã¬ã¯ãµï¼ï¼ï¼ã«ä¾ çµ¦ããããç¬¦å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããã¦ã¯ãæ·· åå¨ï¼ï¼ï¼ããã®åºåãã¼ã¿ï¼æ··åå¦çãã¼ã¿ï¼ããã ããç¬¦å·åããããã«ããã¬ã¯ãµï¼ï¼ï¼ã«ä¾çµ¦ãããã ç¬¦å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã¯ãå³ï¼ï¼ã«ç¤ºããç¬¦å·å¨ ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã¨åºæ¬çã«åæ§ã®ãã®ã§ãããã ã®è©³ç´°ãªæ§æããã³åä½ã«ã¤ãã¦ã¯å¾è¿°ãããIn the encoders 105a to 105e, The audio data (individually encoded data) to be encoded individually supplied from the processed data extractors 103a to 103e are each encoded and supplied to the multiplexer 106. In the encoders 105f to 105g, the output data (mixing processing data) from the mixer 102 is encoded and supplied to the multiplexer 106. The encoders 105a to 105g are basically the same as the encoders 102a to 102g shown in FIG. The detailed configuration and operation will be described later.

ãï¼ï¼ï¼ï¼ããã«ããã¬ã¯ãµï¼ï¼ï¼ã«ããã¦ã¯ãåç¬¦å· å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããä¾çµ¦ãããç¬¦å·åãããç¬¦ å·åãã¼ã¿ã¨ãå¦çãã¼ã¿æ½åºå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ ããä¾çµ¦ãããåç¾ç¨ã®ãã©ã¡ã¼ã¿ãï¼ã¤ã®ãããã¹ã ãªã¼ã ã«ãããåºåç«¯åï¼ï¼ï¼ããåºåããããIn the multiplexer 106, the encoded data supplied from the respective encoders 105a to 105g and the processed data extractors 103a to 103e. The parameters for reproduction supplied by the above are converted into one bit stream and output from the output terminal 107.

ãï¼ï¼ï¼ï¼ãå³ï¼ã¯ãå³ï¼ã®å®æ½ä¾ã«ãããå¦çãã¼ã¿ æ½åºå¨ï¼ï¼ï¼ï½ã®åé¨æ§æä¾ãç¤ºãã¦ãããå¦çãã¼ã¿ æ½åºå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã®æ§æããã³åä½ã¯ãå¦ç ãã¼ã¿æ½åºå¨ï¼ï¼ï¼ï½ã®å ´åã¨åºæ¬çã«åæ§ã§ããã® ã§ããã®èª¬æã¯çç¥ãããFIG. 2 shows an internal configuration example of the processed data extractor 103a in the embodiment of FIG. The configurations and operations of the processed data extractors 103b to 103e are basically the same as those of the processed data extractor 103a, and therefore description thereof will be omitted.

ãï¼ï¼ï¼ï¼ãå¦çãã¼ã¿æ½åºå¨ï¼ï¼ï¼ï½ãæ§æããå¸¯å åå²ãã£ã«ã¿ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã¯ãå¥åç«¯åï¼ï¼ï¼ ï½ããå¥åãããã»ã³ã¿ãã£ãã«ã®ãªã¼ãã£ãªãã¼ã¿ ãããããåæããå¨æ³¢æ°å¸¯åæ¯ã«åå²ãããå¦çãã¼ ã¿åæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã¯ãæ··åå¦çãæ½ããã æ··åå¦çãã¼ã¿ããåã®ãªã¼ãã£ãªãã¼ã¿ãåç¾ããã ãã®åç¾ç¨ä¿¡å·ã¨ãã¦å©ç¨ããåå¨æ³¢æ°å¸¯åæ¯ã«ããã å½è©²ãã£ãã«ã®ãªã¼ãã£ãªãã¼ã¿ã¨æ··åå¦çãã¼ã¿ãæ¯ è¼ãã¦ãæ··åå¦çãã¼ã¿ããå¾©åããå¨æ³¢æ°å¸¯åã«ã¤ã ã¦ãå½è©²ãã£ãã«ã®ãªã¼ãã£ãªãã¼ã¿ãå¾©åããã®ã«æ å¹ãªãã©ã¡ã¼ã¿ï¼ä¾ãã°ã¹ã±ã¼ã«ãã¡ã¯ã¿ï¼ãåæãã åæçµæãå¦çãã¼ã¿ä½æå¨ï¼ï¼ï¼ã«åºåããããã«ãª ããã¦ãããThe band division filters 202a and 202b constituting the processed data extractor 103a are connected to the input terminal 201. The center channel audio data input from a is divided for each frequency band in which it is analyzed. The processed data analyzers 204a and 204b are provided for each frequency band that uses the mixed processed data as a reproduction signal for reproducing the original audio data, The audio data of the channel is compared with the mixed processing data, and for the frequency band restored from the mixed processing data, an effective parameter (for example, a scale factor) for restoring the audio data of the channel is analyzed, The analysis result is output to the processed data generator 205.

ãï¼ï¼ï¼ï¼ããã©ã¡ã¼ã¿è¨é²ã¡ã¢ãªï¼ï¼ï¼ï½ã¯ãå¦çã ã¼ã¿åæå¨ï¼ï¼ï¼ï½ããä¾çµ¦ãããåæçµæã¨ãã¦ã®ã ã©ã¡ã¼ã¿ãè¨æ¶ãããåæ§ã«ããã©ã¡ã¼ã¿è¨é²ã¡ã¢ãªï¼ ï¼ï¼ï½ã¯ãå¦çãã¼ã¿åæå¨ï¼ï¼ï¼ï½ããä¾çµ¦ãããå æçµæã¨ãã¦ã®ãã©ã¡ã¼ã¿ãè¨æ¶ããããã«ãªããã¦ã ããThe parameter recording memory 203a stores the parameter as the analysis result supplied from the processed data analyzer 204a. Similarly, the parameter recording memory 2 03b stores parameters as analysis results supplied from the processed data analyzer 204b.

ãï¼ï¼ï¼ï¼ãå¦çãã¼ã¿ä½æå¨ï¼ï¼ï¼ã¯ãå¦çãã¼ã¿å æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããã®åºåã«åºã¥ãã¦ãå¥å ç«¯åï¼ï¼ï¼ï½ããå¥åãããå½è©²ãã£ãã«ã®ãªã¼ãã£ãª ãã¼ã¿ãããç¬¦å·åã«å¿è¦ãªå¨æ³¢æ°å¸¯åæåã®ã¿ãæã åºããåºåç«¯åï¼ï¼ï¼ï½ããåºåãããã¾ããå¦çãã¼ ã¿åæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããä¾çµ¦ããããå½è©²ã ã£ãã«ã®ãªã¼ãã£ãªãã¼ã¿ãå¾©åããããã®ãã©ã¡ã¼ã¿ ãï¼ã¤ã«ã¾ã¨ããåºåç«¯åï¼ï¼ï¼ï½ããåºåããããã« ãªããã¦ãããBased on the outputs from the processed data analyzers 204a and 204b, the processed data generator 205 extracts only the frequency band component required for encoding from the audio data of the channel input from the input terminal 201a, Output from the output terminal 206a. Further, the parameters for restoring the audio data of the channel, which are supplied from the processed data analyzers 204a and 204b, are put together into one and output from the output terminal 206b.

ãï¼ï¼ï¼ï¼ãå¥åç«¯åï¼ï¼ï¼ï½ã«ãå³ï¼ã«ãããå¥åç«¯ åï¼ï¼ï¼ï½ããã®ã»ã³ã¿ãã£ãã«ã®ãªã¼ãã£ãªãã¼ã¿ã å¥åãããå¥åç«¯åï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ã«ãæ··åå¨ï¼ï¼ ï¼ããã®æ··åå¦çãã¼ã¿ãå¥åãããã¨ãå¥åç«¯åï¼ï¼ ï¼ï½ï¼ï¼ï¼ï¼ï½ãããã³ï¼ï¼ï¼ï½ãä»ãã¦å¥åãããã ã¼ã¿ã¯ãå¸¯ååå²ãã£ã«ã¿ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ãã ããä¾çµ¦ããããã®ãªã¼ãã£ãªãã¼ã¿ãåæããå¨æ³¢æ° å¸¯åï¼ä¾ãã°ã¯ãªãã£ã«ã«ãã³ãï¼æ¯ã«åå²ãããå¦ç ãã¼ã¿åæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ä¾çµ¦ããããThe center channel audio data from the input terminal 101a in FIG. 1 is input to the input terminal 201a, and the mixer 10 is input to the input terminals 201b and 201c. When the mixed processing data from 2 is input, the input terminal 20 The data input via 1a, 201b, and 201c are respectively supplied to band division filters 202a to 202b, divided into frequency bands (for example, critical bands) for analyzing the audio data, and processed data analyzers 204a to 204a. It is supplied to 204b.

ãï¼ï¼ï¼ï¼ãå¦çãã¼ã¿åæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ ã¯ãæ··åå¦çãæ½ãããæ··åå¦çãã¼ã¿ããåã®ãªã¼ã ã£ãªãã¼ã¿ãåç¾ããããã®åç¾ç¨ä¿¡å·ã¨ãã¦å©ç¨ãã åå¨æ³¢æ°å¸¯åæ¯ã«ãããå½è©²ãã£ãã«ã®ãªã¼ãã£ãªãã¼ ã¿ã¨æ··åå¦çãã¼ã¿ãæ¯è¼ãã¦ãæ··åå¦çãã¼ã¿ããå½ è©²ãã£ãã«ã®ãªã¼ãã£ãªãã¼ã¿ãå¾©åããã®ã«æå¹ãªã ã©ã¡ã¼ã¿ï¼ä¾ãã°ã¹ã±ã¼ã«ãã¡ã¯ã¿ï¼ãåæãããåæ çµæãå¦çãã¼ã¿ä½æå¨ï¼ï¼ï¼ã«åºåããããProcessed data analyzers 204a-204b Is for each frequency band that uses the mixed processed data as a reproduction signal for reproducing the original audio data, compares the audio data of the channel and the mixed processed data, and mixes them. A parameter (for example, a scale factor) effective for restoring the audio data of the channel from the processed data is analyzed, and the analysis result is output to the processed data generator 205.

ãï¼ï¼ï¼ï¼ããã®ãã©ã¡ã¼ã¿ã®åæã«ã¯ãããä»¥åã®ã ã¬ã¼ã ã«ãããåæçµæãç¨ãããããããåæçµæã¯ ãããä¿åããããã®ãã©ã¡ã¼ã¿è¨é²ã¡ã¢ãªï¼ï¼ï¼ï½ä¹ è³ï¼ï¼ï¼ï½ã«ãä¾çµ¦ãããè¨é²ããããå¦çãã¼ã¿åæ å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããã¦ã¯ããã©ã¡ã¼ã¿è¨é²ã¡ ã¢ãªï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«è¨é²ãããåæçµæãå¿è¦ ã«å¿ãã¦é©å®ç¨ãããããã©ã¡ã¼ã¿ãåæããããSince the analysis result of the previous frame is used for the analysis of this parameter, the analysis result is also supplied to and recorded in the parameter recording memories 203a and 203b for storing the analysis result. In the processed data analyzers 204a and 204b, the analysis results recorded in the parameter recording memories 203a and 203b are appropriately used as needed to analyze the parameters.

ãï¼ï¼ï¼ï¼ãå¦çãã¼ã¿ä½æå¨ï¼ï¼ï¼ã«ããã¦ã¯ãå¦ç ãã¼ã¿åæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããã®åºåã«åºã¥ã ã¦ãå¥åç«¯åï¼ï¼ï¼ï½ããå¥åãããå½è©²ãã£ãã«ã®ãª ã¼ãã£ãªãã¼ã¿ãããç¬¦å·åã«å¿è¦ãªå¨æ³¢æ°å¸¯åæåã® ã¿ãæãåºãããåºåç«¯åï¼ï¼ï¼ï½ããåºåããããã¾ ããå¦çãã¼ã¿åæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããä¾çµ¦ã ãããå½è©²ãã£ãã«ã®ãªã¼ãã£ãªãã¼ã¿ãå¾©åãããã ã®ãã©ã¡ã¼ã¿ãï¼ã¤ã«ã¾ã¨ããããåºåç«¯åï¼ï¼ï¼ï½ã ãåºåããããIn the processed data generator 205, based on the outputs from the processed data analyzers 204a and 204b, only the frequency band component necessary for encoding is extracted from the audio data of the channel input from the input terminal 201a. And is output from the output terminal 206a. Further, the parameters for restoring the audio data of the channel, which are supplied from the processed data analyzers 204a and 204b, are put together into one and output from the output terminal 206b.

ãï¼ï¼ï¼ï¼ãå¾ã£ã¦ãä»¥åã®ãã¬ã¼ã ã®ãã©ã¡ã¼ã¿ãå ç§ããªãããåç¾ç¨ä¿¡å·ã¨ãã¦ä½¿ç¨ããæ··åå¦çããã ã£ãã«ãã¾ãã¯ãã®ä½¿ç¨å²åãããã¬ã¼ã åä½ã§å¤æ´ã ãããå½è©²ãã£ãã«ã®ãã¼ã¿ãç¬¦å·åããå¨æ³¢æ°å¸¯åã å¤æ´ããããåå¨æ³¢æ°å¸¯åæ¯ã«é©åãªãã©ã¡ã¼ã¿ã®å¤ã å¤æ´ãããã¨ãã§ããå¾©åããã¨ãã®éåæãè»½æ¸ãã ãã¨ãå¯è½ã¨ãªããTherefore, with reference to the parameters of the previous frame, the mixed-processed channel to be used as the reproduction signal, or the ratio of its use, can be changed on a frame-by-frame basis, or the frequency band for coding the data of the channel can be changed. It is possible to change or change the value of an appropriate parameter for each frequency band, and it is possible to reduce the discomfort when restored.

ãï¼ï¼ï¼ï¼ãå³ï¼ã¯ãæ¬çºæã®å¾©å·åæ¹æ³ãé©ç¨ããã å¾©å·åè£ç½®ã®ä¸å®æ½ä¾ã®æ§æãç¤ºããããã¯å³ã§ããã å³ï¼ã«ç¤ºããå¾©å·åè£ç½®ã¯ãã·ã³ã°ã«ãã£ãã«ç¨ã®å§ç¸® ç¬¦å·å¾©å·åå¨ï¼ä¾ãã°ä¸è¨ã®ããããï¼¡ï¼´ï¼²ï¼¡ï¼£æ¹å¼ã« å¯¾å¿ããå¾©å·åå¨ï¼ãè¤æ°ç¨ãã¦ããã«ããã£ãã«ã®å¾© å·åãå®ç¾ãããã®ã§ãããFIG. 3 is a block diagram showing the configuration of an embodiment of a decoding device to which the decoding method of the present invention is applied. The decoding apparatus shown in FIG. 3 realizes multi-channel decoding by using a plurality of single-channel compression code decoders (for example, decoders corresponding to the so-called ATRAC system described above).

ãï¼ï¼ï¼ï¼ãããã«ããã¬ã¯ãµï¼ï¼ï¼ã¯ãå¥åç«¯åï¼ï¼ ï¼ããå¥åããããããã¹ããªã¼ã ã«å«ã¾ããç¬¦å·åã ããåãã£ãã«ã®ãªã¼ãã£ãªãã¼ã¿ã¨ãæ··åå¦çããã æ··åå¦çãã¼ã¿ããåã®ãªã¼ãã£ãªãã¼ã¿ãåç¾ããã ãã®åç¾ç¨ã®ãã©ã¡ã¼ã¿ããå¯¾å¿ãããã£ãã«æ¯ã«åå² ããç¬¦å·åããããªã¼ãã£ãªãã¼ã¿ããå¯¾å¿ãããã£ã ã«ã®å¾©å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ï¼å¾©å·ææ®µï¼ã«ããã ãä¾çµ¦ããåç¾ç¨ã®ãã©ã¡ã¼ã¿ãå¯¾å¿ãããã£ãã«ã®å æãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ï¼å¾©åç¨ãã¼ã¿ä½ æææ®µï¼ã«ããããä¾çµ¦ãããã¾ããç¬¦å·åãããæ··å å¦çãã¼ã¿ãå¾©å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ã«ä¾çµ¦ãããã ã«ãªããã¦ãããThe demultiplexer 132 has the input terminal 13 The encoded audio data of each channel included in the bit stream input from 1 and the reproduction parameter for reproducing the original audio data from the mixed processed data are divided for each corresponding channel. Then, the encoded audio data is supplied to the decoders 133a to 133e (decoding means) of the corresponding channels, and the reproduction parameters are combined data generators 134a to 134e (restoring data generating means) of the corresponding channels. ) Respectively. Further, the encoded mixed processing data is supplied to the decoders 133f and 133g.

ãï¼ï¼ï¼ï¼ãå¾©å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã¯ãããã«ã ãã¬ã¯ãµï¼ï¼ï¼ããä¾çµ¦ãããç¬¦å·åããããªã¼ãã£ãª ãã¼ã¿ãå¾©å·ããåæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ï¼å¾©åæ æ®µï¼ã«ä¾çµ¦ãããå¾©å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ã¯ãããã« ããã¬ã¯ãµï¼ï¼ï¼ããä¾çµ¦ãããæ··åå¦çãã¼ã¿ãå¾©å· ããå¯¾å¿ããåæãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã« ããããä¾çµ¦ããããã«ãªããã¦ãããThe decoders 133a to 133e decode the encoded audio data supplied from the demultiplexer 132, and supply the decoded audio data to the synthesizers 135a to 135e (restoring means). The decoders 133f and 133g are configured to decode the mixed processing data supplied from the demultiplexer 132 and supply it to the corresponding combined data generators 134a to 134e, respectively.

ãï¼ï¼ï¼ï¼ãåæãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ ã¯ãå¾©å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ããä¾çµ¦ãããæ··åå¦ç ãã¼ã¿ãããããã«ããã¬ã¯ãµï¼ï¼ï¼ããä¾çµ¦ãããå ç¾ç¨ã®ãã©ã¡ã¼ã¿ã«åºã¥ãã¦ãå½è©²ãã£ãã«ã®ãªã¼ãã£ ãªãã¼ã¿ãåç¾ããããã®åç¾ç¨ãã¼ã¿ãä½æããåæ å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããããä¾çµ¦ããããã«ãªã ãã¦ãããCombined data generators 134a to 134e Is based on the reproduction parameters supplied from the demultiplexer 132 from the mixed processing data supplied from the decoders 133f and 133g, and creates reproduction data for reproducing the audio data of the relevant channel. It is adapted to be supplied to each of 135a to 135e.

ãï¼ï¼ï¼ï¼ãåæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã¯ãåæãã¼ ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããä¾çµ¦ãããåç¾ç¨ã ã¼ã¿ã¨ãããã«ããã¬ã¯ãµï¼ï¼ï¼ããä¾çµ¦ãããå¾©å·å ããããªã¼ãã£ãªãã¼ã¿ãåæããåãã£ãã«ã®åã®ãª ã¼ãã£ãªãã¼ã¿ãå¾©åããåºåç«¯åï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ ï½ããåºåããããã«ãªããã¦ãããThe synthesizers 135a to 135e synthesize the reproduction data supplied from the synthesized data generators 134a to 134e and the decoded audio data supplied from the demultiplexer 132, and generate the original audio data of each channel. And output terminals 136a through 136 It is designed to be output from e.

ãï¼ï¼ï¼ï¼ãå¥åç«¯åï¼ï¼ï¼ããããã«ããã¬ã¯ãµï¼ï¼ ï¼ã«ãããã¹ããªã¼ã ãå¥åãããã¨ããããã¹ããªã¼ ã åã«ã¯ãåãã£ãã«ã®ãªã¼ãã£ãªãã¼ã¿ã¨å±ã«ãæ··å å¦çããæ··åå¦çãã¼ã¿ããåã®ãã¼ã¿ãåç¾ãããã ã®ãã©ã¡ã¼ã¿ãå«ã¾ãã¦ããã®ã§ãããã«ããã¬ã¯ãµï¼ ï¼ï¼ã«ããã¦ã¯ãç¬¦å·åããããªã¼ãã£ãªãã¼ã¿ãæ··å å¦çãããæ··åå¦çãã¼ã¿ãããã³åç¾ç¨ãã©ã¡ã¼ã¿ã ãã£ãã«æ¯ã«ããããåå²ãããç¬¦å·åããããªã¼ãã£ ãªãã¼ã¿ã¨æ··åå¦çãããæ··åå¦çãã¼ã¿ãå¾©å·å¨ï¼ï¼ ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããããä¾çµ¦ãããåç¾ç¨ãã©ã¡ã¼ ã¿ã¯åæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããããä¾çµ¦ãã ããFrom the input terminal 131 to the demultiplexer 13 When the bitstream is input to the demultiplexer 1, the demultiplexer 1 receives the audio data of each channel and the parameters for reproducing the original data from the mixed processing data. In 32, the encoded audio data, the mixed processing data subjected to the mixing processing, and the reproduction parameter are divided for each channel, and the encoded audio data and the mixed processing data subjected to the mixing processing are decoded by the decoder 13 3a to 133g, respectively, and the reproduction parameters are supplied to the combiners 135a to 135e, respectively.

ãï¼ï¼ï¼ï¼ãæ··åå¦çãããã£ãã«ã«å¯¾å¿ããå¾©å·å¨ï¼ ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ã«ããã¦ã¯ãç¬¦å·åãããæ··åå¦çã ã¼ã¿ãå¾©å·ãããåæãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ ï½ã«ããããä¾çµ¦ããããåæãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹ è³ï¼ï¼ï¼ï½ã«ããã¦ã¯ãå¾©å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ãã ä¾çµ¦ãããå¾©å·åãããæ··åå¦çãã¼ã¿ã¨ãããã«ãã ã¬ã¯ãµï¼ï¼ï¼ããä¾çµ¦ããããæ··åå¦çããããã¼ã¿ã ãåã®ãã¼ã¿ãåç¾ããããã®ãã©ã¡ã¼ã¿ãç¨ãã¦ãå½ è©²ãã£ãã«åç¾ç¨ã®ãã¼ã¿ãä½æãããåæå¨ï¼ï¼ï¼ï½ ä¹è³ï¼ï¼ï¼ï½ã«ä¾çµ¦ããããåæãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ ä¹è³ï¼ï¼ï¼ï½ã®è©³ç´°ãªæ§æãããã³åä½ã«ã¤ãã¦ã¯å¾è¿° ãããDecoder 1 corresponding to the mixed-processed channel In 33f and 133g, the encoded mixed processing data is decoded, and combined data generators 134a to 134 e respectively. In the synthesized data generators 134a to 134e, the original data is reproduced from the decoded mixed processing data supplied from the decoders 133f and 133g and the mixed processing data supplied from the demultiplexer 132. Data for reproducing the channel is created using the parameters, and the synthesizer 135a To 135e. Synthetic data generator 134a The detailed configurations and operations of the to 134e will be described later.

ãï¼ï¼ï¼ï¼ãåæå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããã¦ã¯ã å¾©å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ã¦å¾©å·åãããåãã£ã ã«ã®ãªã¼ãã£ãªãã¼ã¿ã¨ãåæãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹ è³ï¼ï¼ï¼ï½ã«ã¦ä½æãããåãã£ãã«åç¾ç¨ãã¼ã¿ãã ãããåæãããåãã£ãã«ã®å¾©å·åããããªã¼ãã£ãª ãã¼ã¿ãããããå¾©åãããåºåç«¯åï¼ï¼ï¼ï½ä¹è³ï¼ï¼ ï¼ï½ããåºåããããIn the synthesizers 135a to 135e, The audio data of each channel decoded by the decoders 133a to 133e and the data for reproducing each channel created by the combined data creating units 134a to 134e are respectively combined to obtain the decoded audio data of each channel. The output terminals 136a to 13a are respectively restored. It is output from 6e.

ãï¼ï¼ï¼ï¼ãå³ï¼ã¯ãå³ï¼ã®åæãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ ã®åé¨æ§æä¾ãç¤ºãã¦ãããåæãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ ä¹è³ï¼ï¼ï¼ï½ã®æ§æããã³åä½ã¯ãåæãã¼ã¿ä½æå¨ï¼ ï¼ï¼ï½ã®å ´åã¨åºæ¬çã«åæ§ã§ããã®ã§ããã®èª¬æã¯ç ç¥ãããFIG. 4 shows the combined data generator 134a of FIG. 2 shows an example of the internal configuration of FIG. Composite data generator 134b To 134e are the same as those of the synthetic data generator 1 Since it is basically similar to the case of 34a, its description is omitted.

ãï¼ï¼ï¼ï¼ãåæãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ãæ§æããå¸¯å åå²ãã£ã«ã¿ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã¯ãå¥åç«¯åï¼ï¼ï¼ ï½ï¼ï¼ï¼ï¼ï½ããå¥åãããï¼ãã£ãã«ã®æ··åå¦çãã¼ ã¿ãåç¾ãã¼ã¿ãä½æããå¨æ³¢æ°å¸¯åæ¯ã«åå²ããåºå ããããã©ã¡ã¼ã¿è§£æå¨ï¼ï¼ï¼ã¯ãå¥åç«¯åï¼ï¼ï¼ï½ã ãå¥åãããåç¾ç¨ãã©ã¡ã¼ã¿ãè§£æããå¨æ³¢æ°å¸¯åæ¯ ã«åå²ããåºåããããã«ãªããã¦ãããThe band division filters 212a and 212b constituting the combined data generator 134a are connected to the input terminal 211. The mixed processing data of the two channels input from a and 211b is divided for each frequency band in which reproduction data is created and output. The parameter analyzer 213 analyzes the reproduction parameter input from the input terminal 211c, divides it for each frequency band, and outputs it.

ãï¼ï¼ï¼ï¼ãåç¾ãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ ã¯ãåç¾ç¨ãã©ã¡ã¼ã¿ãåã«ãåå¨æ³¢æ°å¸¯åæ¯ã«ãå¾©å· åãããæ··åå¦çãã¼ã¿ãå¤æããåºåãããåç¾ãã¼ ã¿åæå¨ï¼ï¼ï¼ã¯ãåå¨æ³¢æ°å¸¯åæ¯ã®åç¾ãã¼ã¿ä½æå¨ ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ããã®åºåãåæããåºåç«¯åï¼ ï¼ï¼ããåºåããããã«ãªããã¦ãããReproduction data generators 214a and 214b Converts the decoded mixed processing data for each frequency band based on the reproduction parameter and outputs the converted mixed processing data. The reproduction data synthesizer 215 synthesizes the outputs from the reproduction data generators 214a to 214e for each frequency band, and outputs the output terminal 2 It is designed to output from 16.

ãï¼ï¼ï¼ï¼ãå¥åç«¯åï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ã«ã¯ãå³ï¼ã« ç¤ºããå¾©å·å¨ï¼ï¼ï¼ï½ï¼ï¼ï¼ï¼ï½ããã®å¾©å·åãããæ·· åå¦çãã¼ã¿ãå¥åãããå¥åç«¯åï¼ï¼ï¼ï½ã«ã¯å³ï¼ã« ç¤ºããããã«ããã¬ã¯ãµï¼ï¼ï¼ããã®å½è©²ãã£ãã«ã®å ç¾ç¨ãã©ã¡ã¼ã¿ãå¥åããããå¥åç«¯åï¼ï¼ï¼ï½ï¼ï¼ï¼ ï¼ï½ãä»ãã¦ä¾çµ¦ãããå¾©å·åãããåæ··åå¦çãã¼ã¿ ã¯ãå¸¯ååå²ãã£ã«ã¿ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ãããã ä¾çµ¦ãããåç¾ãã¼ã¿ãä½æããå¨æ³¢æ°å¸¯åï¼ä¾ãã°ã¯ ãªãã£ã«ã«ãã³ãï¼æ¯ã«åå²ãããå¾ãåç¾ãã¼ã¿ä½æ å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ããããä¾çµ¦ããããDecoded mixed processed data from the decoders 133f and 133g shown in FIG. 3 are input to the input terminals 211a and 211b, and the input mixed data from the demultiplexer 132 shown in FIG. 3 is input to the input terminal 211c. Channel reproduction parameters are input. Input terminals 211a, 21 The decoded mixed processing data supplied via 1b are supplied to the band division filters 212a to 212b, respectively, and are divided into frequency bands (for example, critical bands) for creating reproduction data, and then reproduction data creation is performed. To the containers 214a and 214b, respectively.

ãï¼ï¼ï¼ï¼ãä¸æ¹ãå¥åç«¯åï¼ï¼ï¼ï½ããå¥åãããå½ è©²ãã£ãã«ã®åç¾ç¨ãã©ã¡ã¼ã¿ã¯ããã©ã¡ã¼ã¿è§£æå¨ï¼ ï¼ï¼ã«ä¾çµ¦ããããã©ã¡ã¼ã¿ãè§£æãã¦åæããå¨æ³¢æ° å¸¯åæ¯ã«åå²ãããè©²å½å¸¯åã®åç¾ãã¼ã¿ä½æå¨ï¼ï¼ï¼ ï½ä¹è³ï¼ï¼ï¼ï½ã«ããããä¾çµ¦ããããOn the other hand, the parameter for reproduction of the channel input from the input terminal 211c is the parameter analyzer 2. 13, the parameters are analyzed and divided into frequency bands for analysis, and a reproduction data generator 214 for the corresponding band is generated. a to 214b, respectively.

ãï¼ï¼ï¼ï¼ãåç¾ãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã« ããã¦ã¯ããã©ã¡ã¼ã¿è§£æå¨ï¼ï¼ï¼ããä¾çµ¦ãããåç¾ ç¨ãã©ã¡ã¼ã¿ãåã«ãåå¨æ³¢æ°å¸¯åæ¯ã«ãå¾©å·åããã æ··åå¦çãã¼ã¿ãå¤æãããåç¾ãã¼ã¿åæå¨ï¼ï¼ï¼ã« ä¾çµ¦ããããIn the reproduction data generators 214a and 214b, the decoded mixed processing data is converted for each frequency band based on the reproduction parameters supplied from the parameter analyzer 213, and the reproduction data synthesizer 215 is used. Is supplied to.

ãï¼ï¼ï¼ï¼ãåç¾ãã¼ã¿åæå¨ï¼ï¼ï¼ã«ããã¦ã¯ãåå¨ æ³¢æ°å¸¯åæ¯ã®åç¾ãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã ãã®åºåãåæãããè©²å½ãã£ãã«ã®åç¾ç¨ãã¼ã¿ãä½ æãããå¾ãåºåç«¯åï¼ï¼ï¼ããåºåããããIn the reproduction data synthesizer 215, the outputs from the reproduction data generators 214a to 214b for each frequency band are synthesized, the reproduction data of the corresponding channel is generated, and then output from the output terminal 216.

ãï¼ï¼ï¼ï¼ããã®ããã«ãä¸è¨å®æ½ä¾ã«ããã¦ã¯ãåæ ãã¼ã¿ä½æå¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã«ãããç¬¦å·åè£ç½® ã§æ±ºå®ãããæé©ãªåç¾æ¹æ³ã«åããã¦ãæ··åå¦çãã¼ ã¿ãåç¾ç¨ãã¼ã¿ã«å¤æãããåæå¨ï¼ï¼ï¼ã«ããå½è©² ãã£ãã«ã®ãã¼ã¿ã¨åæãããåã®ãªã¼ãã£ãªãã¼ã¿ã å¾©åããããå¾ã£ã¦ãæ··åå¦çã«ãã£ã¦å¾ãããæ··åå¦ çãã¼ã¿ãç¨ãããã¨ã«ãããå½è©²ãã£ãã«ã®ãããã¬ ã¼ããåæ¸ãããã¨ãå¯è½ã¨ãªããAs described above, in the above embodiment, the combined data generators 134a to 134e convert the mixed processing data into reproduction data in accordance with the optimum reproduction method determined by the encoding device, and the combiner data is reproduced. The original audio data is restored by combining with the data of the channel by 135. Therefore, by using the mixed processing data obtained by the mixing processing, it is possible to reduce the bit rate of the channel.

ãï¼ï¼ï¼ï¼ãæ¬¡ã«ãæ¬çºæã®ä»ã®å®æ½ä¾ã«ã¤ãã¦è¿°ã¹ ããä¸è¿°ããå®æ½ä¾ã§ã¯æ··åå¦çãããã£ãã«ã¯ï¼ãã£ ãã«ã®å ´åã§è¿°ã¹ã¦ããããæ··åå¦çãããã£ãã«ã¯å° ãªãã¨ãï¼ãã£ãã«ããã°æ¬çºæãä½¿ç¨ãããã¨ãå¯è½ ã§ãããNext, another embodiment of the present invention will be described. In the above-described embodiment, the case where the number of the mixed-processed channels is two is described, but the present invention can be used as long as at least one channel is the mixed-processed channel.

ãï¼ï¼ï¼ï¼ãã¾ããä¸è¿°ããå®æ½ä¾ã®ãã£ãã«é¸æã¯ã ã»ã³ã¿ï¼ï¼£ï¼ãã¬ããï¼ï¼¬ï¼ãã©ã¤ãï¼ï¼²ï¼ãã¬ãããµ ã©ã¦ã³ãï¼ï¼³ï¼¬ï¼ãããã³ã©ã¤ããµã©ã¦ã³ãï¼ï¼³ï¼²ï¼ã® ï¼ãã£ãã«ã®å ´åã«ã¤ãã¦è¿°ã¹ã¦ããããããã«ã¬ãã ã»ã³ã¿ï¼ï¼¬ï¼£ï¼ãã£ãã«ãã©ã¤ãã»ã³ã¿ï¼ï¼²ï¼£ï¼ãã£ã ã«ãå ããï¼ãã£ãã«ã®å ´åããéã«ã»ã³ã¿ï¼ï¼£ï¼ãã£ ãã«ãæããï¼ãã£ãã«ã®å ´åãããã«ãµãã¦ã¼ãã¡ã¼ ï¼ï¼³ï¼·ï¼ãã£ãã«ãä»å ãããå ´åãªã©ãè¤æ°ãã£ãã« ã®ãã£ã¸ã¿ã«ä¿¡å·ãç¬¦å·åãããã¤ãããã¨ã¯å¥ã«ãã ãããæ··åå¦çãããã£ãã«ãæã¤ãã«ããã£ãã«ã®ã ã£ã¸ã¿ã«ä¿¡å·ãç¬¦å·åããå ´åã«ä½¿ç¨å¯è½ã§ãããThe channel selection of the above embodiment is The case of 5 channels of the center (C), the left (L), the right (R), the left surround (SL), and the right surround (SR) is described, and the left center (LC) channel and the right center ( For example, in the case of 7 channels including the RC) channel, in the case of 4 channels without the center (C) channel, and in the case where a subwoofer (SW) channel is further added, digital signals of a plurality of channels are encoded, In addition to this, it can be used when encoding a multi-channel digital signal having a channel obtained by mixing these.

ãï¼ï¼ï¼ï¼ãæ¬¡ã«ãä¸è¿°ããç¬¦å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ ï½ã®å·ä½çæ§æããã³åä½ã«ã¤ãã¦ãå³ï¼ä¹è³å³ï¼ãç¨ ãã¦èª¬æããããªããå³ï¼ã«ã¯ãï¼ã¤ã®ãã£ãã«ã®ç¬¦å· å¨ï¼ï¼ï¼ï½ã®æ§æä¾ãç¤ºãã¦ããããªããç¬¦å·å¨ï¼ï¼ï¼ ï½ä¹è³ï¼ï¼ï¼ï½ã®æ§æããã³åä½ã¯ãç¬¦å·å¨ï¼ï¼ï¼ï½ã® å ´åã¨åºæ¬çã«åæ§ã§ããã®ã§ãããã§ã¯ãã®èª¬æã¯ç ç¥ãããNext, the above-mentioned encoders 105a to 105 The specific configuration and operation of e will be described with reference to FIGS. Note that FIG. 5 shows a configuration example of the encoder 105a for one channel. The encoder 105 Since the configurations and operations of b to 105e are basically the same as those of the encoder 105a, the description thereof is omitted here.

ãï¼ï¼ï¼ï¼ãä¿¡å·å¸¯ååå²ãã£ã«ã¿ï¼ï¼ï¼ã¯ãå¥åç«¯å ï¼ï¼ï¼ãä»ãã¦å¥åãããä¿¡å·ãï¼ã¤ã®å¨æ³¢æ°å¸¯åã«å å²ããããã«ãªããã¦ãããä½åï¼ï¼¤ï¼£ï¼´ï¼Modified D iscrete Cosine Transformï¼æ¹è¯åé¢æ£ä½å¼¦å¤æï¼åè·¯ ï¼ï¼ï¼ï¼¬ã¯ãå¸¯ååå²ãã£ã«ã¿ï¼ï¼ï¼ããä¾çµ¦ãããï¼ ï½ï¼¨ï½ä¹è³ï¼ï¼ï¼ï½ï¼¨ï½ã®ä½åã®ä¿¡å·ã«å¯¾ãã¦ï¼ï¼¤ï¼£ï¼´ æ¼ç®ãè¡ããä¸åï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼ï¼ï¼ã¯ãå¸¯ååå²ã ã£ã«ã¿ï¼ï¼ï¼ããä¾çµ¦ãããï¼ï¼ï¼ï½ï¼¨ï½ä¹è³ï¼ï¼ï½ï¼¨ ï½ã®ä¸åã®ä¿¡å·ã«å¯¾ãã¦ï¼ï¼¤ï¼£ï¼´æ¼ç®ãè¡ããé«åï¼ï¼¤ ï¼£ï¼´åè·¯ï¼ï¼ï¼ï¼¨ã¯ãå¸¯ååå²ãã£ã«ã¿ï¼ï¼ï¼ããä¾çµ¦ ãããï¼ï¼ï½ï¼¨ï½ä»¥ä¸ï¼ï¼ï¼ï½ï¼¨ï½ä¹è³ï¼ï¼ï½ï¼¨ï½ï¼ã® é«åã®ä¿¡å·ã«å¯¾ãã¦ï¼ï¼¤ï¼£ï¼´æ¼ç®ãè¡ãããã«ãªããã¦ ãããThe signal band division filter 401 is adapted to divide the signal inputted through the input terminal 424 into three frequency bands. Low frequency MDCT (Modified D The iscrete Cosine Transform (improved discrete cosine transform) circuit 402L is 0 supplied from the band division filter 401. MDCT for low frequency signals from kHz to 5.5 kHz Perform the operation. The mid-range MDCT circuit 402M has 5.5 kHz to 11 kHz supplied from the band division filter 401. MDCT operation is performed on the signal in the middle band of z. High range MD The CT circuit 402H is configured to perform MDCT calculation on a high frequency signal of 11 kHz or higher (11 kHz to 22 kHz) supplied from the band division filter 401.

ãï¼ï¼ï¼ï¼ããããã¯ãµã¤ãºè©ä¾¡å¨ï¼ï¼ï¼ã¯ãå¾è¿°ãã æéãããã¯é·ãæ±ºå®ãããæ£è¦ååè·¯ï¼ï¼ï¼ï¼¬ä¹è³ï¼ ï¼ï¼ï¼¨ã¯ãä½åãä¸åãããã³é«åãããªããªã¼ãã£ãª ä¿¡å·ãä½åãä¸åãããã³é«åã®åè¨ï¼ï¼åã®ãããã¯ ããã¼ãã£ã³ã°ã¦ãããã«åããã¨ã¨ãã«ãã¦ãããæ¯ ã«è¦æ ¼åï¼æ£è¦åï¼ããããã«ãªããã¦ãããThe block size evaluator 403 determines a time block length described later. Normalization circuits 404L to 4 04H divides the audio signal consisting of low, mid, and high frequencies into a total of 52 block floating units of low, mid, and high frequencies, and normalizes (normalizes) each unit. Has been done.

ãï¼ï¼ï¼ï¼ããããéåå¨ï¼ï¼ï¼ã¯ãå¾è¿°ããéåãã ãæ°æå ±ãæ±ããåéååå¨ï¼ï¼ï¼ã«ä¾çµ¦ãããåéå åå¨ï¼ï¼ï¼ã¯ããããéåå¨ï¼ï¼ï¼ããä¾çµ¦ãããéå ãããæ°æå ±ã«åºã¥ãã¦ãåéååãè¡ãããã«ãªãã ã¦ããããã©ã¼ããã¿ï¼ï¼ï¼ã¯ãåéååå¨ï¼ï¼ï¼ãã ã®åºåä¿¡å·ããããã¹ããªã¼ã ã«å¤æããããã«ãªãã ã¦ãããThe bit allocator 405 obtains allocation bit number information, which will be described later, and supplies it to the requantizer 406. The requantizer 406 is adapted to perform requantization based on the distributed bit number information supplied from the bit distributor 405. The formatter 407 is adapted to convert the output signal from the requantizer 406 into a bitstream.

ãï¼ï¼ï¼ï¼ãå³ï¼ã«ããã¦ãå¥åç«¯åï¼ï¼ï¼ã«ã¯ããªã¼ ãã£ãªãã¼ã¿ï¼æ¨æ¬åããã³éååããããªã¼ãã£ãªã ã¼ã¿ï¼ãä¾çµ¦ããããå¥åç«¯åï¼ï¼ï¼ã«ä¾çµ¦ããããã¼ ã¿ã¯ãåãå¸¯ååå²ãã£ã«ã¿ï¼ï¼ï¼ã«ãã£ã¦ï¼ä¹è³ï¼ï¼ ï¼ï½ï¼¨ï½ã®ä½åã¨ãï¼ï¼ï¼ï½ï¼¨ï½ä¹è³ï¼ï¼ï½ï¼¨ï½ã®ä¸å ã¨ãï¼ï¼ï½ï¼¨ï½ä»¥ä¸ï¼ï¼ï¼ï½ï¼¨ï½ä¹è³ï¼ï¼ï½ï¼¨ï½ï¼ã®ï¼ ã¤ã®å¨æ³¢æ°å¸¯åã«åå²ããããIn FIG. 5, the input terminal 424 is supplied with audio data (sampled and quantized audio data). The data supplied to the input terminal 424 is first supplied to the band splitting filter 401 through 0 to 5. Low frequency of 5kHz, medium frequency of 5.5kHz to 11kHz, 11kHz or more (11kHz to 22kHz) 3 It is divided into two frequency bands.

ãï¼ï¼ï¼ï¼ããããï¼ã¤ã®å¨æ³¢æ°å¸¯åã®ä¿¡å·ã®ãã¡ãå¸¯ ååå²ãã£ã«ã¿ï¼ï¼ï¼ããã®ä½åã®ä¿¡å·ã¯ï¼ï¼¤ï¼£ï¼´æ¼ç® ãè¡ãï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼ï¼ï¼¬ã«ãä¸åã®ä¿¡å·ã¯åããï¼ ï¼¤ï¼£ï¼´æ¼ç®ãè¡ãï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼ï¼ï¼ã«ãã¾ããé«å ã®ä¿¡å·ã¯ï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼ï¼ï¼¨ã«ä¾çµ¦ããããããï¼ï¼¤ ï¼£ï¼´åè·¯ï¼ï¼ï¼ï¼¬ä¹è³ï¼ï¼ï¼ï¼¨ã«ããã¦ãå¨æ³¢æ°æåã« ããããåè§£ãããããã®ã¨ããï¼ï¼¤ï¼£ï¼´å¦çãæ½ãã¨ ãã®æéãããã¯é·ã¯ãåå¨æ³¢æ°å¸¯åæ¯ã«å¯å¤ã§ããã ä¿¡å·ãæ¥æ¿ã«å¤åããé¨åã§ã¯ãæéãããã¯é·ãçã ãã¦ãæéåè§£è½ãé«ããä¿¡å·ãå®å¸¸çãªé¨åã§ã¯æé ãããã¯é·ãé·ããã¦ãä¿¡å·æåã®æå¹ä¼éã¨éååé é³ãå¶å¾¡ããããã«ãã¦ãããOf these three frequency band signals, the low frequency signal from the band division filter 401 is input to the MDCT circuit 402L which performs MDCT operation, and the middle frequency signal is also M. The MDCT circuit 402M that performs the DCT operation and the high-frequency signal are supplied to the MDCT circuit 402H. Each of the CT circuits 402L to 402H is decomposed into frequency components. At this time, the time block length when performing MDCT processing is variable for each frequency band, In the part where the signal changes abruptly, the time block length is shortened to improve the time resolution, and in the part where the signal is stationary, the time block length is lengthened to control the effective transmission of the signal component and the quantization noise. I have to.

ãï¼ï¼ï¼ï¼ããã®æéãããã¯é·ã¯ããããã¯ãµã¤ãºè© ä¾¡å¨ï¼ï¼ï¼ã«ã¦æ±ºå®ããããããªãã¡ãå¸¯ååå²ãã£ã« ã¿ï¼ï¼ï¼ããã®ï¼ã¤ã®å¨æ³¢æ°å¸¯åã®ä¿¡å·ã¯ããããã¯ãµ ã¤ãºè©ä¾¡å¨ï¼ï¼ï¼ã«ãä¾çµ¦ããããããã¯ãµã¤ãºè©ä¾¡å¨ ï¼ï¼ï¼ãï¼ï¼¤ï¼£ï¼´å¦çã®æéãããã¯é·ãæ±ºå®ããæ±ºå® ãããæéãããã¯é·ãç¤ºãæå ±ãï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼ï¼ ï¼¬ä¹è³ï¼ï¼ï¼ï¼¨ã«ä¾çµ¦ãããããã«ãã¦ãããThe time block length is determined by the block size evaluator 403. That is, the signals of the three frequency bands from the band division filter 401 are also supplied to the block size evaluator 403, the block size evaluator 403 determines the time block length of MDCT processing, and indicates the determined time block length. Information is MDCT circuit 402 L to 402H are supplied.

ãï¼ï¼ï¼ï¼ããªããï¼ï¼¤ï¼£ï¼´å¦çã§ã®ï¼ç¨®é¡ã®æéãã ãã¯é·ã®ãã¡ãé·ãæéãããã¯é·ã¯ãã³ã°ã¢ã¼ãã¨å¼ ã°ããä¾ãã°ï¼ï¼ï¼ï¼ï½ï½ã®æéã«ç¸å½ãããã¾ããç ãæéãããã¯é·ã¯ã·ã§ã¼ãã¢ã¼ãã¨å¼ã°ããä¾ãã°é« åï¼ï¼ï¼ï½ï¼¨ï½ä»¥ä¸ï¼ã§ã¯ï¼ï¼ï¼ï¼ï½ï½ã¾ã§ãä½å ï¼ï¼ï¼ï¼ï½ï¼¨ï½ä»¥ä¸ï¼ããã³ä¸åï¼ï¼ï¼ï¼ï½ï¼¨ï½ããï¼ ï¼ï½ï¼¨ï½ï¼ã§ã¯ï¼ï¼ï¼ï½ï½ã¾ã§æéåè§£è½ãä¸ãããã ã«ãã¦ãããOf the two types of time block lengths in MDCT processing, the long time block length is called the long mode and corresponds to a time of 11.6 ms, for example. The short time block length is called a short mode. For example, up to 1.45 ms in the high range (11 kHz or more), low range (5.5 kHz or less) and middle range (5.5 kHz to 1). At 1 kHz), the time resolution is increased up to 2.9 ms.

ãï¼ï¼ï¼ï¼ããã®ããã«ãã¦ãæéã¨å¨æ³¢æ°ã®ï¼æ¬¡åé åï¼ããããããã¯ããã¼ãã£ã³ã°ã¦ãããï¼Block Fl oating Unit ã¨å¼ã¶ï¼ä¸ã®ä¿¡å·æåã«åè§£ããããªã¼ã ã£ãªä¿¡å·ã¯ãæ£è¦ååè·¯ï¼ï¼ï¼ï¼¬ä¹è³ï¼ï¼ï¼ï¼¨ã«ãã£ã¦ ä½åãä¸åãããã³é«åã«ããã¦åè¨ï¼ï¼åã®ãããã¯ ããã¼ãã£ã³ã°ã¦ãããã«åããããã¨å±ã«ãã¦ããã æ¯ã«è¦æ ¼åï¼æ£è¦åï¼ãããï¼ã¹ã±ã¼ã«ãã¡ã¯ã¿ã®æ±ºå® ããªãããï¼ãIn this way, the two-dimensional area of time and frequency (this is a block floating unit: Block Fl The audio signal decomposed into the above signal components is divided into a total of 52 block floating units in the low band, the middle band, and the high band by the normalization circuits 404L to 404H, and standardized for each unit. Is normalized (normalized) (scale factor is determined).

ãï¼ï¼ï¼ï¼ãã¾ãããããéåå¨ï¼ï¼ï¼ã§ã¯ãäººéã®è´ è¦ã®ç¹æ§ãå©ç¨ãã¦ããã®ãªã¼ãã£ãªä¿¡å·ãã©ã®ãããª æåããæ§æããã¦ããããåæãããããã®åæçµæ ã¯ãæ£è¦ååè·¯ï¼ï¼ï¼ï¼¬ä¹è³ï¼ï¼ï¼ï¼¨ããã®åã¦ããã æ¯ã®ä¿¡å·ãä¾çµ¦ãããåéååå¨ï¼ï¼ï¼ã«ä¾çµ¦ããããFurther, in the bit allocator 405, the characteristics of the audio signal are analyzed by utilizing the characteristics of human hearing. The analysis result is supplied to the requantizer 406 to which the signals for each unit from the normalization circuits 404L to 404H are supplied.

ãï¼ï¼ï¼ï¼ãåéååå¨ï¼ï¼ï¼ã«ããã¦ã¯ãä¸è¨åæçµ æã«åºã¥ãã¦ãåã¦ããããã©ã®ç¨åº¦ã®ç²¾åº¦ã§ç¬¦å·åã ãããæ±ãããããã©ã¡ã¼ã¿åãããï¼ã¯ã¼ãã¬ã³ã°ã¹ ã®æ±ºå®ãè¡ãããï¼ã¨å±ã«ãåéååãè¡ããããIn the requantizer 406, the degree of accuracy with which each unit is coded is determined based on the above analysis result, parameterized (word length is determined), and re-quantized. Quantization is performed.

ãï¼ï¼ï¼ï¼ãæå¾ã«ããã©ã¼ããã¿ï¼ï¼ï¼ã«ããã¦ã¯ã åã¦ãããæ¯ã®åãã©ã¡ã¼ã¿æå ±ã¨åéååãããã¹ã ã¯ãã©ã ä¿¡å·ããæå®ã®ãã©ã¼ãããã«å¾ã£ã¦ãå³ï¼ã« ç¤ºãããã«ããã¬ã¯ãµï¼ï¼ï¼ã«ä¾çµ¦ãããï¼ã¤ã®ãã£ã ã«ã«ããããããã¹ããªã¼ã ã«çµã¿ç«ã¦ãããããã®ã ã©ã¼ããã¿ï¼ï¼ï¼ã®åºåãåºåç«¯åï¼ï¼ï¼ããåºåãã ããFinally, in the formatter 407, The parameter information for each unit and the requantized spectrum signal are assembled into a bit stream in one channel supplied to the multiplexer 106 shown in FIG. 1 according to a predetermined format. The output of the formatter 407 is output from the output terminal 425.

ãï¼ï¼ï¼ï¼ãããã§ãä¸è¿°ãããããªç¬¦å·åã®åä½ã¯ãµ ã¦ã³ããã¬ã¼ã ã¨ããåä½æ¯ã«è¡ããããHere, the above-described encoding operation is performed for each unit called a sound frame.

ãï¼ï¼ï¼ï¼ãã¾ããä¸è¨ãããéåå¨ï¼ï¼ï¼ã¯å·ä½çã« ã¯å³ï¼ã«ç¤ºããããªæ§æãæãããã®ã§ãããããªã ã¡ãã¨ãã«ã®ç®åºåè·¯ï¼ï¼ï¼ã¯ãå¥åç«¯åï¼ï¼ï¼ããå¥ åãããä¿¡å·ã®ã¯ãªãã£ã«ã«ãã³ãæ¯ã®ã¨ãã«ã®ãç®åº ããããã«ãªããã¦ãããç³ã¿è¾¼ã¿ãã£ã«ã¿åè·¯ï¼ï¼ï¼ ã¯ãã¨ãã«ã®ç®åºåè·¯ï¼ï¼ï¼ããã®åºåä¿¡å·ã«æå®ã®é ã¿ä»ãé¢æ°ãæãã¦å ç®ããç³ã¿è¾¼ã¿å¦çãæ½ãããã« ãªããã¦ãããFurther, the bit allocator 405 has a concrete structure as shown in FIG. That is, the energy calculation circuit 522 is configured to calculate energy for each critical band of the signal input from the input terminal 521. Convolution filter circuit 523 Performs a convolution process of multiplying an output signal from the energy calculation circuit 522 by a predetermined weighting function and adding the product.

ãï¼ï¼ï¼ï¼ãï¼ï½âï½ï½ï¼é¢æ°çºçåè·¯ï¼ï¼ï¼ã¯ãè¨±å®¹ é¢æ°ï¼ï½âï½ï½ï¼ãçºçããåºåãããå¼ãç®å¨ï¼ï¼ï¼ ã¯ãç³ã¿è¾¼ã¿ãã£ã«ã¿åè·¯ï¼ï¼ï¼ã®åºåãããï¼ï½âï½ ï½ï¼é¢æ°çºçåè·¯ï¼ï¼ï¼ããã®åºåãå¼ãç®ãããã®çµ æãå²ç®å¨ï¼ï¼ï¼ã«ä¾çµ¦ããããã«ãªããã¦ãããå²ç® å¨ï¼ï¼ï¼ã¯ãå¥åä¿¡å·ã«å¯¾ãã¦ãéã³ã³ããªã¥ã¼ã·ã§ã³ å¦çãè¡ãããã¹ãã³ã°ã¹ã¬ã·ã§ã¼ã«ããå¾ãããã«ãª ããã¦ãããThe (n-ai) function generating circuit 525 generates and outputs an allowable function (n-ai). Subtractor 524 From the output of the convolutional filter circuit 523, (nâa i) The output from the function generating circuit 525 is subtracted, and the result is supplied to the divider 526. The divider 526 is adapted to perform inverse convolution processing on the input signal to obtain a masking threshold.

ãï¼ï¼ï¼ï¼ãæå°å¯è´ã«ã¼ãçºçåè·¯ï¼ï¼ï¼ã¯ãæå°å¯ è´ã«ã¼ããç¤ºããã¼ã¿ãåæåè·¯ï¼ï¼ï¼ã«ä¾çµ¦ãããå æåè·¯ï¼ï¼ï¼ã¯ãæå°å¯è´ã«ã¼ãçºçåè·¯ï¼ï¼ï¼ããåº åãããæå°å¯è´ã«ã¼ããç¤ºããã¼ã¿ã¨ãå²ç®å¨ï¼ï¼ï¼ ããåºåããããã¹ãã³ã°ã¹ã¬ã·ã§ã¼ã«ããåæããåº åãããæ¸ç®å¨ï¼ï¼ï¼ã¯ãåæåè·¯ï¼ï¼ï¼ããã®åºåä¿¡ å·ã¨ãéå»¶åè·¯ï¼ï¼ï¼ãä»ãã¦ä¾çµ¦ãããã¨ãã«ã®ç®åº åè·¯ï¼ï¼ï¼ããã®åºåã«å¯¾ãã¦æ¸ç®å¦çãæ½ããåºåã ãããã«ãªããã¦ãããThe minimum audible curve generating circuit 532 supplies the data indicating the minimum audible curve to the synthesizing circuit 527. The synthesizing circuit 527 and the data indicating the minimum audible curve output from the minimum audible curve generating circuit 532 and the divider 526. The masking threshold output from the above is synthesized and output. The subtractor 528 subtracts the output signal from the synthesis circuit 527 and the output from the energy calculation circuit 522 supplied via the delay circuit 529, and outputs the result.

ãï¼ï¼ï¼ï¼ãè£æ£æå ±åºååè·¯ï¼ï¼ï¼ã¯ãæå®ã®çã©ã¦ ããã¹ã«ã¼ãã«å¯¾å¿ãããã¼ã¿ãåºåãããè¨±å®¹éé³è£ æ£åè·¯ï¼ï¼ï¼ã¯ãè£æ£æå ±åºååè·¯ï¼ï¼ï¼ããã®åºåä¿¡ å·ã«åºã¥ãã¦ãæ¸ç®å¨ï¼ï¼ï¼ããã®åºåä¿¡å·ã®è¨±å®¹éé³ ã¬ãã«ã®è£æ£ãè¡ããåºåç«¯åï¼ï¼ï¼ãä»ãã¦åºåãã ããã«ãªããã¦ãããThe correction information output circuit 533 outputs data corresponding to a predetermined equal loudness curve. The permissible noise correction circuit 530 corrects the permissible noise level of the output signal from the subtractor 528 based on the output signal from the correction information output circuit 533, and outputs it via the output terminal 531.

ãï¼ï¼ï¼ï¼ãå³ï¼ã«ããã¦ãå¥åç«¯åï¼ï¼ï¼ã«ã¯ãï¼ï¼¤ ï¼£ï¼´åè·¯ï¼ï¼ï¼ï¼¬ï¼ï¼ï¼ï¼ï¼ãããã³ï¼ï¼ï¼ï¼¨ããã®å¨ æ³¢æ°é åã®ã¹ãã¯ãã«ãã¼ã¿ãä¾çµ¦ããã¦ãããIn FIG. 6, the input terminal 521 has an MD The spectrum data in the frequency domain from the CT circuits 402L, 402M, and 402H is supplied.

ãï¼ï¼ï¼ï¼ããã®å¨æ³¢æ°é åã®ãã¼ã¿ã¯ãå¸¯åæ¯ã®ã¨ã ã«ã®ç®åºåè·¯ï¼ï¼ï¼ã«ä¾çµ¦ããã¦ãã¯ãªãã£ã«ã«ãã³ã ï¼è¨çå¸¯åï¼æ¯ã®ã¨ãã«ã®ããä¾ãã°å½è©²ãã³ãåã§ã® åæ¯å¹å¤ï¼ä¹ï¼ä¾ãã°æ¯å¹å¤ã®ãã¼ã¯å¤ã®ï¼ä¹ï¼ã®ç·å ãè¨ç®ãããã¨çã«ããæ±ããããããã®åãã³ãæ¯ã® ã¨ãã«ã®ã®ä»£ããã«ãæ¯å¹å¤ã®ãã¼ã¯å¤ãå¹³åå¤çãç¨ ãããããã¨ãããããã®ã¨ãã«ã®ç®åºåè·¯ï¼ï¼ï¼ãã ã®åºåã¨ãã¦ã®ä¾ãã°åãã³ãã®ç·åå¤ã®ã¹ãã¯ãã« ã¯ãä¸è¬ã«ãã¼ã¯ã¹ãã¯ãã«ï¼ï¼³ï¼¢ï¼ã¨ç§°ããã¦ããã å³ï¼ã¯ãã®ãããªåã¯ãªãã£ã«ã«ãã³ãæ¯ã®ãã¼ã¯ã¹ã ã¯ãã«ãç¤ºãã¦ããããã ããå³ï¼ã§ã¯ãå³ç¤ºãç°¡ç¥å ãããããä¸è¨ã¯ãªãã£ã«ã«ãã³ãã®ãã³ãæ°ãï¼ï¼ã ã³ãï¼ãã³ãï¼¢1ä¹è³ãã³ãï¼¢12ï¼ã§è¡¨ç¾ãã¦ãããThis frequency domain data is supplied to the energy calculation circuit 522 for each band, and the energy for each critical band (critical band) is, for example, squared for each amplitude value within the band (for example, the peak of the amplitude value). It can be obtained by calculating the sum of the squares of the values. Instead of the energy for each band, a peak value, an average value, etc. of the amplitude value may be used. The spectrum of the total sum value of each band as the output from the energy calculation circuit 522 is generally called a Bark spectrum (SB). FIG. 7 shows the Bark spectrum for each such critical band. However, in FIG. 7, in order to simplify the illustration, the number of bands of the critical band is represented by 12 bands (bands B1 to B12).

ãï¼ï¼ï¼ï¼ãããã§ãä¸è¨ãã¼ã¯ã¹ãã¯ãã«ã®ãããã ãã¹ãã³ã°ã«æ¼ããå½±é¿ãèæ®ããããã«ããã®ãã¼ã¯ ã¹ãã¯ãã«ã«æå®ã®éã¿ä»ãé¢æ°ãæãã¦å ç®ãããã ãªç³è¾¼ã¿ï¼ã³ã³ããªã¥ã¼ã·ã§ã³ï¼å¦çãæ½ãããã®ã ããä¸è¨å¸¯åæ¯ã®ã¨ãã«ã®ç®åºåè·¯ï¼ï¼ï¼ã®åºåãããª ãã¡ãã¼ã¯ã¹ãã¯ãã«ã®åå¤ã¯ãç³è¾¼ã¿ãã£ã«ã¿åè·¯ï¼ ï¼ï¼ã«ä¾çµ¦ããããç³è¾¼ã¿ãã£ã«ã¿åè·¯ï¼ï¼ï¼ã¯ãä¾ã ã°ãå¥åãã¼ã¿ãé æ¬¡éå»¶ãããè¤æ°ã®éå»¶ç´ åã¨ãã ããéå»¶ç´ åããã®åºåã«ãã£ã«ã¿ä¿æ°ï¼éã¿ä»ãé¢ æ°ï¼ãä¹ç®ããè¤æ°ã®ä¹ç®å¨ï¼ä¾ãã°åãã³ãã«å¯¾å¿ã ãï¼ï¼åã®ä¹ç®å¨ï¼ã¨ãåä¹ç®å¨åºåã®ç·åãã¨ãç·å å ç®å¨ã¨ããæ§æããããã®ã§ãããHere, in order to consider the influence of so-called masking of the Bark spectrum, a convolution processing is performed such that the Bark spectrum is multiplied by a predetermined weighting function and added. Therefore, the convolution filter circuit 5 outputs the output of the energy calculation circuit 522 for each band, that is, each value of the Bark spectrum. 23. The convolution filter circuit 523 includes, for example, a plurality of delay elements that sequentially delay input data, and a plurality of multipliers that multiply outputs from these delay elements by a filter coefficient (weighting function) (for example, 25 pieces corresponding to each band). And a sum total adder that sums the outputs of the respective multipliers.

ãï¼ï¼ï¼ï¼ããªããä¸è¨ãã¹ãã³ã°ã¨ã¯ãäººéã®è´è¦ä¸ ã®ç¹æ§ã«ãããããä¿¡å·ã«ãã£ã¦ä»ã®ä¿¡å·ããã¹ã¯ãã ã¦èãããªããªãç¾è±¡ããããã®ã§ããããã®ãã¹ãã³ ã°å¹æã«ã¯ãæéé åã®ãªã¼ãã£ãªä¿¡å·ã«ããæéè»¸ã ã¹ãã³ã°å¹æã¨ãå¨æ³¢æ°é åã®ä¿¡å·ã«ããåæå»ãã¹ã ã³ã°å¹æã¨ãããããããã®ãã¹ãã³ã°å¹æã«ãããã ã¹ãã³ã°ãããé¨åã«ãã¤ãºããã£ãã¨ãã¦ãããã®ã ã¤ãºã¯èãããªããã¨ã«ãªãããã®ãããå®éã®ãªã¼ã ã£ãªä¿¡å·ã§ã¯ããã®ãã¹ãã³ã°ãããç¯å²åã®ãã¤ãºã¯ è¨±å®¹å¯è½ãªãã¤ãºã¨ããããThe above-mentioned masking is a phenomenon in which one signal is masked by another signal and becomes inaudible due to human auditory characteristics. The masking effect is due to the time domain audio signal. There are an axial masking effect and a simultaneous time masking effect by a signal in the frequency domain. Due to these masking effects, even if there is noise in the masked portion, this noise will not be heard. For this reason, in an actual audio signal, noise within the masked range is regarded as acceptable noise.

ãï¼ï¼ï¼ï¼ãããã§ãä¸è¨ç³è¾¼ã¿ãã£ã«ã¿åè·¯ï¼ï¼ï¼ã® åä¹ç®å¨ã®ä¹ç®ä¿æ°ï¼ãã£ã«ã¿ä¿æ°ï¼ã®ä¸å·ä½ä¾ãç¤ºã ã¨ãä»»æã®ãã³ãã«å¯¾å¿ããä¹ç®å¨ï¼ã®ä¿æ°ãï¼ã¨ãã ã¨ããä¹ç®å¨ï¼âï¼ã§ä¿æ°ï¼ï¼ï¼ï¼ããä¹ç®å¨ï¼âï¼ã§ ä¿æ°ï¼ï¼ï¼ï¼ï¼ï¼ããä¹ç®å¨ï¼âï¼ã§ä¿æ°ï¼ï¼ï¼ï¼ï¼ï¼ ï¼ï¼ï¼ããä¹ç®å¨ï¼ï¼ï¼ã§ä¿æ°ï¼ï¼ï¼ããä¹ç®å¨ï¼ï¼ï¼ ã§ä¿æ°ï¼ï¼ï¼ï¼ããä¹ç®å¨ï¼ï¼ï¼ã§ä¿æ°ï¼ï¼ï¼ï¼ï¼ãå éå»¶ç´ åã®åºåã«ä¹ç®ãããã¨ã«ãããä¸è¨ãã¼ã¯ã¹ã ã¯ãã«ã®ç³è¾¼ã¿å¦çãè¡ãããããã ããï¼ã¯ï¼ä¹è³ï¼ ï¼ã®ä»»æã®æ´æ°ã§ãããHere, a specific example of the multiplication coefficient (filter coefficient) of each multiplier of the convolution filter circuit 523 will be described. When the coefficient of the multiplier M corresponding to an arbitrary band is 1, the multiplier M-1 gives a coefficient of 0.15, multiplier M-2 gives a coefficient of 0.0019, and multiplier M-3 gives a coefficient of 0.0000. 086, multiplier M + 1 gives a coefficient of 0.4, multiplier M + 2 By multiplying the output of each delay element by a coefficient of 0.06 with a coefficient of 0.007 by a multiplier M + 3, the convolution processing of the Bark spectrum is performed. However, M is 1 to 2 It is an arbitrary integer of 5.

ãï¼ï¼ï¼ï¼ãæ¬¡ã«ãä¸è¨ç³è¾¼ã¿ãã£ã«ã¿åè·¯ï¼ï¼ï¼ã®åº åã¯å¼ç®å¨ï¼ï¼ï¼ã«ä¾çµ¦ããããå¼ç®å¨ï¼ï¼ï¼ã¯ãä¸è¨ ç³è¾¼ãã é åã§ã®å¾è¿°ããè¨±å®¹å¯è½ãªãã¤ãºã¬ãã«ï¼è¨± å®¹ãã¤ãºã¬ãã«ï¼ã«å¯¾å¿ããã¬ãã«Î±ãæ±ãããã®ã§ã ãããªãããã®è¨±å®¹å¯è½ãªãã¤ãºã¬ãã«ã«å¯¾å¿ããã¬ã ã«Î±ã¯ãå¾è¿°ããããã«ãéã³ã³ããªã¥ã¼ã·ã§ã³å¦çã è¡ããã¨ã«ãã£ã¦ãã¯ãªãã£ã«ã«ãã³ãã®åãã³ãæ¯ã® è¨±å®¹ãã¤ãºã¬ãã«ã¨ãªããããªã¬ãã«ã§ãããããã§ã å¼ç®å¨ï¼ï¼ï¼ã«ã¯ãã¬ãã«Î±ãæ±ããããã®è¨±å®¹é¢æ° ï¼ãã¹ãã³ã°ã¬ãã«ãè¡¨ç¾ããé¢æ°ï¼ãä¾çµ¦ããããã ã®è¨±å®¹é¢æ°ãå¢æ¸ããããã¨ã§ã¬ãã«Î±ã®å¶å¾¡ãè¡ã£ã¦ ãããè¨±å®¹é¢æ°ã¯ãæ¬¡ã«èª¬æãããããªï¼ï½âï½ï½ï¼é¢ æ°çºçåè·¯ï¼ï¼ï¼ããä¾çµ¦ããã¦ãããã®ã§ãããNext, the output of the convolution filter circuit 523 is supplied to the subtractor 524. The subtractor 524 calculates a level Î± corresponding to an allowable noise level (allowable noise level) described later in the convoluted area. It should be noted that the level Î± corresponding to this allowable noise level is a level that becomes an allowable noise level for each band of the critical band by performing inverse convolution processing, as described later. here, The subtractor 524 is supplied with an allowance function (function expressing a masking level) for obtaining the level Î±. The level Î± is controlled by increasing or decreasing this allowance function. The allowance function is supplied from the (n-ai) function generating circuit 525 as described below.

ãï¼ï¼ï¼ï¼ãããªãã¡ãè¨±å®¹ãã¤ãºã¬ãã«ã«å¯¾å¿ããã¬ ãã«Î±ã¯ãã¯ãªãã£ã«ã«ãã³ãã®ãã³ãã®ä½åããé ã« ä¸ããããçªå·ãï½ã¨ããã¨ãæ¬¡ã®å¼ã§æ±ãããã¨ãã§ ãããThat is, the level Î± corresponding to the allowable noise level can be obtained by the following equation, where i is the number given in order from the low band of the critical band.

ãï¼ï¼ï¼ï¼ãÎ±ï¼ï¼³âï¼ï½âï½ï½ï¼Î = S- (n-ai)

ãï¼ï¼ï¼ï¼ããã®å¼ã«ããã¦ãï½ï¼ï½ã¯å®æ°ã§ãããï½ ï¼ï¼ãï¼³ã¯ç³è¾¼ã¿å¦çããããã¼ã¯ã¹ãã¯ãã«ã®å¼·åº¦ã§ ãããå¼ä¸ï¼ï½âï½ï½ï¼ãè¨±å®¹é¢æ°ã¨ãªããä¾ã¨ãã¦ï½ ï¼ï¼ï¼ãï½ï¼âï¼ï¼ï¼ãç¨ãããã¨ãã§ãããIn this equation, n and a are constants, and a > 0 and S are the intensities of the bark spectrum subjected to the convolution processing, and (n-ai) in the formula is the tolerance function. N as an example = 38, a = -0.5 can be used.

ãï¼ï¼ï¼ï¼ããã®ããã«ãã¦ãä¸è¨ã¬ãã«Î±ãæ±ãã ãããã®ãã¼ã¿ã¯ãå²ç®å¨ï¼ï¼ï¼ã«ä¾çµ¦ããããå²ç®å¨ ï¼ï¼ï¼ã¯ãç³è¾¼ã¿ãããé åã§ã®ã¬ãã«Î±ãéã³ã³ããª ã¥ã¼ã·ã§ã³ããããã®ãã®ã§ããããããã£ã¦ããã®é ã³ã³ããªã¥ã¼ã·ã§ã³å¦çãè¡ããã¨ã«ãããã¬ãã«Î±ã ããã¹ãã³ã°ã¹ã¬ãã·ã§ã¼ã«ããå¾ãããããã«ãªãã ããªãã¡ããã®ãã¹ãã³ã°ã¹ã¬ãã·ã§ã¼ã«ããè¨±å®¹ãã¤ ãºã¹ãã¯ãã«ã¨ãªãããªããéã³ã³ããªã¥ã¼ã·ã§ã³å¦ç ã¯ãè¤éãªæ¼ç®ãå¿è¦ã¨ããããæ¬å®æ½ä¾ã§ã¯ç°¡ç¥åã ãå²ç®å¨ï¼ï¼ï¼ãç¨ãã¦éã³ã³ããªã¥ã¼ã·ã§ã³ãè¡ã£ã¦ ãããIn this way, the level Î± is obtained, and this data is supplied to the divider 526. The divider 526 is for deconvoluting the level Î± in the convolved region. Therefore, by performing this inverse convolution processing, the masking threshold can be obtained from the level Î±. That is, this masking threshold becomes the allowable noise spectrum. Although the inverse convolution processing requires a complicated calculation, the inverse convolution is performed using the simplified divider 526 in this embodiment.

ãï¼ï¼ï¼ï¼ãæ¬¡ã«ãä¸è¨ãã¹ãã³ã°ã¹ã¬ãã·ã§ã¼ã«ã ã¯ãåæåè·¯ï¼ï¼ï¼ãä»ãã¦æ¸ç®å¨ï¼ï¼ï¼ã«ä¾çµ¦ãã ããããã§ãæ¸ç®å¨ï¼ï¼ï¼ã«ã¯ãä¸è¨å¸¯åæ¯ã®ã¨ãã«ã® æ¤åºåè·¯ï¼ï¼ï¼ããã®åºåãããªãã¡åè¿°ãããã¼ã¯ã¹ ãã¯ãã«ï¼ï¼³ï¼¢ï¼ããéå»¶åè·¯ï¼ï¼ï¼ãä»ãã¦ä¾çµ¦ãã ã¦ããããããã£ã¦ããã®æ¸ç®å¨ï¼ï¼ï¼ã§ä¸è¨ãã¹ãã³ ã°ã¹ã¬ãã·ã§ã¼ã«ãã¨ãã¼ã¯ã¹ãã¯ãã«ã¨ã®æ¸ç®æ¼ç®ã è¡ããããã¨ã§ãå³ï¼ã«ç¤ºãããã«ãä¸è¨ãã¼ã¯ã¹ãã¯ ãã«ã¯ããã¹ãã³ã°ã¹ã¬ãã·ã§ã¼ã«ãï¼ï¼ï¼³ï¼ã®ã¬ãã« ã§ç¤ºãã¬ãã«ä»¥ä¸ããã¹ãã³ã°ããããã¨ã«ãªãããª ããéå»¶åè·¯ï¼ï¼ï¼ã¯åæåè·¯ï¼ï¼ï¼ä»¥åã®ååè·¯ã§ã® éå»¶éãèæ®ãã¦ãã¨ãã«ã®ç®åºåè·¯ï¼ï¼ï¼ããã®ãã¼ ã¯ã¹ãã¯ãã«ãéå»¶ãããããã«è¨ãããã¦ãããNext, the masking threshold is supplied to the subtractor 528 via the synthesizing circuit 527. Here, the output from the energy detection circuit 522 for each band, that is, the above-described Bark spectrum (SB) is supplied to the subtractor 528 via the delay circuit 529. Therefore, the subtractor 528 performs a subtraction operation on the masking threshold and the Bark spectrum, so that the Bark spectrum is equal to or lower than the level indicated by the masking threshold (MS) level, as shown in FIG. Will be masked. The delay circuit 529 is provided to delay the Bark spectrum from the energy calculation circuit 522 in consideration of the delay amount in each circuit before the combining circuit 527.

ãï¼ï¼ï¼ï¼ãæ¸ç®å¨ï¼ï¼ï¼ããã®åºåã¯ãè¨±å®¹éé³è£æ£ åè·¯ï¼ï¼ï¼ãä»ããããã«åºåç«¯åï¼ï¼ï¼ãä»ãã¦åã åºãããä¾ãã°éåãããæ°æå ±ãäºãè¨æ¶ãããï¼²ï¼¯ ï¼çï¼å³ç¤ºããï¼ã«ä¾çµ¦ãããããã®ï¼²ï¼¯ï¼çã¯ãä¸è¨ æ¸ç®åè·¯ï¼ï¼ï¼ããè¨±å®¹éé³è£æ£åè·¯ï¼ï¼ï¼ãä»ãã¦å¾ ãããåºåï¼ä¸è¨åãã³ãã®ã¨ãã«ã®ã¨ä¸è¨åæåè·¯ï¼ ï¼ï¼ã®åºåã¨ã®å·®åã®ã¬ãã«ï¼ã«å¿ããåãã³ãæ¯ã®é åãããæ°æå ±ãåºåãããThe output from the subtracter 528 is taken out via the allowable noise correction circuit 530 and further via the output terminal 531. For example, RO in which the distribution bit number information is stored in advance. It is supplied to M and the like (not shown). This ROM or the like outputs the energy obtained from the subtraction circuit 528 through the allowable noise correction circuit 530 (the energy of each band and the synthesis circuit 5). The distribution bit number information for each band is output according to the level of the difference from the output of 27).

ãï¼ï¼ï¼ï¼ããã®ããã«ãã¦æ±ããããéåãããæ°æ å ±ãå³ï¼ã«ç¤ºããåéååå¨ï¼ï¼ï¼ã«ä¾çµ¦ããããã¨ ã§ãåéååå¨ï¼ï¼ï¼ã«ããã¦ãï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼ï¼ ï¼¬ï¼ï¼ï¼ï¼ï¼ãããã³ï¼ï¼ï¼ï¼¨ããã®å¨æ³¢æ°é åã®åã¹ ãã¯ãã«ãã¼ã¿ããããããã®ãã³ãæ¯ã«å²ãå½ã¦ãã ããããæ°ã§åéååãããããã§ãããThe distribution bit number information thus obtained is supplied to the requantizer 406 shown in FIG. 5, so that the MDCT circuit 404 in the requantizer 406. Each spectrum data in the frequency domain from L, 404M, and 404H is requantized with the number of bits assigned to each band.

ãï¼ï¼ï¼ï¼ãããªãã¡ãè¦ç´ããã°ãåéååå¨ï¼ï¼ï¼ ã§ã¯ãä¸è¨ã¯ãªãã£ã«ã«ãã³ãã®åãã³ãå¸¯åï¼ã¯ãªã ã£ã«ã«ãã³ãï¼æ¯ãããã¯é«åã«ããã¦ã¯ãã¯ãªãã£ã« ã«ãã³ããæ´ã«è¤æ°å¸¯åã«åå²ããå¸¯åã®ã¨ãã«ã®ãã ãã¯ãã¼ã¯å¤ã¨ãåæåè·¯ï¼ï¼ï¼ã®åºåã¨ã®å·®åã®ã¬ã ã«ã«å¿ãã¦éåããããããæ°ã§ãä¸è¨åãã³ãæ¯ã®ã¹ ãã¯ãã«ãã¼ã¿ãéååãããã¨ã«ãªããThat is, in summary, the requantizer 406 Then, in each band band (critical band) of the critical band or in the high band, depending on the level of the difference between the energy or peak value of the band obtained by further dividing the critical band into a plurality of bands and the output of the synthesizing circuit 527. The spectrum data for each band is quantized with the allocated number of bits.

ãï¼ï¼ï¼ï¼ãã¨ããã§ãä¸è¿°ããåæåè·¯ï¼ï¼ï¼ã§ã®å æã®éã«ã¯ãæå°å¯è´ã«ã¼ãçºçåè·¯ï¼ï¼ï¼ããä¾çµ¦ã ããå³ï¼ã«ç¤ºããããªäººéã®è´è¦ç¹æ§ã§ããããããæ å°å¯è´ã«ã¼ãï¼ï¼²ï¼£ï¼ãç¤ºããã¼ã¿ã¨ãä¸è¨ãã¹ãã³ã° ã¹ã¬ãã·ã§ã¼ã«ãï¼ï¼ï¼³ï¼ã¨ãåæãããã¨ãã§ããã ãã®æå°å¯è´ã«ã¼ãã«ããã¦ãéé³çµ¶å¯¾ã¬ãã«ãæå°å¯ è´ã«ã¼ãä»¥ä¸ãªãã°éé³ã¯èãããªããã¨ã«ãªããBy the way, at the time of synthesizing by the above-mentioned synthesizing circuit 527, data indicating a so-called minimum audible curve (RC) which is the human auditory characteristic as shown in FIG. 8 supplied from the minimum audible curve generating circuit 532. And the masking threshold (MS) can be combined. In this minimum audible curve, if the absolute noise level is below the minimum audible curve, no noise will be heard.

ãï¼ï¼ï¼ï¼ãæå°å¯è´ã«ã¼ãã¯ãã³ã¼ãã£ã³ã°ï¼ç¬¦å·å æ¹æ³ï¼ãåãã§ãã£ã¦ãä¾ãã°åçæã®åçããªã¥ã¼ã ã®éãã§ç°ãªããã®ã¨ãªãããç¾å®çãªãã£ã¸ã¿ã«ã·ã¹ ãã ã§ã¯ãä¾ãã°ï¼ï¼ããããã¤ãããã¯ã¬ã³ã¸ã¸ã®é³ æ¥½ã®ã¯ããæ¹ã«ã¯ãã»ã©éãããªãã®ã§ãä¾ãã°ï¼ï½ï¼¨ ï½ä»è¿ã®æãè³ã«èãããããå¨æ³¢æ°å¸¯åã®éååéé³ ãèãããªãã¨ããã°ãä»ã®å¨æ³¢æ°å¸¯åã§ã¯ãã®æå°å¯ è´ã«ã¼ãã®ã¬ãã«ä»¥ä¸ã®éååéé³ã¯èãããªãã¨èã ããããEven if the coding (coding method) is the same, the minimum audible curve is different due to the difference in the reproduction volume at the time of reproduction. However, in a realistic digital system, for example, music to a 16-bit dynamic range is generated. Since there is not much difference in how to go, for example 4kH If the quantization noise in the most audible frequency band around z is not heard, it is considered that the quantization noise below the level of this minimum audible curve is not heard in other frequency bands.

ãï¼ï¼ï¼ï¼ããããã£ã¦ããã®ããã«ä¾ãã°ã·ã¹ãã ã® æã¤ãã¤ãããã¯ã¬ã³ã¸ã®ï¼ï½ï¼¨ï½ä»è¿ã®éé³ãèãã ãªãä½¿ãæ¹ãããã¨ä»®å®ãããã®æå°å¯è´ã«ã¼ãï¼ï¼² ï¼£ï¼ã¨ãã¹ãã³ã°ã¹ã¬ãã·ã§ã¼ã«ãï¼ï¼ï¼³ï¼ã¨ãå±ã«å æãããã¨ã§è¨±å®¹ãã¤ãºã¬ãã«ãå¾ãããã«ããã¨ãã ã®å ´åã®è¨±å®¹ãã¤ãºã¬ãã«ã¯ãå³ï¼ä¸ã®æç·ã§ç¤ºãé¨å ã¾ã§ã¨ãããã¨ãã§ããããã«ãªãããªããæ¬å®æ½ä¾ã§ ã¯ãä¸è¨æå°å¯è´ã«ã¼ãã®ï¼ï½ï¼¨ï½ã®ã¬ãã«ããä¾ãã° ï¼ï¼ãããç¸å½ã®æä½ã¬ãã«ã«åããã¦ãããã¾ããå³ ï¼ã«ã¯ãä¿¡å·ã¹ãã¯ãã«ï¼ï¼³ï¼³ï¼ãåæã«ç¤ºããã¦ã ããTherefore, assuming that the system is used in such a manner that noise near the dynamic range of the system of 4 kHz cannot be heard, the minimum audible curve (R If the allowable noise level is obtained by synthesizing C) and the masking threshold (MS) together, the allowable noise level in this case can be up to the shaded portion in FIG. Become. In this embodiment, the level of 4 kHz of the minimum audible curve is set to the minimum level equivalent to 20 bits, for example. Further, FIG. 8 also shows the signal spectrum (SS) at the same time.

ãï¼ï¼ï¼ï¼ãã¾ããä¸è¨è¨±å®¹éé³è£æ£åè·¯ï¼ï¼ï¼ã§ã¯ã è£æ£æå ±åºååè·¯ï¼ï¼ï¼ããä¾çµ¦ãããä¾ãã°çã©ã¦ã ãã¹ã«ã¼ãã®æå ±ã«åºã¥ãã¦ãä¸è¨æ¸ç®å¨ï¼ï¼ï¼ããã® åºåã«ãããè¨±å®¹éé³ã¬ãã«ãè£æ£ãã¦ãããããã§ã çã©ã¦ããã¹ã«ã¼ãã¨ã¯ãäººéã®è´è¦ç¹æ§ã«é¢ããç¹æ§ æ²ç·ã§ãããä¾ãã°ï¼ï½ï¼¨ï½ã®ç´é³ã¨åãå¤§ããã«èã ããåå¨æ³¢æ°ã§ã®é³ã®é³å§ãæ±ãã¦æ²ç·ã§çµãã ãã® ã§ãã©ã¦ããã¹ã®çæåº¦æ²ç·ï¼çã©ã¦ããã¹æ²ç·ï¼ã¨ã å¼ã°ãããã¾ããã®çã©ã¦ããã¹æ²ç·ã¯ãå³ï¼ã«ç¤ºãã æå°å¯è´ã«ã¼ãï¼ï¼²ï¼£ï¼ã¨ç¥åãæ²ç·ãæããã®ã§ã ããIn the allowable noise correction circuit 530, The allowable noise level in the output from the subtractor 528 is corrected based on the information on the equal loudness curve supplied from the correction information output circuit 533, for example. here, The equal loudness curve is a characteristic curve relating to human auditory characteristics, and is obtained by, for example, obtaining the sound pressure of sound at each frequency heard at the same loudness as a pure tone of 1 kHz and connecting the curves, and the equal sensitivity curve of loudness ( Isoloudness curve) is also called. Further, this equal loudness curve draws a curve substantially the same as the minimum audible curve (RC) shown in FIG.

ãï¼ï¼ï¼ï¼ããã®çã©ã¦ããã¹æ²ç·ã«ããã¦ã¯ãä¾ãã° ï¼ï½ï¼¨ï½ä»è¿ã§ã¯ï¼ï½ï¼¨ï½ã®ã¨ããããé³å§ãï¼ä¹è³ï¼ ï¼ï½ï¼¢ä¸ãã£ã¦ãï¼ï½ï¼¨ï½ã¨åãå¤§ããã«èãããé ã«ãï¼ï¼ï¼¨ï½ä»è¿ã§ã¯ï¼ï½ï¼¨ï½ã§ã®é³å§ãããç´ï¼ï¼ï½ ï¼¢é«ããªãã¨åãå¤§ããã«èãããªãããã®ãããæå° å¯è´ã«ã¼ãã®ã¬ãã«ãè¶ããéé³ï¼è¨±å®¹ãã¤ãºã¬ãã«ï¼ ã¯ãçã©ã¦ããã¹æ²ç·ã«å¿ããã«ã¼ãã§ä¸ããããå¨æ³¢ æ°ç¹æ§ãæã¤ããã«ããã®ãè¯ããã¨ããããããã®ã ããªãã¨ãããä¸è¨çã©ã¦ããã¹æ²ç·ãèæ®ãã¦ä¸è¨è¨± å®¹ãã¤ãºã¬ãã«ãè£æ£ãããã¨ã¯ãäººéã®è´è¦ç¹æ§ã«é© åãã¦ãããã¨ãããããIn this equal loudness curve, for example, in the vicinity of 4 kHz, the sound pressure is 8 to 1 at 1 kHz. Even if it drops by 0 dB, it sounds the same as 1 kHz. Conversely, the sound pressure around 50 Hz is about 15 dB lower than the sound pressure at 1 kHz. If it is not high, it will not sound the same size. For this reason, noise that exceeds the level of the minimum audible curve (allowable noise level) It is understood that it is better to have a frequency characteristic given by a curve corresponding to the equal loudness curve. From this, it can be seen that correcting the allowable noise level in consideration of the equal loudness curve is suitable for human auditory characteristics.

ãï¼ï¼ï¼ï¼ãæ¬¡ã«ãå³ï¼ã«ãå³ï¼ã®ç¬¦å·å¨ï¼ï¼ï¼ï½ã«å¯¾ å¿ããå³ï¼ã®å¾©å·å¨ï¼ï¼ï¼ï½ã®å·ä½çãªæ§æä¾ãç¤ºãã ãªããå¾©å·å¨ï¼ï¼ï¼ï½ä¹è³ï¼ï¼ï¼ï½ã®æ§æããã³åä½ ã¯ãå¾©å·å¨ï¼ï¼ï¼ï½ã®å ´åã¨åºæ¬çã«åæ§ã§ããã®ã§ã ããã§ã¯ãã®èª¬æã¯çç¥ãããNext, FIG. 9 shows a concrete configuration example of the decoder 133a of FIG. 3 corresponding to the encoder 105a of FIG. Since the configurations and operations of the decoders 133b to 133e are basically the same as those of the decoder 133a, Here, the description is omitted.

ãï¼ï¼ï¼ï¼ãããªãã¡ãå³ï¼ã«ç¤ºããå¾©å·å¨ã¯ãä¾ã ã°ãæ ç»ãã£ã«ã ãããç£æ°ããããåå¦ããããªã©ã« ãã£ã¦èªã¿åã£ãåãã£ãã«ã®ãã¡ã®ï¼ãã£ãã«åã®ç¬¦ å·åãããä¿¡å·ãå¾©å·åãããã®ã§ãããThat is, the decoder shown in FIG. 9 is, for example, for decoding a coded signal for one channel out of each channel read from a motion picture film by a magnetic head or an optical head. .

ãï¼ï¼ï¼ï¼ãå³ï¼ã«ããã¦ãå¥åç«¯åï¼ï¼ï¼ã«ã¯å³ï¼ã« ç¤ºããããã«ããã¬ã¯ãµï¼ï¼ï¼ããã®ç¬¦å·åããã¦ãã ãã¼ã¿ãä¾çµ¦ããããããããã©ã¼ããã¿ï¼ï¼ï¼ã«ä¾çµ¦ ããããããã©ã¼ããã¿ï¼ï¼ï¼ã§ã¯ãå³ï¼ã«ç¤ºãããã© ã¼ããã¿ï¼ï¼ï¼ã«ããã¦å®è¡ãããå¦çã¨ã¯éã®å¦çã è¡ãããåã¦ãããæ¯ã®åãã©ã¡ã¼ã¿æå ±ã¨åéååã ããã¹ãã¯ãã©ã ä¿¡å·ï¼ããªãã¡éååãããï¼ï¼¤ï¼£ï¼´ ä¿æ°ï¼ãå¾ããããIn FIG. 9, the encoded data from the demultiplexer 132 shown in FIG. 3 is supplied to the input terminal 601, and this is supplied to the deformatter 602. The deformatter 602 performs the reverse process of the process executed by the formatter 407 shown in FIG. 5, and each parameter information of each unit and the requantized spectrum signal (that is, the quantized MDCT). Coefficient) is obtained.

ãï¼ï¼ï¼ï¼ãä¸è¨ããã©ã¼ããã¿ï¼ï¼ï¼ããã®åã¦ãã ãæ¯ã®éååãããï¼ï¼¤ï¼£ï¼´ä¿æ°ã¯ãä½åç¨ã®å¾©å·åå è·¯ï¼ï¼ï¼ï¼¬ãä¸åç¨ã®å¾©å·ååè·¯ï¼ï¼ï¼ï¼ãããã³é«å ç¨ã®å¾©å·ååè·¯ï¼ï¼ï¼ï¼¨ã«ããããä¾çµ¦ããããã¾ãã ãããå¾©å·ååè·¯ï¼ï¼ï¼ï¼¬ï¼ï¼ï¼ï¼ï¼ãããã³ï¼ï¼ï¼ï¼¨ ã«ã¯ãããã©ã¼ããã¿ï¼ï¼ï¼ãããã©ã¡ã¼ã¿æå ±ãä¸ã ããããåå¾©å·ååè·¯ï¼ï¼ï¼ï¼¬ï¼ï¼ï¼ï¼ï¼ãããã³ï¼ï¼ ï¼ï¼¨ã¯ããã®ãã©ã¡ã¼ã¿æå ±ãç¨ãã¦ãããéåãè§£é¤ ããã¨å±ã«å¾©å·åãè¡ããThe quantized MDCT coefficients for each unit from the deformatter 602 are respectively supplied to the low frequency decoding circuit 603L, the middle frequency decoding circuit 603M, and the high frequency decoding circuit 603H. Supplied. Also, These decoding circuits 603L, 603M, and 603H Is also given parameter information from the deformatter 602. Each decoding circuit 603L, 603M, and 60 3H uses this parameter information to cancel bit allocation and perform decoding.

ãï¼ï¼ï¼ï¼ããããå¾©å·ååè·¯ï¼ï¼ï¼ï¼¬ä¹è³ï¼ï¼ï¼ï¼¨ã® åºåã¯ãããããå¯¾å¿ããï¼©ï¼ï¼¤ï¼£ï¼´ï¼éï¼ï¼¤ï¼£ï¼´ï¼å è·¯ï¼ï¼ï¼ï¼¬ä¹è³ï¼ï¼ï¼ï¼¨ã«ä¾çµ¦ããããã¾ããåï¼©ï¼ï¼¤ ï¼£ï¼´åè·¯ï¼ï¼ï¼ï¼¬ä¹è³ï¼ï¼ï¼ï¼¨ã«ãä¸è¨ãã©ã¡ã¼ã¿æå ± ãä¾çµ¦ãããããã§ã¯å¨æ³¢æ°é åã®ä¿¡å·ãæéé åã®ä¿¡ å·ã«å¤æãããããããã®é¨åå¸¯åã®æéé åä¿¡å·ã¯ã å¸¯ååæåè·¯ï¼ï¼ï¼ã«ãããå¨å¸¯åä¿¡å·ã«å¾©å·åããã åºåç«¯åï¼ï¼ï¼ããåºåããããThe outputs of these decoding circuits 603L to 603H are supplied to the corresponding IMDCT (inverse MDCT) circuits 604L to 604H. Also, each IMD The parameter information is also supplied to the CT circuits 604L to 604H, where the frequency domain signal is converted into the time domain signal. The time domain signals of these subbands are The band synthesis circuit 605 decodes into a full band signal, It is output from the output terminal 606.

ãï¼ï¼ï¼ï¼ãå³ï¼ã«ç¤ºãããããªç¬¦å·åè£ç½®ã«ãã£ã¦ç¬¦ å·åããããã¼ã¿ã¯ãï¼¤ï¼¶ï¼¤ï¼ãã£ã¸ã¿ã«ãããªãã£ã¹ ã¯ï¼Digital Video Discï¼ãï¼£ï¼¤âï¼²ï¼¯ï¼ï¼ã³ã³ãã¯ã ãã£ã¹ã¯ï¼Compact Disc Read Only Memoryï¼ãããã ã¯ï¼ï¼¤ï¼ãããã£ã¹ã¯ï¼MiniDiscï¼ãªã©ã®è¨é²åªä½ã«è¨ é²ãããã¨ãå¯è½ã§ãããå¾ã£ã¦ãç¬¦å·åè£ç½®ã«ããé« è½çç¬¦å·åããããã¼ã¿ããï¼¤ï¼¶ï¼¤çã®è¨é²åªä½ã«è¨é² ããããããããªãã£ã¹ã¯ãã¬ã¼ã¤ãªã©ã®å¾©å·åè£ç½®ã« ãã£ã¦åçãããã¨ãå¯è½ã¨ãªããThe data encoded by the encoding device as shown in FIG. 1 is a DVD (Digital Video Disc), a CD-ROM (Compact Disc: Read Only Memory), or an MD (Mini-disc). It is possible to record on a recording medium such as a disc: MiniDisc. Therefore, it becomes possible to record the data that has been encoded with high efficiency by the encoding device in a recording medium such as a DVD and reproduce it by a decoding device such as a video disc player.

ãï¼ï¼ï¼ï¼ãä»¥ä¸ã®ããã«ãè¤æ°ãã£ãã«ã®ãã£ã¸ã¿ã« ä¿¡å·ã®ä¸é¨ã¾ãã¯å¨é¨ãæ··åå¦çããï¼ã¤ã¾ãã¯è¤æ°ã ã£ãã«ã®æ··åå¦çãã¼ã¿ããåç¾ç¨ãã¼ã¿ã¨ãã¦ä½¿ç¨ã ããã¨ã«ãããå§ç¸®çãé«ãããã¨ãã§ãããã¾ãããª ã¼ãã£ãªä¿¡å·ã®ç¹æ§ãåçç°å¢ã«å¯¾å¿ãã¦ãåç¾ç¨ãã¼ ã¿ã¨ãã¦ä½¿ç¨ããæ··åå¦çãã¼ã¿ã®å±ãããã£ãã«ãã ãã®ä½¿ç¨å²åããã¬ã¼ã åä½ã§å¤æ´ãããã¨ã«ãããå ã®ãªã¼ãã£ãªä¿¡å·ãåç¾ããå ´åã«ãããé³å ´ã®å¤åã æå¶ãããã¨ãã§ãããAs described above, the compression rate can be increased by using the mixed processed data of one or a plurality of channels, which is obtained by mixing a part or all of the digital signals of a plurality of channels, as the reproduction data. In addition, depending on the characteristics of the audio signal and the playback environment, the channel to which the mixed processing data used as the reproduction data belongs, By changing the usage rate in units of frames, it is possible to suppress changes in the sound field when the original audio signal is reproduced.

ãï¼ï¼ï¼ï¼ãã¾ãããã¬ã¼ã ã®åã¾ãã¯åå¾ä¸¡æ¹ã®ãã¬ ã¼ã ã®ãã£ã¸ã¿ã«ä¿¡å·ã®ç¹æ§ã«å¯¾å¿ãã¦ãåç¾ç¨ãã¼ã¿ ã¨ãã¦ä½¿ç¨ããæ··åå¦çãã¼ã¿ã®å±ãããã£ãã«ããã® ä½¿ç¨å²åãå¤æ´ããããåã®ãã£ã¸ã¿ã«ä¿¡å·ãè¨é²ããª ãå¨æ³¢æ°å¸¯åãå¤æ´ãããã¨ã«ãããå¦çæ¹æ³ã®æ¥æ¿ãª å¤åã«ãã£ã¦ä¸å®å®ãªåçé³å ´ã¨ãªããã¨ãåé¿ããã ã¨ãã§ãããFurther, in accordance with the characteristics of the digital signals of the frames both before and after the frame, the channel to which the mixed processing data used as the reproduction data belongs and the ratio of use thereof are changed, or the original digital signal is recorded. By changing the frequency band that is not used, it is possible to avoid an unstable reproduced sound field due to a sudden change in the processing method.

ãï¼ï¼ï¼ï¼ãããã«ã¾ããè¤æ°ãã£ãã«ã®ãã£ã¸ã¿ã«ä¿¡ å·ã®ä¸é¨ã¾ãã¯å¨é¨ãæ··åå¦çããï¼ã¤ã¾ãã¯è¤æ°ãã£ ãã«ã®æ··åå¦çãã¼ã¿ã®ä¸é¨ã¾ãã¯å¨é¨ãä½¿ç¨ãã¦ãå ã®ãã£ã¸ã¿ã«ä¿¡å·ã®åç¾æ¹æ³ã®ãã¬ã¼ã åä½ã§ã®å¤æ´ã« å¯¾å¿ãããã¨ã«ãããåç¾æã®é³å ´ã®å¤åãæå¶ããã ã¨ãã§ãããFurthermore, by using a part or all of the mixed processing data of one or a plurality of channels obtained by mixing a part or all of the digital signals of a plurality of channels, in a frame unit of the method of reproducing the original digital signal. By responding to the change of, it is possible to suppress the change of the sound field at the time of reproduction.

ãï¼ï¼ï¼ï¼ããªããä¸è¿°ããæ¬çºæã®ç¬¦å·åæ¹æ³ããã³ å¾©å·åæ¹æ³ã¯ãå®æ½ä¾ã«ç¨ããããããï¼¡ï¼´ï¼²ï¼¡ï¼£æ¹å¼ ã ãã§ãªãããã®ä»ã®ç¬¦å·åæ¹å¼ã«ããã¦ãé©ç¨å¯è½ã§ ãããç¹ã«ç´äº¤å¤æã«ããå¨æ³¢æ°æå ±ã«å¤æããç¬¦å·å æ¹å¼ã«ããã¦å¹æãé«ããThe above-described coding method and decoding method of the present invention can be applied not only to the so-called ATRAC method used in the embodiments but also to other coding methods. In particular, the effect is high in a coding method in which frequency information is converted by orthogonal conversion.

ãï¼ï¼ï¼ï¼ãã¾ããä¸è¨åå®æ½ä¾ã«ããã¦ã¯ãè¤æ°ãã£ ãã«ã®ãªã¼ãã£ãªãã¼ã¿ãç¬¦å·åã¾ãã¯å¾©å·åããå ´å ã«ã¤ãã¦èª¬æãããããªã¼ãã£ãªãã¼ã¿ã«éå®ãããã ã®ã§ã¯ãªããFurther, although cases have been described with the above embodiments where the audio data of a plurality of channels are encoded or decoded, the invention is not limited to audio data.

ãï¼ï¼ï¼ï¼ãã¾ããä¸è¨åå®æ½ä¾ã«ããã¦ã¯ãæ··åå¦ç ãããã£ãã«æ°ãï¼ãã£ãã«ã¨ããããããã«éå®ãã ããã®ã§ã¯ãªããä»»æã®ãã£ãã«æ°ã¨ãããã¨ãå¯è½ã§ ããããã®å ´åãä¸è¨åå®æ½ä¾ã®æ§æã¯ãæ··åå¦çãã ãã£ãã«æ°ã«å¯¾å¿ãã¦å¤æ´ããããã¨ã«ãªããIn each of the above embodiments, the number of mixed channels is two, but the number is not limited to this and any number of channels can be used. In that case, the configuration of each of the above-described embodiments is changed in accordance with the number of channels subjected to the mixing process.

ãï¼ï¼ï¼ï¼ã[0124]

ãçºæã®å¹æãè«æ±é ï¼ã«è¨è¼ã®ç¬¦å·åæ¹æ³ãããã³è« æ±é ï¼ã«è¨è¼ã®ç¬¦å·åè£ç½®ã«ããã°ããã£ã¸ã¿ã«ä¿¡å·ã® ç¹æ§ããã³åçç°å¢ã«å¯¾å¿ãã¦ãå°ãªãã¨ãï¼ã¤ã®ãã£ ãã«ã®ãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ã¾ãã¯å¨é¨ã®å¨æ³¢æ°å¸¯åã å°ãªãã¨ãï¼ã¤ã®æ··åãã£ãã«ã«æ··åããããã£ã¸ã¿ã« ä¿¡å·ãããæ··åãã£ãã«ã®æ··åãã£ã¸ã¿ã«ä¿¡å·ã«ãã£ã¦ åç¾ããä¿¡å·ãé¤ããããåå¥ã«ç¬¦å·åããåå¥ç¬¦å·å ãã£ã¸ã¿ã«ä¿¡å·ãæ½åºãããæ··åãã£ã¸ã¿ã«ä¿¡å·ããã³ åå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãç¬¦å·åãããããã«ããã® ã§ãæ··åãã£ã¸ã¿ã«ä¿¡å·ã«åºã¥ãã¦åã®ãã£ã¸ã¿ã«ä¿¡å· ãåç¾ãããã¨ãã§ãããã£ã¸ã¿ã«ä¿¡å·ã®å§ç¸®çãé«ã ããã¨ãã§ãããã¾ããæ··åãã£ã¸ã¿ã«ä¿¡å·ã«ãã£ã¦å¾© åããåã®ãã£ã¸ã¿ã«ä¿¡å·ã®å¨æ³¢æ°å¸¯åããã¬ã¼ã æ¯ã« å¤æ´ãããã¨ã«ãããåç¾æã®é³å ´ã®å¤åãæå¶ããã ã¨ãå¯è½ã¨ãªããAccording to the encoding method of the first aspect and the encoding apparatus of the second aspect, one of the digital signals of at least one channel is selected according to the characteristics of the digital signal and the reproduction environment. Partially or completely frequency bands are mixed into at least one mixing channel, and the individually coded individually coded digital signals are extracted from the digital signal, excluding the signal reproduced by the mixed digital signals of the mixing channels, and mixed. Since the digital signal and the individually encoded digital signal are encoded, the original digital signal can be reproduced based on the mixed digital signal, and the compression rate of the digital signal can be increased. Further, by changing the frequency band of the original digital signal restored by the mixed digital signal for each frame, it is possible to suppress the change of the sound field at the time of reproduction.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ï¼ã«è¨è¼ã®å¾©å·åæ¹æ³ãããã³è« æ±é ï¼ï¼ã«è¨è¼ã®å¾©å·åè£ç½®ã«ããã°ãç¬¦å·åæå ±ã«åº ã¥ãã¦ãæ··åãã£ã¸ã¿ã«ä¿¡å·ãå¾©å·åãããå¾©å·åãã ãæ··åãã£ã¸ã¿ã«ä¿¡å·ã®ä¸é¨ã¾ãã¯å¨é¨ãç¨ãã¦ããã£ ã¸ã¿ã«ä¿¡å·ãå¾©åããããã®å¾©åç¨ãã¼ã¿ãä½æããã åå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ã¨å¾©åç¨ãã¼ã¿ãåæããã ãã£ã¸ã¿ã«ä¿¡å·ãå¾©åãããããã«ããã®ã§ãæ··åãã£ ã¸ã¿ã«ä¿¡å·ã«åºã¥ãã¦åã®ãã£ã¸ã¿ã«ä¿¡å·ãå¾©åããã ã¨ãã§ãããã¾ããæ··åãã£ã¸ã¿ã«ä¿¡å·ãç¨ããåã®ã ã£ã¸ã¿ã«ä¿¡å·ã®å¾©åæ¹æ³ããã¬ã¼ã æ¯ã«å¤æ´ãããã¨ã« ãããå¾©åã«ããé³å ´ã®å¤åãæå¶ãããã¨ãå¯è½ã¨ãª ããAccording to the decoding method of the eleventh aspect and the decoding apparatus of the twelfth aspect, the mixed digital signal is decoded based on the encoded information, and the decoded mixed digital signal The restoration data for restoring the digital signal is created using part or all of the Individually encoded digital signal and restoration data are combined, Since the digital signal is restored, the original digital signal can be restored based on the mixed digital signal. Also, by changing the method of restoring the original digital signal using the mixed digital signal for each frame, it is possible to suppress the change in the sound field due to the restoration.

ãï¼ï¼ï¼ï¼ãè«æ±é ï¼ï¼ã«è¨è¼ã®è¨é²åªä½ã«ããã°ãæ å®ã®ç¬¦å·åæå ±ã«åºã¥ãã¦ç¬¦å·åãããæ··åãã£ã¸ã¿ã« ä¿¡å·ããã³åå¥ç¬¦å·åãã£ã¸ã¿ã«ä¿¡å·ãè¨é²ããããã ã«ããã®ã§ãé«è½çç¬¦å·åããããã£ã¸ã¿ã«ä¿¡å·ãè¨é² ãããããåçãããã¨ãã§ãããã¾ããããå®å®ãªå çé³å ´ãæä¾ãããã¨ãå¯è½ã¨ãªããAccording to the recording medium of the eighteenth aspect, since the mixed digital signal and the individually coded digital signal coded based on the predetermined coded information are recorded, the high efficiency coding is performed. The recorded digital signal can be recorded and reproduced. Further, it becomes possible to provide a more stable reproduced sound field.

ãå³é¢ã®ç°¡åãªèª¬æã[Brief description of the drawings]

ãå³ï¼ãæ¬çºæã®ç¬¦å·åè£ç½®ã®ä¸å®æ½ä¾ã®æ§æãç¤ºãã ããã¯å³ã§ãããFIG. 1 is a block diagram showing a configuration of an embodiment of an encoding device of the present invention.

ãå³ï¼ãå³ï¼ã®å¦çãã¼ã¿æ½åºå¨ã®æ§æä¾ãç¤ºãããã ã¯å³ã§ãããFIG. 2 is a block diagram showing a configuration example of a processed data extractor of FIG.

ãå³ï¼ãæ¬çºæã®å¾©å·åè£ç½®ã®ä¸å®æ½ä¾ã®æ§æãç¤ºãã ããã¯å³ã§ãããFIG. 3 is a block diagram showing the configuration of an embodiment of a decoding device of the present invention.

ãå³ï¼ãå³ï¼ã®åæãã¼ã¿ä½æå¨ã®æ§æä¾ãç¤ºãããã ã¯å³ã§ãããFIG. 4 is a block diagram showing a configuration example of a combined data generator of FIG.

ãå³ï¼ãå³ï¼ã®ç¬¦å·å¨ã®æ§æä¾ãç¤ºããããã¯å³ã§ã ãã5 is a block diagram showing a configuration example of the encoder of FIG. 1. FIG.

ãå³ï¼ãå³ï¼ã®ç¬¦å·å¨ãæ§æãããããéåå¨ã®æ§æä¾ ãç¤ºããããã¯å³ã§ããã6 is a block diagram showing a configuration example of a bit allocator that constitutes the encoder of FIG.

ãå³ï¼ããã¼ã¯ã¹ãã¯ãã«ï¼ï¼³ï¼¢ï¼ã¨ãã¹ãã³ã°ã¹ã¬ã ã·ã§ã¼ã«ãã¬ãã«ï¼ï¼ï¼³ï¼ã«ã¤ãã¦èª¬æããããã®å³ã§ ãããFIG. 7 is a diagram for explaining a Bark spectrum (SB) and a masking threshold level (MS).

ãå³ï¼ãä¿¡å·ã¬ãã«ï¼ï¼³ï¼³ï¼ãæå°å¯è´ã«ã¼ãï¼ï¼² ï¼£ï¼ãããã³ãã¹ãã³ã°ã¹ã¬ãã·ã§ã¼ã«ãï¼ï¼ï¼³ï¼ãç¤º ããå³ã§ãããFIG. 8: Signal level (SS), minimum audible curve (R It is a figure showing C) and a masking threshold (MS).

ãå³ï¼ãå³ï¼ã®å¾©å·å¨ã®æ§æä¾ãç¤ºããããã¯å³ã§ã ãã9 is a block diagram showing a configuration example of the decoder of FIG.

ãå³ï¼ï¼ãæ¬çºæãç¨ããªãå ´åã®ãã«ããã£ãã«ãªã¼ ãã£ãªç¬¦å·åè£ç½®ã®æ§æä¾ãç¤ºããããã¯å³ã§ãããFIG. 10 is a block diagram showing a configuration example of a multi-channel audio encoding device when the present invention is not used.

ãå³ï¼ï¼ãæ¬çºæãç¨ããªãå ´åã®ãã«ããã£ãã«ãªã¼ ãã£ãªã®å¾©å·åè£ç½®ã®æ§æä¾ãç¤ºããããã¯å³ã§ãããFIG. 11 is a block diagram showing a configuration example of a multi-channel audio decoding device when the present invention is not used.

ãå³ï¼ï¼ãæ¬çºæãç¨ããªãå ´åã®ï¼ãã£ãã«ãªã¼ãã£ ãªã®å¾©å·åè£ç½®ã®æ§æä¾ãç¤ºããããã¯å³ã§ãããFIG. 12 is a block diagram showing a configuration example of a 2-channel audio decoding device in the case where the present invention is not used.

ãç¬¦å·ã®èª¬æã[Explanation of symbols]

ï¼ï¼ï¼ å¥åç«¯å ï¼ï¼ï¼ æ··åå¨ ï¼ï¼ï¼ å¦çãã¼ã¿æ½åºå¨ ï¼ï¼ï¼ ç¬¦å·å¨ ï¼ï¼ï¼ ãã«ããã¬ã¯ãµ ï¼ï¼ï¼ åºåç«¯å ï¼ï¼ï¼ å¥åç«¯å ï¼ï¼ï¼ ããã«ããã¬ã¯ãµ ï¼ï¼ï¼ å¾©å·å¨ ï¼ï¼ï¼ åæãã¼ã¿ä½æå¨ ï¼ï¼ï¼ åæå¨ ï¼ï¼ï¼ åºåç«¯å ï¼ï¼ï¼ å¥åç«¯å ï¼ï¼ï¼ å¸¯ååå²ãã£ã«ã¿ ï¼ï¼ï¼ ãã©ã¡ã¼ã¿è¨é²ã¡ã¢ãª ï¼ï¼ï¼ å¦çãã¼ã¿åæå¨ ï¼ï¼ï¼ å¦çãã¼ã¿ä½æå¨ ï¼ï¼ï¼ åºåç«¯å ï¼ï¼ï¼ å¥åç«¯å ï¼ï¼ï¼ å¸¯ååå²ãã£ã«ã¿ ï¼ï¼ï¼ ãã©ã¡ã¼ã¿è§£æå¨ ï¼ï¼ï¼ åç¾ãã¼ã¿ä½æå¨ ï¼ï¼ï¼ åç¾ãã¼ã¿åæå¨ ï¼ï¼ï¼ åºåç«¯å ï¼ï¼ï¼ å¸¯ååå²ãã£ã«ã¿ ï¼ï¼ï¼ï¼¬ ä½åï¼ï¼¤ï¼£ï¼´åè·¯ ï¼ï¼ï¼ï¼ ä¸åï¼ï¼¤ï¼£ï¼´åè·¯ ï¼ï¼ï¼ï¼¨ é«åï¼ï¼¤ï¼£ï¼´åè·¯ ï¼ï¼ï¼ ãããã¯ãµã¤ãºè©ä¾¡å¨ ï¼ï¼ï¼ï¼¬ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼¨ æ£è¦ååè·¯ ï¼ï¼ï¼ ãããéåå¨ ï¼ï¼ï¼ åéååå¨ ï¼ï¼ï¼ ãã©ã¼ããã¿ ï¼ï¼ï¼ å¥åç«¯å ï¼ï¼ï¼ åºåç«¯å ï¼ï¼ï¼ å¥åç«¯å ï¼ï¼ï¼ å¸¯åæ¯ã®ã¨ãã«ã®ç®åºåè·¯ ï¼ï¼ï¼ ç³è¾¼ã¿ãã£ã«ã¿åè·¯ ï¼ï¼ï¼ å¼ç®å¨ ï¼ï¼ï¼ ï¼ï½âï½ï½ï¼é¢æ°çºçåè·¯ ï¼ï¼ï¼ å²ç®å¨ ï¼ï¼ï¼ åæåè·¯ ï¼ï¼ï¼ æ¸ç®å¨ ï¼ï¼ï¼ éå»¶åè·¯ ï¼ï¼ï¼ è¨±å®¹éé³è£æ£åè·¯ ï¼ï¼ï¼ åºåç«¯å ï¼ï¼ï¼ æå°å¯è´ã«ã¼ãçºçåè·¯ ï¼ï¼ï¼ è£æ£æå ±åºååè·¯ ï¼ï¼ï¼ å¥åç«¯å ï¼ï¼ï¼ ããã©ã¼ããã¿ ï¼ï¼ï¼ï¼¬ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼¨ å¾©å·ååè·¯ ï¼ï¼ï¼ï¼¬ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼ï¼¨ ï¼©ï¼ï¼¤ï¼£ï¼´åè·¯ ï¼ï¼ï¼ å¸¯ååæåè·¯ ï¼ï¼ï¼ åºåç«¯åÂ 101 Input Terminal 102 Mixer 103 Processed Data Extractor 105 Encoder 106 Multiplexer 107 Output Terminal 131 Input Terminal 132 Demultiplexer 133 Decoder 134 Synthetic Data Generator 135 Synthesizer 136 Output Terminal 201 Input Terminal 202 Band Division Filter 203 Parameter Recording Memory 204 Processed data analyzer 205 Processed data generator 206 Output terminal 211 Input terminal 212 Band division filter 213 Parameter analyzer 214 Reproduction data generator 215 Reproduction data synthesizer 216 Output terminal 401 Band division filter 402L Low band MDCT circuit 402M Medium band MDCT Circuit 402H High-frequency MDCT circuit 403 Block size evaluator 404L, 404M, 404H Normalization circuit 405 Bit distributor 406 Requantizer 407 Formatter 424 Input terminal 425 Output terminal 521 Input terminal 522 Energy calculation circuit for each band 523 Convolution filter circuit 524 Subtractor 525 (n-ai) function generation circuit 526 Divider 527 Synthesis circuit 528 Subtractor 529 Delay circuit 530 Allowable Noise correction circuit 531 Output terminal 532 Minimum audible curve generation circuit 533 Correction information output circuit 601 Input terminal 602 Deformatter 603L, 603M, 603H Decoding circuit 604L, 604M, 604H IMDCT circuit 605 Band synthesis circuit 606 Output terminal

âââââââââââââââââââââââââââââââââââââââââââââââââââââ ããã³ããã¼ã¸ã®ç¶ã (51)Int.Cl.⁶ èå¥è¨å· åºåæ´ççªå· ï¼¦ï¼© æè¡è¡¨ç¤ºç®æ ï¼¨ï¼ï¼ï¼® 7/24 ï¼¨ï¼ï¼ï¼® 7/13 ï¼º ââââââââââââââââââââââââââââââââââââââââââââââââââç¶ ã Continued on the front page (51) Int.Cl. ⁶ Identification code Agency reference number FI Technical display location H04N 7/24 H04N 7/13 Z

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4