ãï¼ï¼ï¼ï¼ã[0001]
ãçºæã®å±ããæè¡åéãæ¬çºæã¯ããã£ã¸ã¿ã«ä¿¡å·å¦
çæ¹æ³åã³å¦çè£
ç½®ããã£ã¸ã¿ã«ä¿¡å·è¨é²æ¹æ³åã³è¨é²
è£
ç½®ãè¨é²åªä½ä¸¦ã³ã«ãã£ã¸ã¿ã«ä¿¡å·éä¿¡æ¹æ³åã³éä¿¡
è£
ç½®ã«é¢ãããBACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a digital signal processing method and processing apparatus, a digital signal recording method and recording apparatus, a recording medium, a digital signal transmitting method and a transmitting apparatus.
ãï¼ï¼ï¼ï¼ã[0002]
ã徿¥ã®æè¡ããªã¼ãã£ãªä¿¡å·ã®é«è½ç符å·åã®å¾æ¥ã®
æ¹æ³åã³è£
ç½®ã«ã¯ç¨®ã
ãããã以ä¸ã«å¾æ¥ä¾ã®ãã®äºã
ä¸ã®ä¾ã説æãããæéé åã®ãªã¼ãã£ãªä¿¡å·ãå使
鿝ã«ãããã¯åãããã®ãããã¯æ¯ã®æé軸ã®ä¿¡å·ã
卿³¢æ°è»¸ä¸ã®ä¿¡å·ã«å¤æï¼ç´äº¤å¤æï¼ãã¦è¤æ°ã®å¨æ³¢æ°
帯åã«åå²ããå叝忝ã«ç¬¦å·åãããããã¯å卿³¢æ°
帯åå岿¹æ³ã®ä¸ã¤ã§ããå¤æç¬¦å·åæ¹æ³ããããæé
é åã®ãªã¼ãã£ãªä¿¡å·ãå使鿝ã«ãããã¯åããªã
ã§ãè¤æ°ã®å¨æ³¢æ°å¸¯åã«åå²ãã¦ç¬¦å·åããéãããã¯
å卿³¢æ°å¸¯åå岿¹æ³ã®ä¸ã¤ã§ãã帯ååå²ç¬¦å·åï½ãµ
ãã»ãã³ãã»ã³ã¼ãã£ã³ã°ï¼ï¼³ï¼¢ï¼£ï¼Subband Codin
gï¼ï½æ¹æ³ããããåãä¸è¿°ã®å¸¯ååå²ç¬¦å·åã¨å¤æç¬¦
å·åã¨ãçµã¿åãããé«è½ç符å·åæ¹æ³ãããããã®å ´
åã«ã¯ãä¾ãã°ãä¸è¿°ã®å¸¯ååå²ç¬¦å·åæ¹æ³ã§å¸¯ååå²
ãè¡ã£ãå¾ããã®å叝忝ã®ä¿¡å·ãä¸è¿°ã®å¤æç¬¦å·åæ¹
æ³ã§å¨æ³¢æ°é åã®ä¿¡å·ã«ç´äº¤å¤æãããã®ç´äº¤å¤æãã
ãå叝忝ã«ç¬¦å·åãæ½ããã¨ã«ãªãã2. Description of the Related Art There are various conventional methods and apparatuses for highly efficient encoding of audio signals.
Three examples will be described. The audio signal in the time domain is divided into blocks for each unit time, the signal on the time axis for each block is converted into a signal on the frequency axis (orthogonal transform), divided into a plurality of frequency bands, and encoded for each band. There is a transform coding method which is one of the blocking frequency band division methods. Sub-band coding (SBC), which is one of the non-blocking frequency band division methods for dividing and encoding a time-domain audio signal into a plurality of frequency bands without dividing the signal into unit frequency units. : Subband Codin
g) There is a method. There is also a high-efficiency coding method that combines the above-described band division coding and transform coding. In this case, for example, after performing band division by the above-described band division encoding method, the signal for each band is orthogonally transformed into a signal in the frequency domain by the above-described transformation encoding method, and the orthogonal transformation is performed. Encoding is performed for each band.
ãï¼ï¼ï¼ï¼ãããã§ãä¸è¿°ãã帯ååå²ç¬¦å·åæ¹æ³ã«ä½¿
ç¨ããã帯ååå²ç¨ãã£ã«ã¿ã¨ãã¦ã¯ãä¾ãã°ãã¯ã¯ã
ã©ãã£ã»ãã©ã¼ã»ãã£ã«ã¿ï¼ï¼±ï¼ï¼¦ï¼Quadrature Mirro
rFilter) çã®ãã£ã«ã¿ããããããã¯1976 R.E.Crochi
ere Digital coding ofspeech in subbands Bell Sys
t.Tech. J. Vol.55, No.8 1976 ã«ãè¿°ã¹ããã¦ããã
åãICASSP 83, BOSTON Polyphase Quadrature filter
s-A new subbandcoding technique Joseph H. Rothweil
er ã«ã¯ãããªãã§ã¼ãºã»ã¯ã¯ãã©ãã£ã»ãã£ã«ã¿ï¼ï¼°
ï¼±ï¼¦ï¼ Polyphase Quadrature filter) ãªã©ã®çãã³ã
å¹
ã®ãã£ã«ã¿å岿¹æ³åã³è£
ç½®ãè¿°ã¹ããã¦ãããHere, as a filter for band division used in the above-described band division encoding method, for example, a quadrature mirror filter (QMF) is used.
rFilter), which is a 1976 RECrochi
ere Digital coding ofspeech in subbands Bell Sys
This is described in t.Tech. J. Vol.55, No.8 1976.
Also, ICASSP 83, BOSTON Polyphase Quadrature filter
sA new subbandcoding technique Joseph H. Rothweil
er has a polyphase quadrature filter (P
An equal bandwidth filter splitting method and apparatus such as QF (Polyphase Quadrature filter) is described.
ãï¼ï¼ï¼ï¼ãåãä¸è¿°ããç´äº¤å¤æã¨ãã¦ã¯ãä¾ãã°ã
å
¥åãªã¼ãã£ãªä¿¡å·ãæå®å使éï¼ãã¬ã¼ã ï¼ã§ãã
ãã¯åãããã®ãããã¯æ¯ã«é«éãã¼ãªã¨å¤æï¼ï¼¦ï¼¦
ï¼´ï¼ã颿£ã³ãµã¤ã³å¤æï¼ï¼¤ï¼£ï¼´ï¼ãã¢ãã£ãã¡ã¤ãD
ï¼£ï¼´å¤æï¼ï¼ï¼¤ï¼£ï¼´ï¼ãªã©ãè¡ããã¨ã§æé軸ã卿³¢æ°
軸ã«å¤æãããããªæ¹æ³ããããä¸è¿°ã®ï¼ï¼¤ï¼£ï¼´ã«ã¤ã
ã¦ã¯ãICASSP 1987 Subband/Transform Coding Using F
ilter Bank DesignsBased on Time Domain Aliasing Ca
ncellation J.P.Princen A.B. BradleyUniv. of Surre
y Royal Melbourne Inst.of Tech. ã«è¿°ã¹ããã¦ããã[0004] The above-mentioned orthogonal transform includes, for example,
The input audio signal is divided into blocks in a predetermined unit time (frame), and a fast Fourier transform (FF) is performed for each block.
T), Discrete Cosine Transform (DCT), Modified D
There is a method of converting a time axis into a frequency axis by performing CT conversion (MDCT) or the like. Regarding the above MDCT, ICASSP 1987 Subband / Transform Coding Using F
ilter Bank DesignsBased on Time Domain Aliasing Ca
ncellation JPPrincen AB BradleyUniv. of Surre
y Royal Melbourne Inst. of Tech.
ãï¼ï¼ï¼ï¼ãæ´ã«ã卿³¢æ°å¸¯ååå²ãããå卿³¢æ°æå
ãéååããå ´åã®å¨æ³¢æ°åå²å¹
ã¨ãã¦ã人éã®è´è¦ç¹
æ§ãèæ
®ãã帯ååå²ããããããªãã¡ãä¸è¬ã«è¨ç帯
åï¼ã¯ãªãã£ã«ã«ãã³ãï¼ã¨å¼ã°ãã¦ããé«åç¨å¸¯åå¹
ãåºããªããããªå¸¯åå¹
ã§ããªã¼ãã£ãªä¿¡å·ãè¤æ°ãã³
ãï¼ä¾ãã°ï¼ï¼ãã³ãï¼ã®å¸¯åã«åå²ãããã¨ãããã
åããã®æã®å叝忝ã®ãã¼ã¿ã符å·åããéã«ã¯ãå
å¸¯åæ¯ã«æå®ã®ãããé
åæãã¯ãå叝忝ã«é©å¿çãª
ãããé
åã«ãã符å·åãè¡ããããä¾ãã°ãä¸è¿°ã®ï¼
DCTå¦çããã¦å¾ãããï¼ï¼¤ï¼£ï¼´ä¿æ°ãã¼ã¿ãä¸è¿°ã®
ãããé
åã«ãã£ã¦ç¬¦å·åããéã«ã¯ãä¸è¿°ã®åããã
ã¯æ¯ã®ï¼ï¼¤ï¼£ï¼´å¦çã«ããå¾ãããå叝忝ã®ï¼ï¼¤ï¼£ï¼´
ä¿æ°ãã¼ã¿ã«å¯¾ãã¦ãé©å¿çãªé
åãããæ°ã§ç¬¦å·åã
è¡ããããã¨ã«ãªãã[0005] Further, as a frequency division width when quantizing each frequency component divided into frequency bands, there is band division in consideration of human auditory characteristics. That is, an audio signal may be divided into a plurality of bands (for example, 25 bands) with a bandwidth generally called a critical band (critical band) such that the bandwidth becomes wider as the band becomes higher.
When encoding data for each band at this time, predetermined bits are allocated to each band, or encoding is performed by adaptive bit allocation to each band. For example, M
When encoding the MDCT coefficient data obtained by the DCT processing by the above-described bit allocation, the MDCT coefficient for each band obtained by the above-described MDCT processing for each block is used.
Coding is performed on the coefficient data with an adaptive number of allocated bits.
ãï¼ï¼ï¼ï¼ãæ´ã«ãå叝忝ã®ç¬¦å·åã«éãã¦ã¯ãå帯
åæ¯ã«æ£è¦åãè¡ã£ã¦éååãè¡ããã¨ã«ãããããå¹
ççãªç¬¦å·åãå®ç¾ãããããããããã¯ããã¼ãã£ã³
ã°ï¼Block Floating: ãããã¯ã»ããã¼ãã£ã³ã°) å¦ç
ãè¡ããããå³ã¡ãä¸è¿°ã®ï¼ï¼¤ï¼£ï¼´å¦çããã¦å¾ããã
ï¼ï¼¤ï¼£ï¼´ä¿æ°ãã¼ã¿ã符å·åããéã«ã¯ãå叝忝ã«ä¸
è¿°ã®ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®çµ¶å¯¾å¤ã®æå¤§å¤çã«å¯¾å¿ããæ£è¦å
ãè¡ã£ã¦éååãè¡ããããã«ãããããå¹ççãªç¬¦å·
åãè¡ããããFurther, when encoding for each band, a so-called block floating (Block Floating) for realizing more efficient encoding is performed by normalizing and quantizing each band. Processing is performed. That is, when encoding the MDCT coefficient data obtained by the above-described MDCT processing, quantization is performed by performing normalization corresponding to the maximum value of the absolute value of the MDCT coefficient described above for each band. Thereby, more efficient encoding is performed.
ãï¼ï¼ï¼ï¼ãä¸è¿°ã®ãããé
åæ¹æ³ã¨ãã¦ã¯ã徿¥ã次
ã®ï¼æ¹æ³ãç¥ããã¦ãããIEEE Transactions of Accou
stics,Speech,and Signal Processing,vol.ASSP-25,No.
4,August 1977 ã§ã¯ãå叝忝ã®ä¿¡å·ã®å¤§ããããã¨
ã«ããããé
åãè¡ã£ã¦ãããåãICASSP 1980 The cr
itical band coder â digital encoding ofthe percep
tual requirements of the auditory system M.A. Kran
sner MITã§ã¯ãè´è¦ãã¹ãã³ã°ãå©ç¨ãããã¨ã§ãå帯
忝ã«å¿
è¦ãªä¿¡å·å¯¾é鳿¯ãå¾ã¦åºå®çãªãããé
åã
è¡ãæ¹æ³ãè¿°ã¹ããã¦ãããConventionally, the following two methods are known as the above-mentioned bit allocation method. IEEE Transactions of Accou
stics, Speech, and Signal Processing, vol.ASSP-25, No.
In 4, August 1977, bits are allocated based on the magnitude of the signal for each band. Also, ICASSP 1980 The cr
itical band coder â digital encoding of the percep
tual requirements of the auditory system MA Kran
The sner MIT describes a method of obtaining a required signal-to-noise ratio for each band and performing fixed bit allocation by using auditory masking.
ãï¼ï¼ï¼ï¼ã[0008]
ãçºæã解決ãããã¨ãã課é¡ãã¨ããã§ãä¸è¿°ããå¾
æ¥ã®é«è½ç符å·åæ¹æ³ã«ããã¦ãéååãè¡ãå叝忝
ã®ãããé
åéãç®åºããéã«ãå叝忝ã®ä¿¡å·æåã¨
æ£è¦åã®æ¡ä»¶ã«ãã£ã¦ã¯ãããç¨åº¦ä»¥ä¸ã®ãããé
åã
è¡ããªããã°ãä¿¡å·æåãå
¨ã¦ï¼ã«éååããã¦ãã¾ã
å¯è½æ§ãããããããèæ
®ããã«ãããé
åãè¡ã£ãå ´
åãå¹ççãªç¬¦å·åãè¡ããªãå ´åãçãããIn the above-described conventional high-efficiency coding method, when calculating the bit allocation amount for each band to be quantized, the signal component for each band and the condition for normalization are calculated. In some cases, if the bit allocation is not performed to a certain degree or more, the signal components may be all quantized to 0, and if the bit allocation is performed without taking this into account, efficient encoding cannot be performed. Cases arise.
ãï¼ï¼ï¼ï¼ããããç¹ã«éã¿ãæ¬çºæã¯ãå
¥åãã£ã¸ã¿
ã«ä¿¡å·ãè¤æ°ã®å¨æ³¢æ°å¸¯åæåã«åå²ãã¦ãæéã¨å¨æ³¢
æ°ã«é¢ããè¤æ°ã®ï¼æ¬¡å
ãããã¯å
ã®ä¿¡å·æåãå¾ãä¸
è¿°ã®æéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ï¼æ¬¡å
ã
ããã¯å
ã®ä¿¡å·æåãåºã«æ£è¦åãè¡ã£ã¦æ£è¦åãã¼ã¿
ãå¾ãä¸è¿°ã®æéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«
ï¼æ¬¡å
ãããã¯å
ã®ä¿¡å·æåã®ç¹å¾´ã表ãéååä¿æ°ã
æ±ãããã®éååä¿æ°ãåºã«ãããé
åéãæ±ºå®ããä¸
è¿°ã®æéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ä¸è¨æ£è¦
åãã¼ã¿ã¨ãããé
åéã¨ã«ãããããã¯å
ã®ä¿¡å·æå
ãéååãã¦æ
å ±å§ç¸®ããã¨å
±ã«ãä¸è¨æéã¨å¨æ³¢æ°ã«
é¢ããï¼æ¬¡å
ãããã¯æ¯ã®æ
å ±å§ç¸®ãã©ã¡ã¼ã¿ã¼ãå¾ã
ããã«ãããã£ã¸ã¿ã«ä¿¡å·å¦çæ¹æ³ããã£ã¸ã¿ã«ä¿¡å·å¦
çè£
ç½®ããã£ã¸ã¿ã«ä¿¡å·è¨é²æ¹æ³ããã£ã¸ã¿ã«ä¿¡å·è¨é²
è£
ç½®ãè¨é²åªä½ããã£ã¸ã¿ã«ä¿¡å·éä¿¡æ¹æ³ãåã¯ããã£
ã¸ã¿ã«ä¿¡å·éä¿¡è£
ç½®ã«ããã¦ãä½åãªãããã使ç¨ãã
ãã¨ã鲿¢ããããå¹çã®è¯ã符å·åãå®ç¾ãå¾ãéç¹
æ§ãä¿¡å·å質ã®åä¸ãå³ããã¨ãã§ããè¨é²åªä½ã«ãã
ãè¨é²å®¹éãä¼éè·¯ã«ãããä¼é容éã®æå¹å©ç¨ãè¡ã
ãã¨ã®ã§ãããã®ãææ¡ãããã¨ãããã®ã§ãããIn view of the above, the present invention divides an input digital signal into a plurality of frequency band components, obtains signal components in a plurality of two-dimensional blocks relating to time and frequency, and obtains the two-dimensional components relating to time and frequency. Normalization is performed for each block based on the signal components in the two-dimensional block to obtain normalized data, and the quantization coefficient representing the characteristic of the signal component in the two-dimensional block is calculated for each of the two-dimensional blocks with respect to time and frequency. Then, a bit allocation amount is determined based on the quantization coefficient, a signal component in the block is quantized by the normalized data and the bit allocation amount for each of the two-dimensional blocks related to time and frequency, and information is compressed. , Digital signal processing apparatus, and digital signal processing method for obtaining information compression parameters for each two-dimensional block relating to time and frequency In a signal recording method, a digital signal recording device, a recording medium, a digital signal transmission method, or a digital signal transmission device, it is possible to prevent the use of extra bits and realize more efficient encoding, and to realize the static characteristic and the like. An object of the present invention is to propose a device capable of improving signal quality and capable of effectively utilizing a recording capacity of a recording medium and a transmission capacity of a transmission path.
ãï¼ï¼ï¼ï¼ã[0010]
ã課é¡ã解決ããããã®ææ®µãæ¬çºæã«ãããã£ã¸ã¿ã«
ä¿¡å·å¦çæ¹æ³ã¯ãå
¥åãã£ã¸ã¿ã«ä¿¡å·ãè¤æ°ã®å¨æ³¢æ°å¸¯
åæåã«åå²ãã¦ãæéã¨å¨æ³¢æ°ã«é¢ããè¤æ°ã®ï¼æ¬¡å
ãããã¯å
ã®ä¿¡å·æåãå¾ãæéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡
å
ãããã¯æ¯ã«ï¼æ¬¡å
ãããã¯å
ã®ä¿¡å·æåãåºã«æ£è¦
åãè¡ã£ã¦æ£è¦åãã¼ã¿ãå¾ãæéã¨å¨æ³¢æ°ã«é¢ããï¼
次å
ãããã¯æ¯ã«ï¼æ¬¡å
ãããã¯å
ã®ä¿¡å·æåã®ç¹å¾´ã
表ãéååä¿æ°ãæ±ãããã®éååä¿æ°ãåºã«ãããé
åéãæ±ºå®ããæéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯
ã«æ£è¦åãã¼ã¿ã¨ãããé
åéã¨ã«ãããããã¯å
ã®ä¿¡
å·æåãéååãã¦æ
å ±å§ç¸®ããã¨å
±ã«ãæéã¨å¨æ³¢æ°
ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã®æ
å ±å§ç¸®ãã©ã¡ã¼ã¿ã¼ãå¾
ãããã«ãããã£ã¸ã¿ã«ä¿¡å·å¦çæ¹æ³ã«ããã¦ãæéã¨
卿³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ãæ£è¦åãã¼ã¿ã¨ã
ããã¯å
ä¿¡å·æåã«ããããããã¯å
ä¿¡å·æåã®å°ãªã
ã¨ãä¸ã¤ãï¼ä»¥å¤ã®å¤ã«éååããããã®æä½ãããå²
ãå½ã¦ãç®åºãããã®ç®åºçµæããèæ
®ãã¦ãããé
å
éã®æ±ºå®ãè¡ããã®ã§ãããA digital signal processing method according to the present invention divides an input digital signal into a plurality of frequency band components to obtain signal components in a plurality of two-dimensional blocks relating to time and frequency. For each two-dimensional block related to frequency, normalization is performed based on signal components in the two-dimensional block to obtain normalized data.
A quantization coefficient representing a characteristic of a signal component in the two-dimensional block is obtained for each two-dimensional block, a bit allocation amount is determined based on the quantization coefficient, and normalized data and bit allocation are determined for each two-dimensional block with respect to time and frequency. In a digital signal processing method in which a signal component in a block is quantized according to the amount and information is compressed, and an information compression parameter for each two-dimensional block relating to time and frequency is obtained, for each two-dimensional block relating to time and frequency, Based on the normalized data and the signal components in the block, the lowest bit allocation for quantizing at least one of the signal components in the block to a value other than 0 is calculated, and the bit allocation amount is determined in consideration of the calculation result. Is what you do.
ãï¼ï¼ï¼ï¼ããããæ¬çºæãã£ã¸ã¿ã«ä¿¡å·å¦çæ¹æ³ã«ã
ãã°ãä½åãªãããã使ç¨ãããã¨ã鲿¢ããããå¹ç
ã®è¯ã符å·åãå®ç¾ããéç¹æ§ãä¿¡å·å質ã®åä¸ãå³ã
ãã¨ãã§ãããAccording to the digital signal processing method of the present invention, use of extra bits can be prevented, more efficient coding can be realized, and static characteristics and signal quality can be improved.
ãï¼ï¼ï¼ï¼ã[0012]
ãçºæã®å®æ½ã®å½¢æ
ã以ä¸ã«ãå³é¢ãåç
§ããæ¬çºæã®
宿½ã®å½¢æ
ã«ã¤ãã¦èª¬æããããã®å®æ½ã®å½¢æ
ã§ã¯ããª
ã¼ãã£ãªï¼°ï¼£ï¼ä¿¡å·çã®å
¥åãã£ã¸ã¿ã«ä¿¡å·ãã帯åå
å²ç¬¦å·åï¼ï¼³ï¼¢ï¼£ï¼ãé©å¿å¤æç¬¦å·åï½ã¢ãããã£ãã»
ãã©ã³ã¹ãã©ã¼ã ã»ã³ã¼ãã£ã³ã°ï¼ï¼¡ï¼´ï¼£ï¼ï¼¡daptiv T
ransform Coding ï¼ï½åã³é©å¿ãããå²å½ã¦ã®åæè¡ã
ç¨ãã¦é«è½ç符å·åããããã®æè¡ã«ã¤ãã¦ãå³ï¼ä»¥é
ãåç
§ããªãã説æãããEmbodiments of the present invention will be described below with reference to the drawings. In this embodiment, an input digital signal such as an audio PCM signal is subjected to band division coding (SBC), adaptive conversion coding, and adaptive adaptive coding.
Transform coding (ATC: Adaptiv T
ransform Coding) Performs high-efficiency coding using the techniques ofï½ and adaptive bit allocation. This technique will be described with reference to FIG.
ãï¼ï¼ï¼ï¼ãå³ï¼ã«ç¤ºãå
·ä½çãªé«è½ç符å·åã¨ã³ã³ã¼
ãã§ã¯ãå
¥åãã£ã¸ã¿ã«ä¿¡å·ãè¤æ°ã®å¨æ³¢æ°å¸¯åã«åå²
ããã¨å
±ã«ãå卿³¢æ°å¸¯åæ¯ã«ç´äº¤å¤æãè¡ã£ã¦ãå¾ã
ãã卿³¢æ°è»¸ã®ã¹ãã¯ãã«ãã¼ã¿ããä½åã§ã¯ãå¾è¿°ã
ã人éã®è´è¦ç¹æ§ãèæ
®ããããããè¨ç帯åå¹
ï¼ã¯ãª
ãã¤ã«ã«ãã³ãï¼æ¯ã«ãä¸é«åã§ã¯ãããã¯ããã¼ãã£
ã³ã°å¹çãèæ
®ãã¦è¨ç帯åå¹
ãç´°ååãã叝忝ã«ã
é©å¿çã«ãããå²å½ãã¦ç¬¦å·åãã¦ãããé常ãã®ãã
ãã¯ãéååéé³çºçãããã¯ã¨ãªããæ´ã«ãæ¬çºæã®
宿½ã®å½¢æ
ã«ããã¦ã¯ãç´äº¤å¤æã®åã«å
¥åãã£ã¸ã¿ã«
ä¿¡å·ã«å¿ãã¦é©å¿çã«ãããã¯ãµã¤ãºï¼ãããã¯é·ï¼ã
å¤åããã¦ãããIn the specific high-efficiency encoding encoder shown in FIG. 1, the input digital signal is divided into a plurality of frequency bands, and at the same time, orthogonal transform is performed for each frequency band, and the obtained spectrum data on the frequency axis is converted. In the low frequency range, for each so-called critical bandwidth (critical band) in consideration of human hearing characteristics described later, in the middle and high frequency ranges, each critical bandwidth is subdivided in consideration of the block floating efficiency.
Bits are adaptively allocated and encoded. Usually, this block is a quantization noise generating block. Further, in the embodiment of the present invention, the block size (block length) is adaptively changed according to the input digital signal before the orthogonal transform.
ãï¼ï¼ï¼ï¼ãå³ã¡ãå³ï¼ã«ããã¦ãä¾ãã°ãï¼ãï¼ï¼ï½
Hï½ã®å¨æ³¢æ°å¸¯åã®ãªã¼ãã£ãªä¿¡å·ãããµã³ããªã³ã°å¨
æ³¢æ°ï¼ï¼ï¼ï¼ï½ï¼¨ï½ã以ã¦ãµã³ããªã³ã°ãããå¾ãï¼°ï¼£
ï¼åããã¦å¾ããããªã¼ãã£ãªï¼°ï¼£ï¼ä¿¡å·ããå
¥å端å
ï¼ï¼ï¼ã«ä¾çµ¦ãããããã®å
¥åãªã¼ãã£ãªï¼°ï¼£ï¼ä¿¡å·
ã¯ãä¾ãã°ãï¼±ï¼ï¼¦ï¼ã¯ã¯ãã©ãã£ã»ãã©ã¼ã»ãã£ã«
ã¿ï¼çã®å¸¯ååå²ãã£ã«ã¿ï¼å¸¯ååå²ææ®µï¼ï¼ï¼ï¼ã«ã
ããï¼ãï¼ï¼ï½ï¼¨ï½ã®å¸¯åã¨ãï¼ï¼ï½ï¼¨ï½ãï¼ï¼ï½ï¼¨ï½
ã®å¸¯åã¨ã«åå²ããããæ´ã«ãï¼ãï¼ï¼ï½ï¼¨ï½ã®å¸¯åã®
ä¿¡å·ã¯ãåããï¼±ï¼ï¼¦çã®å¸¯ååå²ãã£ã«ã¿ï¼å¸¯ååå²
ææ®µï¼ï¼ï¼ï¼ã«ããï¼ãï¼. ï¼ï½ï¼¨ï½ã®å¸¯åã¨ï¼. ï¼ï½
Hï½ãï¼ï¼ï½ï¼¨ï½ã®å¸¯åã¨ã«åå²ããããThat is, in FIG.
After the audio signal in the frequency band of Hz is sampled at a sampling frequency of 44.1 kHz,
The audio PCM signal obtained by the M conversion is supplied to the input terminal 100. This input audio PCM signal is subjected to a band of 0 to 11 kHz and a band of 11 kHz to 22 kHz by a band dividing filter (band dividing means) 101 such as a QMF (quadrature mirror filter).
And divided into bands. Further, the signal in the band of 0 to 11 kHz is similarly divided into a band of 0 to 5.5 kHz and a band of 5.5 kHz by a band division filter (band division means) 102 such as QMF.
Hz to 11 kHz.
ãï¼ï¼ï¼ï¼ãä¸è¿°ã®å¸¯ååå²ãã£ã«ã¿ï¼ï¼ï¼ããã®ï¼ï¼
ï½ï¼¨ï½ãï¼ï¼ï½ï¼¨ï½ã®å¸¯åã®ä¿¡å·ã¯ãç´äº¤å¤æåè·¯ã®ä¸
ä¾ã§ããï¼ï¼¤ï¼£ï¼´åè·¯ï¼ã¢ãã£ãã¡ã¤ãã»é¢æ£ã³ãµã¤ã³
夿åè·¯ï¼ï¼ç´äº¤å¤æææ®µï¼ï¼ï¼ï¼ã«ä¾çµ¦ãããã帯å
åå²ãã£ã«ã¿ï¼ï¼ï¼ããã®ï¼. ï¼ï½ï¼¨ï½ãï¼ï¼ï½ï¼¨ï½ã®
帯åã®ä¿¡å·ã¯ãï¼ï¼¤ï¼£ï¼´åè·¯ï¼ã¢ãã£ãã¡ã¤ãã»é¢æ£ã³
ãµã¤ã³å¤æææ®µï¼ï¼ç´äº¤å¤æææ®µï¼ï¼ï¼ï¼ã«ä¾çµ¦ãã
ãã帯ååå²ãã£ã«ã¿ï¼ï¼ï¼ããã®ï¼ãï¼. ï¼ï½ï¼¨ï½ã®
帯åã®ä¿¡å·ã¯ãï¼ï¼¤ï¼£ï¼´åè·¯ï¼ã¢ãã£ãã¡ã¤ãã»é¢æ£ã³
ãµã¤ã³å¤æææ®µï¼ï¼ç´äº¤å¤æææ®µï¼ï¼ï¼ï¼ã«ä¾çµ¦ããã
ãã¨ã«ãããããããï¼ï¼¤ï¼£ï¼´å¦çãããããªããåï¼
DCTåè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã§ã¯ãå叝忝ã«è¨
ãããããã¯æ±ºå®åè·¯ï¼ãããã¯æ±ºå®ææ®µï¼ï¼ï¼ï¼ãï¼
ï¼ï¼ãï¼ï¼ï¼ã«ããæ±ºå®ããããããã¯ãµã¤ãºï¼å¦çã
ããã¯ã®é·ãï¼ï¼æ
å ±å§ç¸®ãã©ã¡ã¼ã¿ï¼ã«åºã¥ãã¦ï¼ï¼¤
CTå¦çããªãããã11 from the above-mentioned band division filter 101
A signal in the band of 22 kHz to 22 kHz is supplied to an MDCT circuit (modified / discrete cosine transform circuit) (orthogonal transform means) 103 which is an example of an orthogonal transform circuit. The signal in the band of 5.5 kHz to 11 kHz from the band division filter 102 is supplied to an MDCT circuit (modified / discrete cosine transform means) (orthogonal transform means) 104. The signals in the band of 0 to 5.5 kHz from the band division filter 102 are supplied to an MDCT circuit (modified / discrete cosine transform means) (orthogonal transform means) 105 to be subjected to MDCT processing. Note that each M
In the DCT circuits 103, 104, and 105, block decision circuits (block decision means) 109, 1 provided for each band are provided.
MD based on the block size (length of processing block) (information compression parameter) determined by 10, 111
CT processing is performed.
ãï¼ï¼ï¼ï¼ãä¸è¿°ããããã«ãå
¥åãã£ã¸ã¿ã«ä¿¡å·ãè¤
æ°ã®å¨æ³¢æ°å¸¯åã«åå²ããææ®µã¨ãã¦ã¯ãä¾ãã°ãï¼±ï¼
Fãã£ã«ã¿ãããããããã«ã¤ãã¦ã¯ã1976 R.E.Croch
iereDigital Coding of Speech In Subbands Bell Sys
t.Tech. J. Vol.55,No.8 1976 ã«è¿°ã¹ããã¦ãããåã
ICASSP 83,Boston Polyphase Quadrature Filters-ANew
Subband Coding Technique Joseph H. Rothweiler ã«
ã¯ãçãã³ãå¹
ã®ãã£ã«ã¿å岿¹æ³ãè¿°ã¹ããã¦ããã
ããã§ãä¸è¿°ããç´äº¤å¤æã¨ãã¦ã¯ãä¾ãã°ãå
¥åãªã¼
ãã£ãªä¿¡å·ãæå®å使éï¼ãã¬ã¼ã ï¼ã§ãããã¯å
ãããã®ãããã¯æ¯ã«é«éãã¼ãªã¨å¤æï¼ï¼¦ï¼¦ï¼´ï¼ãé¢
æ£ã³ãµã¤ã³å¤æï¼ï¼¤ï¼£ï¼´ï¼ãã¢ãã£ãã¡ã¤ãï¼¤ï¼£ï¼´å¤æ
ï¼ï¼ï¼¤ï¼£ï¼´ï¼çãè¡ããã¨ã§ãæé軸ã卿³¢æ°è»¸ã«å¤æ
ããããã«ããç´äº¤å¤æããããï¼ï¼¤ï¼£ï¼´ã«ã¤ãã¦ã¯IC
ASSP 1987 Subband/TransformCoding Using Filter Ban
k Designs Based On Time Domain AliasingCancellatio
n J.P.Princen A.B.Bradley Univ.of Surrey Royal Mel
bourneInst. Of Tech.ã«è¿°ã¹ããã¦ãããAs described above, means for dividing an input digital signal into a plurality of frequency bands includes, for example, QM
There is an F filter, which is described in 1976 RECroch
iereDigital Coding of Speech In Subbands Bell Sys
t.Tech. J. Vol.55, No.8, 1976. or,
ICASSP 83, Boston Polyphase Quadrature Filters-ANew
Subband Coding Technique Joseph H. Rothweiler describes an equal bandwidth filter splitting method.
Here, as the above-described orthogonal transform, for example, an input audio signal is divided into blocks in a predetermined unit time (frame), and a fast Fourier transform (FFT), a discrete cosine transform (DCT), and a modified DCT transform (MDCT) are performed for each block. By performing such operations, there is an orthogonal transformation in which the time axis is transformed into the frequency axis. IC for MDCT
ASSP 1987 Subband / TransformCoding Using Filter Ban
k Designs Based On Time Domain AliasingCancellatio
n JPPrincen ABBradley Univ. of Surrey Royal Mel
bourneInst. Of Tech.
ãï¼ï¼ï¼ï¼ãããã§ãåï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ã
ï¼ï¼ï¼ã«ä¾çµ¦ããå叝忝ã®ãããã¯ã«ã¤ãã¦ã®æ¨æºç
ãªå
¥åãã£ã¸ã¿ã«ä¿¡å·ã«å¯¾ããå
·ä½ä¾ãå³ï¼ã«ç¤ºããã
ã®å³ï¼ã®å
·ä½ä¾ã«ããã¦ã¯ãï¼ã¤ã®ãã£ã«ã¿åºåä¿¡å·
ã¯ãå帯åãã¨ã«ç¬ç«ã«åã
è¤æ°ã®ç´äº¤å¤æãããã¯ãµ
ã¤ãºãæã¡ãä¿¡å·ã®æéç¹æ§ã卿³¢æ°åå¸çã«ããæé
åè§£è½ãåãæããããããã«ãã¦ãããä¿¡å·ãæéç
ã«æºå®å¸¸çã§ããå ´åã«ã¯ãå³ï¼ï¼¡ã®ãã³ã°ã¢ã¼ãã®ã
ãã«ãç´äº¤å¤æãããã¯ãµã¤ãºãï¼ï¼ï¼ï¼ï½ï¼³ã¨å¤§ãã
ãããä¿¡å·ãéå®å¸¸çã§ããå ´åã«ã¯ãç´äº¤å¤æããã
ã¯ãµã¤ãºãæ´ã«ï¼åå²ãï¼åå²ã¨ãããå³ã¡ãå³ï¼ï¼¢ã®
ã·ã§ã¼ãã¢ã¼ãã®ããã«ããã¹ã¦ãï¼åå²ãå³ã¡ãï¼ï¼
ï¼ï½ï¼³ã®æéåè§£è½ã¨ããå ´åããå³ï¼ï¼£ã®ããã«ã¢ã¼
ãAãå³ï¼ï¼¤ã®ããã«ã¢ã¼ãï¼¢ã®ããã«ãä¸é¨ãï¼å
å²ãå³ã¡ãï¼ï¼ï¼ï½ï¼³ãä»ã®ï¼é¨ãï¼åå²ãå³ã¡ãï¼ï¼
ï¼ï½ï¼³ã®æéåè§£è½ã¨ãããã¨ã§ãå®éã®è¤éãªå
¥åã
ã£ã¸ã¿ã«ä¿¡å·ã«é©å¿ããããã«ãªã£ã¦ããããã®ç´äº¤å¤
æãããã¯ãµã¤ãºã®åå²ã¯å¦çè£
ç½®ã®è¦æ¨¡ã許ãã°ãæ´
ã«è¤éãªåå²ãè¡ãªãã¨ããã广çãªãã¨ã¯æç½ã§ã
ããHere, each of the MDCT circuits 103, 104,
FIG. 2 shows a specific example of a standard input digital signal for a block for each band to be supplied to 105. In the specific example of FIG. 2, the three filter output signals have a plurality of orthogonal transform block sizes independently for each band, and the time resolution can be switched according to the time characteristics, frequency distribution, and the like of the signals. . If the signal is quasi-stationary in time, the orthogonal transform block size is increased to 11.6 mS as in the long mode in FIG. 2A. If the signal is non-stationary, the orthogonal transform block size is further divided into two and four. That is, as shown in the short mode of FIG.
When the time resolution is 9 mS, or as in the middle mode A in FIG. 2C or the middle mode B in FIG. 2D, a part is divided into two, that is, 5.8 mS, and another part is divided into four, that is, 2.
With a time resolution of 9 mS, it is adapted to an actual complex input digital signal. It is clear that the division of the orthogonal transform block size is more effective if more complicated division is performed if the scale of the processing device allows.
ãï¼ï¼ï¼ï¼ããã®ç´äº¤å¤æãããã¯ãµã¤ãºã®æ±ºå®ã¯ãç´
äº¤å¤æãããã¯ãµã¤ãºã®æ±ºå®åè·¯ï¼ç´äº¤å¤æãããã¯ãµ
ã¤ãºã®æ±ºå®ææ®µï¼ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã§è¡ãããã
ã®æ±ºå®çµæã¯åï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼å
ã³ãããå²å½ç®åºåè·¯ï¼ï¼ï¼ã«ä¾çµ¦ãããã¨å
±ã«ããã
ãã¯ã®ãããã¯ãµã¤ãºæ
å ±ï¼å¦çãããã¯ã®é·ãã®æ
å ±ï¼ï¼æ
å ±å§ç¸®ãã©ã¡ã¼ã¿ï¼ã¨ãã¦åºå端åï¼ï¼ï¼ãï¼
ï¼ï¼ãï¼ï¼ï¼ããåºåããããThe orthogonal transform block size is determined by orthogonal transform block size determining circuits (orthogonal transform block size determining means) 109, 110, and 111, and the result of the determination is determined by each of the MDCT circuits 103, 104, 105 and In addition to being supplied to the bit allocation calculation circuit 118, the output terminals 113, 1
15 and 117 are output.
ãï¼ï¼ï¼ï¼ã次ã«ãç´äº¤å¤æãããã¯ãµã¤ãºã®æ±ºå®åè·¯
ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã®è©³ç´°ã説æããããããã§
ã¯ãç´äº¤å¤æãããã¯ãµã¤ãºã®æ±ºå®åè·¯ï¼ï¼ï¼ãä¾ã«æ¡
ã£ã¦ãå³ï¼ãåç
§ãã¦èª¬æãããå³ï¼ã«ãããï¼±ï¼ï¼¦ï¼
ï¼ï¼ã®åºåã®ãã¡ãï¼ï¼ï½ï¼¨ï½ãï¼ï¼ï½ï¼¨ï½ã®å¸¯åã®åº
åããå³ï¼ã«ãããå
¥å端åï¼ï¼ï¼ãä»ãã¦ãã¯ã¼ç®åº
åè·¯ï¼ãã¯ã¼ç®åºææ®µï¼ï¼ï¼ï¼ã«ä¾çµ¦ãããæ´ã«ãå³ï¼
ã«ãããï¼±ï¼ï¼¦ï¼ï¼ï¼ã®åºåã®ãã¡ãï¼ï¼ï¼ï½ï¼¨ï½ãï¼
ï¼ï½ï¼¨ï½ã®å¸¯åã®åºåãå³ï¼ã«ãããå
¥å端åï¼ï¼ï¼ã
ä»ãã¦ãã¯ã¼ç®åºåè·¯ï¼ãã¯ã¼ç®åºææ®µï¼ï¼ï¼ï¼ã«ä¾çµ¦
ããï¼ãï¼ï¼ï¼ï½ï¼¨ï½ã®å¸¯åã®åºåãå³ï¼ã«ãããå
¥å
端åï¼ï¼ï¼ãä»ãã¦ãã¯ã¼ç®åºåè·¯ï¼ãã¯ã¼ç®åºææ®µï¼
ï¼ï¼ï¼ã«ä¾çµ¦ãããå°ãå³ï¼ã«ããããããã¯ãµã¤ãºæ±º
å®åè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ã¯ãå³ï¼ã«ãããå
¥å端åï¼ï¼
ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã¸å
¥åãããä¿¡å·ããããã¯ãµã¤ãº
決å®åè·¯ï¼ï¼ï¼ã®å ´åã¨ç°ãªãã ãã§ãåä½ã¯åä¸ã§ã
ãã®ã§éè¤èª¬æã¯çç¥ãããNext, details of the orthogonal transform block size determining circuits 109, 110 and 111 will be described. Here, the orthogonal transform block size determining circuit 109 will be described with reference to FIG. . QMF1 in FIG.
Out of the output 01, the output in the band of 11 kHz to 22 kHz is supplied to the power calculation circuit (power calculation means) 304 via the input terminal 301 in FIG. Further, FIG.
Of the output of the QMF 102 at 5.5 kHz to 1
The output in the 1 kHz band is supplied to the power calculation circuit (power calculation means) 305 via the input terminal 302 in FIG. 3, and the output in the 0 to 5.5 kHz band is output via the input terminal 303 in FIG. (Power calculation means)
306. Note that the block size determination circuits 110 and 111 in FIG.
The operations are the same, except that the signals input to 1, 302, and 303 are different from those of the block size determination circuit 109.
ãï¼ï¼ï¼ï¼ãå³ï¼ã«ãããåãããã¯ãµã¤ãºæ±ºå®åè·¯ï¼
ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ããããã®å
¥å端åï¼ï¼ï¼ãï¼ï¼
ï¼ãï¼ï¼ï¼ã¯ãããªã¯ã¹æ§æã¨ãªã£ã¦ãããå³ã¡ããã
ãã¯ãµã¤ãºæ±ºå®åè·¯ï¼ï¼ï¼ã®å
¥å端åï¼ï¼ï¼ã«ã¯å³ï¼ã«
ãããï¼±ï¼ï¼¦ï¼ï¼ï¼ããã®ï¼ï¼ï¼ï½ï¼¨ï½ãï¼ï¼ï½ï¼¨ï½ã®
帯åã®åºåãä¾çµ¦ããã¦ãããåæ§ã«ãå
¥å端åï¼ï¼ï¼
ã«ã¯ï¼±ï¼ï¼¦ï¼ï¼ï¼ããã®ï¼ãï¼ï¼ï¼ï½ï¼¨ï½ã®å¸¯åã®åºå
ãä¾çµ¦ããããEach block size determining circuit 1 in FIG.
09, 110, 111 input terminals 301, 30 respectively
2 and 303 have a matrix configuration, that is, the input terminal 301 of the block size determination circuit 110 is supplied with the output of the band of 5.5 kHz to 11 kHz from the QMF 102 in FIG. 302
Is supplied with the output of the band of 0 to 5.5 kHz from the QMF 102.
ãï¼ï¼ï¼ï¼ãå³ï¼ã«ããã¦ãåãã¯ã¼ç®åºåè·¯ï¼ï¼ï¼ã
ï¼ï¼ï¼ãï¼ï¼ï¼ã¯å
¥åãããæé波形ãä¸å®æéãç©å
ãããã¨ã«ãã£ã¦ãå卿³¢æ°å¸¯åã®ãã¯ã¼ãç®åºãã¦ã
ãããã®éãç©åããæéå¹
ã¯ä¸è¿°ã®ç´äº¤å¤æãããã¯
ãµã¤ãºã®ãã¡ãæå°æéãããã¯ä»¥ä¸ã§ããå¿
è¦ãã
ããåãä¸è¿°ã®ç®åºæ³ä»¥å¤ã«ã¯ãä¾ãã°ç´äº¤å¤æããã
ã¯ãµã¤ãºã®æå°æéå¹
å
ã®æå¤§æ¯å¹
ã®çµ¶å¯¾å¤ãããã¯æ¯
å¹
ã®å¹³åå¤ã代表ãã¯ã¼ã¨ãã¦ç¨ãããã¨ããããIn FIG. 3, each power calculation circuit 304,
Reference numerals 305 and 306 calculate the power of each frequency band by integrating the input time waveform for a certain period of time. At this time, the time width for integration needs to be equal to or smaller than the minimum time block among the orthogonal transform block sizes described above. In addition to the above calculation method, for example, the absolute value or the average value of the maximum amplitude within the minimum time width of the orthogonal transform block size may be used as the representative power.
ãï¼ï¼ï¼ï¼ããã¯ã¼ç®åºåè·¯ï¼ï¼ï¼ã®åºåã¯å¤ååæ½åº
åè·¯ï¼å¤ååæ½åºææ®µï¼ï¼ï¼ï¼åã³ãã¯ã¼æ¯è¼åè·¯ï¼ã
ã¯ã¼æ¯è¼ææ®µï¼ï¼ï¼ï¼ã«ããã¯ã¼ç®åºåè·¯ï¼ãã¯ã¼ç®åº
ææ®µï¼ï¼ï¼ï¼ãï¼ï¼ï¼ã®åºåã¯ãã¯ã¼æ¯è¼åè·¯ï¼ï¼ï¼ã«
ããããä¾çµ¦ããããå¤ååæ½åºåè·¯ï¼ï¼ï¼ã§ã¯ãã¯ã¼
ç®åºåè·¯ï¼ï¼ï¼ããä¾çµ¦ããããã¯ã¼ã®å¾®ä¿æ°ãæ±ãã¦
ãã¯ã¼ã®å¤åæ
å ±ã¨ãã¦ããããã¯ãµã¤ãºï¼æ¬¡æ±ºå®åè·¯
ï¼ãããã¯ãµã¤ãºï¼æ¬¡æ±ºå®ææ®µï¼ï¼ï¼ï¼åã³ã¡ã¢ãªï¼è¨
æ¶ææ®µï¼ï¼ï¼ï¼ã¸ä¾çµ¦ãããã¡ã¢ãªï¼ï¼ï¼ã§ã¯ãå¤åå
æ½åºåè·¯ï¼å¤ååæ½åºææ®µï¼ï¼ï¼ï¼ããä¾çµ¦ããããã¯
ã¼ã®å¤åæ
å ±ãä¸è¿°ã®ç´äº¤å¤æãããã¯ãµã¤ãºã®æå¤§æ
é以ä¸ãèç©ãããããã¯æéçã«é£æ¥ããç´äº¤å¤æã
ããã¯ãç´äº¤å¤æã®éã®ã¦ã£ã³ãã¦å¦çã«ãããäºãã«
å½±é¿ãä¸ãåããããæéçã«é£æ¥ããï¼ã¤åã®ããã
ã¯ã®ãã¯ã¼å¤åæ
å ±ããããã¯ãµã¤ãºï¼æ¬¡æ±ºå®åè·¯ï¼ï¼
ï¼ã«ããã¦å¿
è¦ã¨ããããã§ãããThe output of the power calculating circuit 304 is sent to a change extracting circuit (change extracting means) 308 and a power comparing circuit (power comparing means) 309, and the output of the power calculating circuits (power calculating means) 305 and 306 is sent to the power comparing circuit. 309 respectively. The change extraction circuit 308 obtains a differential coefficient of the power supplied from the power calculation circuit 304 and outputs it to the block size primary determination circuit (primary block size determination means) 310 and the memory (storage means) 307 as power change information. Supply. In the memory 307, the power change information supplied from the change extraction circuit (change extraction means) 308 is stored for the maximum time of the orthogonal transform block size or more. This is because the temporally adjacent orthogonal transform blocks mutually affect each other by window processing at the time of orthogonal transform, so that the power change information of the immediately preceding temporally adjacent block is used as the block size primary decision circuit 31.
This is because it is necessary at 0.
ãï¼ï¼ï¼ï¼ããããã¯ãµã¤ãºï¼æ¬¡æ±ºå®åè·¯ï¼ï¼ï¼ã§ã¯å¤
ååæ½åºåè·¯ï¼ï¼ï¼ããä¾çµ¦ããããããã¯ã®ãã¯ã¼å¤
åæ
å ±ã¨ã¡ã¢ãªï¼ï¼ï¼ããä¾çµ¦ãããæéçã«é£æ¥ãã
ãããã¯ã®ï¼ã¤åã®ãããã¯ã®ãã¯ã¼å¤åæ
å ±ãåºã«ã
ã卿³¢æ°å¸¯åå
ã®ãã¯ã¼ã®æéçå¤ä½ããã卿³¢æ°å¸¯å
ã®ç´äº¤å¤æãããã¯ãµã¤ãºã決å®ããããã®éãä¸å®ä»¥
ä¸ã®å¤ä½ãèªããããå ´åãããæéçã«çãç´äº¤å¤æ
ãããã¯ã¤ãµã¤ãºã鏿ããããã§ãããããã®å¤ä½ç¹
ã¯åºå®ã§ã广ã¯å¾ããããæ´ã«å¨æ³¢æ°ã«æ¯ä¾ããå¤ã
å³ã¡ã卿³¢æ°ãé«ãå ´åã¯å¤§ããªå¤ä½ã«ãã£ã¦æéçã«
çããããã¯ãµã¤ãºã¨ãªãã卿³¢æ°ãä½ãå ´åã¯ãé«ã
å ´åã®ããã«æ¯ã¹å°ããªå¤ä½ã§æéçã«çããããã¯ãµ
ã¤ãºã«æ±ºå®ãããã¨ããã广çã§ããããã®å¤ã¯ãªã
ããã«å¤åãããã¨ãæã¾ããããè¤æ°æ®µéã®é段ç¶ã®
å¤åã§ãã£ã¦ããæ§ããªãã以ä¸ã®ããã«æ±ºå®ãããã
ããã¯ãµã¤ãºã¯ãããã¯ãµã¤ãºä¿®æ£åè·¯ï¼ãããã¯ãµã¤
ãºä¿®æ£ææ®µï¼ï¼ï¼ï¼ã¸ä¾çµ¦ããããThe block size primary decision circuit 310 uses the power change information of the block supplied from the change extraction circuit 308 and the power change information of the block immediately before the temporally adjacent block supplied from the memory 307. The orthogonal transform block size of the frequency band is determined from the temporal displacement of the power within the frequency band to be obtained. At this time, when a displacement equal to or more than a certain value is recognized, a shorter orthogonal transform block size is selected. However, even if the displacement point is fixed, the effect can be obtained. Furthermore, a value proportional to the frequency,
That is, when the frequency is high, a large displacement causes a temporally short block size, and when the frequency is low, it is more effective to determine the temporally short block size with a small displacement as compared to that when the frequency is high. This value desirably changes smoothly, but may be a stepwise change in a plurality of stages. The block size determined as described above is supplied to a block size correction circuit (block size correction means) 311.
ãï¼ï¼ï¼ï¼ã䏿¹ããã¯ã¼æ¯è¼åè·¯ï¼ãã¯ã¼æ¯è¼ææ®µï¼
ï¼ï¼ï¼ã«ããã¦ãåãã¯ã¼ç®åºåè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼
ï¼ï¼ããä¾çµ¦ãããå卿³¢æ°å¸¯åã®ãã¯ã¼æ
å ±ããåæ
å»åã³æé軸ä¸ã§ãã¹ãã³ã°å¹æã®çºçããæéå¹
ã§æ¯
è¼ãè¡ãªãããã¯ã¼ç®åºåè·¯ï¼ï¼ï¼ã®åºå卿³¢æ°å¸¯åã«
åã¼ãä»ã®å¨æ³¢æ°å¸¯åã®å½±é¿ãæ±ãããããã¯ãµã¤ãºä¿®
æ£åè·¯ï¼ï¼ï¼ã¸ä¾çµ¦ããããããã¯ãµã¤ãºä¿®æ£åè·¯ï¼ï¼
ï¼ã§ã¯ããã¯ã¼æ¯è¼åè·¯ï¼ï¼ï¼ããä¾çµ¦ããããã¹ãã³
ã°æ
å ±åã³é æ¬¡ç¸¦ç¶æ¥ç¶ãããé
å»¶åè·¯ï¼é
å»¶ææ®µï¼
ï¼é
å»¶è¨æ¶ææ®µï¼ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã®åã¿ããã
ãä¾çµ¦ãããéå»ã®ãããã¯ãµã¤ãºæ
å ±ãåºã«ãããã
ã¯ãµã¤ãºï¼æ¬¡æ±ºå®åè·¯ï¼ãããã¯ãµã¤ãºï¼æ¬¡æ±ºå®ææ®µï¼
ï¼ï¼ï¼ããä¾çµ¦ããããããã¯ãµã¤ãºã«ããæéçã«é·
ããããã¯ãµã¤ãºã鏿ããããã«ä¿®æ£ããããé
å»¶å
è·¯ï¼ï¼ï¼åã³ã¦ã£ã³ãã¦å½¢ç¶æ±ºå®åè·¯ï¼ã¦ã£ã³ãã¦å½¢ç¶
æ±ºå®ææ®µï¼ï¼ï¼ï¼ã¸ä¾çµ¦ãã¦ãããOn the other hand, a power comparison circuit (power comparison means)
At 309, each power calculation circuit 304, 305, 3
The power information of each frequency band supplied from 06 is compared with the time width at which the masking effect occurs at the same time and on the time axis, and the influence of other frequency bands on the output frequency band of the power calculation circuit 304 is obtained. , To the block size correction circuit 311. Block size correction circuit 31
In 1, the masking information supplied from the power comparison circuit 309 and the delay circuits (delay means) sequentially connected in cascade
(Delay storage means) Primary block size determination circuit (Primary block size determination means) based on past block size information supplied from each tap of 312, 313, 314
Correction is made so that a longer block size is selected based on the block size supplied from 310 and supplied to the delay circuit 312 and the window shape determination circuit (window shape determination means) 315.
ãï¼ï¼ï¼ï¼ããããã¯ãµã¤ãºä¿®æ£åè·¯ï¼ï¼ï¼ã«ãããä½
ç¨ã¯ãããç¹å®ã®å¨æ³¢æ°å¸¯åã«ããã¦ããªã¨ã³ã¼ãåé¡
ã¨ãªãå ´åã§ããä»ã®å¨æ³¢æ°å¸¯åãç¹ã«ç¹å®ã®å¨æ³¢æ°å¸¯
åããä½ã帯åã«ããã¦ã大ããªæ¯å¹
ãæã¤ä¿¡å·ãåå¨
ããå ´åããã®ãã¹ãã³ã°å¹æã«ãããããªã¨ã³ã¼ãè´
æä¸åé¡ã¨ãªããªãããããã¯åé¡ã軽æ¸ãããå ´åã
ããã¨ããç¹æ§ãå©ç¨ãã¦ããããªããä¸è¿°ã®ãã¹ãã³
ã°ã¨ã¯ã人éã®è´è¦ä¸ã®ç¹æ§ã«ãããããä¿¡å·ã«ãã£ã¦
ä»ã®ä¿¡å·ããã¹ã¯ããã¦èãããªããªãç¾è±¡ããããã®
ã§ããããã®ãã¹ãã³ã°å¹æã«ã¯ãæé軸ä¸ã®ãªã¼ãã¤
ãªä¿¡å·ã«ããæé軸ãã¹ãã³ã°å¹æã¨ã卿³¢æ°è»¸ä¸ã®ä¿¡
å·ã«ããåæå»ãã¹ãã³ã°å¹æã¨ãããããããã®ãã¹
ãã³ã°å¹æã«ããããã¹ãã³ã°ãããé¨åã«ãã¤ãºãã
ã£ãã¨ãã¦ãããã®ãã¤ãºã¯èãããªããã¨ã«ãªããã
ã®ãããå®éã®ãªã¼ãã¤ãªä¿¡å·ã§ã¯ããã®ãã¹ãã³ã°ã
ããç¯å²å
ã®ãã¤ãºã¯è¨±å®¹å¯è½ãªãã¤ãºã¨ããããThe operation of the block size correction circuit 311 is performed when a signal having a large amplitude exists in another frequency band, particularly in a band lower than the specific frequency band, even when a pre-echo is a problem in a specific frequency band. The characteristic that the pre-echo does not cause a problem in audibility due to the masking effect or the problem may be reduced. The above-mentioned masking refers to a phenomenon in which a certain signal masks another signal and becomes inaudible due to human auditory characteristics. The masking effect includes a time axis based on an audio signal on a time axis. There are a masking effect and a simultaneous masking effect by a signal on the frequency axis. Due to these masking effects, even if there is noise in the masked portion, this noise will not be heard. For this reason, in an actual audio signal, noise within the masked range is regarded as acceptable noise.
ãï¼ï¼ï¼ï¼ã次ã«ãé æ¬¡ç¸¦ç¶æ¥ç¶ãããé
å»¶åè·¯ï¼ï¼
ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã«ãéå»ã®ç´äº¤å¤æãããã¯ãµã¤ãº
ãé ã«è¨æ¶ãã¦ãããåã¿ãããå³ã¡ãé
å»¶åè·¯ï¼ï¼
ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã®åºåå´ããããã®ç´äº¤äº¤æããã
ã¯ãµã¤ãºããããã¯ãµã¤ãºä¿®æ£åè·¯ï¼ï¼ï¼ã¸ä¾çµ¦ããã
ãã«ãã¦ãããåæã«ãé
å»¶åè·¯ï¼ï¼ï¼ã®åºåå´ã¯åºå
端åï¼ï¼ï¼ã«ãé
å»¶åè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ã®åºåå´ã¯ã¦ã£
ã³ãã¦å½¢ç¶æ±ºå®åè·¯ï¼ï¼ï¼ã«æ¥ç¶ãã¦ããããã®é
å»¶å
è·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ããã®åºåã¯ããããã¯ãµã¤
ãºä¿®æ£åè·¯ï¼ï¼ï¼ã«ããã¦ãããé·ãæéå¹
ã§ã®ããã
ã¯ãµã¤ãºã®å¤åããããã¯ã®ãããã¯ãµã¤ãºã®æ±ºå®ã«å½¹
ç«ã¦ãåããä¾ãã°ãéå»é »ç¹ã«ãããæéçã«çãã
ããã¯ãµã¤ãºã鏿ããã¦ããå ´åã¯ãæéçã«çãã
ããã¯ãµã¤ãºã®é¸æãå¢ãããéå»ã«ããã¦æéçã«ç
ããããã¯ãµã¤ãºã®é¸æããªããã¦ãªãå ´åã«ããã¦
ã¯ãæéçã«é·ããããã¯ãµã¤ãºã®é¸æãå¢ããçã®å¤
æãå¯è½ã¨ãã¦ãããNext, the delay circuits 31 sequentially connected in cascade
The past orthogonal transform block sizes are sequentially stored in 2, 313, and 314, and each tap, that is, the delay circuit 31 is stored.
From the outputs of 2, 313 and 314, the orthogonal exchange block size is supplied to a block size correction circuit 311. At the same time, the output side of the delay circuit 312 is connected to the output terminal 317, and the output side of the delay circuits 312 and 313 is connected to the window shape determination circuit 315. The outputs from the delay circuits 312, 313, and 314 are used by the block size correction circuit 311 to help determine the block size of the block based on the change in the block size over a longer time period. If a short block size is selected, increase the selection of the short block size in time, and if no short block size is selected in the past, select a long block size in the past. It is possible to make decisions such as increasing the number.
ãï¼ï¼ï¼ï¼ããªãããã®é
å»¶åè·¯ã¯ã¦ã£ã³ãã¦æ±ºå®åè·¯
ï¼ï¼ï¼åã³åºå端åï¼ï¼ï¼ã«å¿
è¦ãªé
å»¶åè·¯ï¼ï¼ï¼ãï¼
ï¼ï¼ãé¤ãã°ããã®ã¿ããæ°ã¯è£
ç½®ã®å®éçãªæ§æãè¦
模ã«ãã墿¸ããã¦ç¨ããããå ´åããããã¦ã£ã³ãã¦
å½¢ç¶æ±ºå®åè·¯ï¼ï¼ï¼ã§ã¯ããããã¯ãµã¤ãºä¿®æ£åè·¯ï¼ï¼
ï¼ã®åºåãå³ã¡ããããã¯ã®æéçã«é£æ¥ããï¼ã¤å¾ã®
ãããã¯ãµã¤ãºã¨é
å»¶åè·¯ï¼ï¼ï¼ã®åºåãå³ã¡ãããã
ã¯ã®ãããã¯ãµã¤ãºã¨é
å»¶åè·¯ï¼ï¼ï¼ã®åºåãå³ã¡ãã
ããã¯ã®æéçã«é£æ¥ããï¼ã¤åã®ãããã¯ãµã¤ãºã¨ã
ããä¸è¿°ã®å³ï¼ã«ãããåï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼ï¼ãï¼ï¼
ï¼ãï¼ï¼ï¼ã«ããã¦ä½¿ç¨ãããã¦ã£ã³ãã¦ã®å½¢ç¶ã決å®
ããåºå端åï¼ï¼ï¼ã¸åºåãããå³ï¼ã«ãããåºå端å
ï¼ï¼ï¼ã®åºåãå³ã¡ããããã¯ãµã¤ãºæ
å ±ã¨ãåºå端å
ï¼ï¼ï¼ã®åºåãå³ã¡ãã¦ã£ã³ãã¦å½¢ç¶æ
å ±ã¨ããå³ï¼ã«
ããããããã¯ãµã¤ãºæ±ºå®åè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼
ã®åºåã¨ãã¦åé¨ã¸ä¾çµ¦ããããThis delay circuit includes delay circuits 312, 312 necessary for a window determination circuit 315 and an output terminal 317.
Except for 13, the tap number may be increased or decreased depending on the actual configuration and scale of the device. In the window shape determination circuit 315, the block size correction circuit 31
1, the output of the delay circuit 312, that is, the block size of the block next to the block temporally adjacent to the block, ie, the output of the block size of the block and the output of the delay circuit 313, that is, one block that is temporally adjacent to the block. From the previous block size, each MDCT circuit 103, 10 in FIG.
The window shape used in steps 4 and 105 is determined and output to the output terminal 317. The output of the output terminal 316 in FIG. 3, that is, the block size information, and the output of the output terminal 317, that is, the window shape information are used as the block size determination circuits 109, 110, and 111 in FIG.
Is supplied to each section as the output of
ãï¼ï¼ï¼ï¼ãããã§ã¦ã£ã³ãã¦å½¢ç¶æ±ºå®åè·¯ï¼ï¼ï¼ã«ã
ãã¦æ±ºå®ãããã¦ã£ã³ãã¦ã®å½¢ç¶ã«ã¤ãã¦ã飿¥ããã
ããã¯ã¨ã¦ã£ã³ãã¦ã®å½¢ç¶ã®æ§åã示ãå³ï¼ãåç
§ãã¦
説æãããå³ï¼ï¼¡ãï¼¢ãï¼£ã¯ãããããæ¨ªè»¸ãæéã縦
軸ãã¬ãã«ã¨ããã¨ãã®ãã¦ã£ã³ãã¦ã示ããå®ç·ã®æ²
ç·ã¯ç¾å¨ã®ã¦ã£ã³ãã¦ãç ´ç·ã®æ²ç·ï¼å·¦å´ï¼ã¯éå»ã®ã¦
ã£ã³ãã¦ãç ´ç·ã®æ²ç·ï¼å³å´ï¼ã¯æªæ¥ã®ã¦ã£ã³ãã¦ãã
ããã示ããHere, the window shape determined by the window shape determination circuit 315 will be described with reference to FIG. 4 showing the state of the adjacent blocks and the window shape. 4A, 4B, and 4C show windows when the horizontal axis represents time and the vertical axis represents level, respectively, wherein the solid curve is the current window, the broken curve (left side) is the past window, (Right) show future windows, respectively.
ãï¼ï¼ï¼ï¼ãå³ï¼ããå¤ãããã«ãç´äº¤å¤æã«ä½¿ç¨ãã
ãã¦ã£ã³ãã¦ã¯æéçã«é£æ¥ãããããã¯ã¨ã®éã§éè¤
ããé¨åãããããã®å®æ½ã®å½¢æ
ã§ã¯ã飿¥ããããã
ã¯ã®ä¸å¿ã¾ã§éè¤ããå½¢ç¶ãæ¡ç¨ãã¦ããããã飿¥ã
ããããã¯ã®ç´äº¤å¤æãµã¤ãºã«ããã¦ã£ã³ãã¦ã®å½¢ç¶ã
å¤åãããAs can be seen from FIG. 4, the window used for the orthogonal transformation has a portion overlapping with the temporally adjacent block. In this embodiment, the window overlapping the center of the adjacent block has a shape. Since this is adopted, the shape of the window changes depending on the orthogonal transform size of the adjacent block.
ãï¼ï¼ï¼ï¼ã以ä¸ã®èª¬æããæãããªæ§ã«ãç´äº¤å¤æã«
使ç¨ããã¦ã£ã³ãã¦ã®å½¢ç¶ã¯æéçã«é£ç¶ããï¼ããã
ã¯åã®ç´äº¤å¤æãµã¤ãºã確å®ããå¾ã決å®ããããå¾ã£
ã¦ãå
¥å端åï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ããå
¥åãããä¿¡
å·ã®ãããã¯ã¨ãåºå端åï¼ï¼ï¼ãï¼ï¼ï¼ããåºåãã
ãä¿¡å·ã®ãããã¯ã¨ã¯ããã®å®æ½ã®å½¢æ
ã«ããã¦ï¼ãã
ãã¯åã®å·®ç°ãçãã¦ãããAs is clear from the above description, the shape of the window used for the orthogonal transform is determined after the orthogonal transform size of three temporally continuous blocks is determined. Therefore, in the present embodiment, a block of a signal input from the input terminals 301, 302, and 303 differs from a block of a signal output from the output terminals 316 and 317 by one block.
ãï¼ï¼ï¼ï¼ãåãå³ï¼ã«ããããã¯ã¼ç®åºåè·¯ï¼ï¼ï¼ã
ï¼ï¼ï¼åã³ãã¯ã¼æ¯è¼åè·¯ï¼ï¼ï¼ãçç¥ãã¦ãå³ï¼ã«ã
ãããããã¯ãµã¤ãºæ±ºå®åè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã
æ§æãããã¨ã¯å¯è½ã§ãããæ´ã«ãã¦ã£ã³ãã¦ã®å½¢ç¶ã
ç´äº¤å¤æãããã¯ã®åãå¾ãæéçã«æå°ã®ãããã¯ãµ
ã¤ãºã«åºå®ãããã¨ã«ãã£ã¦ãã®ç¨®é¡ãï¼ç¨®é¡ã¨ããå³
ï¼ã«ãããé
å»¶åè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼åã³ããã
ã¯ãµã¤ãºä¿®æ£åè·¯ï¼ï¼ï¼ãªãã³ã«ã¦ã£ã³ãã¦å½¢ç¶æ±ºå®å
è·¯ï¼ï¼ï¼ãçç¥ãã¦æ§æãããã¨ãå¯è½ã§ãããç¹ã«ã
å¦çæéã®é
å»¶ã好ã¾ãªãå¿ç¨ä¾ã«ããã¦ã¯ä¸è¿°ã®çç¥
ã«ããé
å»¶ã®å°ãªãæ§æã¨ãªããæå¹ã«ä½ç¨ãããThe power calculation circuit 305 in FIG.
Even if the power comparison circuit 306 and the power comparison circuit 309 are omitted, it is possible to configure the block size determination circuits 109, 110, and 111 in FIG. Further, by fixing the shape of the window to the smallest block size that the orthogonal transformation block can take in time, the type is made one, and the delay circuits 312, 313, 314, the block size correction circuit 311 and the window shape shown in FIG. It is also possible to omit the decision circuit 315. Especially,
In an application example in which a delay in the processing time is not desired, the above-described omission causes a configuration with a small delay, which effectively operates.
ãï¼ï¼ï¼ï¼ãåã³å³ï¼ã«æ»ã£ã¦èª¬æãããåï¼ï¼¤ï¼£ï¼´å
è·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã«ãã£ã¦ï¼ï¼¤ï¼£ï¼´å¦çããã¦
å¾ããã卿³¢æ°è»¸ä¸ã®ã¹ãã¯ãã«ãã¼ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿
æ°ãã¼ã¿ï¼æéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯å
ã®ä¿¡
å·æåï¼ã¯ãä½åã¯ããããè¨ç帯åï¼ã¯ãªãã£ã«ã«ã
ã³ãï¼æ¯ã«ã¾ã¨ããããä¸é«åã¯ãããã¯ããã¼ãã£ã³
ã°ã®æå¹æ§ãèæ
®ãã¦ãè¨ç帯åå¹
ãç´°ååãã¦é©å¿ã
ããå²å½ç¬¦å·ååè·¯ï¼é©å¿ãããå²å½ç¬¦å·åææ®µï¼ï¼ï¼
ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼åã³ãããå²ãå½ã¦ç®åºåè·¯ï¼ãã
ãå²ãå½ã¦ç®åºææ®µï¼ï¼ï¼ï¼ã«ä¾çµ¦ããããReturning to FIG. 1, the description will be continued. The spectrum data or MDCT coefficient data (signal components in a two-dimensional block relating to time and frequency) on the frequency axis obtained by performing MDCT processing by each of the MDCT circuits 103, 104, and 105 has a low band called a critical band (critical band). ), And in the middle and high frequency bands, the critical bandwidth is subdivided in consideration of the effectiveness of block floating, and the adaptive bit allocation coding circuit (adaptive bit allocation coding means) 10
6, 107, 108 and a bit allocation calculation circuit (bit allocation calculation means) 118.
ãï¼ï¼ï¼ï¼ããã®ã¯ãªãã¤ã«ã«ãã³ãã¨ã¯ã人éã®è´è¦
ç¹æ§ãèæ
®ãã¦åå²ããã卿³¢æ°å¸¯åã§ãããããç´é³
ã®å¨æ³¢æ°è¿åã®åãå¼·ãã®ç帯åãã³ããã¤ãºã«ãã£ã¦
ç´é³ããã¹ã¯ãããã¨ãã®ãã®ãã¤ãºã®æã¤å¸¯åã®ãã¨
ã§ããããã®è¨ç帯åï¼ã¯ãªãã£ã«ã«ãã³ãï¼ã¯ãé«å
ã»ã©å¸¯åå¹
ãåºããªã£ã¦ãããä¸è¿°ã®ï¼ãï¼ï¼ï½ï¼¨ï½ã®
å
¨å¨æ³¢æ°å¸¯åã¯ä¾ãã°ï¼ï¼ã®ã¯ãªãã£ã«ã«ãã³ãã«åå²
ããã¦ãããThe critical band is a frequency band divided in consideration of human auditory characteristics, and the noise when a pure tone is masked by a narrow band noise of the same intensity near the frequency of a certain pure tone. It is the band that has. This critical band (critical band) has a wider bandwidth as the frequency becomes higher, and the entire frequency band of 0 to 22 kHz is divided into, for example, 25 critical bands.
ãï¼ï¼ï¼ï¼ããããå²å½ç®åºåè·¯ï¼ï¼ï¼ã¯ãä¸è¿°ã®ãã
ãã¯ãµã¤ãºæ
å ±åã³ã¹ãã¯ãã«ãã¼ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°
ãã¼ã¿ã«åºã¥ãããããããã¹ãã³ã°å¹æçãèæ
®ã
ã¦ãä¸è¿°ã®è¨ç帯ååã³ãããã¯ããã¼ãã£ã³ã°ãèæ
®
ããååå²å¸¯åæ¯ã®ããã¹ãã³ã°éãåã³ãååå²å¸¯å
æ¯ã®ã¨ãã«ã®ã¼ãããã¯ãã¼ã¯å¤çãç®åºãããã®çµæ
ã«åºã¥ããå叝忝ã«å²å½ãããæ°ãæ±ããé©å¿ããã
å²å½ç¬¦å·ååè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã¸ä¾çµ¦ãã¦ã
ãããããã®é©å¿ãããå²å½ç¬¦å·ååè·¯ï¼ï¼ï¼ãï¼ï¼
ï¼ãï¼ï¼ï¼ã§ã¯ãä¸è¿°ã®ãããã¯ãµã¤ãºæ
å ±ãè¨ç帯å
åã³ãããã¯ããã¼ãã£ã³ã°ãèæ
®ããååå²å¸¯åæ¯ã«
å²ãå½ã¦ããããããæ°ã«å¿ãã¦ãåã¹ãã¯ãã«ãã¼ã¿
åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ãã¼ã¿ãåéååï¼æ£è¦åãã¦éå
åï¼ããããã«ãã¦ããããã®ããã«ãã¦ç¬¦å·åããã
ãã¼ã¿ã¯ãåºå端åï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ãä»ãã¦å
ãåºãããã以ä¸èª¬æã®ä¾¿å®ä¸ããããå²å½ã®åä½ã¨ãª
ããä¸è¿°ã®è¨ç帯ååã³ãããã¯ããã¼ãã£ã³ã°ãèæ
®
ããååå²å¸¯åããåä½ãããã¯ã¨ç§°ãããã¨ã«ãããThe bit allocation calculating circuit 118 calculates the critical band and the block floating for each divided band in consideration of the so-called masking effect and the like based on the block size information and the spectrum data or the MDCT coefficient data. The masking amount and the energy or peak value for each of the divided bands are calculated, and based on the calculation result, the number of bits to be allocated is obtained for each band and supplied to the adaptive bit allocation coding circuits 106, 107, and 108. . These adaptive bit allocation encoding circuits 106, 10
In Steps 7 and 108, each spectrum data or MDCT coefficient data is requantized (normalized and quantized) according to the number of bits allocated to each divided band in consideration of the block size information, the critical band, and the block floating. ). The data encoded in this way is taken out via the output terminals 112, 114, 116. For the sake of convenience in the following description, each of the above-mentioned divided bands that take into account the critical band and the block floating, which is a unit of bit allocation, will be referred to as a unit block.
ãï¼ï¼ï¼ï¼ã次ã«ãä¸è¿°ã®å³ï¼ã«ããããããå²ãå½ã¦
ç®åºåè·¯ï¼ï¼ï¼ã§è¡ããããããå²å½ã®å
·ä½çãªææ®µã«
ã¤ãã¦ãå³ï¼ãåç
§ãã¦èª¬æãããå³ï¼ã¯ä¸è¿°ã®å³ï¼ã«
ããããããå²ãå½ã¦ç®åºåè·¯ï¼ï¼ï¼ã®ä¸å
·ä½ä¾ã§ãã
ãããé
åæ¼ç®ææ®µï¼ãããé
åç®åºææ®µï¼ã®æ¦ç¥æ§æ
ã示ããããã¯åè·¯å³ã§ããããã®å³ï¼ã«ããã¦ãå
¥å
端åï¼ï¼ï¼ã«ã¯ãä¸è¿°ã®å³ï¼ã«ãããï¼ï¼¤ï¼£ï¼´åè·¯ï¼ï¼
ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ããã®å¨æ³¢æ°è»¸ä¸ã®ã¹ãã¯ãã«ãã¼
ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ãåã³ãä¸è¿°ã®å³ï¼ã«ãããããã
ã¯æ±ºå®åè·¯ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ããã®ãããã¯ãµã¤
ãºæ
å ±ãä¾çµ¦ããã¦ããã以å¾ãå³ï¼ã§ç¤ºããããä¸è¿°
ã®å³ï¼ã«ããããããå²ãå½ã¦ç®åºåè·¯ï¼ï¼ï¼ã«ãã
ã¦ãä¸è¿°ã®ãããã¯ãµã¤ãºæ
å ±ã«é©å¿ããã宿°ãéã¿
ä»ã颿°çãç¨ãã¦å¦çãã¦ãããNext, specific means of bit allocation performed by the bit allocation calculating circuit 118 in FIG. 1 will be described with reference to FIG. FIG. 5 is a block circuit diagram showing a schematic configuration of a bit allocation calculating means (bit allocation calculating means) which is a specific example of the bit allocation calculating circuit 118 in FIG. In FIG. 5, an input terminal 601 is connected to the MDCT circuit 10 shown in FIG.
The spectrum data or MDCT coefficients on the frequency axis from 3, 104 and 105 and the block size information from the block determination circuits 109, 110 and 111 in FIG. Thereafter, the above-described bit allocation calculation circuit 118 shown in FIG. 1 shown in FIG.
ãï¼ï¼ï¼ï¼ãå³ï¼ã«ããã¦ãå
¥å端åï¼ï¼ï¼ããå
¥åã
ã卿³¢æ°è»¸ä¸ã®ã¹ãã¯ãã«ãã¼ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã¯ã
叝忝ã¨ãã«ã®ã¼ç®åºåè·¯ï¼å¸¯åæ¯ã¨ãã«ã®ã¼ç®åºæ
段ï¼ï¼ï¼ï¼ã«ä¾çµ¦ããã¦ãåä½ãããã¯æ¯ã®ã¨ãã«ã®ã¼
ããä¾ãã°åä½ãããã¯å
ã§ã®åæ¯å¹
å¤ã®ç·åãè¨ç®ã
ããã¨çã«ããæ±ããããããã®åãã³ãæ¯ã®ã¨ãã«ã®
ã¼ã®ä»£ããã«ãæ¯å¹
å¤ã®ãã¼ã¯å¤ãå¹³åå¤çãç¨ããã
ããã¨ãããããã®ã¨ãã«ã®ã¼ç®åºåè·¯ï¼ï¼ï¼ããã®åº
åã¨ãã¦ãä¾ãã°åãã³ãã®ç·åå¤ã®ã¹ãã¯ãã«ãå³ï¼
ã«ï¼³ï¼¢ã¨ãã¦ç¤ºãã¦ãããä½ãããã®å³ï¼ã§ã¯ãå³ç¤ºã
ç°¡ç¥åãããããåä½ãããã¯ã«ããå岿°ãï¼ï¼ãã
ãã¯ï¼ï¼¢1 ãï¼¢12ï¼ã§è¡¨ç¾ãã¦ãããå°ãå³ï¼ã®ç ´ç·
ã¯ãåãã³ãã®ç·åå¤ã®ã¹ãã¯ãã«ï¼³ï¼¢ãä»ã®é¨åã«å
ã¼ãå½±é¿ã示ããã³ã³ããªã¥ã¼ã·ã§ã³ã®éã¿ä»ãã«å¯¾å¿
ãããIn FIG. 5, spectrum data or MDCT coefficients on the frequency axis input from an input terminal 601 are:
The energy is supplied to the energy calculation circuit for each band (energy calculation means for each band) 602, and the energy for each unit block is calculated by, for example, calculating the sum of the amplitude values in the unit block. Instead of the energy for each band, a peak value or an average value of the amplitude value may be used. As an output from the energy calculation circuit 602, for example, the spectrum of the sum value of each band is shown in FIG.
Are shown as SB. However, in FIG. 6, the number of divisions by unit blocks is represented by 12 blocks (B1 to B12) to simplify the illustration. Note that the broken line in FIG. 6 indicates the effect of the spectrum SB of the total value of each band on other parts, and corresponds to convolution weighting.
ãï¼ï¼ï¼ï¼ãåãã¨ãã«ã®ã¼ç®åºåè·¯ï¼ï¼ï¼ã«ããã¦
ã¯ãåä½ãããã¯ã®ãããã¯ããã¼ãã£ã³ã°ã®ç¶æ
ã示
ããã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼ï¼æ£è¦åãã¼ã¿ï¼ï¼æ
å ±å§ç¸®ã
ã©ã¡ã¼ã¿ï¼ã®å¤ã«ã¤ãã¦ã決å®ãããã®ã¨ãããå
·ä½ç
ã«ã¯ãä¾ãã°ãããããã¹ã±ã¼ã«ãã¡ã¯ã¿å¤ã®åè£ã¨ã
ã¦å¹¾ã¤ãã®æ£ã®å¤ãç¨æãããã®ä¸ããåä½ãããã¯å
ã®ã¹ãã¯ãã«ãã¼ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®çµ¶å¯¾å¤ã®æå¤§å¤
以ä¸ã®å¤ãã¨ãä¸ã§ãæå°ã®ãã®ãåä½ãããã¯ã®ã¹ã±
ã¼ã«ãã¡ã¯ã¿å¤ã¨ãã¦æ¡ç¨ãããã¹ã±ã¼ã«ãã¡ã¯ã¿å¤ã«
ã¤ãã¦ã¯ãå®éã®å¤ã¨å¯¾å¿ããå½¢ã§ãæ°ããããç¨ã
ã¦ãçªå·ä»ããè¡ãããã®çªå·ãROï¼çï¼å³ç¤ºããï¼
ã«ããè¨æ¶ããã¦ããã°ãããåãããåä½ãããã¯ã«
ããã¦ä¸è¿°ã®æ¹æ³ã§æ±ºå®ãããã¹ã±ã¼ã«ãã¡ã¯ã¿å¤ã¯ã
決å®ãããå¤ã«å¯¾å¿ããä¸è¿°ã®ããããç¨ãã¦ä»ããã
ãçªå·ãåä½ãããã¯ã®ã¹ã±ã¼ã«ãã¡ã¯ã¿ã示ããµãæ
å ±ã¨ãã¦ä½¿ç¨ãããThe energy calculation circuit 602 also determines a value of a scale factor (normalized data) (information compression parameter) indicating a block floating state of a unit block. Specifically, for example, several positive values are prepared in advance as scale factor value candidates, and a minimum value is selected from among them, taking a value equal to or more than the maximum value of the absolute value of the spectral data or MDCT coefficient in the unit block. Is adopted as the scale factor value of the unit block. The scale factor value is numbered using several bits in a form corresponding to the actual value, and the number is stored in a ROM or the like (not shown).
May be stored in the memory. Also, the scale factor value determined by the above method in a certain unit block is:
The number assigned using the above-mentioned bits corresponding to the determined value is used as sub-information indicating the scale factor of the unit block.
ãï¼ï¼ï¼ï¼ã次ã«ãä¸è¿°ã®ã¨ãã«ã®ã¼ç®åºåè·¯ï¼ï¼ï¼ã§
æ±ããããä¸è¿°ã®ã¹ãã¯ãã«ï¼³ï¼¢ã®ãããããã¹ãã³ã°
ã«ãããå½±é¿ãèæ
®ããããã«ããã®ã¹ãã¯ãã«ï¼³ï¼¢ã«
æå®ã®éã¿ä»ã颿°ãæãã¦å ç®ãããããªç³è¾¼ã¿ï¼ã³
ã³ããªã¦ã¼ã·ã§ã³ï¼å¦çãæ½ãããã®ãããä¸è¿°ã®å¸¯å
æ¯ã®ã¨ãã«ã®ã¼ç®åºåè·¯ï¼ï¼ï¼ã®åºåãå³ã¡ããã®ã¹ã
ã¯ãã«ï¼³ï¼¢ã®åå¤ã¯ãç³è¾¼ã¿ãã£ã«ã¿åè·¯ï¼ç³è¾¼ã¿ãã£
ã«ã¿ææ®µï¼ï¼ï¼ï¼ã«ä¾çµ¦ãããããã®ç³è¾¼ã¿ãã£ã«ã¿å
è·¯ï¼ï¼ï¼ã¯ãä¾ãã°ãå
¥åãã¼ã¿ãé æ¬¡é
å»¶ãããè¤æ°
ã®é
å»¶ç´ åã¨ããããé
å»¶ç´ åããã®åºåã«ãã£ã«ã¿ä¿
æ°ï¼éã¿ä»ã颿°ï¼ãä¹ç®ããè¤æ°ã®ä¹ç®å¨ã¨ãåä¹ç®
å¨åºåã®ç·åãã¨ãç·åå ç®å¨ã¨ããæ§æããããã®ã§
ããããã®ç³è¾¼ã¿å¦çã«ãããå³ï¼ä¸ç¹ç·ã§ç¤ºãé¨åã®
ç·åãå¾ããããNext, in order to take into account the so-called masking effect of the above-mentioned spectrum SB obtained by the above-described energy calculation circuit 602, a convolution (combo) such that the spectrum SB is multiplied by a predetermined weighting function and added. (Reuse) processing. Therefore, the output of the energy calculation circuit 602 for each band, that is, each value of the spectrum SB, is supplied to a convolution filter circuit (convolution filter means) 603. The convolution filter circuit 603 includes, for example, a plurality of delay elements for sequentially delaying input data, a plurality of multipliers for multiplying outputs from these delay elements by a filter coefficient (weighting function), and a sum of outputs of the respective multipliers. And a sum adder which takes By this convolution processing, the sum of the parts indicated by the dotted lines in FIG. 6 is obtained.
ãï¼ï¼ï¼ï¼ã次ã«ãä¸è¿°ã®ç³è¾¼ã¿ãã£ã«ã¿åè·¯ï¼ï¼ï¼ã®
åºåã¯å¼ç®å¨ï¼åæå¨ï¼ï¼åæææ®µï¼ï¼å¼ç®ææ®µï¼ï¼ï¼
ï¼ã«ä¾çµ¦ãããããã®å¼ç®å¨ï¼ï¼ï¼ã¯ãä¸è¿°ã®ç³è¾¼ãã
é åã§ã®å¾è¿°ãã許容å¯è½ãªãã¤ãºã¬ãã«ã«å¯¾å¿ããã¬
ãã«Î±ãæ±ãããã®ã§ããããªãã許容å¯è½ãªãã¤ãºã¬
ãã«ï¼éååä¿æ°ï¼ã«å¯¾å¿ããã¬ãã«Î±ã¯ãå¾è¿°ããã
ãã«ãéã³ã³ããªã¦ã¼ã·ã§ã³å¦çãè¡ããã¨ã«ãã£ã¦ã
ã¯ãªãã¤ã«ã«ãã³ãã®åãã³ãæ¯ã®è¨±å®¹ãã¤ãºã¬ãã«
ï¼éååä¿æ°ï¼ã¨ãªããããªã¬ãã«ã§ãããããã§ãä¸
è¿°ã®å¼ç®å¨ï¼ï¼ï¼ã«ã¯ãä¸è¿°ã®ã¬ãã«Î±ãæ±ããããã®
è¨±å®¹é¢æ°ï¼ãã¹ãã³ã°ã¬ãã«ã表ç¾ãã颿°ï¼ãä¾çµ¦ã
ããããã®è¨±å®¹é¢æ°ã墿¸ããããã¨ã§ä¸è¿°ã®ã¬ãã«Î±
ã®å¶å¾¡ãè¡ã£ã¦ãããè¨±å®¹é¢æ°ã¯ã次ã«èª¬æãããããª
ï¼ï½âï½ï½ï¼é¢æ°çºçåè·¯ï½ï¼ï½âï½ï½ï¼é¢æ°çºçæ
段ï½ï¼ï¼ï¼ããä¾çµ¦ããã¦ãããã®ã§ãããNext, the output of the above-described convolution filter circuit 603 is output to a subtractor (synthesizer) (synthesis means) (subtraction means) 60.
4 is supplied. The subtracter 604 obtains a level α corresponding to an allowable noise level described later in the convolved region. Note that the level α corresponding to the allowable noise level (quantization coefficient) is determined by performing inverse convolution processing as described later.
This is a level that becomes an allowable noise level (quantization coefficient) for each band of the critical band. Here, the above-described subtractor 604 is supplied with an allowance function (a function expressing a masking level) for obtaining the above-mentioned level α. By increasing or decreasing this tolerance function, the level α
Is controlled. The permissible function is supplied from a (n-ai) function generating circuit {(n-ai) function generating means # 605 as described below.
ãï¼ï¼ï¼ï¼ãå³ã¡ã許容ãã¤ãºã¬ãã«ã«å¯¾å¿ããã¬ãã«
αã¯ãã¯ãªãã¤ã«ã«ãã³ãã®ãã³ãã®ä½åããé ã«ä¸ã
ãããçªå·ãï½ã¨ããã¨ãæ¬¡ã®æ°ï¼ã®å¼ã§æ±ãããã¨ã
ã§ãããThat is, the level α corresponding to the allowable noise level can be obtained by the following equation (1), where i is a number sequentially given from the lower band of the critical band.
ãï¼ï¼ï¼ï¼ã[0041]
ãæ°ï¼ãαï¼ï¼³âï¼ï½âï½ï½ï¼Î = Sâ (nâai)
ãï¼ï¼ï¼ï¼ããã®æ°ï¼ã®å¼ã«ããã¦ãï½ãï½ã¯å®æ°ã§ã
ï½ï¼ï¼ãï¼³ã¯ç³è¾¼ã¿å¦çããããã¼ã¯ã¹ãã¯ãã«(Bark
Spectrum) ã®å¼·åº¦ã§ãããæ°ï¼ã®å¼ä¸ï¼ï½âï½ï½ï¼ã許
容颿°ã¨ãªããä¸ä¾ã¨ãã¦ï½ï¼ï¼ï¼ï¼ï½ï¼ï¼ãç¨ããã
ã¨ãã§ãããå°ããã¼ã¯ã¯ãä¸è¬ã«ã¯ãªãã£ã«ã«ãã³ã
ã®åä½ãæå³ããããã¼ã¯ã¹ãã¯ãã«ã¯ãï¼ã¤ã®ã¯ãªã
ã£ã«ã«ãã³ãã«å¯¾ããï¼ã¤ã®ã¹ãã¯ãã«ã代表ãããã
ã¨ãæå³ãããIn the equation (1), n and a are constants.
a> 0, S is the convolution processed Bark spectrum (Bark
Spectrum), and (n-ai) in Equation 1 is an allowable function. As an example, n = 38 and a = 1 can be used. Note that bark generally means a unit of a critical band. The bark spectrum means that one spectrum is represented for one critical band.
ãï¼ï¼ï¼ï¼ããã®ããã«ãã¦ãä¸è¿°ã®ã¬ãã«Î±ãæ±ãã
ãããã®ãã¼ã¿ã¯å²ç®å¨ï¼å²ç®ææ®µï¼ï¼ï¼ï¼ã«ä¾çµ¦ãã
ããå²ç®å¨ï¼ï¼ï¼ã§ã¯ãä¸è¿°ã®ç³è¾¼ã¿ãããé åã§ã®ä¸
è¿°ã®ã¬ãã«Î±ãéã³ã³ããªã¦ã¼ã·ã§ã³ããããã®ãã®ã§
ãããå¾ã£ã¦ããã®éã³ã³ããªã¦ã¼ã·ã§ã³å¦çãè¡ãã
ã¨ã«ãããä¸è¿°ã®ã¬ãã«Î±ãããã¹ãã³ã°ã¹ãã¯ãã«ã
å¾ãããããã«ãªããå³ã¡ããã®ãã¹ãã³ã°ã¹ãã¯ãã«
ã許容ãã¤ãºã¹ãã¯ãã«ã¨ãªãããªããä¸è¿°ã®éã³ã³ã
ãªã¦ã¼ã·ã§ã³å¦çã¯ãè¤éãªæ¼ç®ãå¿
è¦ã¨ãããããã®
宿½ã®å½¢æ
ã§ã¯ç°¡ç¥åããå²ç®å¨ï¼ï¼ï¼ãç¨ãã¦éã³ã³
ããªã¦ã¼ã·ã§ã³ãè¡ã£ã¦ãããIn this way, the above-mentioned level α is obtained, and this data is supplied to a divider (division means) 606. The divider 606 is for performing inverse convolution of the level α in the convolved region. Therefore, by performing the inverse convolution processing, a masking spectrum can be obtained from the level α. That is, this masking spectrum becomes an allowable noise spectrum. Note that the above-described inverse convolution processing requires a complicated operation, but in this embodiment, the inverse convolution is performed using the simplified divider 606.
ãï¼ï¼ï¼ï¼ã次ã«ãä¸è¿°ã®ãã¹ãã³ã°ã¹ãã¯ãã«ã¯ãå
æåè·¯ï¼åæææ®µï¼ï¼ï¼ï¼ãä»ãã¦æ¸ç®å¨ï¼æ¸ç®ææ®µï¼
ï¼ï¼ï¼ã«ä¾çµ¦ããããããã§ãæ¸ç®å¨ï¼ï¼ï¼ã«ã¯ãä¸è¿°
ã®å¸¯åæ¯ã®ã¨ãã«ã®ã¼ç®åºåè·¯ï¼ï¼ï¼ããã®åºåãå³
ã¡ãåè¿°ããã¹ãã¯ãã«ï¼³ï¼¢ããé
å»¶åè·¯ï¼é
å»¶ææ®µï¼
ï¼ï¼ï¼ãä»ãã¦ä¾çµ¦ããã¦ãããå¾ã£ã¦ããã®æ¸ç®å¨ï¼
ï¼ï¼ã§ä¸è¿°ã®ãã¹ãã³ã°ã¹ãã¯ãã«ã¨ã¹ãã¯ãã«ï¼³ï¼¢ã¨
ã®æ¸ç®æ¼ç®ãè¡ããããã¨ã§ãå³ï¼ã«ç¤ºãããã«ãä¸è¿°
ã®ã¹ãã¯ãã«ï¼³ï¼¢ã¯ããã®ãã¹ãã³ã°ã¹ãã¯ãã«ï¼ï¼³ã®
ã¬ãã«ã§ç¤ºãã¬ãã«ä»¥ä¸ããã¹ãã³ã°ããããã¨ã«ãª
ããNext, the above-mentioned masking spectrum is subtracted by a subtracter (subtraction means) via a synthesis circuit (synthesis means) 607.
608. Here, the output from the above-described energy calculation circuit 602 for each band, that is, the above-mentioned spectrum SB is supplied to the subtracter 608 by a delay circuit (delay means).
609. Therefore, this subtractor 6
By performing the subtraction operation of the above-mentioned masking spectrum and the spectrum SB at 08, as shown in FIG. 7, the above-mentioned spectrum SB is masked below the level indicated by the level of the masking spectrum MS.
ãï¼ï¼ï¼ï¼ãã¨ããã§ãä¸è¿°ããåæåè·¯ï¼ï¼ï¼ã§ã®å
æã®éã«ã¯ãæå°å¯è´ã«ã¼ãçºçåè·¯ï¼æå°å¯è´ã«ã¼ã
çºçææ®µï¼ï¼ï¼ï¼ããä¾çµ¦ãããå³ï¼ã«ç¤ºããããªäººé
ã®è´è¦ç¹æ§ã§ããããããæå°å¯è´ã«ã¼ãRCã示ãã
ã¼ã¿ã¨ãä¸è¿°ã®ãã¹ãã³ã°ã¹ãã¯ãã«ï¼ï¼³ã¨ãåæãã
ãã¨ãã§ããããã®æå°å¯è´ã«ã¼ãã«ããã¦ãéé³çµ¶å¯¾
ã¬ãã«ããã®æå°å¯è´ã«ã¼ã以ä¸ãªãã°ãã®éé³ã¯èã
ããªããã¨ã«ãªãããã®æå°å¯è´ã«ã¼ãã¯ãã³ã¼ãã¤ã³
ã°ãåãã§ãã£ã¦ãä¾ãã°åçæã®åçããªã¦ã¼ã ã®é
ãã§ç°ãªããã®ã¨ãªãããç¾å®çãªãã£ã¸ã¿ã«ã·ã¹ãã
ã§ã¯ãä¾ãã°ï¼ï¼ããããã¤ãããã¯ã¬ã³ã¸ã¸ã®é³æ¥½ã®
å
¥ãæ¹ã«ã¯ãã»ã©éãããªãã®ã§ãä¾ãã°ï¼ï½ï¼¨ï½ä»è¿
ã®æãè³ã«èããããã卿³¢æ°å¸¯åã®éååéé³ãèã
ããªãã¨ããã°ãä»ã®å¨æ³¢æ°å¸¯åã§ã¯ãã®æå°å¯è´ã«ã¼
ãã®ã¬ãã«ä»¥ä¸ã®éååéé³ã¯èãããªãã¨èããã
ããå¾ã£ã¦ããã®ããã«ä¾ãã°ã·ã¹ãã ã®æã¤ã¯ã¼ãã¬
ã³ã°ã¹ã®ï¼ï½ï¼¨ï½ä»è¿ã®éé³ãèãããªãä½¿ãæ¹ããã
ã¨ä»®å®ãããã®æå°å¯è´ã«ã¼ãRCã¨ãã¹ãã³ã°ã¹ãã¯
ãã«ï¼ï¼³ã¨ãå
±ã«åæãããã¨ã§è¨±å®¹ãã¤ãºã¬ãã«ãå¾
ãããã«ããã¨ããã®å ´åã®è¨±å®¹ãã¤ãºã¬ãã«ã¯ãå³ï¼
ä¸ã®æç·ã§ç¤ºãé¨åã¾ã§ã¨ãããã¨ãã§ããããã«ãª
ãããªãããã®å®æ½ã®å½¢æ
ã§ã¯ãä¸è¿°ã®æå°å¯è´ã«ã¼ã
ã®ï¼ï½ï¼¨ï½ã®ã¬ãã«ããä¾ãã°ï¼ï¼ãããç¸å½ã®æä½ã¬
ãã«ã«åããã¦ãããåããã®å³ï¼ã¯ãä¿¡å·ã¹ãã¯ãã«
SSãåæã«ç¤ºãã¦ãããBy the way, at the time of the synthesizing by the synthesizing circuit 607, the so-called minimum audible characteristic which is a human auditory characteristic as shown in FIG. Data indicating the curve RC and the above-described masking spectrum MS can be synthesized. At this minimum audible curve, if the absolute noise level is below this minimum audible curve, the noise will not be heard. This minimum audible curve will be different depending on the playback volume during playback, for example, even if the coding is the same, but in a realistic digital system, for example, the way music enters the 16-bit dynamic range will be different. Since there is not much difference, if quantization noise in the most audible frequency band around 4 kHz is not heard, for example, it is considered that quantization noise below the level of the minimum audible curve is not heard in other frequency bands. . Therefore, assuming that the system is used in such a manner that noise around 4 kHz of the word length of the system cannot be heard, and an allowable noise level is obtained by synthesizing the minimum audible curve RC and the masking spectrum MS together, The allowable noise level in this case is shown in FIG.
It becomes possible to extend to the part shown by the oblique line in the middle. In this embodiment, the 4 kHz level of the above-mentioned minimum audible curve is adjusted to the lowest level corresponding to, for example, 20 bits. FIG. 8 also shows the signal spectrum SS.
ãï¼ï¼ï¼ï¼ããã®å¾ã許容éé³è£æ£åè·¯ï¼è¨±å®¹éé³è£æ£
ææ®µï¼ï¼ï¼ï¼ã§ã¯ãè£æ£æ
å ±åºååè·¯ï¼ï¼ï¼ããä¾çµ¦ã
ããä¾ãã°çã©ã¦ããã¹ã«ã¼ãã®æ
å ±ã«åºã¥ãã¦ãä¸è¿°
ã®æ¸ç®å¨ï¼ï¼ï¼ããã®åºåã«ããã許容éé³ã¬ãã«ãè£
æ£ãã¦ãããããã§ãçã©ã¦ããã¹ã«ã¼ãã¨ã¯ã人éã®
è´è¦ç¹æ§ã«é¢ããç¹æ§æ²ç·ã§ãããä¾ãã°ï¼ï½ï¼¨ï½ã®ç´
é³ã¨åã大ããã«èãããå卿³¢æ°ã§ã®é³ã®é³å§ãæ±ã
ã¦æ²ç·ã§çµãã ãã®ã§ãã©ã¦ããã¹ã®çæåº¦æ²ç·ã¨ãå¼
ã°ãããåããã®çã©ã¦ããã¹æ²ç·ã¯ãå³ï¼ã«ç¤ºããæ
å°å¯è´ã«ã¼ãRCã¨ç¥åãæ²ç·ãæããã®ã§ããããã®
çã©ã¦ããã¹æ²ç·ã«ããã¦ã¯ãä¾ãã°ï¼ï½ï¼¨ï½ä»è¿ã§ã¯
ï¼ï½ï¼¨ï½ã®ã¨ããããé³å§ãï¼ãï¼ï¼ï½ï¼¢ä¸ãã£ã¦ãï¼
ï½ï¼¨ï½ã¨åã大ããã«èãããéã«ãï¼ï¼ï¼¨ï½ä»è¿ã§ã¯
ï¼ï½ï¼¨ï½ã§ã®é³å§ãããç´ï¼ï¼ï½ï¼¢é«ããªãã¨åã大ã
ãã«èãããªãããã®ãããä¸è¿°ã®æå°å¯è´ã«ã¼ãã®ã¬
ãã«ãè¶ããéé³ï¼è¨±å®¹ãã¤ãºã¬ãã«ï¼ã¯ããã®çã©ã¦
ããã¹æ²ç·ã«å¿ããã«ã¼ãã§ä¸ãããã卿³¢æ°ç¹æ§ãæ
ã¤ããã«ããã®ãè¯ããã¨ããããããã®ãããªãã¨ã
ããä¸è¿°ã®çã©ã¦ããã¹æ²ç·ãèæ
®ãã¦ä¸è¿°ã®è¨±å®¹ãã¤
ãºã¬ãã«ãè£æ£ãããã¨ã¯ã人éã®è´è¦ç¹æ§ã«é©åãã¦
ãããã¨ãåãããThereafter, in the allowable noise correction circuit (allowable noise correction means) 610, the allowable noise level in the output from the above-described subtractor 608 based on, for example, information on the equal loudness curve supplied from the correction information output circuit 613. Has been corrected. Here, the equal loudness curve is a characteristic curve relating to human auditory characteristics. For example, the loudness curve is obtained by calculating the sound pressure of sound at each frequency that sounds as loud as a pure tone of 1 kHz, and is connected by a curve. Also called a sensitivity curve. Further, this equal loudness curve draws substantially the same curve as the minimum audible curve RC shown in FIG. In this equal loudness curve, for example, around 4 kHz, even if the sound pressure falls by 8 to 10 dB from the place of 1 kHz, 1
It sounds the same size as kHz, and conversely, it does not sound the same at around 50 Hz unless it is about 15 dB higher than the sound pressure at 1 kHz. For this reason, it can be seen that noise exceeding the level of the minimum audible curve (allowable noise level) should have a frequency characteristic given by a curve corresponding to the loudness curve. From this, it can be seen that correcting the above-mentioned allowable noise level in consideration of the above-mentioned equal loudness curve is suitable for human auditory characteristics.
ãï¼ï¼ï¼ï¼ãããã§ãè£æ£æ
å ±åºååè·¯ï¼ï¼ï¼ã¨ãã¦ã
ä¸è¿°ã®å³ï¼ã«ãããé©å¿ãããå²å½ç¬¦å·ååè·¯ï¼ï¼ï¼ã
ï¼ï¼ï¼ãï¼ï¼ï¼ã§ã®éååã®éã®åºåæ
å ±éï¼ãã¼ã¿
éï¼ã®æ¤åºåºåã¨ãæçµç¬¦å·åãã¼ã¿ã®ãããã¬ã¼ãç®
æ¨å¤ã¨ã®éã®èª¤å·®ã®æ
å ±ã«åºã¥ãã¦ãä¸è¿°ã®è¨±å®¹ãã¤ãº
ã¬ãã«ãè£æ£ããããã«ãã¦ããããããã¯ãå
¨ã¦ã®ã
ããå²å½åä½ãããã¯ã«å¯¾ãã¦äºã䏿çãªé©å¿ããã
å²å½ãè¡ã£ã¦å¾ãããç·ãããæ°ããæçµçãªç¬¦å·ååº
åãã¼ã¿ã®ãããã¬ã¼ãã«ãã£ã¦å®ã¾ãä¸å®ã®ãããæ°
ï¼ç®æ¨å¤ï¼ã«å¯¾ãã¦èª¤å·®ãæã¤ãã¨ãããããã®èª¤å·®å
ãï¼ã¨ããããã«å度ãããå²å½ããããã®ã§ãããå³
ã¡ãç®æ¨å¤ãããç·å²å½ãããæ°ãå°ãªãã¨ãã«ã¯ãå·®
ã®ãããæ°ãååä½ãããã¯ã«å²ãæ¯ã£ã¦ä»å ãããã
ã«ããç®æ¨å¤ãããç·å²å½ãããæ°ãå¤ãã¨ãã«ã¯ãå·®
ã®ãããæ°ãååä½ãããã¯ã«å²ãæ¯ã£ã¦åãããã«ã
ãããã§ãããHere, as the correction information output circuit 613,
The adaptive bit allocation encoding circuit 106 in FIG.
The above-mentioned allowable noise level is corrected based on the error information between the detection output of the output information amount (data amount) at the time of quantization in 107 and 108 and the bit rate target value of the final encoded data. It may be. This is because the total number of bits obtained by previously performing temporary adaptive bit allocation for all the bit allocation unit blocks becomes a fixed number of bits (target value) determined by the bit rate of the final encoded output data. In some cases, there is an error, and bit allocation is performed again so that the error is set to zero. That is, when the total number of allocated bits is smaller than the target value, the difference bit number is allocated to each unit block and added. It is to be allocated and cut.
ãï¼ï¼ï¼ï¼ããã®ãããªãã¨ãè¡ããããä¸è¿°ã®ç·å²å½
ãããæ°ã®ä¸è¿°ã®ç®æ¨å¤ããã®èª¤å·®ãæ¤åºãããã®èª¤å·®
ãã¼ã¿ã«å¿ãã¦è£æ£æ
å ±åºååè·¯ï¼ï¼ï¼ãåå²å½ããã
æ°ãè£æ£ããããã®è£æ£ãã¼ã¿ãåºåãããããã§ãä¸
è¿°ã®èª¤å·®ãã¼ã¿ããããæ°ä¸è¶³ã示ãå ´åã¯ãä¸è¿°ã®å
ä½ãããã¯å½ããå¤ãã®ãããæ°ã使ããããã¨ã§ä¸è¿°
ã®ãã¼ã¿éãä¸è¿°ã®ç®æ¨å¤ãããå¤ããªã£ã¦ããå ´åã
èãããã¨ãã§ãããåãä¸è¿°ã®èª¤å·®ãã¼ã¿ããããã
æ°ä½ãã示ããã¼ã¿ã¨ãªãå ´åã¯ãä¸è¿°ã®åä½ãããã¯
å½ããå°ãªããããæ°ã§æ¸ã¿ãä¸è¿°ã®ãã¼ã¿éãä¸è¿°ã®
ç®æ¨å¤ãããå°ãªããªã£ã¦ããå ´åãèãããã¨ãã§ã
ããIn order to do this, an error of the total number of allocated bits from the above-described target value is detected, and the correction information output circuit 613 corrects each of the allocated number of bits according to the error data. Outputs correction data. Here, when the above-mentioned error data indicates that the number of bits is insufficient, it is possible to consider a case where the above-described data amount is larger than the above-described target value because a large number of bits are used per unit block. it can. Further, when the above-mentioned error data is data indicating the remainder of the number of bits, the case where the number of bits per unit block is small and the amount of data is smaller than the above-described target value may be considered. it can.
ãï¼ï¼ï¼ï¼ãå¾ã£ã¦ãä¸è¿°ã®è£æ£æ
å ±åºååè·¯ï¼ï¼ï¼ã
ãã¯ããã®èª¤å·®ãã¼ã¿ã«å¿ãã¦ãä¸è¿°ã®æ¸ç®å¨ï¼ï¼ï¼ã
ãã®åºåã«ããã許容ãã¤ãºã¬ãã«ï¼éååä¿æ°ï¼ãã
ä¾ãã°ä¸è¿°ã®çã©ã¦ããã¹æ²ç·ã®æ
å ±ãã¼ã¿ã«åºã¥ãã¦
è£æ£ãããããã®ä¸è¿°ã®è£æ£å¤ã®ãã¼ã¿ãåºåãããã
ãã«ãªããä¸è¿°ã®ãããªè£æ£å¤ããä¸è¿°ã®è¨±å®¹éé³è£æ£
åè·¯ï¼ï¼ï¼ã«ä¾çµ¦ããããã¨ã§ãä¸è¿°ã®æ¸ç®å¨ï¼ï¼ï¼ã
ãã®è¨±å®¹ãã¤ãºã¬ãã«ãè£æ£ãããããã«ãªãã以ä¸èª¬
æãè¡ã£ãä¸è¿°ã®è¨±å®¹éé³è£æ£åè·¯ï¼ï¼ï¼ã¾ã§ã®å¦çã
è¡ããã¨ã«ãã£ã¦ãã¡ã¤ã³æ
å ±ã¨ãã¦ç´äº¤å¤æåºåã¹ã
ã¯ãã«ããµãæ
å ±ã«ããå¦çãããâã¿ã¨ããµãæ
å ±ã¨
ãã¦ãããã¯ããâãã£ã³ã°ã®ç¶æ
ã示ãã¹ã±âã«ãã¡
ã¯ã¿âåã³èªé·ã示ãã¯âãã¬ã³ã°ã¹ãå¾ããããTherefore, the above-mentioned correction information output circuit 613 determines the allowable noise level (quantization coefficient) at the output from the above-mentioned subtractor 608 according to this error data.
For example, the data of the above-described correction value for performing the correction based on the information data of the above-mentioned equal loudness curve is output. The above-described correction value is supplied to the above-described allowable noise correction circuit 610, so that the above-described allowable noise level from the subtracter 608 is corrected. By performing the processing up to the above-described allowable noise correction circuit 610 described above, the data obtained by processing the orthogonal transform output spectrum with the sub-information as the main information and the schedule indicating the state of the block floating as the sub-information. The word length indicating the word factor and word length is obtained.
ãï¼ï¼ï¼ï¼ãããã§ãä¸è¿°ã®è¨±å®¹éé³è£æ£åè·¯ï¼ï¼ï¼ã¾
ã§ã®å¦çãè¡ããã¨ã«ãã£ã¦ãã¡ã¤ã³æ
å ±ã¨ãã¦å¾ãã
ãç´äº¤å¤æåºåã¹ãã¯ãã«ããµãæ
å ±ã«ããå¦çããã
âã¿ã¨ããµãæ
å ±ã¨ãã¦å¾ããããããã¯ããâãã£ã³
ã°ã®ç¶æ
ã示ãã¹ã±âã«ãã¡ã¯ã¿âåã³èªé·ã示ãã¯â
ãã¬ã³ã°ã¹ã«ãã符å·åã®å
·ä½ä¾ãå³ï¼ãç¨ãã¦èª¬æã
ããå³ï¼ã¯ããããå²å½ãï¼ãããã¨ãªã£ãå ´åã®åä½
ãããã¯ã®æ§åã示ããä¾ã§ããã縦軸ã¯ä¸å¿ãï¼ã¨ã
ãã¹ãã¯ãã«ãã¼ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®å¤§ããã示ãã
横軸ã¯å¨æ³¢æ°ã示ãã¦ããããã®ä¾ã§ã¯åä½ãããã¯å
ã«ã¯ï½ãï½ãï½ãï½ãï½
ãï½ãï½ãï½ã§ç¤ºããããï¼æ¬
ã®ã¹ãã¯ãã«ãã¼ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ãåå¨ãã¦ããã
ããããï¼ããæ£æ¹ååã¯è² æ¹åã«å¤§ãããæã£ã¦ã
ããä¸è¿°ããéããããã¯ããâãã£ã³ã°ã®ç¶æ
ã示ã
ã¹ã±âã«ãã¡ã¯ã¿âã¯ãããããå¹¾ã¤ãã®å¤§ããã§æ£ã®
å¤ãç¨æãããã®ä¸ããåä½ãããã¯å
ã®ã¹ãã¯ãã«ã
ã¼ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®çµ¶å¯¾å¤ã®æå¤§å¤ä»¥ä¸ã®å¤ãåã
ä¸ã§æå°ã®ãã®ãæ¡ç¨ããåä½ãããã¯ã®ã¹ã±ã¼ã«ãã¡
ã¯ã¿ã¼ã¨ãããHere, by performing the processing up to the permissible noise correction circuit 610, the data obtained by processing the orthogonal transform output spectrum obtained as the main information with the sub-information and the block floating obtained as the sub-information Scale factor indicating the state of the word and word indicating the word length
A specific example of encoding by length will be described with reference to FIG. FIG. 9 is an example showing a state of a unit block when the bit allocation is 3 bits. The vertical axis indicates the magnitude of the spectrum data or MDCT coefficient with the center at 0,
The horizontal axis indicates frequency. In this example, eight spectral data or MDCT coefficients represented by a, b, c, d, e, f, g, and h exist in the unit block,
Each has a magnitude in the positive or negative direction from 0. As described above, the scale factor indicating the state of the block floating is prepared in advance with positive values of several magnitudes, and among them, a value greater than the maximum value of the absolute value of the spectral data or MDCT coefficient in the unit block is selected from among them. Use the smallest value among the values, and use it as the unit block scale factor.
ãï¼ï¼ï¼ï¼ãå³ï¼ã§ã¯ã絶対å¤ã®æå¤§å¤ã示ãã¹ãã¯ã
ã«ï½ã«ãããã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼å¤ã鏿ãããããã®
ã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼ã¨ãããå²ãå½ã¦ã®å¤§ããã«ããã
åä½ãããã¯å
ã®éååå¹
ãæ±ºå®ããããå³ï¼ã®ä¾ã§ã¯
ãããå²å½ãï¼ãããã®å ´åã示ãã¦ããããæ¬æ¥ï¼ã
ããã§ç¬¦å·åï¼éååï¼ããå ´åï¼å¤ã表ç¾ãããã¨ã
å¯è½ã ããããã§ã¯ï¼ãä¸å¿ã«æ£æ¹åã¨è² æ¹åã«çåå²
ã®éååå¹
ãï¼å¤ã¥ã¤ã¨ããï¼ã¨ãããã¦ï¼å¤ã®éåå
å¤ãã¨ããï¼ãããã§è¡¨ç¾å¯è½ãªããä¸ã¤ã®ç¬¦å·ã¯æªä½¿
ç¨ã¨ãã¦ãããããã§ãåä½ãããã¯å
ã®ã¹ã±ã¼ã«ãã¡
ã¯ã¿ã¼å¤ã¨ãããå²å½å¤ãããéååå¤ã決å®ãããå
ä½ãããã¯å
ã®ã¹ãã¯ãã«ãã¼ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã¯ã
æãè¿ãéååå¤ã«éååããããå³ï¼ã«ãããé»ä¸¸ã®
é¨åã¯åä½ãããã¯å
ã®ããããã®ã¹ãã¯ãã«ãã¼ã¿å
ã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ãéååãããå¤ã示ãããã®ã§ããã
å³ã¡ãå³ï¼ã¯ãåéååï¼æ£è¦åãã¦éååï¼ã®ä¸ä¾ã
示ãããã®ã§ãããIn FIG. 9, the scale factor value is selected based on the spectrum a indicating the maximum absolute value. With this scale factor and the size of the bit allocation,
The quantization width in the unit block is determined. The example of FIG. 9 shows a case where the bit allocation is 3 bits. However, when encoding (quantization) is originally performed with 3 bits, it is possible to express eight values. In the direction, a quantization width of equal division is taken by three values, a seven-value quantization value is added together with 0, and another code that can be expressed by 3 bits is unused. Here, the quantization value is determined from the scale factor value and the bit allocation value in the unit block, and the spectral data or MDCT coefficient in the unit block is
It is quantized to the nearest quantization value. The black circles in FIG. 9 indicate the quantized values of the respective spectral data or MDCT coefficients in the unit block.
That is, FIG. 9 shows an example of requantization (normalization and quantization).
ãï¼ï¼ï¼ï¼ãä¸è¬ã«ãå³ï¼ã§ç¤ºãããããªæ¹æ³ã§ãï¼ã
ä¸å¿ã¨ãã¦æ£æ¹åã¨è² æ¹åã«çåå²ã®éååå¹
ãæã¤ã
ããªå½¢ã§éååãè¡ãå ´åã®éååå¹
ãQVã¨ããã¨ã
ããåä½ãããã¯ã®éååå¹
QVã¯ãååä½ãããã¯ã®
ã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼ã®å¤ãSFããããå²å½æ°ãï¼®ï½ã¨
ããã¨ãã«ã以ä¸ã®æ°ï¼ã®å¼ã«ãã£ã¦æ±ãããã¨ãåºæ¥
ããIn general, the quantization width when performing quantization in such a manner as to have equal quantization widths in the positive direction and the negative direction centering on 0 using the method shown in FIG. ,
The quantization width QV of a certain unit block can be obtained by the following equation (2), where the value of the scale factor of the same unit block is SF and the number of allocated bits is Nb.
ãï¼ï¼ï¼ï¼ã[0053]
ãæ°ï¼ãQVï¼ï¼³ï¼¦ï¼ï¼ï¼(Nb-1)âï¼ï¼ ä½ããï¼®ï½â§ï¼QV = SF / (2 (Nb-1) -1) where Nb â§ 2
ãï¼ï¼ï¼ï¼ãã¨ããã§ãä¸è¿°ã®æ§ãªæé ãè¸ãã§åéå
åãè¡ãå ´åãããåä½ãããã¯å
ã®ã¹ãã¯ãã«åã¯ï¼
ï¼¤ï¼£ï¼´ä¿æ°ã®çµ¶å¯¾å¤ã®æå¤§å¤ãããããããç¨æããã
ã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼ã®ä¸ã§æå°ã®ãã®ããå°ããå ´å
ã¯ãåä½ãããã¯ã®ã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼å¤ã¯ããããã
ç¨æãããã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼ã®ä¸ã§æå°ã®ãã®ãæ¡ç¨
ããããã¨ã¨ãªãããã®ããã«ãããããç¨æãããã¹
ã±ã¼ã«ãã¡ã¯ã¿ã¼ã®ä¸ã§æå°ã®ãã®ãæ¡ç¨ããåä½ãã
ãã¯ã«ã¤ãã¦ã¯ãåä½ãããã¯å
ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤
ï¼£ï¼´ä¿æ°ã«ãããåä½ãããã¯å
ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤
ï¼£ï¼´ä¿æ°ã®ãã¡å°ãªãã¨ãä¸ã¤ãï¼ä»¥å¤ã®å¤ã«éååã
ããããã«æä½éå¿
è¦ãªãããæ°ãåå¨ãããBy the way, when requantization is performed according to the above procedure, the spectrum or M
If the maximum value of the absolute value of the DCT coefficient is smaller than the smallest one among the prepared scale factors, the smallest one among the prepared scale factors is adopted as the scale factor value of the unit block. It will be. As for the unit block employing the smallest one among the scale factors prepared in advance, the spectrum in the unit block or the MD
According to the CT coefficient, the spectrum or MD in a unit block
There is a minimum number of bits required for at least one of the CT coefficients to be quantized to a value other than zero.
ãï¼ï¼ï¼ï¼ãè¨ãæããã°ããããããç¨æãããã¹ã±
ã¼ã«ãã¡ã¯ã¿ã¼ã®ä¸ã§æå°ã®ãã®ãæ¡ç¨ããåä½ããã
ã¯ã«ã¤ãã¦ã¯ãï¼ããã以ä¸ã®ãããå²å½ãããã¦ãã¦
ãããã®ãããå²ãå½ã¦æ°ãä¸è¿°ã®æä½éå¿
è¦ãªããã
æ°ããå°ãªããããå²ãå½ã¦ã®å ´åããããå²å½ããã
ã¦ããã«ãé¢ããããåä½ãããã¯å
ã®ã¹ãã¯ãã«åã¯
ï¼ï¼¤ï¼£ï¼´ä¿æ°ããã¹ã¦ï¼ã«éååããã¦ãã¾ãããã®ã
ããªå ´åããããå²ãå½ã¦ãï¼ã«ãããã¨ãå¯è½ã¨ãª
ããããã«ããã¹ãã¯ãã«ãã¼ã¿åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®ç¬¦
å·åã«ä½¿ç¨ãã¦ããããããçç¥ããããå¹ççãªé
å
ãè¡ããã¨ãå¯è½ã¨ãªããå³ï¼ã«ããã符å·åè£æ£åè·¯
ï¼ï¼ï¼ã§ã¯ããã®ãããªå ´åã«ãããå¹ççãªå½¢ã§ç¬¦å·
åããããã®è£æ£ãè¡ããã®ã§ãããIn other words, for a unit block employing the smallest scale factor prepared in advance, even if bits of 2 bits or more are allocated, the number of allocated bits is the minimum required bit. If the number of bits is smaller than the number, all the spectra or MDCT coefficients in the unit block are quantized to 0, even though the bits are allocated. In such a case, it is possible to set the bit allocation to 0, thereby omitting the bits used for coding the spectral data or MDCT coefficients, and performing more efficient allocation. In such a case, the encoding correction circuit 614 in FIG. 5 performs correction for encoding in a more efficient manner.
ãï¼ï¼ï¼ï¼ã以ä¸ã«ãå³ï¼ã«ããã符å·åè£æ£åè·¯ï¼ï¼
ï¼ã«ããã符å·åã®è£æ£ã«ã¤ãã¦ãå³ï¼ï¼ãç¨ãã¦èª¬æ
ãããå³ï¼ï¼ã¯å³ï¼ã¨åæ§ã«ããåä½ãããã¯ã®åéå
åã®æ§åã示ãã¦ããã縦æ¹åã¯ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£
ï¼´ä¿æ°ã®å¤§ããã示ããæ¨ªæ¹åã¯å¨æ³¢æ°ã示ããã®ã¨
ããåä½ãããã¯å
ã«ã¯ï¼æ¬ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´
ä¿æ°ãåå¨ãã¦ããããã®ä¾ã§ã¯ãåä½ãããã¯å
ã®ã¹
ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®çµ¶å¯¾å¤ã®æå¤§å¤ããããã
ããç¨æãããã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼ã®ä¸ã§æå°ã®ãã®ã
ãå°ããããã®åä½ãããã¯ã®ã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼å¤ã¯
ãããããç¨æãããã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼ã®ä¸ã§æå°ã®
ãã®ãæ¡ç¨ããã¦ããããããå²ãå½ã¦ã¯ï¼ãããã§ã
å³ï¼ï¼ã«ç¤ºããéããï¼ã¨ãæ£æ¹åã¨è² æ¹åã«ï¼å¤ã¥
ã¤ãè¨ï¼å¤ã®éååå¤ãæã¤ãã®ã¨ããããããï¼ãã
ãå²ãå½ã¦ã®å ´åå³ï¼ï¼ã®ããã«åä½ãããã¯å
ã®ã¹ã
ã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®çµ¶å¯¾å¤ã®æå¤§å¤ããåå³ï¼ï¼
ã«ã®ç¹ç·ã§ç¤ºãããéååå¹
ã®ååã®å¤ããå°ããå ´å
ã¯ãåä½ãããã¯å
ã®å
¨ã¦ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿
æ°ã¯ãï¼ã«éååããããã¤ã¾ããï½ãï½ã®ï¼æ¬ã®ã¹ã
ã¯ãã«ãå
¨ã¦ãï¼ï¼ãã§ç¬¦å·åãããå°ãªãã¨ãã¹ãã¯
ãã«ã®è¨é²ã«ï¼ï¼ããããå¿
è¦ã¨ããããéååå¤ãã
ã¹ã¦ï¼ã¨ãªããHereinafter, the encoding correction circuit 61 in FIG.
The encoding correction in 4 will be described with reference to FIG. FIG. 10 shows a state of requantization of a certain unit block as in FIG.
It indicates the magnitude of the T coefficient, the horizontal direction indicates the frequency, and eight spectra or MDCTs are contained in a unit block.
Coefficient exists. In this example, the maximum value of the absolute value of the spectrum or the MDCT coefficient in the unit block is smaller than the smallest one of the prepared scale factors, and the scale factor value of this unit block is the scale factor value of the prepared scale factor. The smallest one is adopted, the bit allocation is 2 bits,
As shown in FIG. 10, it is assumed that there are a total of three quantized values, that is, 0 and one value each in the positive direction and the negative direction. However, in the case of 2-bit allocation, the maximum value of the absolute value of the spectrum or MDCT coefficient in the unit block as shown in FIG.
In the case where the value is smaller than half the quantization width, as indicated by the dotted line in FIG. That is, the eight spectra a to h are all encoded with â00â, and at least 16 bits are required for recording the spectra, but the quantization values are all zero.
ãï¼ï¼ï¼ï¼ããã®å ´åãµãæ
å ±çã«ãããåä½ãããã¯
ã«ã¤ãã¦ã¯è¨é²ãããå³ã¡ããããå²å½ãï¼ãããã¨å¤
æ´ãããã¨ã«ãããåä½ãããã¯å
ã®ã¹ãã¯ãã«åã¯ï¼
ï¼¤ï¼£ï¼´ä¿æ°ã®å¤ã¯å
¨ã¦ï¼ã¨ã¿ãªãäºãå¯è½ã¨ãªãã®ã§ã
ä¸è¿°ã®ï¼ãããå²ãå½ã¦ã®å ´åã«ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£
ï¼´ä¿æ°ã®éååå¤ãï¼ï¼ãã«ä½¿ç¨ãã¦ããï¼ï¼ãããå
ã使ããã«ã¾ã£ããåæ§ã®ç¬¦å·åãè¡ããã¨ãå¯è½ã¨ãª
ããå³ã¡ãããåä½ãããã¯å
ã§ï¼ããã以ä¸ã®ããã
å²å½ãããã«ãé¢ãããã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®
éååå¤ããã¹ã¦ï¼ã¨ãªãæ§ãªå ´åãåä½ãããã¯ã®ã
ããå²å½ãï¼ã¨ãããã¨ã«ãããã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£
ï¼´ä¿æ°ã®ç¬¦å·åã«ä½¿ç¨ãã¦ããããããçç¥ããã¾ã£ã
ãåæ§ã®ç¬¦å·åãè¡ããã¨ãå¯è½ã§ãããIn this case, the unit block is not recorded due to the sub-information or the like, that is, by changing the bit allocation to 0 bits, the spectrum or M
Since all DCT coefficient values can be regarded as 0,
Spectrum or MDC in case of 2-bit allocation described above
Exactly the same encoding can be performed without using the 16 bits used for the quantization value â00â of the T coefficient. That is, in a case where the quantization value of the spectrum or the MDCT coefficient is all 0 in spite of the fact that there is a bit allocation of 2 bits or more in a certain unit block, the bit allocation of the unit block is set to 0, MDC
It is possible to omit the bits used for encoding the T coefficient and perform exactly the same encoding.
ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ã«ç¤ºããæ§ãªãï¼ãããå²ãå½ã¦ã§
ãªãå ´åã«ããä¸è¬ã«ãããããç¨æãããã¹ã±ã¼ã«ã
ã¡ã¯ã¿ã¼ã®ä¸ã§æå°ã®ãã®ãã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼å¤ã¨ã
ã¦æ¡ç¨ãããåä½ãããã¯ã«ã¤ãã¦ãåä½ãããã¯å
ã®
ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®çµ¶å¯¾å¤ã®æå¤§å¤ãSPï½
ï½ï½ã¨ãã¦ãä¸è¿°ã®æ°ï¼ã®å¼ã«ããæ±ããããåä½ãã
ãã¯ã®éååå¹
QVããã¡ãã¦ã次ã«ç¤ºãæ°ï¼ã®å¼ã®æ¡
ä»¶ãæºããåä½ãããã¯å
ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿
æ°ã®éååå¤ã¯ãã¹ã¦ï¼ã¨ãªããåãéã«æ°ï¼ã®å¼ãæº
ãããªããããªï¼±ï¼¶ãä¸è¿°ã®æ°ï¼ã®å¼ã®ï¼®ï½ããæ±ãã
ããã®ã§ãããã«ãããåä½ãããã¯å
ã®ã¹ãã¯ãã«å
ã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®ãã¡å°ãªãã¨ãä¸ã¤ãï¼ä»¥å¤ã®å¤ã«é
ååãããããã«æä½éå¿
è¦ãªãããæ°ãæ±ãããã¨ã
ã§ãããAs shown in FIG. 10, even in the case where the allocation is not 2-bit, in general, a unit block in which the smallest one of the prepared scale factors is adopted as the scale factor value has the spectrum or the spectrum within the unit block. The maximum value of the absolute value of the MDCT coefficient is SPm
Using the quantization width QV of the unit block obtained by the above equation (2) as ax, the quantization value of the spectrum or MDCT coefficient in the unit block satisfying the condition of the following equation (3) is 0. Become. On the other hand, a QV that does not satisfy the equation (3) can be obtained from Nb in the above equation (2), whereby the quantum or the value of at least one of the spectrum or MDCT coefficient in the unit block becomes a value other than 0. The minimum required number of bits to be converted can be obtained.
ãï¼ï¼ï¼ï¼ã[0059]
ãæ°ï¼ãSPï½ï½ï½ï¼ï¼±ï¼¶ï¼ï¼## EQU3 ## SPmax <QV / 2
ãï¼ï¼ï¼ï¼ããã®å®æ½ã®å½¢æ
ã«ããã¦ã¯ãå³ï¼ã«ããã
符å·åè£æ£åè·¯ï¼ï¼ï¼ã§ã¯ãå
¨ã¦ã®åä½ãããã¯ã®ä¸ã
ãä¸è¿°ã®æ°ï¼ã®å¼ãæºãããããªåä½ãããã¯ãæ¤åº
ããåä½ãããã¯ã®ãµãæ
å ±ã§ããããå²å½å¤ãï¼ã¨å¤
æ´ããå°ãªãã¨ãåä½ãããã¯å
ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤
ï¼£ï¼´ä¿æ°ã®ç¬¦å·åã«ä½¿ç¨ãã¦ãããããã®çç¥ãè¡ã£ã¦
ãããåã符å·åã®ãã©ã¼ãããã«ãããããä¾ãã°å®
質çãªãããå²å½éï¼æ
å ±å§ç¸®ãã©ã¡ã¼ã¿ï¼ã示ããµã
æ
å ±ã§ãããå²å½ãï¼ã¨ããæ¹æ³ä»¥å¤ã«ãåä½ãããã¯
ã®æå¹æ§ãã¤ã¾ãåä½ãããã¯ãè¨é²ãããããªããã
示ããµãæ
å ±ãããå ´åããã®åä½ãããã¯ã®æå¹æ§ã
示ããµãæ
å ±ã«ããåä½ãããã¯ã®ç¬¦å·åãè¡ããªãäº
ã示ãã°ãå¦çãããã¯ã®ãµãæ
å ±ã§ããã¹ã±ã¼ã«ãã¡
ã¯ã¿åã³ãããå²å½ã«ä½¿ç¨ããã¦ãããµãæ
å ±ã®ããã
ã®çç¥ãå¯è½ã¨ãªãã®ã§ããã®ãããªå ´åã«ã¤ãã¦ãã
符å·åè£æ£åè·¯ï¼ï¼ï¼ã«ãããé©å¿ããå½¢ã«ãµãæ
å ±ã
夿´ãããããã®çç¥ãè¡ããIn this embodiment, the coding correction circuit 614 shown in FIG. 5 detects a unit block satisfying the above-mentioned equation (3) from all the unit blocks, and uses the sub-information of the unit block to Change the bit allocation value to 0 and at least the spectrum or MD in the unit block.
The bits used for encoding the CT coefficients are omitted. Although depending on the encoding format, for example, other than the method of setting the bit allocation to 0 with sub-information indicating the actual bit allocation amount (information compression parameter), the validity of the unit block, that is, the unit block is recorded. If there is sub-information indicating whether or not to do so, if the sub-information indicating the validity of the unit block indicates that coding of the unit block is not to be performed, the sub-information is useful for the scale factor and bit allocation as sub-information of the processing block. It is also possible to omit the bits of the sub information that was used, so even in such a case,
The sub-information is changed to an adaptive form by the encoding correction circuit 614, and bits are omitted.
ãï¼ï¼ï¼ï¼ãä¸è¿°ãããããªæ¹æ³ã«ãããåä½ãããã¯
å
ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®ç¬¦å·åã«ä½¿ç¨ãã¦ã
ããããããã¹ã±ã¼ã«ãã¡ã¯ã¿ããããå²å½çã®ãµãæ
å ±ã«ä½¿ç¨ãã¦ãããããã®çç¥ãè¡ããå ´åã¯ãããã
ã®çç¥ãããããéã«å¿ãã¦ãçç¥ãè¡ã£ãåä½ããã
ã¯ä»¥å¤ã®åä½ãããã¯ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®
符å·åã«ä½¿ç¨ããäºãå¯è½ã¨ãªãããã®å ´åãä¸è¿°ã®æ¹
æ³ã§çç¥ãè¡ãããããããã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿
æ°ã®ç¬¦å·åã®ããã«æ°ãã«ä½¿ç¨ããåä½ãããã¯ã«ã¤ã
ã¦ã¯ããããå²å½ãå¢å ãããã¨ã«ãªãã®ã§ãéååã®
精度ãåä¸ãããã¨ã«ãªããWhen the bits used for coding the spectrum or MDCT coefficients in the unit block and the bits used for the sub-information such as the scale factor and the bit allocation can be omitted by the method described above. Can be used for coding the spectrum or MDCT coefficients of unit blocks other than the omitted unit block according to the omitted bit amounts. In this case, the bit allocation is increased for a unit block newly used for coding a spectrum or MDCT coefficients of bits that could be omitted in the above-described method, so that the accuracy of quantization is improved. Will be.
ãï¼ï¼ï¼ï¼ãåãä¸è¿°ã®ãããã®çç¥ãè¡ããåä½ãã
ãã¯ãè¤æ°åå¨ããå ´åã«ã¯ãããçç¥å¯è½ãªåä½ãã
ãã¯ã®ãããã¯çç¥ãè¡ãããã®çç¥ãããããããä»
ã®çç¥å¯è½ãªåä½ãããã¯ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿
æ°ã®ç¬¦å·åã«ä½¿ç¨ãããã¨ã«ãã£ã¦ãï¼ä»¥å¤ã®å¤ã«éå
åãããã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ãåå¨ãããæå¹
ãªåä½ãããã¯ã¨ãããã¨ãå¯è½ã§ããããã®ãããªå½¢
ã§ãå³ï¼ã«ããã符å·åè£æ£åè·¯ï¼ï¼ï¼ã§ã¯ãä¸è¿°ã®ã
ããçç¥ãè¡ã£ãå ´åãçç¥ãè¡ãããããéã«ããã
ãã¹ã¦ã®åä½ãããã¯ã®ç¶æ
ãæ¤ç´¢ããä¸ã§ãæã广
çã¨æãããåä½ãããã¯ã«ã¤ãã¦é
åãè¡ãããã«
ããçç¥ããããããåé
ååºæ¥ãªãéã«ãªãã¾ã§ãã
ã®æä½ãç¹°ãè¿ãããã®ã¨ãã符å·åè£æ£åè·¯ï¼ï¼ï¼ã§
ã¯ããããããç¨æãããã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼ã®ä¸ã§æ
å°ã®ãã®ãã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼å¤ã¨ãã¦æ¡ç¨ããåä½ã
ããã¯ã«ã¤ãã¦ã¯ãæ°ï¼ãï¼ã®å¼ã®æ¡ä»¶ã使ç¨ãããã¨
ã«ãããåä½ãããã¯å
ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°
ã®ãã¡å°ãªãã¨ãä¸ã¤ãï¼ä»¥å¤ã®å¤ã«éååãããæå¹
ãªç¬¦å·åãè¡ãããã«ã¯ãæä½ä½ãããå¿
è¦ã§ãããã
ç®åºããç¡é§ãªé
åãé¿ããæ§ã«ãããWhen there are a plurality of unit blocks in which the above-mentioned bits can be omitted, the bits of a certain omissible unit block are omitted, and the omitted bits are replaced with the other omissible unit blocks. By using the spectrum or MDCT coefficients for coding, it is also possible to have a valid unit block in which the spectrum or MDCT coefficients quantized to a value other than 0 exists. In such a manner, in the encoding correction circuit 614 in FIG. 5, when the above-described bit omission is performed,
After retrieving the state of all unit blocks, allocation is performed for the unit block considered to be the most effective, and this operation is repeated until the number of omitted bits reaches a level that cannot be redistributed. At this time, the encoding correction circuit 614 uses the conditions of the equations (2) and (3) for the unit block in which the smallest one among the prepared scale factors is adopted as the scale factor value. In order to perform effective coding in which at least one of the spectrum or the MDCT coefficient in is quantized to a value other than 0, it is necessary to calculate how many bits are required at least to avoid unnecessary distribution.
ãï¼ï¼ï¼ï¼ããã®ããã«ãã¦ã符å·åè£æ£åè·¯ï¼ï¼ï¼ã«
ããè£æ£ãè¡ããã¨ã«ãããããå¹çã®è¯ã符å·åãè¡
ããéååãã¤ãºã使¸ããããã¨ã«ããã鳿¥½ãã¼ã¿
ã®é³è³ªåä¸ãéç¹æ§ã®åä¸ãè¨ããã¨ãå¯è½ã¨ãªãã
åãããã§ã¯ãå³ï¼ã®ãããé
åæ¼ç®ææ®µã®æçµæ®µéã§
符å·åè£æ£åè·¯ï¼ï¼ï¼ã«ããè£æ£ãè¡ãä¾ã示ãããã
æ°ï¼ãï¼ã®å¼ã®æ¡ä»¶ã使ç¨ãããã¨ã«ãããããããã
ç¨æãããã¹ã±ã¼ã«ãã¡ã¯ã¿ã¼ã®ä¸ã§æå°ã®ãã®ãã¹ã±
ã¼ã«ãã¡ã¯ã¿ã¼å¤ã¨ãã¦æ¡ç¨ããåä½ãããã¯ã«ã¤ãã¦
ã¯ããã©ã¼ãããä¸ã¨ãããå
¨ã¦ã®ãããå²å½ã«å¯¾å¿ã
ããåä½ãããã¯å
ã®ã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®ã
ã¡å°ãªãã¨ãä¸ã¤ãï¼ä»¥å¤ã®å¤ã«éååãããæå¹ãªç¬¦
å·åãè¡ãããã®ãã¹ãã¯ãã«åã¯ï¼ï¼¤ï¼£ï¼´ä¿æ°ã®æ¡ä»¶
ãããããããç®åºãã¦ãããã¨ãå¯è½ã¨ãªãããã®ã
ãããã®ç®åºéãROï¼çã«è¨æ¶ãã¦ãããå³ï¼ã«ãã
ããããé
åæ¼ç®ææ®µã§ç¬¦å·åè£æ£åè·¯ï¼ï¼ï¼ãã以å
ã®æ®µéã§ãä¸è¿°ã®ç®åºéãåç
§ãã¦ç¡é§ã®ç¡ããããé
åãè¡ã£ã¦ãããã¨ãå¯è½ã§ãããAs described above, by performing correction by the coding correction circuit 614, more efficient coding is performed, and quantization noise is reduced, thereby improving the sound quality and static characteristics of music data. Will be able to
Also, here, an example is shown in which the correction by the encoding correction circuit 614 is performed in the final stage of the bit allocation calculation means in FIG.
By using the conditions of the formulas 2 and 3, a unit block in which the smallest one of the prepared scale factors is adopted as the scale factor value corresponds to all possible bit allocations in the format. It is possible to calculate in advance the conditions of the spectrum or the MDCT coefficient for performing effective coding in which at least one of the spectrum or the MDCT coefficient in the unit block is quantized to a value other than 0. For this reason, the calculated amount is stored in a ROM or the like, and the bit allocation calculation means in FIG. 5 performs the bit allocation without waste by referring to the calculated amount at a stage before the encoding correction circuit 614. It is also possible to go.
ãï¼ï¼ï¼ï¼ãä¸è¬ã«ããã®ãããªç¬¦å·åã«ãããæ£è¦å
åã³éååã®æ¹æ³ã¯æ§ã
ãªãã©ã¼ãããããããããã
ããã®ãã©ã¼ãããã«åºã¥ãã¦ãä¿¡å·æåã®å°ãªãã¨ã
ä¸ã¤ãï¼ä»¥å¤ã®å¤ã«éååãããããã«ã¯ãæä½éä½ã
ããå¿
è¦ã§ããããç®åºãããããèæ
®ãããã¨ã«ã
ããä¸è¿°ãã説æã¨åæ§ã«ãããå¹çã®è¯ã符å·åãè¡
ãäºãå¯è½ã¨ãªãã®ã¯æç½ã§ãããIn general, there are various formats for normalization and quantization in such encoding. However, since at least one of the signal components is quantized to a value other than 0 based on each format, It is obvious that, by calculating the minimum number of bits required and taking this into account, more efficient encoding can be performed as in the above description.
ãï¼ï¼ï¼ï¼ã以ä¸èª¬æããããã«ãå³ï¼ã«ãããããã
å²å½ç®åºåè·¯ï¼ï¼ï¼ã§ã¯ãå³ï¼ã«ç¤ºãããããé
åæ¼ç®
ææ®µã«ãããã¡ã¤ã³æ
å ±ã¨ãã¦ç´äº¤å¤æåºåã¹ãã¯ãã«
ããµãæ
å ±ã«ããå¦çãããâã¿ã¨ããµãæ
å ±ã¨ãã¦ã
ããã¯ããâãã£ã³ã°ã®ç¶æ
ã示ãã¹ã±âã«ãã¡ã¯ã¿â
åã³èªé·ã示ãã¯âãã¬ã³ã°ã¹ãå¾ããããããåºã«ã
å³ï¼ã«ããããé©å¿ãããå²å½ç¬¦å·ååè·¯ï¼ï¼ï¼ãï¼ï¼
ï¼ãï¼ï¼ï¼ã«ããã¦ãå®éã«åéååãè¡ãã符å·åã
ã©ã¼ãããã«åããå½¢ã§ç¬¦å·åãããAs described above, in the bit allocation calculating circuit 118 in FIG. 1, the data obtained by processing the orthogonal transform output spectrum as the main information by the sub information by the bit allocation calculating means shown in FIG. Scale factor indicating the state of block floating as
And a word length indicating the word length are obtained.
The adaptive bit allocation coding circuits 106, 10 in FIG.
In steps 7 and 108, requantization is actually performed, and encoding is performed in a form conforming to the encoding format.
ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ãåç
§ãã¦ãä¸è¿°ããå³ï¼ã§ç¤ºãã
ãã¨ã³ã³ã¼ãã«ããé«è½ç符å·åãããä¿¡å·ã®ãã³ã¼ã
ã«ã¤ãã¦èª¬æãããå帯åã®éååãããï¼ï¼¤ï¼£ï¼´ä¿æ°
ï¼æéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯ï¼ãå³ã¡ãå³ï¼
ã«ãããåºå端åï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã®åºåä¿¡å·ã¨
ç価ã®ãã¼ã¿ã¯ãå³ï¼ï¼ãããå
¥å端åï¼ï¼ï¼ã«ä¾çµ¦ã
ãã使ç¨ããããããã¯ãµã¤ãºæ
å ±ï¼å¦çãããã¯ã®é·
ãï¼ï¼æ
å ±å§ç¸®ãã©ã¡ã¼ã¿ï¼ãå³ã¡ãå³ï¼ã«ãããåºå
端åï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã®åºåä¿¡å·ã¨ç価ã®ãã¼ã¿
ã¯ãå³ï¼ï¼ã«ãããå
¥å端åï¼ï¼ï¼ã«ä¾çµ¦ããããé©å¿
ãããå²å½å¾©å·ååè·¯ï¼é©å¿ãããå²å½å¾©å·åææ®µï¼ï¼
ï¼ï¼ã§ã¯é©å¿ãããå²å½æ
å ±ãç¨ãã¦ãããå²å½ãè§£é¤
ãããæ¬¡ã«éç´äº¤å¤æï¼ï¼©ï¼ï¼¤ï¼£ï¼´ï¼åè·¯ï¼éç´äº¤å¤æ
ææ®µï¼ï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã§ã¯å¨æ³¢æ°è»¸ä¸ã®ä¿¡å·ã
æé軸ä¸ã®ä¿¡å·ã«å¤æãããããããã®é¨å帯åã®æé
軸ä¸ä¿¡å·ã¯ãå³ï¼ï¼ã«ããã帯ååæãã£ã«ã¿ï¼ï¼©ï¼±ï¼
Fï¼åè·¯ï¼å¸¯ååæææ®µï¼ï¼ï¼ï¼ãï¼ï¼ï¼ã«ãããå
¨å¸¯
åä¿¡å·ã«å¾©å·åããããReferring to FIG. 11, a description will be given of a decoder for a signal which has been encoded by the encoder shown in FIG. 1 with high efficiency. The quantized MDCT coefficients (two-dimensional blocks in time and frequency) of each band, that is, FIG.
Are supplied to the input terminal 707 in FIG. 11 and used block size information (processing block length) (information compression parameter), that is, FIG. The data equivalent to the output signals of the output terminals 113, 115, and 117 are supplied to the input terminal 708 in FIG. Adaptive bit allocation decoding circuit (adaptive bit allocation decoding means) 7
In 06, the bit allocation is released using the adaptive bit allocation information. Next, inverse orthogonal transform (IMDCT) circuits (inverse orthogonal transform means) 703, 704, and 705 convert signals on the frequency axis into signals on the time axis. The signals on the time axis of these partial bands are output from the band synthesis filter (IQM
F) The signals are decoded by the circuits (band combining means) 702 and 701 into full-band signals.
ãï¼ï¼ï¼ï¼ã次ã«ãå³ï¼ï¼ãå³ï¼ï¼ãåç
§ãã¦ãæ¬çºæ
ã®ãã£ã¸ã¿ã«ä¿¡å·è¨é²è£
ç½®ï¼æ¹æ³ï¼ããã£ã¸ã¿ã«ä¿¡å·å
çè£
ç½®ï¼æ¹æ³ï¼ããã£ã¸ã¿ã«ä¿¡å·éä¿¡è£
ç½®ï¼æ¹æ³ï¼åã³
ãã£ã¸ã¿ã«ä¿¡å·åä¿¡è£
ç½®ï¼æ¹æ³ï¼ã®å®æ½ã®å½¢æ
ã説æã
ããå³ï¼ï¼ãå³ï¼ï¼ã«ããã¦ãENCã¯å³ï¼ã®ã¨ã³ã³ã¼
ãã示ããï¼´inã¯ãã®å
¥å端åï¼ï¼ï¼ã示ããDECã¯
å³ï¼ï¼ã®ãã³ã¼ãã示ããï¼´out ã¯ãã®åºå端åï¼ï¼ï¼
ã示ããNext, with reference to FIGS. 12 to 15, the digital signal recording device (method), digital signal reproducing device (method), digital signal transmitting device (method) and digital signal receiving device (method) of the present invention. An embodiment will be described. 12 to 13, ENC indicates the encoder of FIG. 1, Tin indicates its input terminal 100, DEC indicates the decoder of FIG. 11, and Tout indicates its output terminal 700.
Is shown.
ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ã®è¨é²è£
ç½®ã§ã¯ãå
¥å端åï¼´inãã
ã®å
¥åãã£ã¸ã¿ã«ä¿¡å·ãã¨ã³ã³ã¼ãENCã«ä¾çµ¦ãã¦ã¨
ã³ã³ã¼ããããã®ã¨ã³ã³ã¼ãENCã®åºåãå³ã¡ãå³ï¼
ã®ã¨ã³ã³ã¼ãã®åºå端åï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼åã³ï¼
ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ããã®åºåä¿¡å·ããå¤èª¿ææ®µï¼ï¼¯
Dã«ä¾çµ¦ãã¦ãå¤éåãã徿å®ã®å¤èª¿ãããããååº
åä¿¡å·ãããããå¤èª¿ããå¾ãå¤éååã¯åå¤èª¿ããã
å¤èª¿ææ®µï¼ï¼¯ï¼¤ããã®è¢«å¤èª¿ä¿¡å·ãè¨é²ææ®µï¼ç£æ°ãã
ããå
å¦ãããçï¼ã«ãã£ã¦ãè¨é²åªä½ï¼ã«è¨é²ãããIn the recording apparatus of FIG. 12, an input digital signal from the input terminal Tin is supplied to the encoder ENC for encoding, and the output of the encoder ENC, that is, the output of FIG.
Output terminals 112, 114, 116 and 1 of the encoders
13, 115, 117 are output from the modulating means MO.
D, and then multiplexes and performs a predetermined modulation, or modulates each output signal and then multiplexes or remodulates.
The modulated signal from the modulating means MOD is recorded on the recording medium M by the recording means (magnetic head, optical head, etc.).
ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ã®åçè£
ç½®ã§ã¯ãåçææ®µï¼ç£æ°ã
ãããå
å¦ãããçï¼ï¼°ã«ãã£ã¦ãå³ï¼ï¼ã®è¨é²åªä½ï¼
ã®è¨é²ä¿¡å·ãåçãããã®åçä¿¡å·ãå¾©èª¿ææ®µï¼¤ï¼¥ï¼ã«
ãã£ã¦ãå¤èª¿ææ®µï¼ï¼¯ï¼¤ã«ããå¤èª¿ã«å¿ãã復調ãè¡
ããå¾©èª¿ææ®µï¼¤ï¼¥ï¼ããã®å¾©èª¿åºåãå³ã¡ãå³ï¼ã®ã¨ã³
ã³ã¼ãã®åºå端åï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ããã®åºåã«
対å¿ããä¿¡å·ãå³ï¼ã®ãã³ã¼ãã®å
¥å端åï¼ï¼ï¼ã«ä¾çµ¦
ããã¨å
±ã«ãå³ï¼ã®ã¨ã³ã³ã¼ãã®åºå端åï¼ï¼ï¼ãï¼ï¼
ï¼ãï¼ï¼ï¼ããã®åºåã«å¯¾å¿ããä¿¡å·ãå³ï¼ã®å
¥åï¼ï¼
ï¼ã«ä¾çµ¦ãã¦ãã³ã¼ããã¦ãåºå端åï¼´out ã«ãå
¥åã
ã£ã¸ã¿ã«ä¿¡å·ã«å¯¾å¿ããåºåãã£ã¸ã¿ã«ä¿¡å·ãåºåãã
ããIn the reproducing apparatus shown in FIG. 13, the recording medium M shown in FIG.
And the demodulation means DEM demodulates the reproduction signal according to the modulation by the modulation means MOD. A signal corresponding to the demodulated output from the demodulating means DEM, that is, a signal corresponding to the output from the output terminals 112, 114, and 116 of the encoder of FIG. 1 is supplied to the input terminal 707 of the decoder of FIG. 113, 11
The signals corresponding to the outputs from 5, 117 are input to the input 70 of FIG.
The output digital signal corresponding to the input digital signal is output to an output terminal Tout.
ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ã®éä¿¡è£
ç½®ã§ã¯ãå
¥å端åï¼´inãã
ã®å
¥åãã£ã¸ã¿ã«ä¿¡å·ãã¨ã³ã³ã¼ãENCã«ä¾çµ¦ãã¦ã¨
ã³ã³ã¼ããããã®ã¨ã³ã³ã¼ãENCã®åºåãå³ã¡ãå³ï¼
ã®ã¨ã³ã³ã¼ãã®åºå端åï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼åã³ï¼
ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ããã®åºåä¿¡å·ããå¤èª¿ææ®µï¼ï¼¯
Dã«ä¾çµ¦ãã¦ãå¤éåãã徿å®ã®å¤èª¿ãããããååº
åä¿¡å·ãããããå¤èª¿ããå¾ãå¤éååã¯åå¤èª¿ããã
å¤èª¿ææ®µï¼ï¼¯ï¼¤ããã®è¢«å¤èª¿ä¿¡å·ãéä¿¡ææ®µï¼´ï¼¸ã«ä¾çµ¦
ãã¦ã卿³¢æ°å¤æãå¢å¹
çãè¡ã£ã¦éä¿¡ä¿¡å·ãä½ããã
ã®éä¿¡ä¿¡å·ãéä¿¡ææ®µï¼´ï¼¸ã®ä¸é¨ã§ããéä¿¡ã¢ã³ããA
NTâï¼´ã«ãã£ã¦éä¿¡ãããIn the transmitting apparatus shown in FIG. 14, an input digital signal from the input terminal Tin is supplied to the encoder ENC for encoding, and the output of the encoder ENC, that is, FIG.
Output terminals 112, 114, 116 and 1 of the encoders
13, 115, 117 are output from the modulating means MO.
D, and then multiplexes and performs a predetermined modulation, or modulates each output signal and then multiplexes or remodulates.
The modulated signal from the modulating unit MOD is supplied to the transmitting unit TX to perform frequency conversion, amplification, etc., to generate a transmitting signal, and the transmitting signal is transmitted to the transmitting antenna A which is a part of the transmitting unit TX.
Transmit by NT-T.
ãï¼ï¼ï¼ï¼ãå³ï¼ï¼ã®åçè£
ç½®ã§ã¯ãåä¿¡ææ®µï¼²ï¼¸ã®ä¸
é¨ã§ããåä¿¡ã¢ã³ããANTâï¼²ã«ãã£ã¦ãå³ï¼ï¼ã®é
ä¿¡ã¢ã³ããANTâï¼´ããã®éä¿¡ä¿¡å·ãåä¿¡ããã¨å
±
ã«ããã®åä¿¡ä¿¡å·ãåä¿¡ææ®µï¼²ï¼¸ã«ãã£ã¦ãå¢å¹
ãéå¨
æ³¢æ°å¤æçãè¡ããåä¿¡ææ®µï¼²ï¼¸ããã®åä¿¡ä¿¡å·ã復調
ææ®µï¼¤ï¼¥ï¼ã«ãã£ã¦ãå¤èª¿ææ®µï¼ï¼¯ï¼¤ã«ããå¤èª¿ã«å¿ã
ã復調ãè¡ããå¾©èª¿ææ®µï¼¤ï¼¥ï¼ããã®å¾©èª¿åºåãå³ã¡ã
å³ï¼ã®ã¨ã³ã³ã¼ãã®åºå端åï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã
ãã®åºåã«å¯¾å¿ããä¿¡å·ãå³ï¼ã®ãã³ã¼ãã®å
¥å端åï¼
ï¼ï¼ã«ä¾çµ¦ããã¨å
±ã«ãå³ï¼ã®ã¨ã³ã³ã¼ãã®åºå端åï¼
ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ããã®åºåã«å¯¾å¿ããä¿¡å·ãå³ï¼
ã®å
¥åï¼ï¼ï¼ã«ä¾çµ¦ãã¦ãã³ã¼ããã¦ãåºå端åï¼´out
ã«ãå
¥åãã£ã¸ã¿ã«ä¿¡å·ã«å¯¾å¿ããåºåãã£ã¸ã¿ã«ä¿¡å·
ãåºåããããIn the reproducing apparatus of FIG. 15, the receiving signal from the transmitting antenna ANT-T of FIG. 15 is received by the receiving antenna ANT-R which is a part of the receiving means RX, and the received signal is received by the receiving means RX. , Amplification, inverse frequency conversion, etc. The demodulation unit DEM demodulates the reception signal from the reception unit RX according to the modulation by the modulation unit MOD. Demodulated output from demodulating means DEM, ie,
A signal corresponding to the output from the output terminals 112, 114 and 116 of the encoder of FIG.
07 and output terminal 1 of the encoder of FIG.
The signals corresponding to the outputs from 13, 115 and 117 are shown in FIG.
Supplied to the input 708 of the terminal and decoded, and the output terminal Tout
Then, an output digital signal corresponding to the input digital signal is output.
ãï¼ï¼ï¼ï¼ãæ¬çºæã¯ãä¸è¿°ã®å®æ½ã®å½¢æ
ã«éå®ããã
ãã®ã§ã¯ãªãã種ã
ã®å¤å½¢ã夿´ãå¯è½ã§ãããã¨ã³ã³
ã¼ãåã³ãã³ã¼ãã¯å¥ä½ã§ãä¸ä½ã§ãè¯ããè¨é²è£
ç½®å
ã³åçè£
ç½®ã¯å¥ä½ã§ãä¸ä½ã§ãè¯ããè¨é²åªä½ã¯ç£æ°ã
ã¼ããç£æ°ãã£ã¹ã¯ãå
ç£æ°ãã£ã¹ã¯çãå¯è½ã§ããã
åãè¨é²åªä½ã®ä»£ããã«ãICã¡ã¢ãªãã¡ã¢ãªã«ã¼ãç
ã®è¨æ¶ææ®µã§ãã£ã¦ãè¯ããéä¿¡è£
ç½®åã³åä¿¡è£
ç½®éã®
ä¼éè·¯ã¯ãç¡ç·ä¼éè·¯ï½é»æ³¢ãå
ï¼èµ¤å¤ç·çï¼çï½ã§ã
æç·ä¼éè·¯ï¼å°ç·ãå
ã±ã¼ãã«çï¼ã§ãè¯ããä¾ãã°ã
å
¥åãã£ã¸ã¿ã«ä¿¡å·ã¨ãã¦ã¯ããã£ã¸ã¿ã«ãªã¼ãã£ãªä¿¡
å·ï¼ãªã¼ãã£ãªä¿¡å·ã¯ã人ã®è©±å£°ãæå£°ã楽å¨ã®é³çã®
å種ã®é³ã®ä¿¡å·ãå¯è½ã§ããï¼ããã£ã¸ã¿ã«ãããªä¿¡å·
çãå¯è½ã§ãããæ¬çºæã¯ããã£ã¸ã¿ã«ä¿¡å·è¨é²åçæ¹
æ³ï¼åã¯è£
ç½®ï¼ããã£ã¸ã¿ã«ä¿¡å·éåä¿¡æ¹æ³ï¼åã¯è£
ç½®ï¼ããã£ã¸ã¿ã«ä¿¡å·åä¿¡æ¹æ³ï¼åã¯è£
ç½®ï¼çã«é©ç¨ã
ããã¨ãã§ãããThe present invention is not limited to the above embodiment, and various modifications and changes can be made. The encoder and the decoder may be separate or integrated. The recording device and the reproducing device may be separate or integrated. The recording medium can be a magnetic tape, a magnetic disk, a magneto-optical disk, or the like.
Further, instead of the recording medium, a storage means such as an IC memory or a memory card may be used. The transmission path between the transmitting apparatus and the receiving apparatus may be a wireless transmission path {radio waves, light (infrared rays, etc.)} or a wired transmission path (conductor, optical cable, etc.). For example,
As the input digital signal, a digital audio signal (an audio signal can be a signal of various sounds such as a human voice, a singing voice, and a sound of a musical instrument), a digital video signal, and the like can be used. The present invention can be applied to a digital signal recording / reproducing method (or device), a digital signal transmitting / receiving method (or device), a digital signal receiving method (or device), and the like.
ãï¼ï¼ï¼ï¼ã[0073]
ãçºæã®å¹æãä¸è¿°ããæ¬çºæã«ããã°ãå
¥åãã£ã¸ã¿
ã«ä¿¡å·ãè¤æ°ã®å¨æ³¢æ°å¸¯åæåã«åå²ãã¦ãæéã¨å¨æ³¢
æ°ã«é¢ããè¤æ°ã®ï¼æ¬¡å
ãããã¯å
ã®ä¿¡å·æåãå¾ãæ
éã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ï¼æ¬¡å
ãããã¯
å
ã®ä¿¡å·æåãåºã«æ£è¦åãè¡ã£ã¦æ£è¦åãã¼ã¿ãå¾ã
æéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ï¼æ¬¡å
ããã
ã¯å
ã®ä¿¡å·æåã®ç¹å¾´ã表ãéååä¿æ°ãæ±ãã該éå
åä¿æ°ãåºã«ãããé
åéãæ±ºå®ããæéã¨å¨æ³¢æ°ã«é¢
ããï¼æ¬¡å
ãããã¯æ¯ã«æ£è¦åãã¼ã¿ã¨ãããé
åéã¨
ã«ãããããã¯å
ã®ä¿¡å·æåãéååãã¦æ
å ±å§ç¸®ãã
ã¨å
±ã«ãæéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã®æ
å ±
å§ç¸®ãã©ã¡ã¼ã¿ã¼ãå¾ãããã«ãããã£ã¸ã¿ã«ä¿¡å·å¦ç
æ¹æ³ããã£ã¸ã¿ã«ä¿¡å·å¦çè£
ç½®ããã£ã¹ã¯ä¿¡å·è¨é²æ¹
æ³ããã£ã¸ã¿ã«ä¿¡å·è¨é²è£
ç½®ãè¨é²åªä½ããã£ã¸ã¿ã«ä¿¡
å·éä¿¡æ¹æ³ãåã¯ããã£ã¸ã¿ã«ä¿¡å·éä¿¡è£
ç½®ã«ããã¦ã
æéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ãæ£è¦åãã¼
ã¿ã¨ãããã¯å
ä¿¡å·æåã«ããããããã¯å
ä¿¡å·æåã®
å°ãªãã¨ãä¸ã¤ãï¼ä»¥å¤ã®å¤ã«éååããããã®æä½ã
ããå²ãå½ã¦ãç®åºãããã®ç®åºçµæããèæ
®ãã¦ãã
ãé
åéã®æ±ºå®ãè¡ãããã«ããã®ã§ãä½åãªãããã
使ç¨ãããã¨ã鲿¢ããããå¹çã®è¯ã符å·åãå®ç¾ã
å¾ãéç¹æ§ãä¿¡å·å質ã®åä¸ãå³ããã¨ãã§ããè¨é²åª
ä½ã«ãããè¨é²å®¹éãä¼éè·¯ã«ãããä¼é容éã®æå¹å©
ç¨ãè¡ããã¨ã®ã§ãããã®ãå¾ããã¨ãã§ãããAccording to the present invention described above, an input digital signal is divided into a plurality of frequency band components to obtain signal components in a plurality of two-dimensional blocks relating to time and frequency, and two-dimensional blocks relating to time and frequency are obtained. For each time, normalization is performed based on the signal components in the two-dimensional block to obtain normalized data,
For each two-dimensional block relating to time and frequency, a quantization coefficient representing the characteristic of a signal component in the two-dimensional block is obtained, and a bit allocation amount is determined based on the quantization coefficient. SIGNAL PROCESSING METHOD, DIGITAL SIGNAL PROCESSOR, DISC In a signal recording method, a digital signal recording device, a recording medium, a digital signal transmission method, or a digital signal transmission device,
For each two-dimensional block relating to time and frequency, a minimum bit allocation for quantizing at least one of the signal components in the block to a value other than 0 is calculated based on the normalized data and the signal component in the block, and the calculation result is The bit allocation amount is determined in consideration of this, so that extra bits can be prevented from being used, more efficient coding can be achieved, and static characteristics and signal quality can be improved. Thus, it is possible to obtain a recording medium capable of effectively utilizing the recording capacity of the recording medium and the transmission capacity of the transmission path.
ãï¼ï¼ï¼ï¼ãåãä¸è¿°ããæ¬çºæã«ããã°ãå
¥åãã£ã¸
ã¿ã«ä¿¡å·ãè¤æ°ã®å¨æ³¢æ°å¸¯åæåã«åå²ãã¦ãæéã¨å¨
æ³¢æ°ã«é¢ããè¤æ°ã®ï¼æ¬¡å
ãããã¯å
ã®ä¿¡å·æåãå¾ã
æéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ï¼æ¬¡å
ããã
ã¯å
ã®ä¿¡å·æåãåºã«æ£è¦åãè¡ã£ã¦æ£è¦åãã¼ã¿ã
å¾ãæéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ï¼æ¬¡å
ã
ããã¯å
ã®ä¿¡å·æåã®ç¹å¾´ã表ãéååä¿æ°ãæ±ãã該
éååä¿æ°ãåºã«ãããé
åéãæ±ºå®ããæéã¨å¨æ³¢æ°
ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«æ£è¦åãã¼ã¿ã¨ãããé
å
éã¨ã«ãããããã¯å
ã®ä¿¡å·æåãéååãã¦æ
å ±å§ç¸®
ããã¨å
±ã«ãæéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã®
æ
å ±å§ç¸®ãã©ã¡ã¼ã¿ã¼ãå¾ãããã«ãããã£ã¸ã¿ã«ä¿¡å·
å¦çæ¹æ³ããã£ã¸ã¿ã«ä¿¡å·å¦çè£
ç½®ããã£ã¹ã¯ä¿¡å·è¨é²
æ¹æ³ããã£ã¸ã¿ã«ä¿¡å·è¨é²è£
ç½®ãè¨é²åªä½ããã£ã¸ã¿ã«
ä¿¡å·éä¿¡æ¹æ³ãåã¯ããã£ã¸ã¿ã«ä¿¡å·éä¿¡è£
ç½®ã«ãã
ã¦ãæ£è¦åãã¼ã¿ã¨ãããå²ãå½ã¦éã«å¯¾å¿ããããã
ãã¯å
ã®ä¿¡å·æåã®å°ãªãã¨ãä¸ã¤ãï¼ä»¥å¤ã®å¤ã«éå
åãããããã®ä¿¡å·æåã®æ¡ä»¶ããã¡ã¢ãªã«è¨æ¶ãã¦ã
ãããã®ã¡ã¢ãªã«è¨æ¶ããã¦ããä¸è¨ä¿¡å·æåã®æ¡ä»¶ã
ãèæ
®ãã¦ä¸è¨ãããé
åéã®æ±ºå®ãè¡ãããã«ããã®
ã§ãä½åãªãããã使ç¨ãããã¨ã鲿¢ããããå¹çã®
è¯ã符å·åãå®ç¾ãå¾ãéç¹æ§ãä¿¡å·å質ã®åä¸ãå³ã
ãã¨ãã§ããè¨é²åªä½ã«ãããè¨é²å®¹éãä¼éè·¯ã«ãã
ãä¼é容éã®æå¹å©ç¨ãè¡ããã¨ã®ã§ãããã®ãå¾ãã
ã¨ãã§ãããFurther, according to the present invention described above, the input digital signal is divided into a plurality of frequency band components to obtain signal components in a plurality of two-dimensional blocks relating to time and frequency.
For each two-dimensional block relating to time and frequency, normalization is performed based on the signal component within the two-dimensional block to obtain normalized data, and the characteristic of the signal component within the two-dimensional block is represented for each two-dimensional block relating to time and frequency. A quantization coefficient is obtained, a bit allocation amount is determined based on the quantization coefficient, and a signal component in the block is quantized by the normalized data and the bit allocation amount for each two-dimensional block relating to time and frequency to compress information. A digital signal processing method, a digital signal processing device, a disk signal recording method, a digital signal recording device, a recording medium, a digital signal transmission method, or a digital signal processing method for obtaining information compression parameters for each two-dimensional block relating to time and frequency. In the signal transmitting device, the signal component in the block corresponding to the normalized data and the bit allocation amount is reduced. At least one of the conditions of the signal component for quantizing to a value other than 0 is stored in a memory, and the condition of the signal component stored in the memory is also considered in consideration of the bit allocation amount. Since the decision is made, the use of extra bits can be prevented, more efficient encoding can be achieved, the static characteristics and signal quality can be improved, and the recording capacity and transmission capacity of the recording medium can be improved. It is possible to obtain a device that can make effective use of the transmission capacity of the path.
ãï¼ï¼ï¼ï¼ãæ´ã«ãä¸è¿°ããæ¬çºæã«ããã°ãå
¥åãã£
ã¸ã¿ã«ä¿¡å·ãè¤æ°ã®å¨æ³¢æ°å¸¯åæåã«åå²ãã¦ãæéã¨
卿³¢æ°ã«é¢ããè¤æ°ã®ï¼æ¬¡å
ãããã¯å
ã®ä¿¡å·æåã
å¾ãæéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ï¼æ¬¡å
ã
ããã¯å
ã®ä¿¡å·æåãåºã«æ£è¦åãè¡ã£ã¦æ£è¦åãã¼ã¿
ãå¾ãæéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«ï¼æ¬¡å
ãããã¯å
ã®ä¿¡å·æåã®ç¹å¾´ã表ãéååä¿æ°ãæ±ãã
該éååä¿æ°ãåºã«ãããé
åéãæ±ºå®ããæéã¨å¨æ³¢
æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯ã«æ£è¦åãã¼ã¿ã¨ãããé
åéã¨ã«ãããããã¯å
ã®ä¿¡å·æåãéååãã¦æ
å ±å§
縮ããã¨å
±ã«ãæéã¨å¨æ³¢æ°ã«é¢ããï¼æ¬¡å
ãããã¯æ¯
ã®æ
å ±å§ç¸®ãã©ã¡ã¼ã¿ã¼ãå¾ãããã«ãããã£ã¸ã¿ã«ä¿¡
å·å¦çæ¹æ³ããã£ã¸ã¿ã«ä¿¡å·å¦çè£
ç½®ããã£ã¹ã¯ä¿¡å·è¨
鲿¹æ³ããã£ã¸ã¿ã«ä¿¡å·è¨é²è£
ç½®ãè¨é²åªä½ããã£ã¸ã¿
ã«ä¿¡å·éä¿¡æ¹æ³ãåã¯ããã£ã¸ã¿ã«ä¿¡å·éä¿¡è£
ç½®ã«ãã
ã¦ãæ¢ã«ãããé
åãè¡ãããï¼æ¬¡å
ãããã¯ã«ã¤ã
ã¦ããããå²å½ãï¼ä»¥å¤ã§ãããªãããã¹ã¦ã®ä¿¡å·æå
ã®éååå¤ãï¼ã¨ãªãï¼æ¬¡å
ãããã¯ãæ¤åºããè©²ï¼æ¬¡
å
ãããã¯ã®ãããå²å½åã¯æ
å ±å§ç¸®ãã©ã¡ã¼ã¿ã¼ãå¤
æ´ãããããã®åé
åãè¡ãããã«ããã®ã§ãä½åãªã
ããã使ç¨ãããã¨ã鲿¢ããããå¹çã®è¯ã符å·åã
å®ç¾ãå¾ãéç¹æ§ãä¿¡å·å質ã®åä¸ãå³ããã¨ãã§ãã
è¨é²åªä½ã«ãããè¨é²å®¹éãä¼éè·¯ã«ãããä¼é容éã®
æå¹å©ç¨ãè¡ããã¨ã®ã§ãããã®ãå¾ããã¨ãã§ãããFurther, according to the present invention described above, the input digital signal is divided into a plurality of frequency band components to obtain signal components in a plurality of two-dimensional blocks relating to time and frequency. Normalization is performed based on the signal components in the two-dimensional block every time to obtain normalized data, and quantization coefficients representing the characteristics of the signal components in the two-dimensional block are obtained for each two-dimensional block with respect to time and frequency.
A bit allocation amount is determined based on the quantized coefficient, and a signal component in the block is quantized by the normalized data and the bit allocation amount for each two-dimensional block relating to time and frequency to compress information, and to further reduce time and frequency. In a digital signal processing method, a digital signal processing device, a disk signal recording method, a digital signal recording device, a recording medium, a digital signal transmission method, or a digital signal transmission device for obtaining an information compression parameter for each two-dimensional block, For the two-dimensional block to which the bit allocation has been performed, a two-dimensional block in which the quantization values of all signal components are 0 while the bit allocation is other than 0 is detected, and the bit allocation or the information compression parameter of the two-dimensional block is determined. Changed and redistributed bits, so use extra bits Preventing, give to achieve more efficient coding, it is possible to improve the static characteristics and signal quality,
It is possible to obtain a device capable of effectively utilizing the recording capacity of the recording medium and the transmission capacity of the transmission path.
ãå³ï¼ãæ¬çºæã®å®æ½ã®å½¢æ
ã®ãããã¬ã¼ãå§ç¸®ç¬¦å·å
ã«ä½¿ç¨å¯è½ãªé«è½çå§ç¸®ç¬¦å·åã¨ã³ã³âãã®ä¸å
·ä½ä¾ã
示ããããã¯åè·¯å³ã§ãããFIG. 1 is a block diagram showing a specific example of a high-efficiency compression encoding encoder that can be used for bitrate compression encoding according to an embodiment of the present invention.
ãå³ï¼ããããå§ç¸®ã®éã®ç´äº¤å¤æãããã¯ã®æ§é ã表
ãå³ã§ãããFIG. 2 is a diagram illustrating a structure of an orthogonal transform block at the time of bit compression.
ãå³ï¼ãç´äº¤å¤æãããã¯ãµã¤ãºã決å®ããåè·¯ã®æ§æ
ä¾ã示ããããã¯ç·å³ã§ãããFIG. 3 is a block diagram illustrating a configuration example of a circuit that determines an orthogonal transform block size.
ãå³ï¼ãæéçã«é£æ¥ããç´äº¤å¤æãããã¯ã®æéçé·
ãã®å¤åã¨ç´äº¤å¤ææã«ç¨ããã¦ã£ã³ãã¦å½¢ç¶ã®é¢ä¿ã
示ãå³ã§ãããFIG. 4 is a diagram showing a relationship between a change in a temporal length of an orthogonal transform block adjacent in time and a window shape used at the time of orthogonal transform.
ãå³ï¼ããããé
åæ¼ç®æ©è½ã®ä¾ã示ããããã¯ç·å³ã§
ãããFIG. 5 is a block diagram showing an example of a bit allocation operation function.
ãå³ï¼ãåè¨ç帯ååã³ãããã¯ããâãã£ã³ã°ãèæ
®
ãã¦åå²ããã帯åã®ã¹ãã¯ãã«ã示ãå³ã§ãããFIG. 6 is a diagram illustrating spectra of each critical band and a band divided in consideration of block floating.
ãå³ï¼ããã¹ãã³ã°ã¹ãã¯ãã«ã示ãå³ã§ãããFIG. 7 is a diagram showing a masking spectrum.
ãå³ï¼ãæå°å¯è´ã«ã¼ãããã¹ãã³ã°ã¹ãã¯ãã«ãåæ
ããå³ã§ãããFIG. 8 is a diagram in which a minimum audible curve and a masking spectrum are combined.
ãå³ï¼ããããå²ãå½ã¦åä½ãããã¯ã«ãããä¿¡å·æå
ã®éååã®ä¾ã示ãå³ã§ãããFIG. 9 is a diagram illustrating an example of quantization of a signal component in a bit allocation unit block.
ãå³ï¼ï¼ããããå²ãå½ã¦åä½ãããã¯ã«ããã¦ãä¿¡å·
æåãå
¨ã¦ï¼ã«éååãããä¾ã示ãå³ã§ãããFIG. 10 is a diagram illustrating an example in which all signal components are quantized to 0 in a bit allocation unit block.
ãå³ï¼ï¼ãä¸è¿°ã®å®æ½ä¾ã®ãããã¬ã¼ãå§ç¸®ç¬¦å·åä¿¡å·
ã®ãã³âãã®ä¸å
·ä½ä¾ã示ããããã¯ç·å³ã§ãããFIG. 11 is a block diagram showing a specific example of a bitrate compression-encoded signal decoder according to the above embodiment.
ãå³ï¼ï¼ãæ¬çºæã®å®æ½ã®å½¢æ
ã®è¨é²è£
ç½®ã示ãããã
ã¯ç·å³ã§ãããFIG. 12 is a block diagram illustrating a recording apparatus according to an embodiment of the present invention.
ãå³ï¼ï¼ãæ¬çºæã®å®æ½ã®å½¢æ
ã®åçè£
ç½®ã示ãããã
ã¯ç·å³ã§ãããFIG. 13 is a block diagram illustrating a playback device according to an embodiment of the present invention.
ãå³ï¼ï¼ãæ¬çºæã®å®æ½ã®å½¢æ
ã®éä¿¡è£
ç½®ã示ãããã
ã¯ç·å³ã§ãããFIG. 14 is a block diagram showing a transmission device according to an embodiment of the present invention.
ãå³ï¼ï¼ãæ¬çºæã®å®æ½ã®å½¢æ
ã®åä¿¡è£
ç½®ã示ãããã
ã¯ç·å³ã§ãããFIG. 15 is a block diagram showing a receiving device according to an embodiment of the present invention.
ï¼ï¼ï¼ãï¼ï¼ï¼ 帯ååå²ãã£ã«ã¿ãï¼ï¼ï¼ãï¼ï¼ï¼ã
ï¼ï¼ï¼ ç´äº¤å¤æåè·¯ï¼ï¼ï¼¤ï¼£ï¼´ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ã
ï¼ï¼ï¼ ãããã¯æ±ºå®åè·¯ãï¼ï¼ï¼ ãããå²ãå½ã¦ç®
åºåè·¯ãï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ é©å¿ãããå²å½ç¬¦å·
ååè·¯ãï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ ãã¯ã¼ç®åºåè·¯ãï¼
ï¼ï¼ ã¡ã¢ãªãï¼ï¼ï¼ å¤ååæ½åºåè·¯ãï¼ï¼ï¼ ãã¯
ã¼æ¯è¼åè·¯ãï¼ï¼ï¼ ãããã¯ãµã¤ãºï¼æ¬¡æ±ºå®åè·¯ãï¼
ï¼ï¼ ãããã¯ãµã¤ãºä¿®æ£åè·¯ãï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼
ï¼ é
å»¶åè·¯ãï¼ï¼ï¼ ã¦ã£ã³ãã¦å½¢ç¶æ±ºå®åè·¯ãï¼ï¼
ï¼ å¸¯åæ¯ã¨ãã«ã®ã¼ç®åºå¨ãï¼ï¼ï¼ ç³è¾¼ã¿ãã£ã«
ã¿ãï¼ï¼ï¼ å ç®å¨ãï¼ï¼ï¼ 颿°çºçå¨ãï¼ï¼ï¼ å²
ãç®å¨ãï¼ï¼ï¼ åæå¨ãï¼ï¼ï¼ æ¸ç®å¨ãï¼ï¼ï¼ é
å»¶åè·¯ãï¼ï¼ï¼ 許容éé³è£æ£å¨ãï¼ï¼ï¼ æå°å¯è´ã«
ã¼ãçºçå¨ãï¼ï¼ï¼ã»ã»ã»ã»ã»ã»ã»ã»ã»ã»è£æ£æ
å ±åºå
å¨ãï¼ï¼ï¼ 符å·åè£æ£å¨ãï¼ï¼ï¼ãï¼ï¼ï¼ 帯ååæ
ãã£ã«ã¿ï¼ï¼©ï¼±ï¼ï¼¦ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ãï¼ï¼ï¼ éç´
äº¤å¤æåè·¯ï¼ï¼©ï¼ï¼¤ï¼£ï¼´ï¼ãï¼ï¼ï¼ é©å¿ãããå²å½å¾©
å·ååè·¯ãENC ã¨ã³ã³ã¼ããï¼ï¼¯ï¼¤ å¤èª¿ææ®µãï¼²
EC è¨é²ææ®µãï¼° åçææ®µãDEï¼å¾©èª¿ææ®µãDE
ï¼£ ãã³ã¼ããTX éä¿¡ææ®µãRX åä¿¡ææ®µ101, 102 band division filter, 103, 104,
105 orthogonal transform circuit (MDCT), 109, 110,
111 block determination circuit, 118 bit allocation calculation circuit, 106, 107, 108 adaptive bit allocation coding circuit, 304, 305, 306 power calculation circuit, 3
07 memory, 308 change extraction circuit, 309 power comparison circuit, 310 block size primary decision circuit, 3
11 Block size correction circuit, 312, 313, 31
4 delay circuit, 315 window shape determination circuit, 60
2 energy calculator per band, 603 convolution filter, 604 adder, 605 function generator, 606 divider, 607 synthesizer, 608 subtractor, 609 delay circuit, 610 allowable noise corrector, 612 minimum audible curve generator, 613... Correction information output unit, 614 coding correction unit, 701, 702 band synthesis filter (IQMF), 703, 704, 705 inverse orthogonal transform circuit (IMDCT), 706 adaptive bit allocation decoding Circuit, ENC encoder, MOD modulation means, R
EC recording means, P reproduction means, DEM demodulation means, DE
C decoder, TX transmitting means, RX receiving means
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4