본 ë°ëª ì ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ ë° ê·¸ ë°©ë²ì ê´í ê²ì¼ë¡, ì¢ì° ì±ë ê° ìí¸ìê´ ê°ì ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìì±íê³ ë¶í¸í ì ë³´(ìí¸ìê´ ì ë³´, ê°ììì ë°©í¥ì ë³´)를 ì´ì©íì¬ ì기 ìì±ë ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ì¡°ì í¨ì¼ë¡ì¨, ë¤ì±ë ì¤ëì¤ ì í¸ ì¤ ì¤ìì±ë ë° ìë¼ì´ë ì±ë ì í¸ë¥¼ ì ííê² ë³µìí기 ìí, ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ ë° ê·¸ ë°©ë²ì ì ê³µíê³ ì íë¤.The present invention relates to an apparatus and method for decoding a multichannel audio signal using cross-correlation. And a method for decoding a multichannel audio signal using cross-correlation for accurately reconstructing a center channel and a surround channel signal among the multichannel audio signals by adjusting the generated multichannel audio signal using sound source direction information). I would like to.
ì´ë¥¼ ìíì¬, 본 ë°ëª ì ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ì ìì´ì, ì¢/ì° ì±ë ê° ìí¸ìê´ ê°ì ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ë¥¼ ìì±í기 ìí ë¤ì±ë ì í¸ ìì± ìë¨; ë° ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ì ëí ì ì í¸ë¥¼ ë³µìí ì ìëë¡, ì기 ìì±ë ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ì ìí¸ìê´ ê° ë° ìë¸ë°´ëë³ íì ê°ì ì기 ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ê°ììì ë°©í¥ì 보를 ì´ì©íì¬ ì¡°ì í기 ìí ë¤ì±ë ì í¸ ì¡°ì ìë¨ì í¬í¨íë¤.To this end, the present invention in the multi-channel audio signal decoding apparatus using cross-correlation, generating a multi-channel signal for generating a plurality of channel-specific audio signal from the downmixed stereo audio signal using the cross-correlation value between the left and right channels Way; And a cross-correlation value and a sub-band power value of the generated plurality of channel-specific audio signals and the sub-band cross-correlation information and the virtual sound source direction to restore the original signal for the downmixed stereo audio signal. Multi-channel signal adjusting means for adjusting using information.
ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸, ë¤ì±ë ì¤ëì¤ ì í¸, ìë¼ì´ë ì¢ì±ë ì¤ëì¤ ì í¸, ìë¼ì´ë ì°ì±ë ì¤ëì¤ ì í¸, ìí¸ìê´, ê°ììì ë°©í¥ì ë³´ Downmixing stereo audio signal, multichannel audio signal, surround left channel audio signal, surround right channel audio signal, cross correlation, virtual sound source direction information
Description Translated from Korean ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ ë° ê·¸ ë°©ë²{APPARATUS AND METHOD FOR DECODING MULTI-CHANNEL AUDIO SIGNAL USING CROSS-CORRELATION}Multi-channel audio signal decoding apparatus using cross-correlation and its method {APPARATUS AND METHOD FOR DECODING MULTI-CHANNEL AUDIO SIGNAL USING CROSS-CORRELATION}본 ë°ëª ì ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ ë° ê·¸ ë°©ë²ì ê´í ê²ì¼ë¡, ëì± ìì¸íê²ë ì¢ì° ì±ë ê° ìí¸ìê´ ê°ì ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìì±íê³ ë¶í¸í ì ë³´(ìí¸ìê´ ì ë³´, ê°ììì ë°©í¥ì ë³´)를 ì´ì©íì¬ ì기 ìì±ë ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ì¡°ì í¨ì¼ë¡ì¨, ë¤ì±ë ì¤ëì¤ ì í¸ ì¤ ì¤ìì±ë ë° ìë¼ì´ë ì±ë ì í¸ë¥¼ ì ííê² ë³µìí기 ìí, ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ ë° ê·¸ ë°©ë²ì ê´í ê²ì´ë¤.The present invention relates to an apparatus and method for decoding a multichannel audio signal using cross-correlation, and more particularly, to generate a multi-channel audio signal from a downmixed stereo audio signal using cross-correlation values between left and right channels and to encode encoded information (correlation). A multi-channel audio signal decoding apparatus using cross-correlation for accurately reconstructing the center channel and surround channel signals among the multi-channel audio signals by adjusting the generated multi-channel audio signals using correlation information and virtual sound source direction information; It's about how.
본 ë°ëª ì ì ë³´íµì ë¶ ë° ì ë³´íµì ì°êµ¬ì§í¥ìì ITì°¨ì¸ëíµì¬ê¸°ì ê°ë°ì¬ì ì ì¼íì¼ë¡ ìíí ì°êµ¬ë¡ë¶í° ëì¶ë ê²ì´ë¤[ê³¼ì ê´ë¦¬ë²í¸: 2005-S-403-02, ê³¼ì ëª : ì§ë¥í íµí©ì ë³´ ë°©ì¡(Smar TV) 기ì ê°ë°].The present invention is derived from the research conducted as part of the next generation core technology development project of the Ministry of Information and Communication and the Ministry of Information and Communication Research and Development. [Task Management Number: 2005-S-403-02, Title: Intelligent Integrated Information Broadcasting ) Technology development].
ìµê·¼ì ê°ì ì© ê·¹ì¥ ìì¤í ì´ ë³´í¸íëë©´ì 5.1ì±ë ì¤ëì¤ íìì ê°ì ì© ì¤ëì¤ì ëì¸ë¡ ì리매ê¹í´ ê°ê³ ìë¤. ëí, í´ëí ì¤ëì¤ ì¥ë¹ììë í¤ëí° ëë ë´ì¥ë ìí ì¤í¼ì»¤ì ìí´ ê°ì ìë¼ì´ë를 ì¬ìíë 3ì°¨ì ì¤ëì¤ í¨ê³¼ 기ë¥ì´ íì 구ë¹ì¬íì¼ë¡ ëê³ ìë¤. ì´ë¬í ì¶ì¸ë¥¼ ê°ìíë©´ í¥í 5.1ì±ë ì¤ëì¤ íìì´ ê°ì ì© ë° í´ëì© ì¤ëì¤ ì¥ë¹ì 기본 ì¤ëì¤ ì¬ì íìì´ ë ê²ì´ë¼ë ì측ì ê°ë¥íê² íë¤.With the recent popularization of home theater systems, the 5.1-channel audio format is becoming the mainstream of home audio. In addition, portable audio equipment has become a necessity to have a three-dimensional audio effect function that reproduces virtual surround by headphones or a small built-in speaker. This trend makes it possible to predict that the 5.1-channel audio format will be the default audio playback format for home and portable audio equipment.
íì§ë§, ì¢ ëì 5.1ì±ë ì¤ëì¤ ê¸°ì ì ì±ë ê°ìì ë°ë¼ ë°ì´í° ëì´ ì¦ê°íë¤ë 문ì ì ì´ ìë¤. ê·¸ë¬ë¯ë¡ ì¢ ëì 5.1ì±ë ì¤ëì¤ ê¸°ì ììë ë°ì´í° ëì í¨ê³¼ì ì¼ë¡ ìì¶í ì ìë ë¤ì±ë ë¶í¸í ë°©ìì´ ì¤ìí 기ë¥ì ìííë¤. ì를 ë¤ì´, MPEG(Moving Picture Expert Group)-2 ë° MPEG-4ììë ì§ê° ë¶í¸í ë°©ìì ì¬ì©í ë¤ì±ë ë¶í¸í ë°©ìì íì¤ííê³ ìë¤. ê·¸ë¬ë ê·¸ í¹ì±ì ì±ë ìì ë¹ë¡íì¬ ë¹í¸ì¨ì´ ì¦ê°íê² ëë 문ì ì ì´ ìë¤.However, the conventional 5.1 channel audio technology has a problem in that the amount of data increases with the number of channels. Therefore, in the conventional 5.1-channel audio technology, a multi-channel encoding method capable of compressing the data amount effectively performs an important function. For example, moving picture expert group (MPEG) -2 and MPEG-4 standardize the multi-channel coding method using the perceptual coding method. However, there is a problem in that the bit rate increases in proportion to the number of channels.
ìµê·¼ì, ì±ë ìê° ì¦ê°íì¬ë ë¹í¸ì¨ì´ ê±°ì ì¦ê°íì§ ìë BCC(Binaural Cue Coding) ë°©ìì´ ê°ë°ëìë¤. BCCë ê·¸ êµ¬ì¡°ê° ë¹êµì ê°ë¨íë¤. ê·¸ë¦¬ê³ ë¤ì±ë ì¤ëì¤ë¥¼ ì¤í ë ì¤ ëë ëª¨ë ¸ë¡ ë¤ì´ë¯¹ì¤í í, ì´ë¡ë¶í° ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ë³µìí기 ìí íë¼ë¯¸í°ë¥¼ ì°ì¶íë¤. ì´ë¤ íë¼ë¯¸í°ë ì±ëê° ë 벨 ì°¨ì´(ICLD: Inter Channel Level Difference), ì±ëê° ìê° ì°¨ì´(ICTD: Inter Channel Time Difference), ë° ì±ëê° ìí¸ìê´(ICC: Inter Channel Cross-correlation)ì í¬í¨í ì ìë¤.Recently, Binaural Cue Coding (BCC) schemes have been developed in which the bit rate does not increase even when the number of channels increases. The BCC is relatively simple in structure. After downmixing the multichannel audio to stereo or mono, a parameter for reconstructing the multichannel audio signal is calculated therefrom. These parameters may include Inter Channel Level Difference (ICLD), Inter Channel Time Difference (ICTD), and Inter Channel Cross-correlation (ICC).
ëí, ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ë³µìíë 기ì ë¡ë ëë¹ íë¡ë¡ì§ì ëíì ì¸ ê¸°ì ë¡ ë¤ ì ìë¤. ê·¸ë¬ë ëë¹ íë¡ë¡ì§ì ê²½ì° ì¤í ë ì¤ ì í¸ ì¬ì´ì ìí¸ìê´ì ë°ë¼ ì¤íí¸ë¼ ììì ë¶íìíê² ì ê±°ëê±°ë ì¦íëë ì í¸ê° ë°ìí ì ìë¤ë 문ì ì ì´ ìë¤. í¹í, ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ë³µìí ë, ë¨ìí ì í¸ì ê°ì° ë° ê°ì°ì íµí´ ìë¼ì´ë ì í¸ì±ë¶ì´ ì ííê² ë³µìëì§ ìëë¤ë 문ì ì ì´ ìë¤.In addition, Dolby Pro Logic is a representative technology for recovering a multi-channel audio signal from a stereo audio signal. However, in the case of Dolby Pro Logic, there is a problem that a signal that is unnecessarily removed or amplified in the spectrum may occur depending on the correlation between the stereo signals. In particular, when restoring a multi-channel audio signal from a stereo audio signal, there is a problem that the surround signal component is not correctly restored through simple addition and subtraction of the signal.
ë°ë¼ì ì기ì ê°ì ì¢ ë 기ì ì ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ì ì í¸ì¸ ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ë³µìí ë, ì¤ìì±ë, ìë¼ì´ë ì¢ì±ë, ë° ìë¼ì´ë ì°ì±ë ì í¸ì±ë¶ì´ ì¤íí¸ë¼ ììì ë¶íìíê² ì ê±°ëê±°ë ì¦íëì´ ì¤ìì±ë, ë° ìë¼ì´ë ì±ë ì í¸ì±ë¶ì ì¶©ì¤íê² ë³µìíì§ ëª»íë¤ë 문ì ì ì´ ìì¼ë©°, ì´ë¬í 문ì ì ì í´ê²°íê³ ì íë ê²ì´ 본 ë°ëª ì ê³¼ì ì´ë¤.Therefore, in the prior art as described above, when restoring a multichannel audio signal that is an original signal from a downmixing stereo audio signal, the center channel, surround left channel, and surround right channel signal components are unnecessarily removed or amplified in the spectrum, thereby causing the center channel, And there is a problem that can not be faithfully restored to the surround channel signal component, it is an object of the present invention to solve this problem.
ë°ë¼ì 본 ë°ëª ì ì¢ì° ì±ë ê° ìí¸ìê´ ê°ì ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìì±íê³ ë¶í¸í ì ë³´(ìí¸ìê´ ì ë³´, ê°ììì ë°©í¥ì ë³´)를 ì´ì©íì¬ ì기 ìì±ë ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ì¡°ì í¨ì¼ë¡ì¨, ë¤ì±ë ì¤ëì¤ ì í¸ ì¤ ì¤ìì±ë ë° ìë¼ì´ë ì±ë ì í¸ë¥¼ ì ííê² ë³µìí기 ìí, ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ ë° ê·¸ ë°©ë²ì ì ê³µíëë° ê·¸ 목ì ì´ ìë¤.Accordingly, the present invention generates a multi-channel audio signal from the downmixed stereo audio signal using the cross-correlation value between the left and right channels and adjusts the generated multi-channel audio signal using encoding information (correlation information, virtual sound source direction information). Accordingly, an object of the present invention is to provide an apparatus and method for decoding a multichannel audio signal using cross-correlation for accurately reconstructing a center channel and a surround channel signal among multichannel audio signals.
본 ë°ëª ì 목ì ë¤ì ì´ììì ì¸ê¸í 목ì ì¼ë¡ ì íëì§ ìì¼ë©°, ì¸ê¸ëì§ ìì 본 ë°ëª ì ë¤ë¥¸ 목ì ë° ì¥ì ë¤ì í기ì ì¤ëª ì ìí´ì ì´í´ë ì ìì¼ë©°, 본 ë°ëª ì ì¤ììì ìí´ ë³´ë¤ ë¶ëª íê² ìê² ë ê²ì´ë¤. ëí, 본 ë°ëª ì 목ì ë° ì¥ì ë¤ì í¹í ì²êµ¬ ë²ìì ëíë¸ ìë¨ ë° ê·¸ ì¡°í©ì ìí´ ì¤íë ì ììì ì½ê² ì ì ìì ê²ì´ë¤.The objects of the present invention are not limited to the above-mentioned objects, and other objects and advantages of the present invention which are not mentioned above can be understood by the following description, and will be more clearly understood by the embodiments of the present invention. Also, it will be readily appreciated that the objects and advantages of the present invention may be realized by the means and combinations thereof indicated in the claims.
본 ë°ëª ì ì기 문ì ì ì í´ê²°í기 ìíì¬ ì ìë ê²ì¼ë¡, ì¢ì° ì±ë ê° ìí¸ìê´ ê°ì ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìì±íê³ ë¶í¸í ì ë³´(ìí¸ìê´ ì ë³´, ê°ììì ë°©í¥ì ë³´)를 ì´ì©íì¬ ì기 ìì±ë ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ì¡°ì íë ê²ì í¹ì§ì¼ë¡ íë¤.The present invention has been proposed to solve the above problems, and generates a multi-channel audio signal from a downmixed stereo audio signal using cross-correlation values between left and right channels and uses encoding information (cross-correlation information, virtual sound source direction information). And adjusting the generated multi-channel audio signal.
ëì± êµ¬ì²´ì ì¼ë¡, 본 ë°ëª ì, ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ì ìì´ì, ì¢/ì° ì±ë ê° ìí¸ìê´ ê°ì ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ë¥¼ ìì±í기 ìí ë¤ì±ë ì í¸ ìì± ìë¨; ë° ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ì ëí ì ì í¸ë¥¼ ë³µìí ì ìëë¡, ì기 ìì±ë ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ì ìí¸ìê´ ê° ë° ìë¸ë°´ëë³ íì ê°ì ì기 ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ê°ììì ë°©í¥ì 보를 ì´ì©íì¬ ì¡°ì í기 ìí ë¤ì±ë ì í¸ ì¡°ì ìë¨ì í¬í¨íë¤.More specifically, in the multi-channel audio signal decoding apparatus using cross-correlation, a multi-channel for generating a plurality of channel-specific audio signals from the downmixed stereo audio signal using the cross-correlation value between the left and right channels Signal generating means; And a cross-correlation value and a sub-band power value of the generated plurality of channel-specific audio signals and the sub-band cross-correlation information and the virtual sound source direction to restore the original signal for the downmixed stereo audio signal. Multi-channel signal adjusting means for adjusting using information.
ëí, 본 ë°ëª ì, ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ë°©ë²ì ìì´ì, ì¢/ì° ì±ë ê° ìí¸ìê´ ê°ì ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ë¥¼ ìì±íë ë¤ì±ë ì í¸ ìì± ë¨ê³; ë° ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ì ëí ì ì í¸ë¥¼ ë³µìí ì ìëë¡, ì기 ìì±ë ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ì ìí¸ìê´ ê° ë° ìë¸ë°´ëë³ íì ê°ì ì기 ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ê°ììì ë°©í¥ì 보를 ì´ì©íì¬ ì¡°ì íë ë¤ì±ë ì í¸ ì¡°ì ë¨ê³ë¥¼ í¬í¨íë¤.Also, in the multi-channel audio signal decoding method using cross-correlation, a multi-channel signal generation step of generating a plurality of channel-specific audio signals from the downmixed stereo audio signal using cross-correlation values between left and right channels ; And a cross-correlation value and a sub-band power value of the generated plurality of channel-specific audio signals and the sub-band cross-correlation information and the virtual sound source direction to restore the original signal for the downmixed stereo audio signal. A multi-channel signal adjustment step of adjusting using the information.
ì기ì ê°ì 본 ë°ëª ì, ì¢ì° ì±ë ê° ìí¸ìê´ ê°ì ë°ë¼ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìì±íê³ ì±ë ê° ìí¸ìê´ ë° ê°ììì ë°©í¥ì ë³´ë¡ êµ¬ì±ëë ê³µê°ìí¥ ì§ê°ë¨ì를 ì´ì©íì¬ ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ì¡°ì í¨ì¼ë¡ì¨, ë¤ì±ë ì¤ëì¤ ì í¸ ì¤ ì¤ìì±ë ë° ìë¼ì´ë ì±ë ì í¸ë¥¼ ì ííê² ë³µìí ì ìëë¡ íë í¨ê³¼ê° ìë¤.As described above, the present invention generates multi-channel audio signals from downmixed stereo audio signals according to cross-correlation values between left and right channels, and uses multi-channel audio using spatial acoustic perception cues composed of cross-correlation between channel and virtual sound source direction information. By adjusting the signal, it is possible to accurately restore the center channel and surround channel signals among the multichannel audio signals.
ëí, 본 ë°ëª ì, ì±ë ê° ìí¸ìê´ ë° ê°ììì ë°©í¥ì ë³´ë¡ êµ¬ì±ëë ê³µê°ìí¥ ì§ê°ë¨ì를 ì´ì©íì¬ ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ì¡°ì í¨ì¼ë¡ì¨, ì¤íí¸ë¼ ì곡íìì ìíìí¬ ì ìë í¨ê³¼ê° ìë¤.In addition, the present invention has the effect of mitigating the spectral distortion phenomenon by adjusting the multi-channel audio signal using the spatial acoustic perception terminal composed of the cross-correlation between the channel and the virtual sound source direction information.
ìì í 목ì , í¹ì§ ë° ì¥ì ì 첨ë¶ë ëë©´ì 참조íì¬ ìì¸íê² íì ëì´ ìë ìì¸í ì¤ëª ì íµíì¬ ë³´ë¤ ëª íí´ ì§ ê²ì´ë©°, ê·¸ì ë°ë¼ 본 ë°ëª ì´ ìíë 기ì ë¶ì¼ìì íµìì ì§ìì ê°ì§ ìê° ë³¸ ë°ëª ì 기ì ì ì¬ìì ì©ì´íê² ì¤ìí ì ìì ê²ì´ë¤. ëí, 본 ë°ëª ì ì¤ëª í¨ì ìì´ì 본 ë°ëª ê³¼ ê´ë ¨ë ê³µì§ ê¸°ì ì ëí 구체ì ì¸ ì¤ëª ì´ ë³¸ ë°ëª ì ìì§ë¥¼ ë¶íìíê² í릴 ì ìë¤ê³ íë¨ëë ê²½ì°ì ê·¸ ìì¸í ì¤ëª ì ìëµíê¸°ë¡ íë¤. ì´í, 첨ë¶ë ëë©´ì 참조íì¬ ë³¸ ë°ëª ì ë°ë¥¸ ë°ëì§í ì¤ìì를 ìì¸í ì¤ëª íê¸°ë¡ íë¤.The above objects, features, and advantages will become more apparent from the detailed description given hereinafter with reference to the accompanying drawings, and accordingly, those skilled in the art to which the present invention pertains may share the technical idea of the present invention. It will be easy to implement. In addition, in describing the present invention, when it is determined that the detailed description of the known technology related to the present invention may unnecessarily obscure the gist of the present invention, the detailed description thereof will be omitted. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.
ë 1 ì 본 ë°ëª ì ë°ë¥¸ ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ì ì¼ì¤ìì 구ì±ëë¡ì, ë¤ì±ë ì¤ëì¤ ì í¸ ë¶í¸í ì¥ì¹ì í¨ê» ëìëì´ ìë¤.1 is a configuration diagram of an apparatus for decoding a multi-channel audio signal using cross-correlation according to the present invention, which is illustrated together with the apparatus for encoding a multi-channel audio signal.
ë 1ì ëìë ë°ì ê°ì´, 본 ë°ëª ì ë°ë¥¸ ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹(120)ë ë¤ì±ë ì í¸ ìì±ë¶(121)ì ë¤ì±ë ì í¸ ì¡°ì ë¶(122)를 í¬í¨íê³ , ì°¸ê³ ì ì¼ë¡ ë¶í¸í ì¥ì¹(110)ë ë¤ì´ë¯¹ì±ë¶(112)ì ê³µê°ìí¥ ì§ê°ë¨ì ë¶ìë¶(111)를 í¬í¨íë¤.As shown in FIG. 1, the multi-channel audio signal decoding apparatus 120 using cross-correlation according to the present invention includes a multi-channel signal generator 121 and a multi-channel signal adjuster 122, which are encoded by reference. The device 110 includes a downmixing unit 112 and a spatial acoustic perceptual cue analysis unit 111.
ì´í, ë¶í¸í ì¥ì¹(110)ì ë³µí¸í ì¥ì¹(120)ì 구ì±ìì ê°ê°ì ëí´ ìì¸í ì´í´ë³´ê¸°ë¡ íë¤.Hereinafter, each component of the encoding apparatus 110 and the decoding apparatus 120 will be described in detail.
ê³µê°ìí¥ ì§ê°ë¨ì ë¶ìë¶(111)ë ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ì ë¬ë°ì ê° ì±ëê³¼ ê´ë ¨ë ìë¸ë°´ëë³ë¡ ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìë¸ë°´ë íí°ë§íë¤. ê·¸ë¦¬ê³ ê³µê°ìí¥ ì§ê°ë¨ì ë¶ìë¶(111)ë ìë¸ë°´ë íí°ë§ë ê° ì±ëì ì¤ëì¤ ì í¸ìì ì¸ì ì±ë ê° ë 벨 ì°¨ì´ ë° ìí¸ìê´ì ë¶ìíì¬ ê³µê°ìí¥ ì§ê°ë¨ì를 ì¶ì¶íë¤. ì¬ê¸°ì, ê³µê°ìí¥ ì§ê°ë¨ìë ì±ë ê° ìí¸ìê´ ê°ê³¼ ê°ììì ë°©í¥ì 보를 í¬í¨íë¤.The spatial acoustic perceptual analysis unit 111 receives the multichannel audio signal and subband filters the multichannel audio signal for each subband associated with each channel. In addition, the spatial acoustic perceptual analysis unit 111 extracts the spatial acoustic perceptual cue by analyzing the level difference and cross-correlation between adjacent channels in the audio signal of each subband filtered channel. Here, the spatial acoustic perception cues include cross-correlation values between the channels and virtual sound source direction information.
ê·¸ë¦¬ê³ ë¤ì´ë¯¹ì±ë¶(112)ë ê³µê°ìí¥ ì§ê°ë¨ì ë¶ìë¶(111)ë¡ë¶í° ì ë¬ë°ì ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ ìì¶íë 기ë¥ì ìííë¤. ì¦, ë¤ì´ë¯¹ì±ë¶(112)ë ê³µê°ìí¥ ì§ê°ë¨ì ë¶ìë¶(111)ìì ìë¸ë°´ë íí°ë§ë ë¤ì±ë ìí¥ ì¤íí¸ë¼ì ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ í¼í©íê³ , ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¥¼ ìê°ìì ì í¸ë¡ ë³ííë¤.The downmixing unit 112 performs a function of compressing the multi-channel audio signal received from the spatial acoustic perception cue analysis unit 111 into a stereo audio signal. That is, the downmixing unit 112 mixes the subband filtered multi-channel sound spectrum by the spatial acoustic perceptual analysis unit 111 into a downmixing stereo audio signal and converts the downmixing stereo audio signal into a time domain signal.
ì´ë, ë¤ì´ë¯¹ì±ë¶(112)ìì ë¤ì´ë¯¹ì±í기 ìí ì¼ë°ì ì¸ ë§¤í¸ë¦ì¤ì ììì í기ì [ìíì 1]ê³¼ ê°ë¤.At this time, the formula of the general matrix for downmixing in the downmixing unit 112 is shown in Equation 1 below.
Ldm = L + SL + SQRT(2)/2ÃCL dm = L + SL + SQRT (2) / 2 Ã C
Rdm = R + SR + SQRT(2)/2ÃCR dm = R + SR + SQRT (2) / 2 x C
ì¬ê¸°ì, Ldm ë° Rdmì ê°ê° ë¤ì´ë¯¹ì± ì¢ì±ë ë° ë¤ì´ë¯¹ì± ì°ì±ë ì¤í ë ì¤ ì í¸, L ë° Rì ë¤ì±ë ìí¥ ì í¸ì ìì´ ì¢ì±ë ë° ì°ì±ë ì í¸, SL ë° SRì ìë¼ì´ë ì¢ì±ë ë° ìë¼ì´ë ì°ì±ë ì í¸, Cë ì¤ìì±ë ì í¸ë¥¼ ëíë¸ë¤. íµìì ì¼ë¡ ì¬ì©ëë ì ì ì±ë(LFE: Low Frequency Effect) ì í¸ë Cì ëì¼íê² ì측 ë¤ì´ë¯¹ì± ì±ëì ëëì´ ë¶ê°í¨ì¼ë¡ì¨ ì²ë¦¬í ì ìë¤.Where L dm And R dm are the downmix left channel and downmix right channel stereo signals, L and R are the left and right channel signals in the multichannel sound signal, SL and SR are the surround left and surround right channel signals, and C is the center. Indicates a channel signal. A low frequency effect (LFE) signal, which is commonly used, can be processed by dividing it into both downmixing channels as in C.
ì´í, 본 ë°ëª ì ë°ë¥¸ ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹(120)ì 구ì±ìì ê°ê°ì ëíì¬ ìì¸í ì´í´ë³´ê¸°ë¡ íë¤.Hereinafter, each component of the multi-channel audio signal decoding apparatus 120 using cross correlation according to the present invention will be described in detail.
ë¤ì±ë ì í¸ ìì±ë¶(121)ë ë¶í¸í ì¥ì¹(110)ìì ë¤ì±ë ì¤ëì¤ ì í¸(ì ì í¸)ê° ë¤ì´ë¯¹ì±ë ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¥¼ ê° ì±ëê° ìí¸ìê´ ê°ì ë°ë¼ ë¶ë¦¬íì¬ ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìì±íë¤. ë¤ì±ë ì í¸ ìì±ë¶(121)ë ì ë¬ë°ì ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ì ìíí°ë¥¼ ì´ì©í´ ë ì±ë ê° ê³µíµë ì í¸ì ë 립ë ì í¸ë¥¼ ì ìì ì¼ë¡ ì¶ì¶íì¬ ìë¼ì´ë ì±ë ì í¸ì ì¤ìì±ëì ì í¸ë¥¼ ì¼ë¶ ë¶ë¦¬íë 기ë¥ì ìííë¤. ì¦, ë¤ì±ë ì í¸ ìì±ë¶(121)ë ë¶í¸í ì¥ ì¹(110)ììì ë¤ì´ë¯¹ì± ì¤í ë ì¤(ì¢ì±ë, ì°ì±ë) ì¤ëì¤ ì í¸ë¡ë¶í° ì ìíí°ì ë ì±ë ê° í© ë° ì°¨ ì í¸ë¥¼ ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤ìì±ë ì í¸ ë° ë¤ì´ë¯¹ì± ìë¼ì´ë ì±ë ì í¸ë¥¼ ìì±íë¤.The multichannel signal generator 121 separates the downmixing stereo audio signal in which the multichannel audio signal (the original signal) is downmixed by the encoding apparatus 110 according to the cross-correlation value between the respective channels to generate the downmixing multichannel audio signal. Create The multi-channel signal generator 121 adaptively extracts a common signal and an independent signal between the two channels by using an adaptive filter from the received downmixed stereo audio signal and performs a function of partially separating the surround channel signal and the center channel signal. do. That is, the multi-channel signal generation unit 121 uses the adaptive filter and the sum and difference signals between the two channels from the downmixing stereo (left channel and right channel) audio signals of the encoding device 110 to use the downmixing central channel signal. And generate a downmix surround channel signal.
ê·¸ë¦¬ê³ ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ë ë¤ì±ë ì í¸ ìì±ë¶(121)ìì ìì±ë ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ì ì±ë ê° ìí¸ìê´ ê°ì ë¶í¸í ì¥ì¹(110)ë¡ë¶í° ì ë¬ë°ì ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ë§ê² ì¡°ì íê³ , ì기 ì¡°ì ë ë¤ì±ë ì¤ëì¤ ì í¸ì ìë¸ë°´ëë³ íì ê°ì ë¶í¸í ì¥ì¹(110)ë¡ë¶í° ì ë¬ë°ì ì ì í¸ì ê°ììì ë°©í¥ì ë³´ì ë§ê² ì¡°ì íë¤. ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ë ë¶í¸í ì¥ì¹(110)ìì ì¶ì¶ë ì±ëê° ìí¸ìê´ ë° ê°ììì ë°©í¥ì 보를 ì´ì©íì¬ ë¤ì±ë ì í¸ ìì±ë¶(121)ìì ìì±ë ë¤ì´ë¯¹ì± ë¤ì±ë ìí¥ ì í¸ì ìë¸ë°´ë ë³ ì¤íí¸ë¼ì ìí¸ìê´ ë° íìì ì¡°ì íë¤. ì¦, ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ë ë¤ì±ë ì í¸ ìì±ë¶(121)ìì ì¶ë ¥ë ë¤ì±ë ìí¥ ì í¸ì ìí¸ìê´ ë° ìë¸ë°´ë íì를 ì¡°ì íì¬ ìëì ë¤ì±ë ì í¸ë¥¼ ë³µìíë¤. ê·¸ë¦¬ê³ ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ë ìí¸ìê´ ë° íìì´ ì¡°ì ë ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ì¶ë ¥íë¤.In addition, the multi-channel signal adjusting unit 122 converts the channel cross-correlation value of the downmixing multi-channel audio signal generated by the multi-channel signal generating unit 121 to the inter-channel cross-correlation information of the original signal received from the encoding apparatus 110. The power value for each subband of the adjusted multichannel audio signal is adjusted according to the virtual sound source direction information of the original signal received from the encoding apparatus 110. The multi-channel signal adjusting unit 122 uses sub-band spectra of the downmixing multi-channel sound signal generated by the multi-channel signal generator 121 by using the cross-correlation and virtual sound source direction information extracted from the encoding apparatus 110. Adjust cross-correlation and shape of. That is, the multi-channel signal adjusting unit 122 adjusts cross-correlation and subband power of the multi-channel sound signal output from the multi-channel signal generating unit 121 to restore the original multi-channel signal. The multi-channel signal adjusting unit 122 outputs a multi-channel audio signal whose cross-correlation and shape are adjusted.
ë 2 ë 본 ë°ëª ì ì´ì©ëë ë 1ì ê³µê°ìí¥ ì§ê°ë¨ì ë¶ìë¶ì ì¼ì¤ìì ìì¸êµ¬ì±ëì´ë¤.2 is a detailed configuration diagram of an embodiment of the spatial acoustic perception cue analysis unit of FIG. 1 used in the present invention.
ë 2ì ëìë ë°ì ê°ì´, ê³µê°ìí¥ ì§ê°ë¨ì ë¶ìë¶(111)ë, ê° ì±ëì ììíë ì 1 ë´ì§ ì 5 ìë¸ë°´ë íí°ë§ë¶(201 ë´ì§ 205), ë° ê³µê°ìí¥ ì§ê°ë¨ì ì¶ì¶ë¶(206)를 í¬í¨íë¤.As shown in FIG. 2, the spatial acoustic perceptual cue analysis unit 111 includes the first to fifth subband filtering units 201 to 205 and the spatial acoustic perceptual extracting unit 206 corresponding to each channel. Include.
ì 1 ë´ì§ ì 5 ìë¸ë°´ë íí°ë§ë¶(201 ë´ì§ 205)ë ì¸ë¶ë¡ë¶í° ì ë ¥ë ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ê° ì±ëì ëí´ ì¸ê° ì²ê°í¹ì±ì 기ë°í ìë¸ë°´ëë³ë¡ 구ë¶íì¬ ìë¸ë°´ë íí°ë§íë¤. ê·¸ë¦¬ê³ ì 1 ë´ì§ ì 5 ìë¸ë°´ë íí°ë§ë¶(201 ë´ì§ 205)ë ìë¸ë°´ë íí°ë§ë ì 1 ì±ë ë´ì§ ì 5 ì±ë ì¤ëì¤ ì í¸ë¥¼ ê³µê°ìí¥ ì§ê°ë¨ì ì¶ì¶ë¶(206)ë¡ ì ë¬íë¤.The first to fifth subband filtering units 201 to 205 classify the multichannel audio signal input from the outside into subbands based on human auditory characteristics for each channel. The first to fifth subband filtering units 201 to 205 transmit the subband filtered first to fifth channel audio signals to the spatial acoustic perceptual end extractor 206.
ê·¸ë¦¬ê³ ê³µê°ìí¥ ì§ê°ë¨ì ì¶ì¶ë¶(206)ë ì 1 ë´ì§ ì 5 ìë¸ë°´ë íí°ë§ë¶(201 ë´ì§ 205)ìì ê°ê° ìë¸ë°´ë íí°ë§ë ì 1 ì±ë ë´ì§ ì 5 ì±ë ì¤ëì¤ ì í¸ë¥¼ ë¶ìíì¬ ì¸ì ì±ë ê° ìí¸ìê´ ì ë³´ ë° ê°ììì ë°©í¥ì ë³´ê° í¬í¨ë ê³µê°ìí¥ ì§ê°ë¨ì를 ì¶ì¶íë¤. ì¦, ê³µê°ìí¥ ì§ê°ë¨ì ì¶ì¶ë¶(206)ë ê° ìë¸ë°´ë ë³ë¡ ì±ëê° ìí¸ìê´ ì ë³´ ë° ê°ììì ë°©í¥ì 보를 ìì±íë¤. ê·¸ë¦¬ê³ ê³µê°ìí¥ ì§ê°ë¨ì ì¶ì¶ë¶(206)ë ì 1 ì±ë ë´ì§ ì 5 ì±ë ì¤ëì¤ ì í¸ë¥¼ ë¤ì´ë¯¹ì±ë¶(112)ë¡ ì ë¬íê³ , ìì±ë ì±ëê° ìí¸ìê´ ë° ê°ììì ë°©í¥ì 보를 ë³µí¸í ì¥ì¹(120)ë¡ ì ì¡íë¤.The spatial acoustic perceptual extractor 206 analyzes the first to fifth channel audio signals subband filtered by the first to fifth subband filtering units 201 to 205, respectively, and correlates information between adjacent channels. Spatial acoustic perception cues containing virtual sound source direction information are extracted. That is, the spatial acoustic perceptual extractor 206 generates cross-correlation information and virtual sound source direction information between channels for each subband. The spatial acoustic perceptual extractor 206 transmits the first to fifth channel audio signals to the downmixer 112, and transmits the generated cross-correlation and virtual sound source direction information to the decoding device 120. do.
ì¬ê¸°ì, ì±ë ê° ìí¸ìê´ ì ë³´ë ê° ìë¸ë°´ë ì í¸ì ëíì¬ ì£¼íì ìììì ì°ì¶ë ì ìë¤. ëí, ê°ììì ë°©í¥ì ë³´ë ì¸ì ì±ë ì í¸ì ìë¸ë°´ë íìë¹ì¨ì ìí´ ì¸ì ì±ë ì¤í¼ì»¤ ë°°ì¹ ê°ë ì¬ì´ìì ê°ë ê°ì¼ë¡ ì°ì¶ë ì ìë¤.Here, the cross-correlation information between channels may be calculated in the frequency domain for each subband signal. In addition, the virtual sound source direction information may be calculated as an angle value between the adjacent channel speaker placement angles by the subband power ratio of the adjacent channel signal.
ë 3 ì 본 ë°ëª ì ë°ë¥¸ ë 1ì ë¤ì±ë ì í¸ ìì±ë¶ì ì¼ì¤ìì ìì¸êµ¬ì±ëì´ë¤.3 is a detailed block diagram of an embodiment of the multi-channel signal generator of FIG. 1 according to the present invention.
ë 3ì ëìë ë°ì ê°ì´, ë¤ì±ë ì í¸ ìì±ë¶(121)ë ì 1 ìë¼ì´ë ì±ëì í¸ ìì±ë¶(310), ì 2 ìë¼ì´ë ì±ëì í¸ ìì±ë¶(320), ë° ì¤ìì±ë ì í¸ ìì±ë¶(330)를 í¬í¨íë¤. ì¬ê¸°ì, ì 1 ìë¼ì´ë ì±ëì í¸ ìì±ë¶(310)ë ì 1 ì ìí í°(311)ì ì 1 ë° ì 2 ê°ì°ê¸°(312, 313)를 í¬í¨íë¤. ëí, ì 2 ìë¼ì´ë ì±ëì í¸ ìì±ë¶(320)ë ì 2 ì ìíí°(321)ì ì 3 ë° ì 4 ê°ì°ê¸°(322, 323)를 í¬í¨íë¤. ëí, ì¤ìì±ë ì í¸ ìì±ë¶(330)ë ê°ì°ê¸°(331)ì ì ì°ê¸°(332)를 í¬í¨íë¤.As shown in FIG. 3, the multi-channel signal generator 121 includes a first surround channel signal generator 310, a second surround channel signal generator 320, and a center channel signal generator 330. do. Here, the first surround channel signal generator 310 includes a first adaptive filter 311 and first and second subtractors 312 and 313. In addition, the second surround channel signal generator 320 includes a second adaptive filter 321 and third and fourth subtractors 322 and 323. In addition, the central channel signal generator 330 includes an adder 331 and a divider 332.
ë¤ì±ë ì í¸ ìì±ë¶(121)ë ë¶í¸í ì¥ì¹(110)ìì ë¤ì±ë ì¤ëì¤ ì í¸(ì ì í¸)ê° ë¤ì´ë¯¹ì±ë ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¥¼ ê° ì±ëê° ìí¸ìê´ ê°ì ë°ë¼ ë¶ë¦¬íì¬ ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìì±íë¤. ì¬ê¸°ì, ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ ì¤ ìë¼ì´ë ì±ë ì í¸ì±ë¶ì ë¤ì´ë¯¹ì± ì¢ì±ë ì í¸ì ë¤ì´ë¯¹ì± ì°ì±ë ì í¸ì ì°¨ì´ ê°ì ì 1 ë° ì 2 ì ìíí°(311, 321)ì ê³ì를 ê°±ì íëë° ì´ì©íì¬ êµ¬íë¤. ë¤ì±ë ì í¸ ìì±ë¶(121)ë ì ìíí°ë¥¼ ì´ì©í ì¤ë¬´ë© í¨ê³¼ë¡ ììì°¨ì´ì ë°ë¼ í¹ì ì¤íí¸ë¼ ì í¸ê° ì곡ëë íìì ì ê±°í ì ìë¤.The multichannel signal generator 121 separates the downmixing stereo audio signal in which the multichannel audio signal (the original signal) is downmixed by the encoding apparatus 110 according to the cross-correlation value between the respective channels to generate the downmixing multichannel audio signal. Create Here, the surround channel signal component of the downmixing multichannel audio signal is obtained by using a difference value between the downmixing left channel signal and the downmixing right channel signal to update the coefficients of the first and second adaptive filters 311 and 321. The multi-channel signal generator 121 may remove a phenomenon in which a specific spectrum signal is distorted due to a phase difference by a smoothing effect using an adaptive filter.
ì´í, ë¤ì±ë ì í¸ ìì±ë¶(121)ì 구ì±ìì를 ê°ê° ìì¸í ì´í´ë³´ê¸°ë¡ íë¤.Hereinafter, the components of the multi-channel signal generator 121 will be described in detail.
ì 1 ìë¼ì´ë ì±ëì í¸ ìì±ë¶(310)ë ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ ì¤ ë¤ì´ë¯¹ì± ì¢ì±ë ì¤ëì¤ ì í¸ìì ìí¸ìê´ ê°ì ì´ì©íì¬ ì¤ìì±ë ì í¸ì±ë¶ê³¼ ìë¼ì´ë ì°ì±ë ì í¸ì±ë¶ì ì ê±°íì¬ ë¤ì´ë¯¹ì± ìë¼ì´ë ì¢ì±ë ì í¸ë¥¼ ìì±íë¤. ì¦, ì 1 ìë¼ì´ë ì±ëì í¸ ìì±ë¶(310)ë ë¤ì´ë¯¹ì± ì°ì±ë ì í¸ì ë¤ì´ë¯¹ì± ì¢ì±ë ì í¸ë¥¼ ì ë ¥ë°ê³ , ì ë ¥ë ì í¸ìì ì 1 ì ìíí°(311)ì ì 1 ë° ì 2 ê°ì°ê¸°(312, 313)를 ì´ì©íì¬ ë¤ì´ë¯¹ì± ìë¼ì´ë ì¢ì±ë ì í¸ë¥¼ ìì±íë¤.The first surround channel signal generator 310 removes the center channel signal component and the surround right channel signal component from the downmixing left channel audio signal among the downmixing stereo audio signals to remove the downmixing surround left channel signal. Create That is, the first surround channel signal generator 310 receives the downmixing right channel signal and the downmixing left channel signal, and the first adaptive filter 311 and the first and second subtractors 312 and 313 from the input signal. To generate the downmix surround left channel signal.
ì¬ê¸°ì, ì 1 ì ìíí°(311)ë ê³µíµë ì í¸ ì±ë¶ì¸ ì¤ìì±ë ì í¸ ì±ë¶ì ìµì íê³ ë 립ë ì í¸ ì±ë¶ì¸ ìë¼ì´ë ì í¸ë¥¼ íµê³¼ìí¤ë 기ë¥ì ìííë¤. ì 1 ê°ì°ê¸°(312)ë ë¤ì´ë¯¹ì± ì°ì±ë ì í¸ìì ì 1 ì ìíí°ë¥¼ íµê³¼í ë¤ì´ë¯¹ì± ì¢ì±ë ì í¸ë¥¼ ë¹¼ì ì¤ì°¨ì í¸ë¥¼ ì¶ë ¥íë¤. ì´ë, ì¶ë ¥ë ì¤ì°¨ì í¸ë ì 1 ì ìíí°(311)ì ê³ì를 ê°±ì íëë° ì¬ì©ëë¤. ê·¸ë¦¬ê³ ì 2 ê°ì°ê¸°(313)ë ë¤ì´ë¯¹ì± ì¢ì±ë ì í¸ìì ì 1 ê°ì°ê¸°(312)ì ì¶ë ¥ ì í¸ë¥¼ ë¹¼ì ë¤ì´ë¯¹ì± ìë¼ì´ë ì¢ì±ë ì í¸ë¥¼ ìì±íë¤. ì¬ê¸°ì, ì 1 ê°ì°ê¸°(312)ì ì¶ë ¥ ì í¸ë¥¼ ë¤ì´ë¯¹ì± ìë¼ì´ë ì¢ì±ë ì í¸ìì ë¹¼ë ê²ì ì íë°© ìí¸ìê´ì ìµëíí기 ìí¨ì´ë¤.Here, the first adaptive filter 311 performs a function of suppressing a central channel signal component which is a common signal component and passing a surround signal which is an independent signal component. The first subtractor 312 outputs an error signal by subtracting the downmixing left channel signal passed through the first adaptive filter from the downmixing right channel signal. In this case, the output error signal is used to update the coefficient of the first adaptive filter 311. The second subtractor 313 subtracts the output signal of the first subtracter 312 from the downmix left channel signal to generate a downmix surround left channel signal. Here, subtracting the output signal of the first subtractor 312 from the downmix surround left channel signal is for maximizing forward and backward cross-correlation.
ì 2 ìë¼ì´ë ì±ëì í¸ ìì±ë¶(320)ë ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ ì¤ ë¤ì´ë¯¹ì± ì°ì±ë ì¤ëì¤ ì í¸ìì ìí¸ìê´ ê°ì ì´ì©íì¬ ì¤ìì±ë ì í¸ì±ë¶ê³¼ ìë¼ì´ë ì¢ì±ë ì í¸ì±ë¶ì ì ê±°íì¬ ë¤ì´ë¯¹ì± ìë¼ì´ë ì°ì±ë ì í¸ë¥¼ ìì±íë¤. ì¦, ì 2 ìë¼ì´ë ì±ëì í¸ ìì±ë¶(320)ë ë¤ì´ë¯¹ì± ì¢ì±ë ì í¸ì ë¤ì´ë¯¹ì± ì°ì±ë ì í¸ë¥¼ ì ë ¥ë°ê³ , ì ë ¥ë ì í¸ìì ì 2 ì ìíí°(321)ì ì 3 ë° ì 4 ê°ì°ê¸°(322, 323)를 ì´ì©íì¬ ë¤ì´ë¯¹ì± ìë¼ì´ë ì°ì±ë ì í¸ë¥¼ ìì±íë¤.The second surround channel signal generator 320 removes the center channel signal component and the surround left channel signal component from the downmixing right channel audio signal among the downmixing stereo audio signals to remove the downmixing surround right channel signal. Create That is, the second surround channel signal generator 320 receives the downmixing left channel signal and the downmixing right channel signal, and the second adaptive filter 321 and the third and fourth subtractors 322 and 323 from the input signal. To generate the downmix surround right channel signal.
ì¬ê¸°ì, ì 2 ì ìíí°(321)ë ê³µíµë ì í¸ ì±ë¶ì¸ ì¤ìì±ë ì í¸ ì±ë¶ì ìµì íê³ ë 립ë ì í¸ ì±ë¶ì¸ ìë¼ì´ë ì í¸ë¥¼ íµê³¼ìí¤ë 기ë¥ì ìííë¤. ì 3 ê°ì°ê¸°(322)ë ë¤ì´ë¯¹ì± ì¢ì±ë ì í¸ìì ë¤ì´ë¯¹ì± ì°ì±ë ì í¸ë¥¼ ë¹¼ì ì¤ì°¨ì í¸ë¥¼ ì¶ë ¥íë¤. ì´ë, ì¶ë ¥ë ì¤ì°¨ì í¸ë ì 2 ì ìíí°(321)ì ê³ì를 ê°±ì íëë° ì¬ì©ëë¤. ê·¸ë¦¬ê³ ì 4 ê°ì°ê¸°(323)ë ë¤ì´ë¯¹ì± ì°ì±ë ì í¸ìì ì 2 ê°ì°ê¸°(322)ì ì¶ë ¥ ì í¸ë¥¼ ë¹¼ì ë¤ì´ë¯¹ì± ìë¼ì´ë ì°ì±ë ì í¸ë¥¼ ì¶ë ¥íë¤. ì¬ê¸°ì, ì 3 ê°ì°ê¸°(322)ì ì¶ë ¥ ì í¸ë¥¼ ë¤ì´ë¯¹ì± ìë¼ì´ë ì°ì±ë ì í¸ìì ë¹¼ë ê²ì ì íë°© ìí¸ìê´ì ìµëíí기 ìí¨ì´ë¤.Here, the second adaptive filter 321 suppresses the central channel signal component that is a common signal component and passes the surround signal that is an independent signal component. The third subtractor 322 subtracts the downmixing right channel signal from the downmixing left channel signal to output an error signal. In this case, the output error signal is used to update the coefficient of the second adaptive filter 321. The fourth subtractor 323 subtracts the output signal of the second subtractor 322 from the downmixing right channel signal to output the downmixing surround right channel signal. Here, subtracting the output signal of the third subtractor 322 from the downmix surround right channel signal is for maximizing forward and backward cross-correlation.
ì¤ìì±ë ì í¸ ìì±ë¶(330)ë ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ ì¤ ì¢ì±ë ì¤ ëì¤ ì í¸ì ì°ì±ë ì¤ëì¤ ì í¸ë¥¼ ê²°í©íì¬ ë¤ì´ë¯¹ì± ì¤ìì±ë ì í¸ë¥¼ ìì±íë¤. ì¦, ì¤ìì±ë ì í¸ ìì±ë¶(330)ë ë¤ì´ë¯¹ì± ì¢ì±ë ì í¸ì ë¤ì´ë¯¹ì± ì°ì±ë ì í¸ë¥¼ ì ë ¥ë°ê³ , ë¤ì´ë¯¹ì± ë ì±ë ì í¸ë¥¼ ëí í ë°ì¼ë¡ ëëì´ì ë¤ì´ë¯¹ì± ì¤ìì±ë ì í¸ë¥¼ ìì±íë¤.The center channel signal generator 330 generates a downmixed center channel signal by combining a left channel audio signal and a right channel audio signal among the downmixed stereo audio signals. That is, the center channel signal generator 330 receives the downmixing left channel signal and the downmixing right channel signal, adds the downmixing two channel signals, and divides them in half to generate the downmixing central channel signal.
ë 4 ë 본 ë°ëª ì ë°ë¥¸ ë 1ì ë¤ì±ë ì í¸ ì¡°ì ë¶ì ì¼ì¤ìì ìì¸êµ¬ì±ëì´ë¤.4 is a detailed configuration diagram of an embodiment of the multi-channel signal adjusting unit of FIG. 1 according to the present invention.
ë 4ì ëìë ë°ì ê°ì´, ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ë ì 6 ë´ì§ ì 10 ìë¸ë°´ë íí°ë§ë¶(401 ë´ì§ 405), ì 1 ë° ì 2 ìí¸ìê´ ì¡°ì ë¶(406, 407), ë¤ì±ë íìë¹ì¨ ì°ì¶ë¶(408), ë° ì í¸ ë³íë¶(409)를 í¬í¨íë¤.As shown in FIG. 4, the multi-channel signal adjusting unit 122 calculates the sixth to tenth subband filtering units 401 to 405, the first and second cross-correlation adjusting units 406 and 407, and the multichannel power ratio. A unit 408 and a signal converter 409 are included.
ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ë ë¤ì±ë ì í¸ ìì±ë¶(121)ìì ìì±ë ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ì ì±ë ê° ìí¸ìê´ ê°ì ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ë§ê² ì¡°ì íê³ , ì기 ì¡°ì ë ë¤ì±ë ì¤ëì¤ ì í¸ì ìë¸ë°´ëë³ íì ê°ì ì ì í¸ì ê°ììì ë°©í¥ì ë³´ì ë§ê² ì¡°ì íë¤. ì¦, ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ë ë¤ì±ë ì í¸ ìì±ë¶(121)ìì ìì±ë ë¤ì±ë ìí¥ ì í¸ì ìí¸ìê´ ë° ìë¸ë°´ë íì를 ì¡°ì íì¬ ìëì ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ë³µìíë 기ë¥ì ìííë¤.The multi-channel signal adjusting unit 122 adjusts the cross-correlation value of the channels of the downmixing multi-channel audio signal generated by the multi-channel signal generating unit 121 according to the cross-correlation information of the channels of the original signal, and adjusts the adjusted multi-channel. The power value of each subband of the audio signal is adjusted according to the virtual sound source direction information of the original signal. That is, the multichannel signal adjuster 122 adjusts the cross-correlation and subband power of the multichannel sound signal generated by the multichannel signal generator 121 to perform a function of restoring the original multichannel audio signal.
ì´í, ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ì 구ì±ìì ê°ê°ì ëí´ ìì¸í ì´í´ë³´ê¸°ë¡ íë¤.Hereinafter, each component of the multi-channel signal adjusting unit 122 will be described in detail.
ì 6 ë´ì§ ì 10 ìë¸ë°´ë íí°ë§ë¶(401 ë´ì§ 405)ë ë¤ì±ë ì í¸ ìì±ë¶(121)ìì ìì±ë ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ê°ê° ìë¸ë°´ë íí°ë§íë¤.The sixth to tenth subband filtering units 401 to 405 each perform subband filtering on the downmixing multichannel audio signal generated by the multichannel signal generator 121.
ê·¸ë¦¬ê³ ë¤ì±ë íìë¹ì¨ ì°ì¶ë¶(408)ë ë¶í¸í ì¥ì¹(110)ë¡ë¶í° ì ë¬ë°ì ê° ììì ë°©í¥ì ë³´ë¡ë¶í° ë¤ì±ë ì í¸ì ìë¸ë°´ëë³ íìë¹ì¨ì ì°ì¶íë¤.The multi-channel power ratio calculator 408 calculates the power ratio for each subband of the multi-channel signal from the virtual sound source direction information received from the encoding apparatus 110.
ê·¸ë¦¬ê³ ì 1 ë° ì 2 ìí¸ìê´ ì¡°ì ë¶(406, 407)ë ì 6 ë´ì§ ì 10 ìë¸ë°´ë íí°ë§ë¶(401 ë´ì§ 405)ìì ê°ê° ìë¸ë°´ë íí°ë§ë ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ì ì±ë ê° ìí¸ìê´ ê°ì ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ë§ê² ì¡°ì íë¤.In addition, the first and second cross-correlation adjustment units 406 and 407 may use the original signal as the cross-correlation value of the channel of the downmixed multichannel audio signal filtered by the sixth to tenth subband filtering units 401 to 405, respectively. Adjust the channel's cross-correlation information.
ê·¸ë¦¬ê³ ì í¸ ë³íë¶(409)ë ì 1 ë° ì 2 ìí¸ìê´ ì¡°ì ë¶(406, 407)ìì ê°ê° ìí¸ìê´ì´ ì¡°ì ë ë¤ì±ë ì¤ëì¤ ì í¸ì ìë¸ë°´ëë³ íì ê°ì ë¤ì±ë íìë¹ì¨ ì°ì¶ë¶(408)ìì ì°ì¶ë ë¤ì±ë íìë¹ì¨ì ë§ê² ì¡°ì íê³ ìê°ììì¼ë¡ ë³ííë¤. ì¦, ì í¸ ë³íë¶(409)ë ë¤ì±ë íìë¹ì¨ ì°ì¶ë¶(408)ìì ê³ì°ë íìë¹ì¨ì ë§ê² ì 1 ë° ì 2 ìí¸ìê´ ì¡°ì ë¶(406, 407)ì ìí´ ì¶ë ¥ë ìë¼ì´ë ì í¸ì íìì í´ë¹íë ìë¸ë°´ëë³ë¡ ë¤ì±ë ìí¥ì í¸ì íì ê°ì ì¡°ì íê³ , íì ê°ì´ ì¡°ì ë ì í¸ë¥¼ ìê°ììì¼ë¡ ë³ííë¤.In addition, the signal converter 409 calculates, by the multi-channel power ratio calculator 408, a subband power value of the multi-channel audio signal whose cross-correlation is adjusted by the first and second cross-correlation adjustment units 406 and 407, respectively. It adjusts to the multichannel power ratio which is made and converts it to time domain. That is, the signal converter 409 is a sub corresponding to the power of the surround signal output by the first and second cross-correlation adjustment units 406 and 407 according to the power ratio calculated by the multi-channel power ratio calculator 408. The power value of the multi-channel sound signal is adjusted for each band, and the signal whose power value is adjusted is converted into the time domain.
ë 5 ë 본 ë°ëª ì ë°ë¥¸ ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ë°©ë²ì ëí ì¼ì¤ìì íë¦ëì´ë¤.5 is a flowchart illustrating a method of decoding a multichannel audio signal using cross-correlation according to the present invention.
ì°ì , ë¶í¸í ë°©ë²ì ì´í´ë³´ë©´, ê³µê°ìí¥ ì§ê°ë¨ì ë¶ìë¶(111)ë ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìë¸ë°´ë íí°ë§íê³ íí°ë§ë ê° ì±ë ì¤ëì¤ ì í¸ë¡ë¶í° ì¸ì ì±ë ê° ìí¸ìê´ ë° ê°ììì ë°©í¥ì ë³´ê° í¬í¨ëë ê³µê°ìí¥ ì§ê°ë¨ì를 ì¶ì¶íë¤.First, referring to the encoding method, the spatial acoustic perceptual analysis unit 111 subband-filters a multi-channel audio signal and includes spatial correlation perceptual information including cross-correlation and virtual sound source direction information between adjacent channels from each filtered channel audio signal. Extract
ê·¸ë¦¬ê³ ë¤ì´ë¯¹ì±ë¶(112)ë íí°ë§ë ê° ì±ë ì¤ëì¤ ì í¸ë¥¼ ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ ë¤ì´ë¯¹ì±íì¬ ë¶í¸ííë¤.The downmixer 112 downmixes each filtered channel audio signal into a stereo audio signal and encodes the stereo audio signal.
ì´í, 본 ë°ëª ì ë°ë¥¸ ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ë°©ë²ì ì´í´ë³´ë©´ ë¤ìê³¼ ê°ë¤.Hereinafter, a multichannel audio signal decoding method according to the present invention will be described.
ë¤ì±ë ì í¸ ìì±ë¶(121)ë ë¶í¸í ì¥ì¹(110)ë¡ë¶í° ì ë¬ë°ì ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì í¸ìì ë¤ì±ë ì¤ëì¤ ì í¸(ì ì í¸)ê° ë¤ì´ë¯¹ì±ë ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¥¼ ê° ì±ëê° ìí¸ìê´ ê°ì ë°ë¼ ë¶ë¦¬íì¬ ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ìì±íë¤(502).The multi-channel signal generator 121 separates the down-mixed stereo audio signal from which the multi-channel audio signal (the original signal) is downmixed from the down-mixed stereo signal received from the encoding apparatus 110 according to the cross-correlation value between the channels and downmixes them. A multichannel audio signal is generated (502).
ê·¸ë¦¬ê³ ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ë ìì±ë ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ì ì±ë ê° ìí¸ìê´ ê°ì ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ë§ê² ì¡°ì íë¤(504).The multi-channel signal adjusting unit 122 adjusts the cross-correlation value of the generated downmixing multi-channel audio signal according to the cross-correlation information of the original signal (504).
ì´í, ë¤ì±ë ì í¸ ì¡°ì ë¶(122)ë ì¡°ì ë ë¤ì±ë ì¤ëì¤ ì í¸ì ìë¸ë°´ëë³ íì ê°ì ì ì í¸ì ê°ììì ë°©í¥ì ë³´ì ë§ê² ì¡°ì íì¬ ë³µí¸ííë¤(506).Thereafter, the multi-channel signal adjusting unit 122 adjusts and decodes the power value of each subband of the adjusted multi-channel audio signal according to the virtual sound source direction information of the original signal (506).
íí¸, ì ì í ë°ì ê°ì 본 ë°ëª ì ë°©ë²ì ì»´í¨í° íë¡ê·¸ë¨ì¼ë¡ ìì±ì´ ê°ë¥íë¤. ê·¸ë¦¬ê³ ì기 íë¡ê·¸ë¨ì 구ì±íë ì½ë ë° ì½ë ì¸ê·¸ë¨¼í¸ë ë¹í´ ë¶ì¼ì ì»´í¨í° íë¡ê·¸ë머ì ìíì¬ ì©ì´íê² ì¶ë¡ ë ì ìë¤.ãëí, ì기 ìì±ë íë¡ê·¸ë¨ì ì»´í¨í°ê° ì½ì ì ìë 기ë¡ë§¤ì²´(ì ë³´ì ì¥ë§¤ì²´)ì ì ì¥ëê³ , ì»´í¨í°ì ìíì¬ íë ëê³ ì¤íë¨ì¼ë¡ì¨ 본 ë°ëª ì ë°©ë²ì 구ííë¤. ê·¸ë¦¬ê³ ì기 기ë¡ë§¤ì²´ë ì»´í¨í°ê° íë í ì ìë 모ë ííì 기ë¡ë§¤ì²´ë¥¼ í¬í¨íë¤.On the other hand, the method of the present invention as described above can be written in a computer program. And the code and code segments constituting the program can be easily inferred by a computer programmer in the art. In addition, the written program is stored in a computer-readable recording medium (information storage medium), and read and executed by a computer to implement the method of the present invention. The recording medium may include any type of computer readable recording medium.
ì´ììì ì¤ëª í 본 ë°ëª ì, 본 ë°ëª ì´ ìíë 기ì ë¶ì¼ìì íµìì ì§ìì ê°ì§ ììê² ìì´ ë³¸ ë°ëª ì 기ì ì ì¬ìì ë²ì´ëì§ ìë ë²ì ë´ìì ì¬ë¬ ê°ì§ ì¹í, ë³í ë° ë³ê²½ì´ ê°ë¥íë¯ë¡ ì ì í ì¤ìì ë° ì²¨ë¶ë ëë©´ì ìí´ íì ëë ê²ì´ ìëë¤.The present invention described above is capable of various substitutions, modifications, and changes without departing from the technical spirit of the present invention for those skilled in the art to which the present invention pertains. It is not limited by the drawings.
ë 1 ì 본 ë°ëª ì ë°ë¥¸ ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ì ì¼ì¤ìì 구ì±ë,1 is a configuration diagram of an apparatus for decoding a multichannel audio signal using cross-correlation according to the present invention;
ë 2 ë 본 ë°ëª ì ì´ì©ëë ë 1ì ê³µê°ìí¥ ì§ê°ë¨ì ë¶ìë¶ì ì¼ì¤ìì ìì¸êµ¬ì±ë,2 is a detailed configuration diagram of an embodiment of the spatial acoustic perception cue analysis unit of FIG. 1 used in the present invention;
ë 3 ì 본 ë°ëª ì ë°ë¥¸ ë 1ì ë¤ì±ë ì í¸ ìì±ë¶ì ì¼ì¤ìì ìì¸êµ¬ì±ë,3 is a detailed configuration diagram of an embodiment of the multi-channel signal generator of FIG. 1 according to the present invention;
ë 4 ë 본 ë°ëª ì ë°ë¥¸ ë 1ì ë¤ì±ë ì í¸ ì¡°ì ë¶ì ì¼ì¤ìì ìì¸êµ¬ì±ë,4 is a detailed configuration diagram of an embodiment of the multi-channel signal adjusting unit of FIG. 1 according to the present invention;
ë 5 ë 본 ë°ëª ì ë°ë¥¸ ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ë°©ë²ì ëí ì¼ì¤ìì íë¦ëì´ë¤.5 is a flowchart illustrating a method of decoding a multichannel audio signal using cross-correlation according to the present invention.
* ëë©´ì 주ì ë¶ë¶ì ëí ë¶í¸ ì¤ëª * Explanation of symbols on the main parts of the drawing
120: ë³µí¸í ì¥ì¹ 121: ë¤ì±ë ì í¸ ìì±ë¶120: decoding apparatus 121: multi-channel signal generation unit
122: ë¤ì±ë ì í¸ ì¡°ì ë¶ 310: ì 1 ìë¼ì´ë ì±ëì í¸ ìì±ë¶122: multi-channel signal adjusting unit 310: first surround channel signal generating unit
320: ì 2 ìë¼ì´ë ì±ëì í¸ ìì±ë¶ 311: ì 1 ì ì íí°320: second surround channel signal generator 311: first adaptive filter
321: ì 2 ì ì íí° 330: ì¤ìì±ë ì í¸ ìì±ë¶321: second adaptive filter 330: center channel signal generator
401 ë´ì§ 405: ì 6 ë´ì§ ì 10 ìë¸ë°´ë íí°ë§ë¶401 to 405: sixth to tenth subband filtering units
406: ì 1 ìí¸ìê´ ì¡°ì ë¶ 407: ì 2 ìí¸ìê´ ì¡°ì ë¶406: first cross-correlation adjustment unit 407: second cross-correlation adjustment unit
408: ë¤ì±ë íìë¹ì¨ ì°ì¶ë¶ 409: ì í¸ ë³íë¶408: multi-channel power ratio calculator 409: signal converter
Claims (8) Translated from Koreanìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹ì ìì´ì,In the multi-channel audio signal decoding apparatus using cross-correlation, ì¢/ì° ì±ë ê° ìí¸ìê´ ê°ì ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ë¥¼ ìì±í기 ìí ë¤ì±ë ì í¸ ìì± ìë¨; ë°Multi-channel signal generating means for generating a plurality of channel-specific audio signals from the downmixing stereo audio signal using left / right channel correlation values; And ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ì ëí ì ì í¸ë¥¼ ë³µìí ì ìëë¡, ì기 ìì±ë ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ì ìí¸ìê´ ê° ë° ìë¸ë°´ëë³ íì ê°ì ì기 ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ê°ììì ë°©í¥ì 보를 ì´ì©íì¬ ì¡°ì í기 ìí ë¤ì±ë ì í¸ ì¡°ì ìë¨The cross-correlation value and the sub-band power value of the generated plurality of channel-specific audio signals and the sub-band power correlation information may be used to restore the original signal with respect to the downmixed stereo audio signal. Means for adjusting multichannel signals ì í¬í¨íë ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹.Multi-channel audio signal decoding apparatus using a cross-correlation comprising a. ì 1 íì ìì´ì,The method of claim 1, ì기 ë¤ì±ë ì í¸ ìì± ìë¨ì,The multi-channel signal generating means, ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ ì¤ ë¤ì´ë¯¹ì± ì¢ì±ë ì¤ëì¤ ì í¸ìì ìí¸ìê´ ê°ì ì´ì©íì¬ ì¤ìì±ë ì í¸ì±ë¶ê³¼ ìë¼ì´ë ì°ì±ë ì í¸ì±ë¶ì ì ê±°íì¬ ë¤ì´ë¯¹ì± ìë¼ì´ë ì¢ì±ë ì í¸ë¥¼ ìì±í기 ìí ì 1 ìë¼ì´ë ì±ëì í¸ ìì± ìë¨;First surround channel signal generation means for generating a downmix surround left channel signal by removing a center channel signal component and a surround right channel signal component by using a cross-correlation value from a downmixing left channel audio signal among the downmixing stereo audio signals ; ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ ì¤ ë¤ì´ë¯¹ì± ì°ì±ë ì¤ëì¤ ì í¸ìì ìí¸ìê´ ê°ì ì´ì©íì¬ ì¤ìì±ë ì í¸ì±ë¶ê³¼ ìë¼ì´ë ì¢ì±ë ì í¸ì±ë¶ì ì ê±°í ì¬ ë¤ì´ë¯¹ì± ìë¼ì´ë ì°ì±ë ì í¸ë¥¼ ìì±í기 ìí ì 2 ìë¼ì´ë ì±ëì í¸ ìì± ìë¨; ë°Generating a second surround channel signal for generating a downmix surround right channel signal by removing a center channel signal component and a surround left channel signal component by using a cross-correlation value from a downmixing right channel audio signal among the downmixing stereo audio signals Way; And ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ ì¤ ì¢ì±ë ì¤ëì¤ ì í¸ì ì°ì±ë ì¤ëì¤ ì í¸ë¥¼ ê²°í©íì¬ ë¤ì´ë¯¹ì± ì¤ìì±ë ì í¸ë¥¼ ìì±í기 ìí ì¤ìì±ë ì í¸ ìì± ìë¨Center channel signal generation means for generating a downmix center channel signal by combining a left channel audio signal and a right channel audio signal among the downmixed stereo audio signals ì í¬í¨íë ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹.Multi-channel audio signal decoding apparatus using a cross-correlation comprising a. ì 2 íì ìì´ì,The method of claim 2, ì기 ì 1 ìë¼ì´ë ì±ëì í¸ ìì± ìë¨ì,The first surround channel signal generating means, ì기 ë¤ì´ë¯¹ì± ì¢ì±ë ì¤ëì¤ ì í¸ìì ìí¸ìê´ ê°ì ì´ì©íì¬ ì¤ìì±ë ì í¸ì±ë¶ê³¼ ìë¼ì´ë ì°ì±ë ì í¸ì±ë¶ì ì ê±°íë ì ìíí° ë° ê°ì°ê¸°ë¥¼ 구ë¹íë ê²ì í¹ì§ì¼ë¡ íë ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹.And an adaptive filter and a subtractor for removing the center channel signal component and the surround right channel signal component by using the cross-correlation value from the downmixing left channel audio signal. ì 2 íì ìì´ì,The method of claim 2, ì기 ì 2 ìë¼ì´ë ì±ëì í¸ ìì± ìë¨ì,The second surround channel signal generating means, ì기 ë¤ì´ë¯¹ì± ì°ì±ë ì¤ëì¤ ì í¸ìì ìí¸ìê´ ê°ì ì´ì©íì¬ ì¤ìì±ë ì í¸ì±ë¶ê³¼ ìë¼ì´ë ì¢ì±ë ì í¸ì±ë¶ì ì ê±°íë ì ìíí° ë° ê°ì°ê¸°ë¥¼ 구ë¹íë ê²ì í¹ì§ì¼ë¡ íë ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹.And an adaptive filter and a subtractor for removing the center channel signal component and the surround left channel signal component by using the cross-correlation value from the downmixing right-channel audio signal. ì 1 í ë´ì§ ì 4 í ì¤ ì´ë í íì ìì´ì,The method according to any one of claims 1 to 4, ì기 ë¤ì±ë ì í¸ ì¡°ì ìë¨ì,The multi-channel signal adjusting means, ë¶í¸í ì¥ì¹ 측ì¼ë¡ë¶í° ì ë¬ë°ì ê°ììì ë°©í¥ì ë³´ë¡ë¶í° ë¤ì±ë íìë¹ì¨ì ì°ì¶í기 ìí ë¤ì±ë íìë¹ì¨ ì°ì¶ ìë¨;Multichannel power ratio calculating means for calculating a multichannel power ratio from the virtual sound source direction information received from the encoding apparatus; ì기 ë¤ì±ë ì í¸ ìì± ìë¨ìì ìì±ë ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ê°ê° ìë¸ë°´ë íí°ë§í기 ìí ë³µìì ìë¸ë°´ë íí°ë§ ìë¨;A plurality of subband filtering means for subband filtering each of the downmixing multichannel audio signals generated by the multichannel signal generating means; ì기 ìë¸ë°´ë íí°ë§ë ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ì ì±ë ê° ìí¸ìê´ ê°ì ì기 ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ë§ê² ì¡°ì í기 ìí ë³µìì ìí¸ìê´ ì¡°ì ìë¨; ë°A plurality of cross-correlation adjustment means for adjusting the cross-correlation value of the subband filtered downmixing multichannel audio signal according to the cross-correlation information of the original signal; And ì기 ìí¸ìê´ì´ ì¡°ì ë ë¤ì±ë ì¤ëì¤ ì í¸ì ìë¸ë°´ëë³ íì ê°ì ì기 ì°ì¶ë ë¤ì±ë íìë¹ì¨ì ë§ê² ì¡°ì íê³ ìê°ììì¼ë¡ ë³íí기 ìí ì í¸ ë³í ìë¨Signal conversion means for adjusting the power value of each subband of the multi-channel audio signal whose cross-correlation is adjusted according to the calculated multi-channel power ratio and converting it into a time domain ì í¬í¨íë ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ì¥ì¹.Multi-channel audio signal decoding apparatus using a cross-correlation comprising a. ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ë°©ë²ì ìì´ì,In the multi-channel audio signal decoding method using cross-correlation, ì¢/ì° ì±ë ê° ìí¸ìê´ ê°ì ì´ì©íì¬ ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ë¡ë¶í° ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ë¥¼ ìì±íë ë¤ì±ë ì í¸ ìì± ë¨ê³; ë°Generating a plurality of channel-specific audio signals from the downmixing stereo audio signal using the cross-correlation value between the left and right channels; And ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ì ëí ì ì í¸ë¥¼ ë³µìí ì ìëë¡, ì기 ìì±ë ë³µìì ì±ëë³ ì¤ëì¤ ì í¸ì ìí¸ìê´ ê° ë° ìë¸ë°´ëë³ íì ê°ì ì기 ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ê°ììì ë°©í¥ì 보를 ì´ì©íì¬ ì¡°ì íë ë¤ì±ë ì í¸ ì¡°ì ë¨ê³The cross-correlation value and the sub-band power value of the generated plurality of channel-specific audio signals and the sub-band power correlation information may be used to restore the original signal with respect to the downmixed stereo audio signal. Multi-channel signal adjustment step 를 í¬í¨íë ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ë°©ë².Multi-channel audio signal decoding method using cross-correlation comprising a. ì 6 íì ìì´ì,The method of claim 6, ì기 ë¤ì±ë ì í¸ ìì± ë¨ê³ë,The multi-channel signal generation step, ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ ì¤ ë¤ì´ë¯¹ì± ì¢ì±ë ì¤ëì¤ ì í¸ìì ìí¸ìê´ ê°ì ì´ì©íì¬ ì¤ìì±ë ì í¸ì±ë¶ê³¼ ìë¼ì´ë ì°ì±ë ì í¸ì±ë¶ì ì ê±°íì¬ ë¤ì´ë¯¹ì± ìë¼ì´ë ì¢ì±ë ì í¸ë¥¼ ìì±íë ë¨ê³;Generating a downmixing surround left channel signal by removing a center channel signal component and a surround right channel component using a cross-correlation value from a downmixing left channel audio signal among the downmixing stereo audio signals; ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ ì¤ ë¤ì´ë¯¹ì± ì°ì±ë ì¤ëì¤ ì í¸ìì ìí¸ìê´ ê°ì ì´ì©íì¬ ì¤ìì±ë ì í¸ì±ë¶ê³¼ ìë¼ì´ë ì¢ì±ë ì í¸ì±ë¶ì ì ê±°íì¬ ë¤ì´ë¯¹ì± ìë¼ì´ë ì°ì±ë ì í¸ë¥¼ ìì±íë ë¨ê³; ë°Generating a downmix surround right channel signal by removing a center channel signal component and a surround left channel signal component using a cross-correlation value from a downmixing right channel audio signal among the downmixing stereo audio signals; And ì기 ë¤ì´ë¯¹ì± ì¤í ë ì¤ ì¤ëì¤ ì í¸ ì¤ ì¢ì±ë ì¤ëì¤ ì í¸ì ì°ì±ë ì¤ëì¤ ì í¸ë¥¼ ê²°í©íì¬ ë¤ì´ë¯¹ì± ì¤ìì±ë ì í¸ë¥¼ ìì±íë ë¨ê³Combining a left channel audio signal and a right channel audio signal among the downmixing stereo audio signals to generate a downmix center channel signal; 를 í¬í¨íë ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ë°©ë².Multi-channel audio signal decoding method using cross-correlation comprising a. ì 6 í ëë ì 7 íì ìì´ì,The method according to claim 6 or 7, ì기 ë¤ì±ë ì í¸ ì¡°ì ë¨ê³ë,The multi-channel signal adjustment step, ë¶í¸í ì¥ì¹ 측ì¼ë¡ë¶í° ì ë¬ë°ì ê°ììì ë°©í¥ì ë³´ë¡ë¶í° ë¤ì±ë íìë¹ì¨ì ì°ì¶íë ë¤ì±ë íìë¹ì¨ ì°ì¶ ë¨ê³;A multichannel power ratio calculating step of calculating a multichannel power ratio from the virtual sound source direction information received from the encoding apparatus; ì기 ë¤ì±ë ì í¸ ìì± ë¨ê³ìì ìì±ë ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ë¥¼ ê°ê° ìë¸ë°´ë íí°ë§íë ìë¸ë°´ë íí°ë§ ë¨ê³;A subband filtering step of subband filtering each of the downmixing multichannel audio signals generated in the multichannel signal generation step; ì기 ìë¸ë°´ë íí°ë§ë ë¤ì´ë¯¹ì± ë¤ì±ë ì¤ëì¤ ì í¸ì ì±ë ê° ìí¸ìê´ ê°ì ì기 ì ì í¸ì ì±ë ê° ìí¸ìê´ ì ë³´ì ë§ê² ì¡°ì íë ìí¸ìê´ ì¡°ì ë¨ê³; ë°A cross-correlation adjustment step of adjusting the cross-correlation value of the subband-filtered downmixing multichannel audio signal according to the cross-correlation information of the original signal; And ì기 ìí¸ìê´ì´ ì¡°ì ë ë¤ì±ë ì¤ëì¤ ì í¸ì ìë¸ë°´ëë³ íì ê°ì ì기 ì°ì¶ë ë¤ì±ë íìë¹ì¨ì ë§ê² ì¡°ì íê³ ìê°ììì¼ë¡ ë³ííë ì í¸ ë³í ë¨ê³A signal conversion step of adjusting the power value of each subband of the multi-channel audio signal whose cross-correlation is adjusted according to the calculated multi-channel power ratio and converting it to the time domain 를 í¬í¨íë ìí¸ìê´ì ì´ì©í ë¤ì±ë ì¤ëì¤ ì í¸ ë³µí¸í ë°©ë².Multi-channel audio signal decoding method using cross-correlation comprising a.
KR1020070107406A 2006-12-04 2007-10-24 Apparatus and method for decoding multi-channel audio signal using cross-correlation Expired - Fee Related KR100917845B1 (en) Applications Claiming Priority (2) Application Number Priority Date Filing Date Title KR1020060121683 2006-12-04 KR20060121683 2006-12-04 Publications (2) Family ID=39806185 Family Applications (1) Application Number Title Priority Date Filing Date KR1020070107406A Expired - Fee Related KR100917845B1 (en) 2006-12-04 2007-10-24 Apparatus and method for decoding multi-channel audio signal using cross-correlation Country Status (1) Cited By (3) * Cited by examiner, â Cited by third party Publication number Priority date Publication date Assignee Title US9058804B2 (en) 2011-11-28 2015-06-16 Samsung Electronics Co., Ltd. Speech signal transmission and reception apparatuses and speech signal transmission and reception methods US9299357B2 (en) 2013-03-27 2016-03-29 Samsung Electronics Co., Ltd. Apparatus and method for decoding audio data US12165660B2 (en) 2020-07-17 2024-12-10 Huawei Technologies Co., Ltd. Multi-channel audio signal coding method and apparatus Families Citing this family (5) * Cited by examiner, â Cited by third party Publication number Priority date Publication date Assignee Title EP2169666B1 (en) 2008-09-25 2015-07-15 Lg Electronics Inc. A method and an apparatus for processing a signal EP2169664A3 (en) * 2008-09-25 2010-04-07 LG Electronics Inc. A method and an apparatus for processing a signal WO2010036062A2 (en) 2008-09-25 2010-04-01 Lg Electronics Inc. A method and an apparatus for processing a signal KR20100035121A (en) * 2008-09-25 2010-04-02 ìì§ì ì 주ìíì¬ A method and an apparatus for processing a signal KR101112215B1 (en) * 2010-02-26 2012-03-13 ìê²½ëíêµ ì°ííë ¥ë¨ Method and system for blocking contents including harmful soundPatent event code: PA01091R01D
Comment text: Patent Application
Patent event date: 20071024
2007-10-24 PA0201 Request for examination 2008-06-10 PG1501 Laying open of application 2009-08-31 E701 Decision to grant or registration of patent right 2009-08-31 PE0701 Decision of registrationPatent event code: PE07011S01D
Comment text: Decision to Grant Registration
Patent event date: 20090831
2009-09-10 GRNT Written decision to grant 2009-09-10 PR0701 Registration of establishmentComment text: Registration of Establishment
Patent event date: 20090910
Patent event code: PR07011E01D
2009-09-10 PR1002 Payment of registration feePayment date: 20090911
End annual number: 3
Start annual number: 1
2009-09-18 PG1601 Publication of registration 2012-09-11 LAPS Lapse due to unpaid annual fee 2012-09-11 PC1903 Unpaid annual feeRetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4