A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://patents.google.com/patent/CN100353813C/en below:

CN100353813C - Stereo human voice cancellation method and related device

CN100353813C - Stereo human voice cancellation method and related device - Google Patents Stereo human voice cancellation method and related device Download PDF Info
Publication number
CN100353813C
CN100353813C CNB031557627A CN03155762A CN100353813C CN 100353813 C CN100353813 C CN 100353813C CN B031557627 A CNB031557627 A CN B031557627A CN 03155762 A CN03155762 A CN 03155762A CN 100353813 C CN100353813 C CN 100353813C
Authority
CN
China
Prior art keywords
signal
pass
low
frequency band
channel
Prior art date
2003-09-01
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB031557627A
Other languages
Chinese (zh)
Other versions
CN1592499A (en
Inventor
陈互志
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
MediaTek Inc
Original Assignee
MediaTek Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
2003-09-01
Filing date
2003-09-01
Publication date
2007-12-05
2003-09-01 Application filed by MediaTek Inc filed Critical MediaTek Inc
2003-09-01 Priority to CNB031557627A priority Critical patent/CN100353813C/en
2005-03-09 Publication of CN1592499A publication Critical patent/CN1592499A/en
2007-12-05 Application granted granted Critical
2007-12-05 Publication of CN100353813C publication Critical patent/CN100353813C/en
2023-09-01 Anticipated expiration legal-status Critical
Status Expired - Fee Related legal-status Critical Current
Links Images Landscapes Abstract

The invention provides a stereo vocal cancellation method and a related device, which are used for respectively providing two corresponding output signals after canceling vocal in two stereo vocal tract signals and increasing the stereo effect of the two output signals. The method comprises the following steps: generating a mono signal according to a result of the synthesis of the two channel signals; according to the difference between each sound channel signal and the single sound channel signal, the human voice elimination is respectively carried out on the two sound channel signals, and the low-frequency and high-frequency band compensation is carried out on the two output signals so as to generate the two output signals. Since the above method is to perform voice elimination on two channels, the two output signals provided by the method have substantial difference in frequency bands other than the high frequency band, thereby increasing the stereo feeling.

Description Translated from Chinese 立体声的人声消除方法及相关装置Stereo human voice cancellation method and related device

技术领域technical field

发明提供一种在人声消除时增加立体声感的方法及相关装置,特别是指一种分别对不同声道信号各自进行人声消除以增加立体声感的方法及相关装置。The invention provides a method and a related device for increasing the sense of stereo when human voice is eliminated, in particular, a method and a related device for increasing the sense of stereo by performing human voice elimination on different channel signals respectively.

背景技术Background technique

随着信息、电子技术的进步与普及,在现代社会中,娱乐的形态也越趋多样化。举例来说,有卡拉OK之称的伴唱系统,就能够播放歌曲的背景配乐,让使用者不需乐团的伴奏,就能随背景配乐歌唱,享受专业级的娱乐环境。不过,一般来说,娱乐业者推出的歌曲中都包含有配唱的人声(vocal);因此,为了适应伴唱系统的需要,现代的信息业者也发展出了人声消除的技术,设法将歌曲中的配唱人声减抑消除,留下背景配乐,供伴唱系统使用。With the advancement and popularization of information and electronic technology, entertainment forms are becoming more and more diverse in modern society. For example, the accompaniment system known as karaoke can play the background music of the song, allowing users to sing along with the background music without the accompaniment of an orchestra, enjoying a professional-level entertainment environment. However, generally speaking, the songs released by the entertainment industry all contain vocals (vocal) for accompanying singing; The accompaniment vocals are reduced and eliminated, leaving the background soundtrack for use by the accompaniment system.

请参考图1。图1为一播放器10以公知技术进行人声消除时相关功能方块配置的示意图。一般来说,现代的播放系统都能播放两声道(或更多声道)的立体人声效果,以播放器中不同的扬声器模块来分别播放不同声道的信号,让使用者能听到更具临场感的音乐播放效果。而播放器10中即设有一个能提供两声道信号(像是左右声道)的音源电路12、一个用来进行人声消除的信号模块14、以及两个用来播放立体人声效果的扬声器模块16A、16B。音源电路12可以是一光盘读取机构,能利用读取头18存取一光盘片20上的歌曲数据,并加以解析(例如适当地解调、解码)。对应于立体人声效果的展现,现代的娱乐业者在提供娱乐资源媒体(例如是储存有歌曲的光盘片媒体)时,也都会在其中记录有不同声道的信号。而音源电路12在存取光盘片20的数据后,即可解读出两立体声的声道信号PLi、PRi。信号模块14即用来对声道信号PLi、PRi进行人声消除,以分别产生出输出信号PLo、PRo。扬声器模块16A、16B可分别设有各自的数字/模拟转换器、功率放大器、扬声器等电路,以分别将输出信号PLo、PRo转换为声波播放出来。Please refer to Figure 1. FIG. 1 is a schematic diagram of the configuration of related functional blocks when a player 10 performs human voice cancellation in the known technology. Generally speaking, modern playback systems can play two-channel (or more) stereo vocal effects, and use different speaker modules in the player to play signals of different channels, so that users can hear More immersive music playback effect. And promptly be provided with a sound source circuit 12 that can provide two-channel signal (like left and right sound channels) in player 10, a signal module 14 that is used for vocal elimination, and two for playing stereo vocal effect Speaker modules 16A, 16B. The sound source circuit 12 can be an optical disc reading mechanism, which can use the reading head 18 to access the song data on an optical disc 20 and analyze (eg, properly demodulate, decode). Corresponding to the presentation of the stereo vocal effect, modern entertainment companies also record signals of different channels when providing entertainment resource media (such as CD media with songs stored therein). After the audio source circuit 12 accesses the data of the optical disc 20, it can interpret the two stereo channel signals PLi and PRi. The signal module 14 is used to perform vocal cancellation on the channel signals PLi, PRi to generate output signals PLo, PRo respectively. The speaker modules 16A, 16B can be respectively equipped with their own digital/analog converters, power amplifiers, speakers and other circuits to convert the output signals PLo, PRo into sound waves and play them.

为了进行人声消除,公知的信号模块14中设有两高通模块26A、26B、一低通模块28及一人声消除模块22。高通模块26A、26B分别用来对声道信号PLi、PRi进行高通滤波,以产生对应的高通信号PLh、PRh;低通模块28用来对一信号Ps进行低通滤波,产生对应的低通信号Pl。而人声消除模块22则是将两声道信号PLi、PRi相减以产生一人声消除后的中介信号PVC。将对应于声道信号PLi的高通信号PLh、低通信号Pl以及中介信号PVC混音相加后,就能产生输出信号PLo;将对应于声道信号的高通信号PRh、低通信号Pl及中介信号PVC相加混音后,即为输出信号PRo。In order to perform vocal cancellation, the known signal module 14 is provided with two high-pass modules 26A, 26B, a low-pass module 28 and a vocal cancellation module 22 . The high-pass modules 26A and 26B are respectively used to perform high-pass filtering on the channel signals PLi and PRi to generate corresponding high-pass signals PLh and PRh; the low-pass module 28 is used to perform low-pass filtering on a signal Ps to generate corresponding low-pass signals No. Pl. The human voice cancellation module 22 subtracts the two-channel signals PLi and PRi to generate a human voice-cancelled intermediate signal PVC. After mixing and adding the high-pass signal PLh corresponding to the channel signal PLi, the low-pass signal Pl and the intermediate signal PVC, the output signal PLo can be generated; the high-pass signal PRh and the low-pass signal Pl corresponding to the channel signal After being added and mixed with the intermediate signal PVC, the output signal PRo is obtained.

为了说明上述公知技术人声消除的原理,请先参考图2(并一并参考图1)。图2为立体人声效果中各声道信号及相关信号典型频谱(spectrum)的示意图。图2中各信号频谱的横轴为频率,纵轴则为频谱的大小(像是绝对值的大小)。In order to illustrate the principle of vocal cancellation in the above known technology, please refer to FIG. 2 (and refer to FIG. 1 together). FIG. 2 is a schematic diagram of a typical spectrum (spectrum) of each channel signal and related signals in a stereo vocal effect. The horizontal axis of each signal spectrum in FIG. 2 is the frequency, and the vertical axis is the magnitude of the spectrum (like the magnitude of the absolute value).

一般来说,如熟习技术人士所知,在娱乐业者所提供的歌曲中,是在不同的声道信号中混入不同的背景配乐信号来营造立体人声效果;而配唱的人声信号则当作是主要的信号,通常会以均等的强度混入在各声道的信号中。这样一来,当使用者以播放器的相异扬声器模块来播放不同声道的信号时,就能感觉到配唱的人声彷佛在面前(因其在两声道中成份相等),而不同声道的不同背景配乐则会让使用者感受到立体音效果,彷佛背景音乐的音源是环绕在使用者四周。在图2中,频谱Vf即代表配唱的人声信号的频谱,相异的频谱Lmf、Rmf则分别代表不同背景配乐信号的频谱。如上所述,将背景配乐的频谱Lmf与人声频谱Vf相加,就组成为立体声中一个声道信号的频谱Lf;而将背景配乐的频谱Rmf与人声频谱Vf相加,则成为立体声中另一声道信号的频谱Rf。像是在图1中由音源电路12取得的声道信号PLi、PRi,其信号的频谱就可以像是频谱Lf、Rf所示。由于人发声机制的生理限制,使人声不能太低频或超过一定的高频,故人声信号的频谱通常也限定于一定的频带范围中。而图2中标出的频率fl、fh,就分别代表了人声信号的频率下限及上限;而人声信号的频谱Vf即集中于频率fl至fh间的中频频带BM中。相对于局限于中频频带BM的人声频谱Vf,背景配乐中由各种乐器加总起来的频谱就能延伸至较广的频率范围;就如图2中所示,即使在低于频率fl的低频频带BL及高于频率fh的高频频带BH中,都会有背景配乐的频谱Lmf、Rmf分布。连带地,除了人声信号频谱所在的中频频带BM,各声道信号的频谱Lf、Rf也都会延伸到低频频带BL及高频频带BH。Generally speaking, as those skilled in the art know, in the songs provided by the entertainment industry, different background soundtrack signals are mixed in different channel signals to create a stereo human voice effect; As the main signal, it is usually mixed with the signal of each channel with equal intensity. In this way, when the user uses different speaker modules of the player to play signals of different channels, he can feel that the vocals of the accompaniment seem to be in front of him (because the components in the two channels are equal), and different The different background music of the channel will allow the user to experience the stereo sound effect, as if the sound source of the background music surrounds the user. In FIG. 2 , the frequency spectrum Vf represents the frequency spectrum of the vocal signal for accompaniment, and the different frequency spectrums Lmf and Rmf represent the frequency spectrum of different background music signals. As mentioned above, the spectrum Lmf of the background music is added to the spectrum Vf of the human voice to form the spectrum Lf of a channel signal in stereo; and the spectrum Rmf of the background music is added to the spectrum Vf of the human voice to form Spectrum Rf of the other channel signal. Like the channel signals PLi, PRi obtained by the sound source circuit 12 in FIG. 1, the frequency spectrum of the signal can be shown as the frequency spectrum Lf, Rf. Due to the physiological limitation of the human voice mechanism, the human voice cannot be too low frequency or exceed a certain high frequency, so the frequency spectrum of the human voice signal is usually limited to a certain frequency band range. The frequencies fl and fh marked in Fig. 2 represent the lower limit and upper limit of the frequency of the human voice signal respectively; and the frequency spectrum Vf of the human voice signal is concentrated in the middle frequency band BM between the frequencies fl to fh. Compared with the human voice spectrum Vf limited to the mid-frequency band BM, the spectrum summed up by various instruments in the background music can extend to a wider frequency range; as shown in Figure 2, even at frequencies below fl In the low-frequency band BL and the high-frequency band BH higher than the frequency fh, there will be spectrum Lmf and Rmf distribution of the background music. Correspondingly, in addition to the middle frequency band BM where the vocal signal spectrum is located, the spectrum Lf and Rf of each channel signal also extend to the low frequency band BL and the high frequency band BH.

由于人声信号在各声道信号中的成份相等,公知的信号模块14(请见图1)即是在人声消除模块22中将两声道信号PLi、PRi相减,以减去两声道中共有的人声信号,产生人声消除后的中介信号PVC。不过,在将声道信号PLi、PRi相减的过程中,声道信号PLi、PRi中位于低频频带BL及高频频带BH的信号成份也会被相减;而人声消除的原意当然还是要保留背景配乐延伸于低频频带BL及高频频带BH的成份。故信号模块14中还要以高通模块26A、26B以及低通模块28来进行高频补偿及低频补偿。其中,高通模块26A可将声道信号PLi中属于高频频带BH的成份取出,成为高通信号PLh。而低通模块28的信号来源Ps可以是声道信号PLi、PRi其中之一,低通模块28将信号Ps位于低频频带BL的成份取出成为低通信号Pl,就相当于将声道信号的低频成份取出于低通信号Pl。在将高通信号PLh、低通信号Pl与中介信号PVC混音相加后,就能补偿中介信号PVC在人声消除过程中损失的高频及低频成份,成为输出信号PLo。Because the components of the human voice signal in each channel signal are equal, the known signal module 14 (seeing FIG. 1 ) promptly subtracts the two channel signals PLi and PRi in the human voice elimination module 22 to subtract the two voice signals. The common human voice signal in the channel generates the intermediate signal PVC after the human voice is eliminated. However, in the process of subtracting the channel signals PLi and PRi, the signal components in the low-frequency band BL and the high-frequency band BH in the channel signals PLi and PRi will also be subtracted; The components of the background soundtrack extending in the low frequency band BL and the high frequency band BH are retained. Therefore, high-pass modules 26A, 26B and low-pass module 28 are also used in the signal module 14 to perform high-frequency compensation and low-frequency compensation. Among them, the high-pass module 26A can extract the components belonging to the high-frequency band BH in the channel signal PLi to form a high-pass signal PLh. The signal source Ps of the low-pass module 28 can be one of the channel signals PLi and PRi, and the low-pass module 28 takes out the component of the signal Ps located in the low-frequency band BL to become the low-pass signal P1, which is equivalent to the low-frequency signal of the channel signal. The components are taken out from the low-pass signal Pl. After mixing and adding the high-pass signal PLh, low-pass signal Pl and intermediate signal PVC, the high-frequency and low-frequency components lost in the process of vocal cancellation of the intermediate signal PVC can be compensated to become the output signal PLo.

同理,高通模块26B在将声道信号PRi的高频成份取出为高通信号PRh后,信号模块14就能以高通信号PRh、低通信号Pl来对中介信号PVC进行高频、低频的补偿,产生输出信号PRo。一般来说,各声道信号中属于低频频带BL的成份比较不具有指向性,两声道信号PRi、PLi在低频成份的差异比较难以营造出立体音效果,故信号模块14都是以相同的低频信号Pl来对输出信号PLo、PRo进行低频补偿。相对地,在声道信号PRi、PLi中属于高频频带BH的成份就会比较富有指向性,两声道信号在高频频带的差异能够让使用者较为显着地体会到立体声的效果,故信号模块14是分别以两声道信号PRi、PLi高通滤波后的高通信号PRh、PLh做高通补偿,以利用输出信号PRo、PLo在高频成份的差异展现立体人声效果。总结来说,公知信号模块14在接收两声道的声道信号PLi、PRi后,再统一以人声消除模块22产生的中介信号PVC作为两声道人声消除的基本结果,再以低通信号Pl及高通信号PLh及PRh作为低频及高频补偿,分别产生出两声道的输出信号PLo、PRo,当作声道信号PLi、PRi人声消除后的结果。现有信号模块14即以此种方式减抑两声道的声道信号PLi、PRi中人声的部分,并试图在两声道的输出信号PLo、PRo中保留背景配乐的立体人声效果。In the same way, after the high-pass module 26B takes out the high-frequency components of the channel signal PRi as the high-pass signal PRh, the signal module 14 can use the high-pass signal PRh and the low-pass signal P1 to perform high-frequency and low-frequency conversion of the intermediate signal PVC. Compensated to produce an output signal PRo. Generally speaking, the components belonging to the low-frequency band BL in each channel signal are relatively non-directional, and the difference in the low-frequency components of the two-channel signals PRi and PLi is relatively difficult to create a stereo effect, so the signal modules 14 are based on the same The low-frequency signal Pl is used to perform low-frequency compensation on the output signals PLo and PRo. In contrast, the components belonging to the high-frequency band BH in the channel signals PRi and PLi will be more directional, and the difference between the two-channel signals in the high-frequency band can make the user experience the stereo effect more significantly, so the signal The module 14 performs high-pass compensation with high-pass signals PRh, PLh after high-pass filtering of the two-channel signals PRi, PLi, respectively, so as to use the difference in high-frequency components of the output signals PRo, PLo to present a stereo vocal effect. To sum up, after the known signal module 14 receives the two-channel channel signals PLi and PRi, the intermediate signal PVC generated by the vocal canceling module 22 is used as the basic result of the two-channel vocal canceling, and then the low-communication Signal Pl and high-pass signals PLh and PRh are used as low-frequency and high-frequency compensation to generate two-channel output signals PLo, PRo respectively, which are regarded as the result of vocal cancellation of channel signals PLi and PRi. In this way, the existing signal module 14 reduces the human voice in the two-channel channel signals PLi, PRi, and tries to preserve the stereo human voice effect of the background music in the two-channel output signals PLo, PRO.

请参考图3。图3是图1中信号模块14在运作时相关信号频谱的示意图。延续图2中的频谱示意例,若图1中声道信号PLi、PRi的频谱分别是图2中的频谱Lf、Rf,则在信号模块14运作后,输出信号PLo、PRo的频谱就分别如图3中的频谱PLof、PRof所示;图3各频谱的横轴亦为频率,纵轴为频谱大小(如绝对值的大小)。而图3中标出的频率fl、fh及低频频带BL、中频频带BM与高频频带BH,其意义就如图2及相关说明中相同。为了比较两频谱PLof与PRof的差异,图3中也将频谱PRof以虚线方式与实线的频谱PLof绘示于同一坐标系下。Please refer to Figure 3. FIG. 3 is a schematic diagram of related signal spectrums when the signal module 14 in FIG. 1 is in operation. Continuing the schematic example of the frequency spectrum in FIG. 2, if the frequency spectrums of the channel signals PLi and PRi in FIG. 1 are respectively the frequency spectrums Lf and Rf in FIG. Shown in the spectrum PLof and PRof in Fig. 3; the horizontal axis of each spectrum in Fig. 3 is also the frequency, and the vertical axis is the spectrum size (such as the size of the absolute value). The meanings of the frequencies fl, fh, low frequency band BL, middle frequency band BM and high frequency band BH marked in FIG. 3 are the same as in FIG. 2 and related descriptions. In order to compare the difference between the two spectrums PLof and PRof, the spectrum PRof is also shown in the same coordinate system as the dotted line and the solid line spectrum PLof in FIG. 3 .

由于图1中公知的信号模块14所产生的输出信号PLo及PRo都包含有相同的中介信号PVC、低通信号Pl,差异仅在于此两输出信号是以不同的高通信号PLh、PRh做高频补偿,故由图3中可比较出,输出信号PLo、PRo的频谱PLof、PRof,两者主要的差异集中于高频频带BH;至于两频谱PLof、PRof于中频频带BM、低频频带BL的成份,则几乎是完全相同的。虽然信号中高频的成份较具有立体音效中的指向性,但由于频谱PLof、PRof的信号能量大部分还是集中于中频、低频频带BM及BL,分布于高频频带BH的信号能量较少,故频谱PLof、PRof的整体差异其实不大。当播放器10将输出信号PLo、PRo播放出来时,由于两者间的差异不大,故其所能展现出来的立体音效也就大打折扣。而这也就是公知技术的缺点之一。换句话说,在图1的公知信号模块14中,由于两声道的输出信号PLo、PRo皆以同一中介信号PVC作为人声消除后的基本信号,仅使用了相异的高通信号PLh、PRh做高频补偿,故此两声道的输出信号PLo、PRo的差异仅集中于高频、能量较少的部分,无法从原来的声道信号PLi、PRi中撷取出足够的差异来产生较为明显的立体人声效果;这也使得两声道信号PLi、PRi在进行人声消除后,其所能展现出来的立体人声效果大打折扣,无法让使用者享受到具有环场音效的伴唱配乐。Since the output signals PLo and PRo produced by the known signal module 14 in FIG. 1 all include the same intermediate signal PVC and low-pass signal P1, the only difference is that the two output signals are made high by different high-pass signals PLh and PRh. Therefore, it can be compared from Fig. 3 that the main difference between the frequency spectrum PLof and PRof of the output signal PLo and PRo is concentrated in the high frequency band BH; as for the two frequency spectrum PLof and PRof in the middle frequency band BM and the low frequency band BL The ingredients are almost identical. Although the high-frequency components in the signal have more directivity in the stereo sound effect, most of the signal energy in the spectrum PLof and PRof is still concentrated in the middle frequency, low-frequency bands BM and BL, and the signal energy distributed in the high-frequency band BH is less, so The overall difference between spectrum PLof and PRof is actually not big. When the player 10 plays the output signals PLo, PRO, the stereo sound effect it can show is greatly reduced because the difference between them is not large. And this is one of the shortcoming of known technology. In other words, in the known signal module 14 of FIG. 1 , since the two-channel output signals PLo and PRO all use the same intermediate signal PVC as the basic signal after vocal cancellation, only the different high-pass signals PLh, PRh performs high-frequency compensation, so the difference between the output signals PLo and PRo of the two channels is only concentrated in the high-frequency and low-energy parts, and it is impossible to extract enough differences from the original channel signals PLi and PRi to produce a more obvious This also makes the two-channel signals PLi and PRi after the human voice is eliminated, the stereo vocal effect that it can show is greatly reduced, and it is impossible for the user to enjoy the accompaniment soundtrack with surround sound effect.

发明内容Contents of the invention

因此,本发明的目的,即在于提出一种较佳的人声消除方法及相关装置,使得不同声道的声道信号在经过人声消除后,还能维持相当程度的信号差异,产生较佳的立体人声效果,克服公知技术的缺点。Therefore, the purpose of the present invention is to propose a better human voice cancellation method and related devices, so that the channel signals of different channels can maintain a considerable degree of signal difference after the human voice is eliminated, resulting in better The stereo vocal effect overcomes the shortcoming of known technology.

在公知技术中,是以两声道的声道信号产生出一个中介信号作为人声消除的主要信号;以此中介信号为主,在进行低频及相异的高频补偿后即产生两声道的输出信号。由于此两声道的输出信号皆基于相同的中介信号,两者间的差异仅局限于高频成份,故此两声道的输出信号无法展现出较佳的立体人声效。In the known technology, an intermediary signal is generated from two-channel channel signals as the main signal for vocal cancellation; this intermediary signal is the main signal, and two channels are generated after low-frequency and different high-frequency compensation. output signal. Since the output signals of the two channels are based on the same intermediate signal, the difference between the two is limited to the high-frequency components, so the output signals of the two channels cannot show a better stereo human sound effect.

在本发明中,则是以两声道信号的平均产生出一个单声道信号,再以各声道信号与此单声道信号间的差异作为各声道信号在人声消除后对应的中介信号;在对各声道信号对应的中介信号做低频及对应的高频补偿后,产生各声道信号对应的输出信号。在上述方法中,由于各声道信号对应的中介信号是由各声道的信号与该单声道信号间的差异所产生,故各声道输出信号对应的中介信号也各不相同。这样一来,即使在人声消除后,各声道信号在低频、中频频带的差异还是会有相当程度的保留,也使得各声道的输出信号得以展现出较佳的立体人声效果,让使用者得以享受具有显著环场音效的伴唱配乐。In the present invention, a monophonic signal is generated by the average of the two-channel signals, and then the difference between each channel signal and the monophonic signal is used as the corresponding intermediary of each channel signal after the human voice is eliminated. signal; after performing low-frequency and corresponding high-frequency compensation on the intermediate signal corresponding to each channel signal, an output signal corresponding to each channel signal is generated. In the above method, since the intermediate signal corresponding to each channel signal is generated by the difference between the signal of each channel and the mono signal, the intermediate signal corresponding to the output signal of each channel is also different. In this way, even after the human voice is eliminated, the differences in the low-frequency and mid-frequency bands of each channel signal will still be preserved to a considerable extent, and the output signal of each channel can show a better stereo human voice effect. Allowing users to enjoy the accompaniment soundtrack with significant surround sound effects.

附图说明Description of drawings

图1为一个公知播放器进行人声消除相关功能方块的示意图。FIG. 1 is a schematic diagram of functional blocks related to human voice cancellation in a known player.

图2为立体声各声道信号及相关信号典型频谱的示意图。FIG. 2 is a schematic diagram of typical frequency spectra of stereo channel signals and related signals.

图3为图1中播放器进行人声消除后相关输出信号的频谱示意图。FIG. 3 is a schematic diagram of the frequency spectrum of the relevant output signal after the player in FIG. 1 performs vocal cancellation.

图4为本发明在一个播放器中实现人声消除的相关功能方块的示意图。FIG. 4 is a schematic diagram of related functional blocks for implementing human voice cancellation in a player according to the present invention.

图5为图4中播放器进行人声消除后相关输出信号的频谱示意图。FIG. 5 is a schematic diagram of the frequency spectrum of the relevant output signal after the player in FIG. 4 performs vocal cancellation.

图6为图4中信号模块的功能以一程序代码实现的示意图。FIG. 6 is a schematic diagram illustrating the function of the signal module in FIG. 4 implemented by a program code.

附图符号说明Description of reference symbols

10、30:播放器;10, 30: player;

12、32:音源电路;12, 32: audio source circuit;

14、34:信号模块;14, 34: signal module;

16A-16B、36A-36B:扬声器模块;16A-16B, 36A-36B: speaker module;

18、38:读取头;18, 38: read head;

20、40:光盘片;20, 40: CD-ROM;

22、42A-42B:人声消除模块;22. 42A-42B: Human voice cancellation module;

26A-26B、46A-46B:高通模块;26A-26B, 46A-46B: Qualcomm module;

28、48:低通模块;28, 48: low-pass module;

50:单声道处理模块;50: monophonic processing module;

52A、52B:混音单元;52A, 52B: mixing unit;

100:程序代码;100: program code;

PRi、PLi、Ri、Li:声道信号;PRi, PLi, Ri, Li: channel signal;

PVC、LVC、RVC:中介信号;PVC, LVC, RVC: intermediary signal;

PLh、PRh、Lh、Rh:高通信号;PLh, PRh, Lh, Rh: Qualcomm signal;

Pl、Sl:低通信号;Pl, Sl: low-pass signal;

PLo、Pro、Lo、Ro:输出信号;PLo, Pro, Lo, Ro: output signal;

Ps、S:信号;Ps, S: signal;

M:单声道信号;M: mono signal;

Lmf、Rmf、Vf、Lf、Rf、PLof、Prof:频谱;Lmf, Rmf, Vf, Lf, Rf, PLof, Prof: spectrum;

BL、BM、BH:频带;BL, BM, BH: frequency band;

fl、fh:频率。fl, fh: frequency.

具体实施方式Detailed ways

请参考图4。图4为本发明的技术在一个播放器30中实现人声消除相关功能方块的示意图。播放器30中设有一个音源电路32、一个信号模块34以及用来播放立体人声效的扬声器模块36A、36B。音源电路32可以是一个光盘读取机构,以利用一读取头38从一光盘片40上读取歌曲的信号数据,并解析出立体人声效果中的各个声道信号Li、Ri。信号模块34则用来实现本发明人声消除的功能,以根据两声道信号Li、Ri产生人声消除后的两声道输出信号Lo、Ro。其中,信号模块34设有一个单声道处理模块50、一低通模块48;配合两声道的声道信号Li、Ri,信号模块34中也对应地设有两个人声消除模块42A、42B以及高通模块46A、46B。扬声器模块36A、36B中可分别设有数字/模拟转换器、功率放大器及扬声器等等,以分别将输出信号Lo、Ro转换为声波播放出来。Please refer to Figure 4. FIG. 4 is a schematic diagram of a player 30 implementing functional blocks related to vocal cancellation by the technology of the present invention. The player 30 is provided with a sound source circuit 32 , a signal module 34 and speaker modules 36A, 36B for playing stereo vocal effects. The sound source circuit 32 can be an optical disc reading mechanism, so as to use a reading head 38 to read the signal data of a song from an optical disc 40, and analyze each channel signal Li, Ri in the stereo vocal effect. The signal module 34 is used to implement the vocal cancellation function of the present invention to generate two-channel output signals Lo, Ro after vocal cancellation according to the two-channel signals Li, Ri. Wherein, the signal module 34 is provided with a monophonic processing module 50 and a low-pass module 48; in conjunction with the two-channel channel signals Li, Ri, the signal module 34 is also correspondingly provided with two vocal cancellation modules 42A, 42B And Qualcomm modules 46A, 46B. The speaker modules 36A, 36B can be respectively provided with a digital/analog converter, a power amplifier, a speaker, etc., so as to convert the output signals Lo, Ro into sound waves and play them.

本发明以信号模块34进行人声消除的情形可描述如下。信号模块34中的单声道处理模块50可计算两声道信号Li、Ri的平均,产生一个单声道(monochannel)信号M;换句话说,即M=(Li+Ri)/2。而本发明即利用此单声道信号M,分别针对各个声道信号进行各自的人声消除。在对应于声道信号Li的人声消除模块42A中,声道信号Li会和单声道信号M相减,以声道信号Li及单声道信号M之间的差异形成中介信号LVC(即LVC=Li-M)。在对应于人声消除模块42B中,则是以声道信号Ri及单声道信号M间的差异形成中介信号RVC(即RVC=Ri-M)。The situation that the present invention uses the signal module 34 to cancel the human voice can be described as follows. The mono channel processing module 50 in the signal module 34 can calculate the average of the two channel signals Li and Ri to generate a mono channel signal M; in other words, M=(Li+Ri)/2. However, the present invention utilizes the monophonic signal M to perform respective human voice cancellation for each channel signal. In the human voice elimination module 42A corresponding to the channel signal Li, the channel signal Li will be subtracted from the mono signal M, and the difference between the channel signal Li and the mono signal M will form an intermediate signal LVC (i.e. LVC=Li-M). In the module 42B corresponding to the human voice cancellation, the difference between the channel signal Ri and the monaural signal M is used to form the intermediate signal RVC (ie, RVC=Ri−M).

如图2及相关说明中讨论过的,在立体声的各声道信号中,人声配唱的部分通常是以相等的大小强度混音至各声道信号中,故本发明中以两声道信号Li、Ri的平均所产生出来的单声道信号M,应该也和各声道信号一样包括有相等的人声部分。而本发明即是利用与各声道信号对应的人声消除模块将各声道信号与此单声道信号间相减,以分别对各声道信号进行人声消除,减抑各声道信号中人声的部分。与公知技术不同的是,本发明是针对各声道信号各自进行人声消除,所以不同的声道信号在各自进行人声消除后所产生出来的对应中介信号,自然也会有所差异。就如图4中的实施例,声道信号Li在人声消除后产生的中介信号LVC等于(Li-M),另一声道信号Ri人声消除后的中介信号RVC等于(Ri-M),自然就和中介信号LVC不同。如前面讨论过的,立体人声效果是以各声道信号间的信号差异来展现的;而在本发明中,原先在两声道信号Li、Ri间形成立体人声效的信号差异,还是会保留于人声消除后的中介信号LVC与RVC中。而本发明主要即是利用中介信号LVC、RVC间的信号差异,于人声消除后展现更胜于公知技术、更丰富的立体人声效果。请注意,如图1中所示,在公知的人声消除技术中,即使是不同声道的声道信号,还是以同一人声消除模块进行人声消除,以同一个中介信号作为人声消除的基本结果。相较之下,本发明是分别针对不同声道的声道信号进行各自的人声消除,产生出不同的中介信号,更能保留原来在各声道信号中用来展现立体人声效的信号差异。As discussed in Fig. 2 and related descriptions, in each channel signal of stereophonic sound, the part of vocal accompaniment is usually mixed into each channel signal with equal size and intensity, so in the present invention, two-channel The monophonic signal M produced by the average of the signals Li and Ri should also include an equal part of the human voice as the signal of each channel. And the present invention promptly utilizes the human voice elimination module corresponding to each channel signal to subtract between each channel signal and this monophonic signal, to respectively carry out human voice elimination to each channel signal, reduce each channel signal the vocal part. Different from the known technology, the present invention performs vocal cancellation on each channel signal, so the corresponding intermediate signals generated by different channel signals after the vocal cancellation are naturally different. Just like the embodiment in Fig. 4, the intermediary signal LVC produced by the channel signal Li after the human voice is eliminated is equal to (Li-M), and the intermediary signal RVC of the other channel signal Ri after the human voice is eliminated is equal to (Ri-M) , which is naturally different from the intermediary signal LVC. As discussed above, the stereo human voice effect is presented by the signal difference between the channel signals; and in the present invention, the signal difference that originally formed the stereo human voice effect between the two-channel signals Li and Ri will still be Retained in the intermediate signals LVC and RVC after vocal cancellation. However, the present invention mainly uses the signal difference between the intermediate signals LVC and RVC to display a richer stereo human voice effect than the known technology after the human voice is eliminated. Please note that, as shown in Figure 1, in the known vocal cancellation technology, even if the channel signals of different channels are used, the same vocal cancellation module is used for vocal cancellation, and the same intermediate signal is used as the vocal cancellation the basic result of . In contrast, the present invention performs separate human voice cancellation on channel signals of different channels to generate different intermediate signals, and can better preserve the signal differences that were originally used to show the stereo human voice effect in each channel signal .

如图4所示,在根据各声道信号Li、Ri分别产生中介信号LVC、RVC后,信号模块34就可针对中介信号LVC、RVC进行高频及低频的补偿,产生输出信号Lo、Ro。其中,高通模块46A可将声道信号Li中属于高频频带的部分(主要是高于人声中频频带的部分,请参考图2及相关说明)取出,成为高通信号Lh;低通模块48则能将信号S中属于低频频带的部分取出成为低通信号Sl。此信号S可以是声道信号Li、Ri其中之一,或是单声道信号M。以混音单元52A将声道信号Li对应的高通信号Lh、中介信号LVC以及低通信号Sl混音相加,等效上就是对中介信号LVC进行高频及低频补偿,并产生出对应于声道信号Li的输出信号Lo(即Lo=LVC+Sl+Lh)。同理,高通模块46B能将声道信号Ri中属于高频频带的部分取出为高通信号Rh,来对中介信号RVC做高频补偿。以混音单元52B将声道信号Ri对应的中介信号RVC、高通信号Rh以及低通信号Sl相加,就能对中介信号RVC进行高频及低频补偿,形成输出信号Ro(即Ro=RVC+Sl+Rh)。As shown in FIG. 4 , after the intermediate signals LVC and RVC are respectively generated according to the channel signals Li and Ri, the signal module 34 can perform high-frequency and low-frequency compensation for the intermediate signals LVC and RVC to generate output signals Lo and Ro. Among them, the high- pass module 46A can take out the part belonging to the high-frequency band (mainly the part higher than the mid-frequency band of the human voice, please refer to FIG. 2 and related descriptions) in the channel signal Li to become the high-pass signal Lh; 48 can take out the part belonging to the low-frequency band in the signal S to become the low-pass signal S1. The signal S can be one of the channel signals Li and Ri, or the mono signal M. Using the mixing unit 52A to mix and add the high-pass signal Lh corresponding to the channel signal Li, the intermediate signal LVC, and the low-pass signal S1, it is equivalent to performing high-frequency and low-frequency compensation on the intermediate signal LVC, and generating a signal corresponding to The output signal Lo of the channel signal Li (ie Lo=LVC+Sl+Lh). Similarly, the high-pass module 46B can extract the part of the channel signal Ri belonging to the high-frequency band as the high-pass signal Rh to perform high-frequency compensation on the intermediate signal RVC. By adding the intermediate signal RVC corresponding to the channel signal Ri, the high-pass signal Rh and the low-pass signal S1 with the mixing unit 52B, high-frequency and low-frequency compensation can be performed on the intermediate signal RVC to form an output signal Ro (that is, Ro=RVC +Sl+Rh).

请继续参考图5(并一并参考图4)。图5即为图4中本发明信号模块34运作后产生出来的输出信号Lo、Ro的频谱示意图;图5的横轴为频率,纵轴为频谱大小。延续图2中的例子,假设图4中的声道信号Li、Ri其频谱分别如图2中的频谱Lf、Rf所示,则本发明输出信号Lo、Ro的频谱就分别如图5中的频谱Lof、Rof所示(为了比较方便,图5中亦以虚线将频谱Rof与实线的频谱Lof绘于同一坐标系中;图2中的频率fl、fh与频带BL、BM及BH亦一并标示于图5中)。由图5中可看出,由于本发明是针对不同的声道信号分别进行人声消除,故原本各声道信号间分布于低频频带BL、中频频带BM的信号差异,也会保留于输出信号Lo、Ro之间,使得本发明的输出信号Lo、Ro不仅在高频频带BH有所差异,也会在低频、中频频带有所差异。这样一来,当本发明的播放器30以扬声器模块36A、36B分别将各声道的输出信号Lo、Ro播放出来时,使用者就能听到比公知技术更丰富、更具立体人声效的配乐,享受更好的伴唱环境。Please continue to refer to FIG. 5 (and refer to FIG. 4 together). FIG. 5 is a schematic diagram of the frequency spectrum of the output signals Lo and Ro generated after the operation of the signal module 34 of the present invention in FIG. 4; the horizontal axis of FIG. 5 is the frequency, and the vertical axis is the spectrum size. Continuing the example in Fig. 2, assuming that its frequency spectrum of channel signal Li in Fig. 4, Ri is shown in frequency spectrum Lf, Rf in Fig. 2 respectively, then the frequency spectrum of output signal Lo of the present invention, Ro is just respectively in Fig. 5 Shown in spectrum Lof, Rof (for convenience, in Fig. 5, spectrum Rof and the spectrum Lof of solid line are also drawn in the same coordinate system by dashed line; Frequency fl, fh among Fig. 2 and frequency band BL, BM and BH are also same and marked in Figure 5). It can be seen from Fig. 5 that since the present invention performs vocal cancellation for different channel signals, the signal differences between the original channel signals distributed in the low frequency band BL and the intermediate frequency band BM will also be retained in the output Between the signals Lo and Ro, the output signals Lo and Ro of the present invention differ not only in the high frequency band BH, but also in the low frequency and intermediate frequency bands. In this way, when the player 30 of the present invention uses the speaker modules 36A, 36B to play out the output signals Lo and Ro of each channel respectively, the user can hear the audio that is richer and more stereoscopic than the known technology. Soundtrack, enjoy a better accompaniment environment.

本发明于图4中信号模块34的各个功能方块,能分别以硬件电路或韧体、软件的形式来实现。举例来说,一般的播放器都设有可程序的信号处理电路,而本发明就可以用韧体的方式来实现,将实现本发明技术的程序代码储存于播放器的内存(例如是非易失性内存)中;当信号处理电路执行程序代码时,就能实现本发明人声消除的功能。另外,例如计算机中常会使用播放程序配合适当的外围装置(例如声卡、光驱)播放歌曲音乐,本发明也可以软件的方式实现于此类播放程序中,以消除人声,提供伴唱的背景配乐。请参考图6(并一并参考图4)。图6中的程序代码100即是用来实现本发明人声消除的功能;其中,数组型态的变量x_L、x_R即是用来代表不同声道的声道信号Li、Ri(如图4),数组变量Mono代表单声道信号M,子程序Hi_Pass用来实现高通滤波模块的功能,Low_Pass则用来实现低通模块的功能;数组变量h_L、h_R分别代表高通信号Lh、Rh,数组变量low代表低通信号Sl,而数组变量L_out、R_out也就分别代表了输出信号Lo及Ro。程序代码100中的整数指标j则用来代表一数组变量第j个元素的值,也就是该数组变量对应的信号于第j个时点的取样值。如程序代码100所示,变量Mono代表的单声道信号为各声道信号对应变量x_L、x_R的平均,变量x_L、x_R高通滤波的结果分别储存于变量h_L、h_R。以变量x_R代表的声道信号Ri作为图4中的信号S,而低通滤波所产生低通信号Sl,也就由变量low来代表。最后,中介信号LVC、PVC就分别由程序代码100中的运算x_L[j]-Mono[j]、x_R[j]-Mono[j]来实现,再加上低频补偿的变量low、高频补偿的变量h_L、h_R,就能产生本发明人声消除后的输出信号,分别储存于变量L_out、R_out中。Each functional block of the signal module 34 in FIG. 4 of the present invention can be realized in the form of a hardware circuit, firmware, or software. For example, a general player is provided with a programmable signal processing circuit, and the present invention can be realized in the form of firmware, and the program code for realizing the technology of the present invention is stored in the memory of the player (such as a non-volatile When the signal processing circuit executes the program code, the function of vocal elimination of the present invention can be realized. In addition, for example, playing programs are often used in computers to play songs and music with appropriate peripheral devices (such as sound cards, CD-ROMs). The present invention can also be implemented in such playing programs in the form of software to eliminate human voices and provide background music for accompaniment. Please refer to FIG. 6 (and refer to FIG. 4 together). The program code 100 among Fig. 6 promptly is to be used for realizing the function of vocal elimination of the present invention; Wherein, the variable x_L of array type, x_R promptly is used for representing the channel signal Li of different channels, Ri (as Fig. 4) , the array variable Mono represents the mono signal M, the subroutine Hi_Pass is used to realize the function of the high-pass filter module, and Low_Pass is used to realize the function of the low-pass module; the array variables h_L and h_R represent the high-pass signal Lh and Rh respectively, and the array variable low represents the low-pass signal S1, and the array variables L_out and R_out represent the output signals Lo and Ro respectively. The integer index j in the program code 100 is used to represent the value of the jth element of an array variable, that is, the sampled value of the signal corresponding to the array variable at the jth time point. As shown in the program code 100, the monaural signal represented by the variable Mono is the average of the corresponding variables x_L and x_R of each channel signal, and the high-pass filtering results of the variables x_L and x_R are respectively stored in the variables h_L and h_R. The channel signal Ri represented by the variable x_R is taken as the signal S in FIG. 4 , and the low-pass signal S1 generated by the low-pass filter is also represented by the variable low. Finally, the intermediary signals LVC and PVC are respectively realized by the calculations x_L[j]-Mono[j] and x_R[j]-Mono[j] in the program code 100, plus the low frequency compensation variable low and high frequency compensation The variables h_L and h_R of the present invention can produce the output signal after the vocal cancellation of the present invention, which are stored in the variables L_out and R_out respectively.

在公知的人声消除技术中,由于不同声道的输出信号都是以相同的中介信号作为人声消除的主要结果,除了高频补偿所引入的信号差异外,输出信号在低频、中频频带皆没有明显的信号差异,故现有技术所产生出来的各声道输出信号无法展现出较佳的立体人声效果。相较之下,本发明则是针对不同声道的信号各自进行对应的人声消除,故得以在各声道的输出信号中较为完整地保留原来各声道信号间的信号差异;当不同声道的输出信号由不同的扬声器模块播放出来后,就能展现胜于现有技术的立体人声效,让使用者得以在伴唱系统中享受到更好的立体声配乐。本发明的技术除了可以运用于图4中的光盘播放器之外,还可以应用于其它种类的播放器;举例来说,图4中的音源电路可以是一网络模块,能透过有线或无线网络取得歌曲数据信号,解析出各声道信号。In the known human voice cancellation technology, since the output signals of different channels all use the same intermediate signal as the main result of human voice cancellation, except for the signal difference introduced by high frequency compensation, the output signal is in the low frequency and intermediate frequency bands. There is no obvious signal difference, so the output signals of each channel produced by the prior art cannot show a better stereo vocal effect. In contrast, the present invention performs corresponding human voice cancellation on the signals of different channels, so the signal differences between the original channel signals can be relatively completely preserved in the output signals of each channel; After the output signals of the channel are played by different speaker modules, the stereo vocal effect better than that of the prior art can be displayed, so that the user can enjoy a better stereo soundtrack in the accompaniment system. The technology of the present invention can be applied to other types of players besides being applicable to the CD player in Fig. 4; for example, the sound source circuit in Fig. 4 can be a network module, which can be The network obtains the song data signal, and analyzes the signal of each channel.

以上所述仅为本发明的较佳实施例,凡依本发明申请专利范围所做的均等变化与修饰,皆应属本发明专利的涵盖范围。The above descriptions are only preferred embodiments of the present invention, and all equivalent changes and modifications made according to the scope of the patent application of the present invention shall fall within the scope of the patent of the present invention.

Claims (12) Translated from Chinese

1.一种在人声消除时产生立体声的方法,以根据一个第一声道信号及一个第二声道信号提供一个第一输出信号及一个第二输出信号;该方法包含有:1. A method for producing stereophonic sound when human voice is eliminated, to provide a first output signal and a second output signal according to a first sound channel signal and a second sound channel signal; the method comprises: 根据该第一声道信号及该第二声道信号的合成结果产生一个单声道信号;generating a mono signal according to a synthesis result of the first channel signal and the second channel signal; 第一高通滤波步骤,用于根据一个预设的高频频带对该第一声道信号进行高通滤波,并产生一个对应的第一高通信号,以使该第一高通信号的信号频率实质集中于该高频频带;The first high-pass filtering step is used to perform high-pass filtering on the first channel signal according to a preset high-frequency band, and generate a corresponding first high-pass signal, so that the signal frequency of the first high-pass signal is substantially focus on the high frequency band; 第二高通滤波步骤,用于根据该高频频带对该第二声道信号进行高通滤波,并产生一个对应的第二高通信号,以使该第二高通信号的信号频率实质集中于该高频频带;The second high-pass filtering step is used to perform high-pass filtering on the second channel signal according to the high-frequency band, and generate a corresponding second high-pass signal, so that the signal frequency of the second high-pass signal is substantially concentrated on the high frequency band; 第一人声消除步骤,用于根据该第一声道信号及该单声道信号间的差异产生第一中介信号;The first human voice elimination step is used to generate a first intermediate signal according to the difference between the first channel signal and the mono signal; 第二人声消除步骤,用于根据该第二声道信号及该单声道信号间的差异产生第二中介信号;The second human voice elimination step is used to generate a second intermediate signal according to the difference between the second channel signal and the mono signal; 第一混音步骤,用于将该第一中介信号及该第一高通信号混音以产生该第一输出信号;以及a first mixing step for mixing the first intermediate signal and the first high-pass signal to generate the first output signal; and 第二混音步骤,用于将该第二中介信号及该第二高通信号混音以产生该第二输出信号,使该第一输出信号及该第二输出信号此两信号中频率在该高频频带之外的部分亦有实质上的差异。The second mixing step is used to mix the second intermediate signal and the second high-pass signal to generate the second output signal, so that the frequencies of the first output signal and the second output signal are within the frequency of the two signals Substantial differences also exist outside the high-frequency band. 2.如权利要求1所述的方法,其另包含有:2. The method of claim 1, further comprising: 根据一个预设的低频频带产生一低通信号,使该低通信号的信号频率实质集中于该低频频带;generating a low-pass signal according to a predetermined low-frequency band such that the signal frequency of the low-pass signal is substantially concentrated in the low-frequency band; 而在进行该第一混音步骤时,是将该第一中介信号、该第一高通信号及该低通信号进行混音以产生该第一输出信号;在进行该第二混音步骤时,是将该第二中介信号、该第二高通信号及该低通信号进行混音以产生该第二输出信号。When performing the first mixing step, the first intermediate signal, the first high-pass signal and the low-pass signal are mixed to generate the first output signal; when performing the second mixing step , mixing the second intermediate signal, the second high-pass signal and the low-pass signal to generate the second output signal. 3.如权利要求2所述的方法,其中,在根据该低频频带产生该低通信号时,根据该低频频带对该第一声道信号或该第二声道信号进行低通滤波以产生该低通信号。3. The method according to claim 2, wherein, when generating the low-pass signal according to the low-frequency band, performing low-pass filtering on the first channel signal or the second channel signal according to the low-frequency band to generate the low pass signal. 4.如权利要求2所述的方法,其中,在根据该低频频带产生该低通信号时,是根据该低频频带对该单声道信号进行低通滤波以产生该低通信号。4. The method of claim 2, wherein when generating the low-pass signal according to the low-frequency band, the low-pass signal is generated by performing low-pass filtering on the monaural signal according to the low-frequency band. 5.如权利要求1所述的方法,其中,该高频频带的频带范围高于人声的频带范围。5. The method as claimed in claim 1, wherein the frequency range of the high frequency band is higher than that of human voice. 6.一种播放器,其包含有:6. A player comprising: 一个音源电路,用来提供一个第一声道信号及一个第二声道信号;以及a sound source circuit for providing a first channel signal and a second channel signal; and 一个信号模块,用来对该第一声道信号和该第二声道信号进行人声消除处理并提供立体声的第一输出信号及第二输出信号;该信号模块包含有:A signal module is used to perform vocal cancellation processing on the first channel signal and the second channel signal and provide a stereo first output signal and a second output signal; the signal module includes: 一个单声道处理模块,用来根据该第一声道信号及该第二声道信号的合成结果产生一个单声道信号;A monophonic processing module, used to generate a monophonic signal according to the synthesis result of the first channel signal and the second channel signal; 一个第一高通模块,用来根据一个预设的高频频带对该第一声道信号进行高通滤波,并产生一个对应的第一高通信号,以使该第一高通信号的信号频率实质集中于该高频频带;A first high-pass module, used for high-pass filtering the first channel signal according to a preset high-frequency band, and generating a corresponding first high-pass signal, so that the signal frequency of the first high-pass signal is substantially focus on the high frequency band; 一个第二高通模块,用来根据该高频频带对该第二声道信号进行高通滤波,并产生一个对应的第二高通信号,以使该第二高通信号的信号频率实质集中于该高频频带;A second high-pass module, used for high-pass filtering the second channel signal according to the high-frequency band, and generating a corresponding second high-pass signal, so that the signal frequency of the second high-pass signal is substantially concentrated on the high frequency band; 一个第一人声消除模块,用来根据该第一声道信号及该单声道信号间的差异产生一个第一中介信号;A first vocal cancellation module, used to generate a first intermediate signal according to the difference between the first channel signal and the mono signal; 一个第二人声消除模块,用来根据该第二声道信号及该单声道信号间的差异产生一个第二中介信号;A second vocal cancellation module, used to generate a second intermediate signal according to the difference between the second channel signal and the mono signal; 一个第一混音单元,用来将该第一中介信号及该第一高通信号混音以产生该第一输出信号;以及a first mixing unit for mixing the first intermediate signal and the first high-pass signal to generate the first output signal; and 一个第二混音单元,用来将该第二中介信号及该第二高通信号混音以产生该第二输出信号,使该第一输出信号及该第二输出信号此两信号中频率在该高频频带之外的部分亦有实质上的差异。A second mixing unit is used to mix the second intermediate signal and the second high-pass signal to generate the second output signal, so that the frequencies of the first output signal and the second output signal are between Parts outside the high-frequency band also have substantial differences. 7.如权利要求6所述的播放器,还包含有:7. The player according to claim 6, further comprising: 一个低通模块,用来根据一个预设的低频频带产生一个低通信号,该低通信号的信号频率实质集中于该低频频带;a low-pass module, used to generate a low-pass signal according to a preset low-frequency band, the signal frequency of the low-pass signal is substantially concentrated in the low-frequency band; 而该第一混音单元用于将该第一中介信号、该第一高通信号及该低通信号进行混音以产生该第一输出信号;该第二混音单元则用于将该第二中介信号、该第二高通信号及该低通信号进行混音以产生该第二输出信号。And the first mixing unit is used for mixing the first intermediate signal, the first high-pass signal and the low-pass signal to generate the first output signal; the second mixing unit is used for the second The two intermediate signals, the second high-pass signal and the low-pass signal are mixed to generate the second output signal. 8.如权利要求7所述的播放器,其中,该低通模块用于根据该低频频带对该第一声道信号或该第二声道信号进行低通滤波以产生该低通信号。8. The player as claimed in claim 7, wherein the low-pass module is configured to low-pass filter the first channel signal or the second channel signal according to the low frequency band to generate the low-pass signal. 9.如权利要求7所述的播放器,其中,该低通模块用于根据该低频频带对该单声道信号进行低通滤波以产生该低通信号。9. The player as claimed in claim 7, wherein the low-pass module is configured to low-pass filter the mono signal according to the low-frequency band to generate the low-pass signal. 10.如权利要求6所述的播放器,其中,该高频频带的频带范围高于人声的频带范围。10. The player as claimed in claim 6, wherein the frequency range of the high frequency band is higher than that of human voice. 11.如权利要求6所述的播放器,其中,该音源电路可从一光盘片上读取信号以形成该第一声道信号及该第二声道信号。11. The player as claimed in claim 6, wherein the audio source circuit can read signals from an optical disc to form the first channel signal and the second channel signal. 12.如权利要求6所述的播放器,还包含有:12. The player according to claim 6, further comprising: 一个第一扬声器模块,用来将该第一输出信号转换为声波播放出来;以及A first loudspeaker module, used to convert the first output signal into a sound wave and play it; and 一个第二扬声器模块,用来将该第二输出信号转换为声波播放出来。A second loudspeaker module is used to convert the second output signal into a sound wave and play it.

CNB031557627A 2003-09-01 2003-09-01 Stereo human voice cancellation method and related device Expired - Fee Related CN100353813C (en) Priority Applications (1) Application Number Priority Date Filing Date Title CNB031557627A CN100353813C (en) 2003-09-01 2003-09-01 Stereo human voice cancellation method and related device Applications Claiming Priority (1) Application Number Priority Date Filing Date Title CNB031557627A CN100353813C (en) 2003-09-01 2003-09-01 Stereo human voice cancellation method and related device Publications (2) Family ID=34598192 Family Applications (1) Application Number Title Priority Date Filing Date CNB031557627A Expired - Fee Related CN100353813C (en) 2003-09-01 2003-09-01 Stereo human voice cancellation method and related device Country Status (1) Families Citing this family (6) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title KR100636248B1 (en) * 2005-09-26 2006-10-19 삼성전자주식회사 Vocal Removal Device and Method CN101577117B (en) * 2009-03-12 2012-04-11 无锡中星微电子有限公司 Extraction method and device of accompaniment music CN101944355B (en) * 2009-07-03 2013-05-08 深圳Tcl新技术有限公司 Obbligato music generation device and realization method thereof CN101894559B (en) * 2010-08-05 2012-06-06 展讯通信(上海)有限公司 Audio processing method and device thereof CN109429167B (en) * 2017-08-31 2020-10-13 瑞昱半导体股份有限公司 Audio enhancement device and method US10491179B2 (en) * 2017-09-25 2019-11-26 Nuvoton Technology Corporation Asymmetric multi-channel audio dynamic range processing Citations (3) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title CN1277530A (en) * 1999-06-09 2000-12-20 皇家菲利浦电子有限公司 Stereo signal processing device TW437256B (en) * 1999-03-12 2001-05-28 Ind Tech Res Inst Apparatus and method for virtual sound enhancement CN1327705A (en) * 1999-06-22 2001-12-19 皇家菲利浦电子有限公司 Stereophonic signal processing apparatus Patent Citations (3) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title TW437256B (en) * 1999-03-12 2001-05-28 Ind Tech Res Inst Apparatus and method for virtual sound enhancement CN1277530A (en) * 1999-06-09 2000-12-20 皇家菲利浦电子有限公司 Stereo signal processing device CN1327705A (en) * 1999-06-22 2001-12-19 皇家菲利浦电子有限公司 Stereophonic signal processing apparatus Also Published As Similar Documents Publication Publication Date Title TWI489887B (en) 2015-06-21 Virtual audio processing for loudspeaker or headphone playback KR100458021B1 (en) 2004-11-26 Multi-channel audio enhancement system for use in recording and playback and methods for providing same EP2974010B1 (en) 2021-08-18 Automatic multi-channel music mix from multiple audio stems KR101569032B1 (en) 2015-11-13 A method and an apparatus of decoding an audio signal AU2012257865B2 (en) 2015-07-09 Apparatus and method and computer program for generating a stereo output signal for providing additional output channels CN100353813C (en) 2007-12-05 Stereo human voice cancellation method and related device JP2006025439A (en) 2006-01-26 Apparatus and method for creating 3d sound JP7256164B2 (en) 2023-04-11 Audio processing device and audio processing method EP2566195B1 (en) 2017-08-16 Speaker apparatus KR100849030B1 (en) 2008-07-29 3D sound Reproduction Apparatus using Virtual Speaker Technique under Plural Channel Speaker Environments US6909787B2 (en) 2005-06-21 Method and related apparatus for stereo vocal cancellation Floros et al. 2011 Spatial enhancement for immersive stereo audio applications KR100802339B1 (en) 2008-02-13 Stereo sound playback device and method using virtual speaker technology in stereo speaker environment Kunchur 2021 3D imaging in two-channel stereo sound: Portrayal of elevation Bhalani et al. 2012 Karaoke Machine implementation and validation using Out of Phase Stereo method JP7332745B2 (en) 2023-08-23 Speech processing method and speech processing device WO2024216494A1 (en) 2024-10-24 Method for multichannel audio reconstruction and speaker system using the method Lopes 2023 Instrument Position in Immersive Audio: A Study on Good Practices and Comparison With Stereo Approaches TWI262738B (en) 2006-09-21 Expansion method of multi-channel panoramic audio effect Sjöholm 2023 Evaluation of ambisonic microphone techniques in conjunction with spot-microphones for 360-degree video within an acoustic environment JP2023545547A (en) 2023-10-30 Sound reproduction by multi-order HRTF between the left and right ears JP2005250199A (en) 2005-09-15 Audio equipment RU2384973C1 (en) 2010-03-20 Device and method for synthesising three output channels using two input channels Ciesla 0 Sound and Music for Games JPH03274096A (en) 1991-12-05 'karaoke' (recorded orchestral accompaniment) player Legal Events Date Code Title Description 2005-03-09 C06 Publication 2005-03-09 PB01 Publication 2005-05-11 ASS Succession or assignment of patent right

Owner name: MEDIATEK INC.

Free format text: FORMER OWNER: YANGZHI SCIENCE + TECHNOLOGY CO. LTD.

Effective date: 20050408

2005-05-11 C10 Entry into substantive examination 2005-05-11 C41 Transfer of patent application or patent right or utility model 2005-05-11 SE01 Entry into force of request for substantive examination 2005-05-11 TA01 Transfer of patent application right

Effective date of registration: 20050408

Address after: Hsinchu Science Industrial Park, Hsinchu County, Taiwan

Applicant after: MEDIATEK Inc.

Address before: Taipei County of Taiwan Province

Applicant before: ALI CORPORATION

2007-12-05 C14 Grant of patent or utility model 2007-12-05 GR01 Patent grant 2023-09-15 CF01 Termination of patent right due to non-payment of annual fee 2023-09-15 CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20071205


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4