A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://patents.google.com/patent/CN103618986B/en below:

CN103618986B - The extracting method of source of sound acoustic image body and device in a kind of 3d space

CN103618986B - The extracting method of source of sound acoustic image body and device in a kind of 3d space - Google PatentsThe extracting method of source of sound acoustic image body and device in a kind of 3d space Download PDF Info
Publication number
CN103618986B
CN103618986B CN201310580928.7A CN201310580928A CN103618986B CN 103618986 B CN103618986 B CN 103618986B CN 201310580928 A CN201310580928 A CN 201310580928A CN 103618986 B CN103618986 B CN 103618986B
Authority
CN
China
Prior art keywords
acoustic image
centerdot
source
sound
eta
Prior art date
2013-11-19
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310580928.7A
Other languages
Chinese (zh)
Other versions
CN103618986A (en
Inventor
江游
黄莉苹
王恒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Xinyidai Information Technology Research Institute Co Ltd
Original Assignee
Shenzhen Xinyidai Information Technology Research Institute Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
2013-11-19
Filing date
2013-11-19
Publication date
2015-09-30
2013-11-19 Application filed by Shenzhen Xinyidai Information Technology Research Institute Co Ltd filed Critical Shenzhen Xinyidai Information Technology Research Institute Co Ltd
2013-11-19 Priority to CN201310580928.7A priority Critical patent/CN103618986B/en
2014-03-05 Publication of CN103618986A publication Critical patent/CN103618986A/en
2014-06-04 Priority to US14/422,070 priority patent/US9646617B2/en
2014-06-04 Priority to PCT/CN2014/079177 priority patent/WO2015074400A1/en
2015-09-30 Application granted granted Critical
2015-09-30 Publication of CN103618986B publication Critical patent/CN103618986B/en
Status Active legal-status Critical Current
2033-11-19 Anticipated expiration legal-status Critical
Links Classifications Landscapes Abstract

The invention provides extracting method and the device of source of sound acoustic image body in a kind of 3d space, comprise the locus determining source of sound acoustic image, according to the locus (ρ, μ, η) of gained source of sound acoustic image, determine the loud speaker of source of sound acoustic image place spatial proximity; Calculate the correlation of selected loud speaker each sound channel signal in the horizontal and vertical directions, obtain the parameter set { IC of acoustic image body h, IC v, Min{IC h, IC vand preserve, wherein Min{IC h, IC vbe IC hand IC vin smaller value.The expression parameter that the present invention obtains acoustic image body is that the size recovering source of sound acoustic image in 3D live audio system accurately provides technical guarantee, solves the technical barrier that the acoustic image of current 3D Audio recovery is too narrow and small.

Description The extracting method of source of sound acoustic image body and device in a kind of 3d space

Technical field

The invention belongs to field of acoustics, particularly relate to extracting method and the device of source of sound acoustic image body in 3d space.

Background technology

In the end of the year 2009,3D film " A Fanda " climbs up top box-office value in more than 30 country in the whole world, and at the beginning of 2010 9 months, accumulative box office, the whole world is more than 2,700,000,000 dollars.Why " A Fanda " can obtain so brilliant box office achievement, be it have employed brand-new 3D special effect making technology and bring the effect of the shock on people's sense organ.The gorgeous picture that " A Fanda " represents and sound effect true to nature have not only shaken spectators, also made industry have the asserting of " film enters the 3D epoch ".Moreover, it also will expedite the emergence of technology and the standard of more relevant video display, recording, broadcasting aspect.In the international consumption electronic product exhibition that in January, 2010 holds at Las Vegas, US, the TV new product band that each colour TV giant reveals one after another gives people new expectation---and 3D has become the new focus of global Ge great colour TV manufacturer competition.Want to reach better audiovisual experience, need the 3D sound field auditory effect synchronous with 3D video content, could really reach hearing experience on the spot in person.Early stage 3D audio system (as Ambisonics system), due to its complex structure, requires higher to collection and playback apparatus, is difficult to promote practicality.Japanese NHK company is proposed 22.2 sound channel systems in recent years, by the 3D sound field that 24 loudspeaker reproduction are original.MPEG in 2011 sets about the international standard formulating 3D audio frequency, wishes by fewer loud speaker or earphone to reduce 3D sound field, so that can by this Technique Popularizing to ordinary family user while reaching certain code efficiency.The 3D audio frequency and video technology study hotspot having become multimedia technology field and the important directions further developed as can be seen here.

But traditional 3D audio frequency only focuses on locus or the physics sound field of recovering source of sound, and not for the size of the acoustic image of source of sound, particularly acoustic image body recovers.In order to reach better hearing effect, need the size recovering source of sound acoustic image accurately, simultaneously for the ease of the process of the systems such as encoding and decoding, also need to find the expression parameter expressing source of sound acoustic image body, so just by also perfectly original sound image can be recovered after the process of 3D audio system.

Summary of the invention

The present invention is directed to the deficiencies in the prior art, propose extracting method and the device of source of sound acoustic image body in a kind of 3d space.

Technical scheme provided by the invention provides the extracting method of source of sound acoustic image body in a kind of 3d space, comprises the following steps:

Step 1, determine the locus of source of sound acoustic image, implementation is as follows,

The signal of each sound channel is carried out time-frequency conversion, identical sub-band division is carried out to each sound channel; Take auditor as spheric coordinate system initial point, to being positioned at horizontal angle μ iwith elevation angle η iloud speaker, if vector p i(k, n) represents the time-frequency representation of corresponding signal,

p i ( k , n ) = g i ( k , n ) cos μ i · cos η i sin μ i · cos η i sin η i

Wherein, i is the index value of loud speaker, and k is band index, and n is time domain frame number index, g i(k, n) is the strength information of frequency domain point;

The horizontal angle μ of source of sound acoustic image and elevation angle η adopts following formulae discovery,

tan μ ( k , n ) = Σ i = 1 N g i ( k , n ) · cos μ i · cos η i Σ i = 1 N g i ( k , n ) · sin μ i · cos η i

tan η ( k , n ) = [ Σ u = 1 N g i ( k , n ) · cos μ i · cos η i ] 2 + [ Σ i = 1 N g i ( k , n ) · sin μ i · cos η i ] 2 Σ i = 1 N g i ( k , n ) · sin η i

Wherein, N is the sum of loud speaker, and the value of i is 1,2 ... N, μ (k, n), η (k, n) the i.e. horizontal angle μ of the n-th frame kth frequency band source of sound acoustic image and elevation angle η;

Source of sound acoustic image gets the average distance of all loud speakers to auditor to the distance ρ of spheric coordinate system initial point;

Step 2, according to the locus (ρ, μ, η) of step 1 gained source of sound acoustic image, determines the loud speaker of source of sound acoustic image place spatial proximity;

Step 3, the correlation of each sound channel signal in the horizontal and vertical directions of loud speaker selected by calculation procedure 2, implementation is as follows:

Selected loud speaker is divided into left and right two parts according to acoustic image position, with the middle vertical plane at source of sound acoustic image and auditor place for projection plane, calculates the right and left signal component sum vertical with this projection plane respectively, be designated as P land P r, calculate the correlation IC of the right and left signal hit is as follows,

IC H = cov ( P L , P R ) cov ( P L , P L ) · cov ( P R , P R )

Selected loud speaker is divided into upper and lower two parts according to acoustic image position, with the plane at source of sound acoustic image and auditor place for projection plane, calculates the component sum that upper and lower both sides signal is vertical with this projection plane respectively, be designated as P uand P d, calculate the correlation IC of upper and lower both sides signal vit is as follows,

IC V = cov ( P U , P D ) cov ( P U , P U ) · cov ( P D , P D )

Step 4, obtains the parameter set { IC of acoustic image body h, IC v, Min{IC h, IC vand preserve, wherein Min{IC h, IC vbe IC hand IC vin smaller value.

The present invention is the corresponding extraction element providing source of sound acoustic image body in a kind of 3d space also, comprises with lower unit:

Locus extraction unit, for determining the locus of source of sound acoustic image, implementation is as follows,

The signal of each sound channel is carried out time-frequency conversion, identical sub-band division is carried out to each sound channel; Take auditor as spheric coordinate system initial point, to being positioned at horizontal angle μ iwith elevation angle η iloud speaker, if vector p i(k, n) represents the time-frequency representation of corresponding signal,

p i ( k , n ) = g i ( k , n ) cos μ i · cos η i sin μ i · cos η i sin η i

Wherein, i is the index value of loud speaker, and k is band index, and n is time domain frame number index, g i(k, n) is the strength information of frequency domain point;

The horizontal angle μ of source of sound acoustic image and elevation angle η adopts following formulae discovery,

tan μ ( k , n ) = Σ i = 1 N g i ( k , n ) · cos μ i · cos η i Σ i = 1 N g i ( k , n ) · sin μ i · cos η i

tan η ( k , n ) = [ Σ u = 1 N g i ( k , n ) · cos μ i · cos η i ] 2 + [ Σ i = 1 N g i ( k , n ) · sin μ i · cos η i ] 2 Σ i = 1 N g i ( k , n ) · sin η i

Wherein, N is the sum of loud speaker, and the value of i is 1,2 ... N, μ (k, n), η (k, n) the i.e. horizontal angle μ of the n-th frame kth frequency band source of sound acoustic image and elevation angle η;

Source of sound acoustic image gets the average distance of all loud speakers to auditor to the distance ρ of spheric coordinate system initial point;

Unit chosen by loud speaker, for the locus (ρ, μ, η) according to locus extraction unit gained source of sound acoustic image, determines the loud speaker of source of sound acoustic image place spatial proximity;

Correlation extraction unit, choose the correlation of each sound channel signal in the horizontal and vertical directions of loud speaker selected by unit for calculating loud speaker, implementation is as follows,

Selected loud speaker is divided into left and right two parts according to acoustic image position, with the middle vertical plane at source of sound acoustic image and auditor place for projection plane, calculates the right and left signal component sum vertical with this projection plane respectively, be designated as P land P r, calculate the correlation IC of the right and left signal hit is as follows,

IC H = cov ( P L , P R ) cov ( P L , P L ) · cov ( P R , P R )

Selected loud speaker is divided into upper and lower two parts according to acoustic image position, with the plane at source of sound acoustic image and auditor place for projection plane, calculates the component sum that upper and lower both sides signal is vertical with this projection plane respectively, be designated as P uand P d, calculate the correlation IC of upper and lower both sides signal vit is as follows,

IC V = cov ( P U , P D ) cov ( P U , P U ) · cov ( P D , P D )

Acoustic image bulk properties storage unit, for obtaining the parameter set { IC of acoustic image body h, IC v, Min{IC h, IC vand preserve, wherein Min{IC h, IC vbe IC hand IC vin smaller value.

The acoustic image body of source of sound refers in the 3 d space relative to the size in the front and back/degree of depth of acoustic image auditor, left and right/length and up and down/height three dimensions.The present invention is directed to the 3D audio system of multichannel, by utilizing the correlation between different sound channel to describe the size of source of sound acoustic image body from three dimensions.The expression parameter that the present invention obtains acoustic image body is that the size recovering source of sound acoustic image in 3D live audio system accurately provides technical guarantee, solves the technical barrier that the acoustic image of current 3D Audio recovery is too narrow and small.

Accompanying drawing explanation

Fig. 1 is loudspeaker position and the calculated signals relation schematic diagram of the embodiment of the present invention.

Embodiment

Below in conjunction with drawings and Examples, the invention will be further described.

Technical scheme of the present invention can realize automatic operational process by those skilled in the art based on computer software technology.Described in the flow process of embodiment is specific as follows:

Step 1, determines the locus of source of sound acoustic image, take auditor as the origin of coordinates, and the spherical coordinate of loud speaker can be set to (ρ, μ, η), and ρ is the distance of loud speaker to spheric coordinate system initial point, and μ is horizontal angle, and η is elevation angle, as shown in Figure 1.

Take auditor as reference point, Orthogonal Decomposition is carried out to each sound channel signal of multi-channel system, obtain the X of each sound channel in 3d space cartesian coordinate system, the component on Y and Z axis.The component of each sound channel is the decomposition of former single-tone source in this sound channel.Therefore after the component on the X obtaining each sound channel, Y and Z axis, respectively each component is added, the component of former single-tone source for listener location can be obtained.Being implemented as follows of embodiment:

First the signal of each sound channel is carried out time-frequency conversion, carry out identical sub-band division to each sound channel, available prior art carries out time-frequency conversion and sub-band division.

Because generally there is multiple loud speaker, the spherical coordinate of each loud speaker (ρ, μ, η) can be pressed index value respectively as subscript, be designated as (ρ i, μ i, η i).Consider that is positioned at a horizontal angle μ i, elevation angle η iloud speaker, can with a vector p i(k, n) represents the time-frequency representation of the corresponding sound channel signal of loud speaker, computing formula as the formula (1):

p i ( k , n ) = g i ( k , n ) cos μ i · cos η i sin μ i · cos η i sin η i · · · ( 1 )

Wherein, i is the index value of loud speaker, and k is band index, and n is time domain frame number index, g i(k, n) is the strength information of frequency domain point.The azimuth of source of sound acoustic image also can be divided into horizontal angle μ and elevation angle η, and through type (2), formula (3) calculate:

tan μ ( k , n ) = Σ i = 1 N g i ( k , n ) · cos μ i · cos η i Σ i = 1 N g i ( k , n ) · sin μ i · cos η i · · · ( 2 )

tan η ( k , n ) = [ Σ u = 1 N g i ( k , n ) · cos μ i · cos η i ] 2 + [ Σ i = 1 N g i ( k , n ) · sin μ i · cos η i ] 2 Σ i = 1 N g i ( k , n ) · sin η i · · · ( 3 )

Wherein, N is the sum of loud speaker, and the value of i is 1,2 ... N, μ (k, n), η (k, n) the i.e. horizontal angle μ of the n-th frame kth frequency band source of sound acoustic image and elevation angle η.

So just can obtain horizontal angle μ and the elevation angle η of source of sound acoustic image, because loud speaker is generally arrange centered by auditor, source of sound acoustic image roughly gets the distance ρ of all loud speakers to auditor to the distance ρ of spheric coordinate system initial point imean value, usual ρ=ρ 1=ρ 2=...=ρ n.

Step 2, determines the loud speaker of source of sound acoustic image place spatial proximity.

After determining the locus (ρ, μ, η) rebuilding source of sound acoustic image, find out the loud speaker near it according to its position.During concrete enforcement, can first according to each loud speaker (ρ i, μ i, η i) sort from the near to the remote to source of sound acoustic image, the loud speaker that then selected distance is near, can select flexibly according to actual conditions, generally chooses 4-8 and is advisable.

Step 3, the correlation of each sound channel signal in the horizontal and vertical directions of loud speaker selected by calculation procedure 2, this correlation can represent acoustic image size in the horizontal and vertical directions.

Selected loud speaker is divided into left and right two parts according to acoustic image position, if P ifor the frequency domain value of i-th sound channel of source of sound, with the middle vertical plane at source of sound acoustic image and auditor place for projection plane, calculating the right and left signal component sum vertical with this projection plane respectively, is P land P r.Namely from loud speaker selected by step 2, be taken at all loud speakers on the left side, acoustic image position, obtain the respective tones thresholding P of each loud speaker icomponent vertical with this projection plane respectively, then summation obtains P l; From loud speaker selected by step 2, be taken at all loud speakers on the right of acoustic image position, obtain the respective tones thresholding P of each loud speaker icomponent vertical with this projection plane respectively, then summation obtains P r.Calculate the correlation IC of the right and left signal h, as the formula (4):

IC H = cov ( P L , P R ) cov ( P L , P L ) · cov ( P R , P R ) · · · ( 4 )

Equally selected loud speaker is divided into upper and lower two parts according to acoustic image position, with the plane at source of sound acoustic image and auditor place for projection plane, this plane is vertical with above-mentioned middle vertical plane, calculates the component sum that upper and lower both sides signal is vertical with this projection plane respectively, is P uand P d, from loud speaker selected by step 2, be namely taken at all loud speakers of top, acoustic image position, obtain the respective tones thresholding P of each loud speaker icomponent vertical with this projection plane respectively, then summation obtains P u; From loud speaker selected by step 2, be taken at the following all loud speakers in acoustic image position, obtain the respective tones thresholding P of each loud speaker icomponent vertical with this projection plane respectively, then summation obtains P d.Then the correlation IC of upper and lower both sides signal is calculated v, as the formula (5):

IC V = cov ( P U , P D ) cov ( P U , P U ) · cov ( P D , P D ) · · · ( 5 )

So just obtain the expression parameter of acoustic image size on horizontal and vertical direction, the perception of adjusting the distance due to people is sensitive not, and therefore distance parameter can IC hand IC vin smaller value represent, i.e. Min{IC h, IC v.

By above method, can according to the horizontal angle μ of the source of sound acoustic image of each frequency band of every frame signal and elevation angle η, the corresponding acoustic image body obtaining each frequency band of every frame signal.

During concrete enforcement, the acoustic image body available parameter collection { IC extracted h, IC v, Min{IC h, IC vrepresent and store, for recovery source of sound acoustic image.

Technical solution of the present invention also can adopt software modularity technology, is embodied as device.The corresponding extraction element providing source of sound acoustic image body in a kind of 3d space of the embodiment of the present invention, comprises with lower unit:

Locus extraction unit, for determining the locus of source of sound acoustic image, implementation is as follows,

The signal of each sound channel is carried out time-frequency conversion, identical sub-band division is carried out to each sound channel; Take auditor as spheric coordinate system initial point, to being positioned at horizontal angle μ iwith elevation angle η iloud speaker, if vector p i(k, n) represents the time-frequency representation of corresponding signal,

p i ( k , n ) = g i ( k , n ) cos μ i · cos η i sin μ i · cos η i sin η i

Wherein, i is the index value of loud speaker, and k is band index, and n is time domain frame number index, g i(k, n) is the strength information of frequency domain point;

The horizontal angle μ of source of sound acoustic image and elevation angle η adopts following formulae discovery,

tan μ ( k , n ) = Σ i = 1 N g i ( k , n ) · cos μ i · cos η i Σ i = 1 N g i ( k , n ) · sin μ i · cos η i

tan η ( k , n ) = [ Σ u = 1 N g i ( k , n ) · cos μ i · cos η i ] 2 + [ Σ i = 1 N g i ( k , n ) · sin μ i · cos η i ] 2 Σ i = 1 N g i ( k , n ) · sin η i

Wherein, N is the sum of loud speaker, and the value of i is 1,2 ... N, μ (k, n), η (k, n) the i.e. horizontal angle μ of source of sound acoustic image and elevation angle η;

Source of sound acoustic image gets the average distance of all loud speakers to auditor to the distance ρ of spheric coordinate system initial point;

Unit chosen by loud speaker, for the locus (ρ, μ, η) according to locus extraction unit gained source of sound acoustic image, determines the loud speaker of source of sound acoustic image place spatial proximity;

Correlation extraction unit, choose the correlation of each sound channel signal in the horizontal and vertical directions of loud speaker selected by unit for calculating loud speaker, implementation is as follows,

Selected loud speaker is divided into left and right two parts according to acoustic image position, with the middle vertical plane at source of sound acoustic image and auditor place for projection plane, calculates the right and left signal component sum vertical with this projection plane respectively, be designated as P land P r, calculate the correlation IC of the right and left signal hit is as follows,

IC H = cov ( P L , P R ) cov ( P L , P L ) · cov ( P R , P R )

Selected loud speaker is divided into upper and lower two parts according to acoustic image position, with the plane at source of sound acoustic image and auditor place for projection plane, calculates the component sum that upper and lower both sides signal is vertical with this projection plane respectively, be designated as P uand P d, calculate the correlation IC of upper and lower both sides signal vit is as follows,

IC V = cov ( P U , P D ) cov ( P U , P U ) · cov ( P D , P D )

Acoustic image bulk properties storage unit, for obtaining the parameter set { IC of acoustic image body h, IC v, Min{IC h, IC vand preserve, wherein Min{IC h, IC vbe IC hand IC vin smaller value.Adopt IC h, IC v, Min{IC h, IC videntify characteristic in the front and back/degree of depth of acoustic image, left and right/length and up and down/height three dimensions respectively.

Above-mentioned example of the present invention is only and illustrates that method of the present invention realizes; any people being familiar with this technology is in the technical scope disclosed by the present invention; all can expect its change easily and replace, therefore scope all should be encompassed within the protection range that limited by claims.

Claims (2)

1. the extracting method of source of sound acoustic image body in 3d space, is characterized in that, comprise the following steps:

Step 1, determine the locus of source of sound acoustic image, implementation is as follows,

The signal of each sound channel is carried out time-frequency conversion, identical sub-band division is carried out to each sound channel; Take auditor as spheric coordinate system initial point, to being positioned at horizontal angle μ iwith elevation angle η iloud speaker, if vector p i(k, n) represents the time-frequency representation of corresponding signal,

p i ( k , n ) = g i ( k , n ) · cos μ i · cos η i sin μ i · cos η i sin η i

Wherein, i is the index value of loud speaker, and k is band index, and n is time domain frame number index, g i(k, n) is the strength information of frequency domain point;

The horizontal angle μ of source of sound acoustic image and elevation angle η adopts following formulae discovery,

tan μ ( k , n ) = Σ i = 1 N g i ( k , n ) · cos μ i · cos η i Σ i = 1 N g i ( k , n ) · sin μ i · cos η i

tan η ( k , n ) = [ Σ i = 1 N g i ( k , n ) · cos μ i · cos η i ] 2 + [ Σ i = 1 N g i ( k , n ) · sin μ i · cos η i ] 2 Σ i = 1 N g i ( k , n ) · sin η i

Wherein, N is the sum of loud speaker, and the value of i is 1,2 ... N, μ (k, n), η (k, n) the i.e. horizontal angle μ of the n-th frame kth frequency band source of sound acoustic image and elevation angle η;

Source of sound acoustic image gets the average distance of all loud speakers to auditor to the distance ρ of spheric coordinate system initial point;

Step 2, according to the locus (ρ, μ, η) of step 1 gained source of sound acoustic image, determines the loud speaker of source of sound acoustic image place spatial proximity;

Step 3, the correlation of each sound channel signal in the horizontal and vertical directions of loud speaker selected by calculation procedure 2, implementation is as follows,

Selected loud speaker is divided into left and right two parts according to acoustic image position, with the middle vertical plane at source of sound acoustic image and auditor place for projection plane, the component sum that the right and left signal that all loud speakers and all loud speakers on the right of acoustic image position that calculate the left side, acoustic image position respectively produce respectively is vertical with this projection plane, is designated as P land P r, calculate the correlation IC of the right and left signal hit is as follows,

IC H = cov ( P L , P R ) cov ( P L , P L ) · cov ( P R , P R )

Selected loud speaker is divided into upper and lower two parts according to acoustic image position, with the plane at source of sound acoustic image and auditor place for projection plane, the component sum that the signal of both sides up and down that all loud speakers that all loud speakers of calculating top, acoustic image position are following with acoustic image position respectively produce respectively is vertical with this projection plane, is designated as P uand P d, calculate the correlation IC of upper and lower both sides signal vit is as follows,

IC V = cov ( P U , P D ) cov ( P U , P U ) · cov ( P D , P D )

Step 4, obtains the parameter set { IC of acoustic image body h, IC v, Min{IC h, IC vand preserve, wherein Min{IC h, IC vbe IC hand IC vin smaller value.

2. the extraction element of source of sound acoustic image body in 3d space, is characterized in that, comprise with lower unit:

Locus extraction unit, for determining the locus of source of sound acoustic image, implementation is as follows,

The signal of each sound channel is carried out time-frequency conversion, identical sub-band division is carried out to each sound channel; Take auditor as spheric coordinate system initial point, to being positioned at horizontal angle μ iwith elevation angle η iloud speaker, if vector p i(k, n) represents the time-frequency representation of corresponding signal,

p i ( k , n ) = g i ( k , n ) · cos μ i · cos η i sin μ i · cos η i sin η i

Wherein, i is the index value of loud speaker, and k is band index, and n is time domain frame number index, g i(k, n) is the strength information of frequency domain point;

The horizontal angle μ of source of sound acoustic image and elevation angle η adopts following formulae discovery,

tan μ ( k , n ) = Σ i = 1 N g i ( k , n ) · cos μ i · cos η i Σ i = 1 N g i ( k , n ) · sin μ i · cos η i

tan η ( k , n ) = [ Σ i = 1 N g i ( k , n ) · cos μ i · cos η i ] 2 + [ Σ i = 1 N g i ( k , n ) · sin μ i · cos η i ] 2 Σ i = 1 N g i ( k , n ) · sin η i

Wherein, N is the sum of loud speaker, and the value of i is 1,2 ... N, μ (k, n), η (k, n) the i.e. horizontal angle μ of the n-th frame kth frequency band source of sound acoustic image and elevation angle η;

Source of sound acoustic image gets the average distance of all loud speakers to auditor to the distance ρ of spheric coordinate system initial point;

Unit chosen by loud speaker, for the locus (ρ, μ, η) according to locus extraction unit gained source of sound acoustic image, determines the loud speaker of source of sound acoustic image place spatial proximity;

Correlation extraction unit, choose the correlation of each sound channel signal in the horizontal and vertical directions of loud speaker selected by unit for calculating loud speaker, implementation is as follows,

Selected loud speaker is divided into left and right two parts according to acoustic image position, with the middle vertical plane at source of sound acoustic image and auditor place for projection plane, the component sum that the right and left signal that all loud speakers and all loud speakers on the right of acoustic image position that calculate the left side, acoustic image position respectively produce respectively is vertical with this projection plane, is designated as P land P r, calculate the correlation IC of the right and left signal hit is as follows,

IC H = cov ( P L , P R ) cov ( P L , P L ) · cov ( P R , P R )

Selected loud speaker is divided into upper and lower two parts according to acoustic image position, with the plane at source of sound acoustic image and auditor place for projection plane, the component sum that the signal of both sides up and down that all loud speakers that all loud speakers of calculating top, acoustic image position are following with acoustic image position respectively produce respectively is vertical with this projection plane, is designated as P uand P d, calculate the correlation IC of upper and lower both sides signal vit is as follows,

IC V = cov ( P U , P D ) cov ( P U , P U ) · cov ( P D , P D )

Acoustic image bulk properties storage unit, for obtaining the parameter set { IC of acoustic image body h, IC v, Min{IC h, IC vand preserve, wherein Min{IC h, IC vbe IC hand IC vin smaller value.

CN201310580928.7A 2013-11-19 2013-11-19 The extracting method of source of sound acoustic image body and device in a kind of 3d space Active CN103618986B (en) Priority Applications (3) Application Number Priority Date Filing Date Title CN201310580928.7A CN103618986B (en) 2013-11-19 2013-11-19 The extracting method of source of sound acoustic image body and device in a kind of 3d space US14/422,070 US9646617B2 (en) 2013-11-19 2014-06-04 Method and device of extracting sound source acoustic image body in 3D space PCT/CN2014/079177 WO2015074400A1 (en) 2013-11-19 2014-06-04 Method and apparatus for extracting acoustic image body of sound source in 3d space Applications Claiming Priority (1) Application Number Priority Date Filing Date Title CN201310580928.7A CN103618986B (en) 2013-11-19 2013-11-19 The extracting method of source of sound acoustic image body and device in a kind of 3d space Publications (2) Family ID=50169690 Family Applications (1) Application Number Title Priority Date Filing Date CN201310580928.7A Active CN103618986B (en) 2013-11-19 2013-11-19 The extracting method of source of sound acoustic image body and device in a kind of 3d space Country Status (3) Families Citing this family (11) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title CN103618986B (en) * 2013-11-19 2015-09-30 深圳市新一代信息技术研究院有限公司 The extracting method of source of sound acoustic image body and device in a kind of 3d space CN104064194B (en) * 2014-06-30 2017-04-26 武汉大学 Parameter coding/decoding method and parameter coding/decoding system used for improving sense of space and sense of distance of three-dimensional audio frequency CN105657633A (en) 2014-09-04 2016-06-08 杜比实验室特许公司 Method for generating metadata aiming at audio object CN104270700B (en) * 2014-10-11 2017-09-22 武汉轻工大学 The generation method of pan, apparatus and system in 3D audios WO2016210174A1 (en) 2015-06-25 2016-12-29 Dolby Laboratories Licensing Corporation Audio panning transformation system and method US10579879B2 (en) * 2016-08-10 2020-03-03 Vivint, Inc. Sonic sensing WO2018076387A1 (en) * 2016-10-31 2018-05-03 华为技术有限公司 Directional sound recording device and electronic device US11341952B2 (en) 2019-08-06 2022-05-24 Insoundz, Ltd. System and method for generating audio featuring spatial representations of sound sources CN117061983A (en) * 2021-03-05 2023-11-14 华为技术有限公司 Virtual speaker set determining method and device CN114025287B (en) * 2021-10-29 2023-02-17 歌尔科技有限公司 Audio output control method, system and related components US12254540B2 (en) * 2022-08-31 2025-03-18 Sonaria 3D Music, Inc. Frequency interval visualization education and entertainment system and method Citations (3) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title WO2005079114A1 (en) * 2004-02-18 2005-08-25 Yamaha Corporation Acoustic reproduction device and loudspeaker position identification method CN102883246A (en) * 2012-10-24 2013-01-16 武汉大学 Simplifying and laying method for loudspeaker groups of three-dimensional multi-channel audio system CN103369453A (en) * 2012-03-30 2013-10-23 三星电子株式会社 Audio apparatus and method of converting audio signal thereof Family Cites Families (10) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title US6072878A (en) * 1997-09-24 2000-06-06 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics US8249283B2 (en) * 2006-01-19 2012-08-21 Nippon Hoso Kyokai Three-dimensional acoustic panning device JP5448451B2 (en) * 2006-10-19 2014-03-19 パナソニック株式会社 Sound image localization apparatus, sound image localization system, sound image localization method, program, and integrated circuit GB0712998D0 (en) * 2007-07-05 2007-08-15 Adaptive Audio Ltd Sound reproducing systems CN101889307B (en) 2007-10-04 2013-01-23 创新科技有限公司 Phase-amplitude 3-D stereo encoder and decoder EP2450880A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data CN102026086B (en) * 2010-12-01 2013-08-21 国光电器股份有限公司 Method for mixing down multiple channels into 3-channel surrounding sound CA2819394C (en) * 2010-12-03 2016-07-05 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Sound acquisition via the extraction of geometrical information from direction of arrival estimates CN102790931B (en) * 2011-05-20 2015-03-18 中国科学院声学研究所 Distance sense synthetic method in three-dimensional sound field synthesis CN103618986B (en) 2013-11-19 2015-09-30 深圳市新一代信息技术研究院有限公司 The extracting method of source of sound acoustic image body and device in a kind of 3d space Patent Citations (3) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title WO2005079114A1 (en) * 2004-02-18 2005-08-25 Yamaha Corporation Acoustic reproduction device and loudspeaker position identification method CN103369453A (en) * 2012-03-30 2013-10-23 三星电子株式会社 Audio apparatus and method of converting audio signal thereof CN102883246A (en) * 2012-10-24 2013-01-16 武汉大学 Simplifying and laying method for loudspeaker groups of three-dimensional multi-channel audio system Also Published As Similar Documents Publication Publication Date Title CN103618986B (en) 2015-09-30 The extracting method of source of sound acoustic image body and device in a kind of 3d space CN104956695B (en) 2017-06-06 It is determined that the method and apparatus of the renderer for spherical harmonics coefficient CN106104680B (en) 2019-08-23 Voice-grade channel is inserted into the description of sound field US10097943B2 (en) 2018-10-09 Apparatus and method for reproducing recorded audio with correct spatial directionality US10117039B2 (en) 2018-10-30 Audio apparatus and method of converting audio signal thereof CN105264911A (en) 2016-01-20 Audio apparatus CN105981411A (en) 2016-09-28 Multiplet-based matrix mixing for high-channel count multichannel audio CN102422348A (en) 2012-04-18 Audio format transcoder US20160066118A1 (en) 2016-03-03 Audio signal processing method using generating virtual object US20170347218A1 (en) 2017-11-30 Method and apparatus for processing audio signal US20140372107A1 (en) 2014-12-18 Audio processing US20230305800A1 (en) 2023-09-28 Video-informed Spatial Audio Expansion US9838790B2 (en) 2017-12-05 Acquisition of spatialized sound data CN103826194A (en) 2014-05-28 Method and device for reconstructing sound source direction and distance in multi-channel system Lin et al. 2021 Exploiting audio-visual consistency with partial supervision for spatial audio generation US10547962B2 (en) 2020-01-28 Speaker arranged position presenting apparatus US20160044432A1 (en) 2016-02-11 Audio signal processing apparatus CN102883246A (en) 2013-01-16 Simplifying and laying method for loudspeaker groups of three-dimensional multi-channel audio system CN103065634A (en) 2013-04-24 Three-dimensional audio space parameter quantification method based on perception characteristic US20150310869A1 (en) 2015-10-29 Apparatus aligning audio signals in a shared audio scene WO2022170716A1 (en) 2022-08-18 Audio processing method and apparatus, and device, medium and program product CN110890100A (en) 2020-03-17 Voice enhancement method, multimedia data acquisition method, multimedia data playing method, device and monitoring system CN103607690A (en) 2014-02-26 Down conversion method for multichannel signals in 3D (Three Dimensional) voice frequency Luo et al. 2024 Multi-Modality Speech Recognition Driven by Background Visual Scenes CN105895106B (en) 2020-01-24 Panoramic sound coding method Legal Events Date Code Title Description 2014-03-05 PB01 Publication 2014-03-05 PB01 Publication 2014-04-02 C10 Entry into substantive examination 2014-04-02 SE01 Entry into force of request for substantive examination 2015-09-30 C14 Grant of patent or utility model 2015-09-30 GR01 Patent grant

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4