Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( )...

49
Microphone Array project in MSR: approach and results Ivan Tashev Microsoft Research June 2004

Transcript of Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( )...

Page 1: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Microphone Array project in MSR:approach and results

Ivan TashevMicrosoft ResearchJune 2004

Page 2: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Agenda

Microphone Array projectBeamformer design algorithmImplementation and hardware designsDemo

Page 3: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Motivation

PCs today have pretty bad “ears”; audio captured or recorded from PCs sounds terrible (especially with laptops) – unless a good headset is used.Sound will play more and more important role in human-computer interaction, especially in devices without keyboard (tablets, handhelds)Increases using computers in collaboration and communicationUsers don’t like headsets or other tethered microphones, especially in a video call.Existing wireless solutions do not provide enough good sound quality, you have to wear them

Page 4: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Microphone array project: goals

Far goal: sound capturing quality for untethered user the same as with close-up microphoneNear goal: Create technology for OS support and devices so cheap to become commodity on the marketBeamforming is ability to make the microphone array to listen to given location, suppressing the signals coming from other locations

Page 5: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Target scenarios

Real-time communications– Providing good sound capturing for Windows Messenger, MSN

Messenger, other applications built on top of the RTC stack– New applications for VoIP and enhanced telephony

Collaboration and groupware– High quality sound from meeting rooms for recording and

broadcasting purposes (OneNote)– Voice messaging

Speech recognition– Voice commands for Tablet PCs and handhelds– Voice control and dictation for PCs and laptops

Page 6: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Problems

“Wear nothing” approach requires using separate microphones: connected or integrated These microphones deliver poor sound capturing quality:

– Too much ambient and electronic noises– Reverberation and reflections – poor user experience and

bad speech recognition results Noise suppression and de-reverberation are difficult with a single microphone channel

Page 7: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

The solution

Using microphone arrays for capturing the sound– A set of close positioned microphones– Synchronous capturing of the signals

Microphone Array acts as an acoustic antenna – This is called spatial filtering or beamforming– Listens only to the direction of the speaker– Reduces the noises from other directions– Reduces the reverberation

Page 8: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamforming: known approaches

Fixed beam formation– Delay and sum – most intuitive, irregular beam

shape– Parametric solutions: very complex– Fast real-time execution

Adaptive beamformers– Generalized side lobe canceller– Vary with the target criteria (MVDR, etc.)– Slow adaptation, CPU time intensive

Page 9: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamforming: known approaches

Fixed beam formation– Delay and sum – most intuitive, irregular beam

shape– Parametric solutions: very complex– Fast real-time execution

Adaptive beamformers– Generalized side lobe canceller– Vary with the target criteria (MVDR, etc.)– Slow adaptation, CPU time intensive

Page 10: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamforming: known approaches

Fixed beam formation– Delay and sum – most intuitive, irregular beam

shape– Parametric solutions: very complex– Fast real-time execution

Adaptive beamformers– Generalized side lobe canceller– Vary with the target criteria (MVDR, etc.)– Slow adaptation, CPU time intensive

Page 11: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamformer: canonical form

Canonical form of the beamformer:

M – number of microphonesXi(f) – spectrum of i-th channelW(f,i) – weight coefficients matrixY(f) – output signal

For each weight matrix we have corresponding shape of the beam - the array gain as function of directionThe goal is to find weight matrix to satisfy certain criteria

∑−

==

1

0)(),()(

M

ii fXifWfY

),,( fB θϕ

Page 12: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamformer: Array parameters

Noise = ambient + non-correlated + correlated (jammers and reverberation)Ambient noise gain

Non-correlated noise:

Correlated (from given direction):

The total noise gain is the combination of the first two

∫ ∫ ∫+

2

0

2

0

2

2

),,()(log20

Sf

dfddfBfNπ

π

πϕθθϕ

2

0

2

0

),,()(

),,()(log20

S

S

f

JJ

f

SS

dffBfJ

dffBfS

θϕ

θϕ

⎥⎥⎥

⎢⎢⎢

∫ ∑−

=

2

0

1

0

2),(log20

SfM

idfifW

Page 13: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Weights calculation

Weights calculation as optimization processMinimization criterion: the total noise gainMultidimensional optimization

– Slow, especially in real time (adaptive beamformers)– Can’t follow the changes

Multimodal 2M dimensional hypersurface – local minima In all cases the starting point is critical

Page 14: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Weights calculation (2)

Our approach:– Deterministic beam formation– Use as much prior info as possible– Do your homework: calculate the weights in

advance– Calculate set of beams to cover the work volume– Fast real-time engine: switches the beams on the

fly

Page 15: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamformer: Prior Info

Prerequisites:– Microphone array geometry – microphones coordinates and

orientation– Directivity response of the microphones Um(f,c)– Hardware noise model NI(f)– Ambient noise model NA(f)

Page 16: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamformer: Prior Info

Prerequisites:– Microphone array geometry – microphones coordinates and

orientation– Directivity response of the microphones Um(f,c)– Hardware noise model NI(f)– Ambient noise model NA(f)

Page 17: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamformer: Prior Info

Prerequisites:– Microphone array geometry – microphones coordinates and

orientation– Directivity response of the microphones Um(f,c)– Hardware noise model NI(f)– Ambient noise model NA(f)

Page 18: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamformer: Prior Info

Prerequisites:– Microphone array geometry – microphones coordinates and

orientation– Directivity response of the microphones Um(f,c)– Hardware noise model NI(f)– Ambient noise model NA(f)

0 1000 2000 3000 4000 5000 6000 7000 8000-100

-90

-80

-70

-60

-50

-40

-30

-20

Page 19: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Beamformer: Prior Info

Prerequisites:– Microphone array geometry – microphones coordinates and

orientation– Directivity response of the microphones Um(f,c)– Hardware noise model NI(f)– Ambient noise model NA(f)

Page 20: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 21: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Beams at 1250 Hz

-0.2

0

0.2

0.4

0.6

0.8

1

1.2

0 100 200 300 400

Angle, deg

Gai

n Desired Delay and sum

Page 22: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 23: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 24: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 25: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 26: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 27: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠Set of design beams

-0.2

0

0.2

0.4

0.6

0.8

1

1.2

0 50 100 150 200 250 300 350 400

Angle, deg

Gai

ns

Page 28: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 29: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 30: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 31: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 32: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 33: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Pattern synthesis

Design in the beamspaceDefine the target beam shape:

Define the weight functionCombine the microphone directivity patterns using weighted MMSE

T1xL = V1xLDMxLMMxLW1xM

Do the design in 3D

( ) ( ) ( )( , , , ) cos cos cosT T TTk

π ρ ρ π ϕ ϕ π θ θρ ϕ θ δδ δ δ− − −⎛ ⎞ ⎛ ⎞ ⎛ ⎞= ⎜ ⎟ ⎜ ⎟ ⎜ ⎟

⎝ ⎠ ⎝ ⎠ ⎝ ⎠

Page 34: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Dimensions reduction

Dimensions reduction: from 2M to 1Two controversial processes:– Narrow beam: better ambient noise reduction– Wide beam: better internal noise reduction

One dimensional search: beam widthCover the whole frequency bandCalculate set of beams

Page 35: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

On next charts:

Z-axis: noise gain in dBX-axis: frequency, logarithmic, 1-100Hz, 2-200 Hz, 3-400Hz, …7-6400HzY-axis: beam width, linear, 0 – 1800, every 50, 33-150.

Page 36: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Ambient noise gain

05

1015

2025

3035 1

2

3

4

5

6

7

-80

-60

-40

-20

0

Noise gain

Frequency

Beamwidth

Page 37: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Non-correlated noise gain

0

10

20

30

401

23

45

67

-40

-20

0

20

40

60

80

100

120

Noise gain

Frequency

Beamwidth

Page 38: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Total noise gain

05

1015

2025

3035

12

34

56

7

-40

-20

0

20

40

60

80

Noise gain

Frequency

Beamwidth

Page 39: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Dimensions reduction

Dimensions reduction: from 2M to 1Two controversial processes:– Narrow beam: better ambient noise reduction– Wide beam: better internal noise reduction

One dimensional search: beam widthCover the whole frequency bandCalculate set of beams

Page 40: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Implementation: overall

MASynthesis.exe

MicArr.INI Weights.dat

AEC MABeamformer Noise Suppression

Offline –Design the weights

Real time – just use pre-calculatedweights

Page 41: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Implementation: Real-time engine

Beamformer

SSL Gaincalibration

N-channelsinput stream

Mono outputstream

Geometry Weights

Beam selection Gains correction

Page 42: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Hardware designs

USB MicArray Prototypes– 4-mic desktop – 8-mic conference tabletop– Bus-powered (no power grid)– Compatible with USB audio (no device

drivers to install)

Integrated in laptops/monitors

Page 43: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Results: noise suppression

Microphone Array noise suppression– Provides itself 14-18 dB ambient noise suppression– Helps the noise suppressor to do better job– More at http://micarray

One of the best technologies on the market

27.82-33.86-61.68MSR 4 elements + WinXP NS

33.27-32.14-64.41MSR 4 elements + New NS

28.79-32.6-62.39Acoustic Magic, 8 element MA, $250

25.53-26.19-51.72Andrea DA 400 2.0, 4 el. MA, $135

34.42-30.04-64.46Close-Up Microphone

10.6-33.91-44.51Unidirectional Microphone

4.89-40.64-45.53Omni-directional Microphone

SNRSignalNoiseDevice

Page 44: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Results: speech recognition

Microphone Arrays for speech recognition– Linear processing, speech recognition friendly– Reduces ambient noises– Partial de-reverberation

Results

4 element array, Yakima SAPI 5.2374 utterances, 7 speakers(4 male, 3 female), age 25-53

2:356.171Close-up

3:3413.683MSR MicArray+NS

4:0314.22MSR MicArray

3:1717.9VoiceTracker

3:2520.391PC Mic

TimeError rate, %Device

Speech Recognition Error

0

5

10

15

20

25

PC Mic

VoiceTra

cker

MSR MicA

rray

MSR MicA

rray+

NS

Close-up

Device

Erro

r rat

e, %

Page 45: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Results: conclusions

Ambient noise suppression– The current technology provides good noise suppression

under the quality requirements constrains– Telecommunication scenario has good quality sound– Meetings recording for listening purposes – OK.

Speech recognition results– Need improvement– Reverberation as major reason– Important for recorded meetings search technology

Page 46: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Microphone Array - Example

Person speaking at 3 ft from microphones

MSR USB desktop array SNR=42.5dBCompetitor (HW DSP) SNR=34.4dB

PC mic + WinXP noise reduction SNR=18.4 dBTypical $10 PC microphone SNR=10.3 dB

Page 47: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Microphone array - demo

First demo:– Records in parallel the output of the microphone

array and a regular PC microphone.– After this merges both WAV files to one file …– … and plays it with CoolEdit.

Second demo: ClearMessage application

Page 48: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Take outs

Most of our projects are optimization in one way or another:Try carefully to define the optimization criterionReduce the number of dimensions as much as possibleChoose the method, especially if there are too many papers and no definite answer

Page 49: Microphone Array project in MSR: approach and results · PDF filezDo the design in 3D ( ) ( ) T( , , , ) cos cos cosTT T k π ρρπϕϕ πθ

Finally

Questions?

Contact: [email protected]: http://research.microsoft.com/users/ivantash/