TODO: Still many unknowns but we might have a good base to start with.

Note

This assumes the wave are planar otherwise beam forming won’t work

Let’s start with circular microphones. Let us suppose microphones at known position or Our goal is to estimate where the sound is coming from. First we assume that sound that we are getting is planar. This means the speaker is far away that sound is |||||||| instead of ))))))).

  • where time at which wavefront hits the reference point at the center. This is not required as it cancels out. as we are doing
  • which is unit direction of arrival. angle theta is made between source and the speaker
  • = where is transpose
  • is just projection of direction u. Geometrically it’s
  • since we are working on Time = distance / velocity we get

Now we don’t even need because we are doing and what we want is or .

Now the new equation for microphone signal : signal recorded at microphone at time . : desired (target) source waveform (e.g., the speaker we want to keep). : propagation delay from the desired source to microphone (under the planar-wave model, ). : waveform of the -th interfering source (another speaker/noise source). : propagation delay from interferer to microphone . : sum over all interfering sources present in the environment.

Now for the beam forming

: beamformer output signal after spatial filtering. : total number of microphones in the array. : weight applied to microphone (for delay-and-sum beamforming, ). : signal recorded at microphone . : predicted propagation delay for microphone . The hat is present because this is predicted assuming the sound arrives from direction (computed from array geometry). : unit direction-of-arrival vector of the desired source. Each microphone records a time-shifted version of the same desired signal: .

The time shifts depend on the spatial position of the microphone relative to the incoming wavefront. By delaying each by , signals arriving from direction are aligned in time. Once aligned, the desired signal adds constructively across microphones, increasing its amplitude. Signals arriving from other directions are misaligned and therefore add destructively, resulting in attenuation.