Sample Rate Conversion

Objectives

Learn about compression and expansion operations on discrete-time signals.
Learn about downsampling and upsampling operations on discrete-time signals.
Observe the effects of compression and expansion in the time and frequency domains.
Gain some experience designing filters and applying them to signals.

Mathematical Background

In class we have derived formulas for sampling and reconstruction and downsampling and upsampling. These derivations and visualizations are available in these slides. (See also these slides.) Here are slide summaries (math and diagrams) for the four operations.

We summarize these formulas here.

$F_s = \frac{1}{T}$ :

\begin{matrix} Y (F) = \frac{1}{T} \sum_{k = - \infty}^{\infty} X (F - \frac{k}{T}) (CT aliasing formula) \\ Z (f) = \frac{1}{T} \sum_{k = - \infty}^{\infty} X (\frac{f - k}{T}) (CT sampling formula) \end{matrix}

$f_s = \frac{1}{D}$ :

\begin{matrix} Y (f) = \frac{1}{D} \sum_{k = 0}^{D - 1} X (f - \frac{k}{D}) (DT aliasing formula) \\ Z (f) = \frac{1}{D} \sum_{k = 0}^{D - 1} X (\frac{f - k}{D}) (DT sampling formula) \end{matrix}

Note the strong resemblance between these formulas. There are two main differences.

$T$ $D$ [samples].
$X(F)$ $X(f)$ is periodic with period 1.

The formulas for reconstruction and upsampling are summarized here.

$F_s=\frac{1}{T}$ :

\begin{matrix} Y (F) = X (F T) (scale frequency axis) \\ Z (F) = {\begin{cases} X (F T), & | F | \leq \frac{1}{2 T} \\ 0, & \frac{1}{2 T} < | F | . \end{cases} (LPF to remove images) \end{matrix}

$f_s = \frac{1}{U}$ :

\begin{matrix} Y (f) = X (f U) (scale frequency axis) \\ Z (f) = {\begin{cases} X (f U), & | f | \leq \frac{1}{2 U} \\ 0, & \frac{1}{2 U} < | f | \leq \frac{1}{2} . \end{cases} (LPF to remove images) \end{matrix}

Note the strong resemblance between these formulas. The main difference is the periodic nature of the discrete-time spectra.

Sample Rate Conversion

$U$ $D$ $H(f)$ $\frac{1}{2U}$ $\frac{1}{2D}$ .

Sample rate conversion system

Observation 1: $x[n]$ $F_s$ $y[n]$ $\frac{U}{D} F_s$ $U$ $D$ (which must positive integers), the sample rate may be changed by any rational factor (at least in theory).

\begin{matrix} F_{s} (input sample rate) \\ \frac{U}{D} F_{s} (output sample rate) \end{matrix}

Observation 2: $x[n] = \cos(2\pi f_0 n)$ $f_0< \min\left(\frac{1}{2U}, \frac{1}{2D}\right)$ $U$ $f_0/U$ $\frac{D}{U} f_0$ $D$ $y[n] = \cos\left(2\pi \frac{D}{U}f_0 n\right)$ .

\begin{matrix} f_{0} (input frequency) \\ \frac{D}{U} f_{0} (output frequency) \end{matrix}

Observation 3: $\frac{D}{U}$ $\frac{U}{D}$ $F_0 = f_0 F_s$ $F_0 = \left(\frac{D}{U} f_0\right) \left(\frac{U}{D} F_s\right) = f_0 F_s$ Hz, which his the same frequency at input as at output.

Multistage Sample Rate Conversion

$x[n]$ $F_x = 11025$ $y[n]$ $F_y=8000$ S/s. Using the sample rate conversion system from above we have:

F_{y} = 8000 = \frac{U}{D} F_{x} = \frac{U}{D} 11025.

$D/U$ we obtain

\frac{D}{U} = \frac{11025}{8000} = \frac{441}{320} = \frac{3 \cdot 3 \cdot 7 \cdot 7}{2 \cdot 2 \cdot 2 \cdot 2 \cdot 2 \cdot 2 \cdot 5} = \frac{1}{2} \cdot \frac{3}{2} \cdot \frac{3}{4} \cdot \frac{7}{4} \cdot \frac{7}{5} .

$U=320$ $D=441$ $U=320$ $441$ $320$ may be factored into products of small primes suggesting that the sample rate may be converted using a series of five stages with rather low complexity. The assignment is to implement these five sample rate conversion stages. For each stage, use the low pass filter with the trapazoidally shaped magnitude response that we used in CA2.

Illustration

The figures below illustrate the signal spectrum for each of the five stages. In each figure, there are five subplots as follows.

Spectrum of the signal input to that stage.
Spectrum of the upsampled signal along with the magnitude response of the low pass filter.
Spectrum of the signal at the output of the low pass filter.
Spectrum of the aliased signal (aliasing formula = replication).
Spectrum of the signal output from that stage (downsampling formula = frequency scaling).

$U$ $D$ $f_p$ $f_s$ .

Note: In the example below, I assumed that the input signal had a triangular shaped spectrum.

Stage 1: U=2, D=1

Stage 1 sample rate conversion

Stage 2: U=2, D=3

Stage 2 sample rate conversion

Stage 3: U=4, D=3

Stage 3 sample rate conversion

Stage 4: U=4, D=7

Stage 4 sample rate conversion

Stage 5: U=5, D=7

Stage 5 sample rate conversion

$f_s = 0.101562$ $f_s = \frac{1}{2D} = \frac{1}{14} = 0.0714$ . This would be a small change to make. Aliasing was avoided in all prior stages.

$11025$ $8000$ S/s. The spectral plots are shown below in dB scaling.

Stage 1: U=2, D=1

Stage 1 sample rate conversion

Stage 2: U=2, D=3

Stage 2 sample rate conversion

Stage 3: U=4, D=3

Stage 3 sample rate conversion

Stage 4: U=4, D=7

Stage 4 sample rate conversion

Stage 5: U=5, D=7

Stage 5 sample rate conversion

Designing Filters

The tricky part of sample rate conversion is designing the filters. Let's figure this out.

There are three frequencies that we have to keep track of in each stage of sample rate conversion. These are as follows.

$f_p$ $|f|\leq f_p$ must be protected from aliasing and should be free attenuation free.
$f_m$ $f_p < |f| \leq f_m$ $f_m$ $f_m$ .
$f_s$ $H(f)$ $U$ $f_s = (1-f_m)/U$ $D$ $f_s = 1/2D$ . In general, we will let

f_{s} = min (\frac{1 - f_{m}}{U}, \frac{1}{2 D}) .

$U$ $D$ $U$ -fold expander maps the following critical frequencies:

\begin{aligned} f_{p} ⟶ & f_{p}^{'} = \frac{f_{p}}{U} \\ f_{m} ⟶ & f_{m}^{'} = \frac{f_{m}}{U} \\ 1 ⟶ & \frac{1}{U} & (center frequency of first image) \\ (1 - f_{m}) ⟶ & f_{s} = \frac{1 - f_{m}}{U} & (lowest frequency in first image) \\ f_{s} = \frac{1}{2 D} & (lowest frequency in aliasing band) \end{aligned}

$H(f)$ $f_p'$ $f_s$ $f_s$ $f_m'$ $f_s$ , whichever is smaller,

f_{m}^{'} = min (\frac{f_{m}}{U}, \frac{1}{2 D}) .

$D$ -fold compression the output pass and maximum frequencies are given by:

\begin{matrix} f_{p}^{″} = D f_{p}^{'} = \frac{D}{U} f_{p}, \\ f_{m}^{″} = D f_{m}^{'} = min (\frac{D}{U} f_{m}, \frac{1}{2}) . \end{matrix}

$f_p = 3800/11025 = 0.3447$ $f_m = 0.5$ cycles/sample.

$U=2, D=1$ $U=2$ fold expansion.

\begin{aligned} f_{p} = 0.3447 ⟶ & f_{p}^{'} = \frac{f_{p}}{U} = 0.1723 \\ f_{m} = 0.5 ⟶ & f_{m}^{'} = \frac{f_{m}}{U} = 0.25 \\ 1 ⟶ & \frac{1}{U} = 0.5 & (center frequency of first image) \\ (1 - f_{m}) = 0.5 ⟶ & f_{s} = \frac{1 - f_{m}}{U} = 0.25 & (lowest frequency in first image) \\ f_{s} = \frac{1}{2 D} = 0.5 & (lowest frequency in aliasing band) \end{aligned}

$H(f)$ should have a passbands edge frequency of 0.1723 cycles/sample and a stop band edge frequency of 0.25 cycles/sample. The filter is designed to reject images.

The critical frequencies in the filter output are

\begin{matrix} f_{p}^{'} = 0.1723, \\ f_{m}^{'} = 0.25 . \end{matrix}

$D=1$ fold compression, the critical frequencies at the output of this stage are the same

\begin{matrix} f_{p}^{″} = 0.1723, \\ f_{m}^{″} = 0.25 . \end{matrix}

Now we are ready to apply these principles to the subsequent stages.

Assignment

$F_x = 11025$ $F_y = 8000$ S/s.
Label the block diagram to indicate the sample rate at the output of each block in the system diagram.
$f_0=1000$ Hz. Label the block digram to indicate the frequency of this signal at the output of each block in the system diagram.
Write a Matlab function that performs one stage of sample rate conversion (SRC). The signature of the function should be:


xxxxxxxxxx
[y,fp,fm,Fs] = src(x,fp,fm,U,D,Fs)

$x$ $f_p$ $f_\text{max}$ $U$ $D$ $F_s$ $y$ $f_p, f_m, F_s$ $D/U$ $F_s$ $U/D$ by the input sample rate.

Your function should design and do all the processing for the sample rate conversion for that one stage.

Your function should make plots like those shown above.

Call your function five times to perform the five stage sample rate conversion on this audio file.
Turn in your plots of the signal spectra for each stage.
$f_p, f_m$ $f_s$ for each stage.
Include a spectrogram plot of the input signal and the final output signal.
Turn in your code.